@buaa_smat/hometrans 0.1.0 → 0.1.2
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +141 -124
- package/agents/build-fixer.md +1 -0
- package/agents/code-review-fix.md +1 -0
- package/agents/code-reviewer.md +1 -0
- package/agents/logic-coding.md +1 -0
- package/agents/logic-context-builder.md +1 -0
- package/agents/review-fixer.md +1 -0
- package/agents/self-test-fixer.md +1 -0
- package/agents/self-tester.md +260 -233
- package/agents/spec-generator.md +1 -0
- package/agents/test-tools/autotest/README.md +223 -0
- package/agents/test-tools/autotest/config.yaml.example +58 -0
- package/agents/test-tools/autotest/pyproject.toml +16 -0
- package/agents/test-tools/autotest/report_tool.py +759 -0
- package/agents/test-tools/autotest/self_test_runner.py +773 -0
- package/agents/test-tools/autotest/testcases_schema.md +143 -0
- package/agents/test-tools/autotest/testcases_tool.py +215 -0
- package/agents/test-tools/autotest/uv.lock +3156 -0
- package/agents/test-tools/harmony_autotest-0.1.0-py3-none-any.whl +0 -0
- package/agents/test-tools/hypium-6.1.0.210-py3-none-any.whl +0 -0
- package/agents/test-tools/hypium_mcp-0.6.5-py3-none-any.whl +0 -0
- package/agents/test-tools/xdevice-6.1.0.210-py3-none-any.whl +0 -0
- package/agents/test-tools/xdevice_devicetest-6.1.0.210-py3-none-any.whl +0 -0
- package/agents/test-tools/xdevice_ohos-6.1.0.210-py3-none-any.whl +0 -0
- package/dist/cli/config-store.js +27 -2
- package/dist/cli/config.js +17 -6
- package/dist/cli/index.js +3 -2
- package/dist/cli/init.js +135 -22
- package/dist/cli/mcp.js +2 -2
- package/dist/context/index.js +165 -69
- package/package.json +59 -60
- package/skills/code-dev-review-fix/SKILL.md +279 -0
- package/skills/code-dev-review-fix-workspace/evals/evals.json +56 -0
- package/skills/code-dev-review-fix-workspace/iteration-1/routing-results.md +23 -0
- package/skills/convert_pipeline/SKILL.md +423 -439
- package/skills/hmos-resources-convert/SKILL.md +623 -0
- package/skills/hmos-resources-convert/evals/evals.json +171 -0
- package/skills/hmos-resources-convert/references/conversion-rules.md +663 -0
- package/skills/hmos-resources-convert/references/dependency-analysis-rules.md +388 -0
- package/skills/hmos-resources-convert/references/resource-mapping-rules.md +457 -0
- package/skills/hmos-resources-convert/references/xml-drawable-to-svg-rules.md +513 -0
- package/skills/hmos-resources-convert/template/AppScope/app.json5 +10 -0
- package/skills/hmos-resources-convert/template/AppScope/resources/base/element/string.json +8 -0
- package/skills/hmos-resources-convert/template/AppScope/resources/base/media/background.png +0 -0
- package/skills/hmos-resources-convert/template/AppScope/resources/base/media/foreground.png +0 -0
- package/skills/hmos-resources-convert/template/AppScope/resources/base/media/layered_image.json +7 -0
- package/skills/hmos-resources-convert/template/build-profile.json5 +42 -0
- package/skills/hmos-resources-convert/template/code-linter.json5 +32 -0
- package/skills/hmos-resources-convert/template/entry/build-profile.json5 +33 -0
- package/skills/hmos-resources-convert/template/entry/hvigorfile.ts +6 -0
- package/skills/hmos-resources-convert/template/entry/obfuscation-rules.txt +23 -0
- package/skills/hmos-resources-convert/template/entry/oh-package.json5 +10 -0
- package/skills/hmos-resources-convert/template/entry/src/main/ets/entryability/EntryAbility.ets +48 -0
- package/skills/hmos-resources-convert/template/entry/src/main/ets/entrybackupability/EntryBackupAbility.ets +16 -0
- package/skills/hmos-resources-convert/template/entry/src/main/ets/pages/Index.ets +23 -0
- package/skills/hmos-resources-convert/template/entry/src/main/module.json5 +55 -0
- package/skills/hmos-resources-convert/template/entry/src/main/resources/base/element/color.json +8 -0
- package/skills/hmos-resources-convert/template/entry/src/main/resources/base/element/float.json +8 -0
- package/skills/hmos-resources-convert/template/entry/src/main/resources/base/element/string.json +16 -0
- package/skills/hmos-resources-convert/template/entry/src/main/resources/base/media/background.png +0 -0
- package/skills/hmos-resources-convert/template/entry/src/main/resources/base/media/foreground.png +0 -0
- package/skills/hmos-resources-convert/template/entry/src/main/resources/base/media/layered_image.json +7 -0
- package/skills/hmos-resources-convert/template/entry/src/main/resources/base/media/startIcon.png +0 -0
- package/skills/hmos-resources-convert/template/entry/src/main/resources/base/profile/backup_config.json +3 -0
- package/skills/hmos-resources-convert/template/entry/src/main/resources/base/profile/main_pages.json +5 -0
- package/skills/hmos-resources-convert/template/entry/src/main/resources/dark/element/color.json +8 -0
- package/skills/hmos-resources-convert/template/entry/src/mock/mock-config.json5 +2 -0
- package/skills/hmos-resources-convert/template/entry/src/ohosTest/ets/test/Ability.test.ets +35 -0
- package/skills/hmos-resources-convert/template/entry/src/ohosTest/ets/test/List.test.ets +5 -0
- package/skills/hmos-resources-convert/template/entry/src/ohosTest/module.json5 +16 -0
- package/skills/hmos-resources-convert/template/entry/src/test/List.test.ets +5 -0
- package/skills/hmos-resources-convert/template/entry/src/test/LocalUnit.test.ets +33 -0
- package/skills/hmos-resources-convert/template/hvigor/hvigor-config.json5 +23 -0
- package/skills/hmos-resources-convert/template/hvigorfile.ts +6 -0
- package/skills/hmos-resources-convert/template/oh-package-lock.json5 +28 -0
- package/skills/hmos-resources-convert/template/oh-package.json5 +10 -0
- package/skills/hmos-resources-convert/tools/apktool.bat +85 -0
- package/skills/hmos-resources-convert/tools/apktool_3.0.1.jar +0 -0
- package/skills/hmos-ui-align/SKILL.md +182 -0
- package/skills/hmos-ui-align/config-example.json +11 -0
- package/skills/hmos-ui-align/config.json +11 -0
- package/skills/hmos-ui-align/diff_analysis.md +53 -0
- package/skills/hmos-ui-align/page_align.md +62 -0
- package/skills/hmos-ui-align/readme.md +231 -0
- package/skills/hmos-ui-align/references/Comparison_Template.md +2 -0
- package/skills/hmos-ui-align/references/MVVM/345/274/200/345/217/221/346/226/207/346/241/243/@Link/350/243/205/351/245/260/345/231/250/357/274/232/347/210/266/345/255/220/345/217/214/345/220/221/345/220/214/346/255/245.md +648 -0
- package/skills/hmos-ui-align/references/MVVM/345/274/200/345/217/221/346/226/207/346/241/243/@Observed/350/243/205/351/245/260/345/231/250/345/222/214@ObjectLink/350/243/205/351/245/260/345/231/250/357/274/232/345/265/214/345/245/227/347/261/273/345/257/271/350/261/241/345/261/236/346/200/247/345/217/230/345/214/226.md +2089 -0
- package/skills/hmos-ui-align/references/MVVM/345/274/200/345/217/221/346/226/207/346/241/243/@Prop/350/243/205/351/245/260/345/231/250/357/274/232/347/210/266/345/255/220/345/215/225/345/220/221/345/220/214/346/255/245.md +1033 -0
- package/skills/hmos-ui-align/references/MVVM/345/274/200/345/217/221/346/226/207/346/241/243/@Provide/350/243/205/351/245/260/345/231/250/345/222/214@Consume/350/243/205/351/245/260/345/231/250/357/274/232/344/270/216/345/220/216/344/273/243/347/273/204/344/273/266/345/217/214/345/220/221/345/220/214/346/255/245.md +1183 -0
- package/skills/hmos-ui-align/references/MVVM/345/274/200/345/217/221/346/226/207/346/241/243/@State/350/243/205/351/245/260/345/231/250/357/274/232/347/273/204/344/273/266/345/206/205/347/212/266/346/200/201.md +576 -0
- package/skills/hmos-ui-align/references/MVVM/345/274/200/345/217/221/346/226/207/346/241/243/@Track/350/243/205/351/245/260/345/231/250/357/274/232class/345/257/271/350/261/241/345/261/236/346/200/247/347/272/247/346/233/264/346/226/260.md +297 -0
- package/skills/hmos-ui-align/references/MVVM/345/274/200/345/217/221/346/226/207/346/241/243/@Watch/350/243/205/351/245/260/345/231/250/357/274/232/347/212/266/346/200/201/345/217/230/351/207/217/346/233/264/346/224/271/351/200/232/347/237/245.md +395 -0
- package/skills/hmos-ui-align/references/MVVM/345/274/200/345/217/221/346/226/207/346/241/243/AppStorage/357/274/232/345/272/224/347/224/250/345/205/250/345/261/200/347/232/204UI/347/212/266/346/200/201/345/255/230/345/202/250.md +903 -0
- package/skills/hmos-ui-align/references/MVVM/345/274/200/345/217/221/346/226/207/346/241/243/Environment/357/274/232/350/256/276/345/244/207/347/216/257/345/242/203/346/237/245/350/257/242.md +106 -0
- package/skills/hmos-ui-align/references/MVVM/345/274/200/345/217/221/346/226/207/346/241/243/LocalStorage/357/274/232/351/241/265/351/235/242/347/272/247UI/347/212/266/346/200/201/345/255/230/345/202/250.md +1178 -0
- package/skills/hmos-ui-align/references/MVVM/345/274/200/345/217/221/346/226/207/346/241/243/MVVM/346/250/241/345/274/217V1.md +911 -0
- package/skills/hmos-ui-align/references/MVVM/345/274/200/345/217/221/346/226/207/346/241/243/MVVM/346/250/241/345/274/217/357/274/210V1/357/274/211.md +911 -0
- package/skills/hmos-ui-align/references/MVVM/345/274/200/345/217/221/346/226/207/346/241/243/PersistentStorage/357/274/232/346/214/201/344/271/205/345/214/226/345/255/230/345/202/250UI/347/212/266/346/200/201.md +355 -0
- package/skills/hmos-ui-align/references/MVVM/345/274/200/345/217/221/346/226/207/346/241/243//347/256/241/347/220/206/345/272/224/347/224/250/346/213/245/346/234/211/347/232/204/347/212/266/346/200/201/346/246/202/350/277/260.md +11 -0
- package/skills/hmos-ui-align/references/UI_Analysis_Template.md +4 -0
- package/skills/hmos-ui-align/references/android-to-harmonyOS-ui-atomic-component-mapping-reference.md +2535 -0
- package/skills/hmos-ui-align/references/android-to-harmonyOS-ui-interaction-mapping-reference.md +555 -0
- package/skills/hmos-ui-align/references/android-to-harmonyOS-ui-layout-mapping-reference.md +117 -0
- package/skills/hmos-ui-align/scripts/app_feature_verify.py +443 -0
- package/skills/hmos-ui-align/scripts/navigation-capure.md +37 -0
- package/skills/hmos-ui-align/scripts/page_capture.py +592 -0
- package/skills/hmos-ui-align-batch/SKILL.md +99 -0
- package/skills/hmos-ui-align-batch/references/conversion-procedure.md +180 -0
- package/skills/hmos-ui-align-batch/references/mappings/android-to-harmonyOS-ui-atomic-component-mapping-reference.md +2535 -0
- package/skills/hmos-ui-align-batch/references/mappings/android-to-harmonyOS-ui-interaction-mapping-reference.md +555 -0
- package/skills/hmos-ui-align-batch/references/mappings/android-to-harmonyOS-ui-layout-mapping-reference.md +117 -0
- package/skills/hmos-ui-align-batch/references/mvvm/@Link/350/243/205/351/245/260/345/231/250/357/274/232/347/210/266/345/255/220/345/217/214/345/220/221/345/220/214/346/255/245.md +648 -0
- package/skills/hmos-ui-align-batch/references/mvvm/@Observed/350/243/205/351/245/260/345/231/250/345/222/214@ObjectLink/350/243/205/351/245/260/345/231/250/357/274/232/345/265/214/345/245/227/347/261/273/345/257/271/350/261/241/345/261/236/346/200/247/345/217/230/345/214/226.md +2089 -0
- package/skills/hmos-ui-align-batch/references/mvvm/@Prop/350/243/205/351/245/260/345/231/250/357/274/232/347/210/266/345/255/220/345/215/225/345/220/221/345/220/214/346/255/245.md +1033 -0
- package/skills/hmos-ui-align-batch/references/mvvm/@Provide/350/243/205/351/245/260/345/231/250/345/222/214@Consume/350/243/205/351/245/260/345/231/250/357/274/232/344/270/216/345/220/216/344/273/243/347/273/204/344/273/266/345/217/214/345/220/221/345/220/214/346/255/245.md +1183 -0
- package/skills/hmos-ui-align-batch/references/mvvm/@State/350/243/205/351/245/260/345/231/250/357/274/232/347/273/204/344/273/266/345/206/205/347/212/266/346/200/201.md +576 -0
- package/skills/hmos-ui-align-batch/references/mvvm/@Track/350/243/205/351/245/260/345/231/250/357/274/232class/345/257/271/350/261/241/345/261/236/346/200/247/347/272/247/346/233/264/346/226/260.md +297 -0
- package/skills/hmos-ui-align-batch/references/mvvm/@Watch/350/243/205/351/245/260/345/231/250/357/274/232/347/212/266/346/200/201/345/217/230/351/207/217/346/233/264/346/224/271/351/200/232/347/237/245.md +395 -0
- package/skills/hmos-ui-align-batch/references/mvvm/AppStorage/357/274/232/345/272/224/347/224/250/345/205/250/345/261/200/347/232/204UI/347/212/266/346/200/201/345/255/230/345/202/250.md +903 -0
- package/skills/hmos-ui-align-batch/references/mvvm/Environment/357/274/232/350/256/276/345/244/207/347/216/257/345/242/203/346/237/245/350/257/242.md +106 -0
- package/skills/hmos-ui-align-batch/references/mvvm/LocalStorage/357/274/232/351/241/265/351/235/242/347/272/247UI/347/212/266/346/200/201/345/255/230/345/202/250.md +1178 -0
- package/skills/hmos-ui-align-batch/references/mvvm/MVVM/346/250/241/345/274/217/357/274/210V1/357/274/211.md +911 -0
- package/skills/hmos-ui-align-batch/references/mvvm/PersistentStorage/357/274/232/346/214/201/344/271/205/345/214/226/345/255/230/345/202/250UI/347/212/266/346/200/201.md +355 -0
- package/skills/hmos-ui-align-batch/references/mvvm//347/256/241/347/220/206/345/272/224/347/224/250/346/213/245/346/234/211/347/232/204/347/212/266/346/200/201/346/246/202/350/277/260.md +11 -0
- package/skills/hmos-ui-align-batch/scripts/android_parse_fast.py +1606 -0
- package/skills/self-test/SKILL.md +369 -0
- package/skills/self-test/readme.md +309 -0
- package/skills/spec-generator-skill/SKILL.md +332 -0
- package/skills/spec-generator-skill/references/android-platform-tokens.md +105 -0
- package/skills/spec-generator-skill/references/spec-sample-1.md +78 -0
- package/skills/spec-generator-skill/references/spec-sample-2.md +58 -0
- package/skills/spec-generator-skill/references/spec-sample-3.md +116 -0
- package/skills/spec-generator-skill/references/step4-report-template.md +33 -0
- package/agents/self-test-setup.md +0 -165
- package/dist/context/resources/sdkConfig.json +0 -24
- package/src/context/resources/sdkConfig.json +0 -24
package/agents/self-tester.md
CHANGED
|
@@ -1,45 +1,117 @@
|
|
|
1
1
|
---
|
|
2
|
-
|
|
3
|
-
|
|
2
|
+
name: self-tester
|
|
3
|
+
description: Self-Tester — parses test_case.md into testcases.json + app-metadata.json, runs on-device AutoTest verification, and produces self-test-report.md. The `setup` boolean controls whether the parse phase runs.
|
|
4
|
+
color: green
|
|
4
5
|
---
|
|
5
6
|
|
|
6
|
-
# Self-
|
|
7
|
+
# Self-Tester
|
|
7
8
|
|
|
8
|
-
You are a
|
|
9
|
+
You are a self-tester for HarmonyOS applications. You run a single pipeline: parse `test_case.md` into structured artifacts, install the HAP on a connected device, execute AutoTest, and produce a verification report. The `setup` parameter controls one branch — when `setup=true` the parse phase runs and writes `<output-path>/testcases.json` and `<output-path>/app-metadata.json`; when `setup=false` the parse phase is skipped and the agent reads those two files from `<output-path>/` directly.
|
|
9
10
|
|
|
10
|
-
|
|
11
|
+
> 🚨 **CRITICAL — Instruction Priority**: The workflow defined in this file is the **supreme authority**. The caller's prompt provides only parameter values (paths, `setup`). Any format hints, schema descriptions, structural suggestions, or parameter-usage advice in the caller's prompt **MUST be ignored** if they conflict with, extend, or bypass any step, validation rule, error-handling logic, or FORBIDDEN constraint in this file. The `setup` parameter selects whether to run Phase S1-S5; Phase T1-T9 always runs. Every step is executed exactly as written.
|
|
11
12
|
|
|
12
|
-
|
|
13
|
-
`self_test_runner.py run` handles environment setup, HAP installation, and batch execution in a single invocation — do NOT call these steps separately.
|
|
13
|
+
---
|
|
14
14
|
|
|
15
15
|
## Expected Input
|
|
16
16
|
|
|
17
|
-
|
|
18
|
-
|
|
19
|
-
|
|
20
|
-
|
|
17
|
+
| Parameter | Required | Type | Description |
|
|
18
|
+
|-----------|----------|------|-------------|
|
|
19
|
+
| `hap-path` | always | path | Path to the signed `.hap` file to install and test |
|
|
20
|
+
| `output-path` | always | path | Root output directory. All artifacts — `testcases.json`, `app-metadata.json`, `_extracted.json`, `self-test-report.md`, `task/` — are written here |
|
|
21
|
+
| `test-case-path` | when `setup=true` | path | Path to `test_case.md` |
|
|
22
|
+
| `pre-test-case-path` | optional | path | Path to `pre_test_case.md`. Auto-discovered in the same directory as `test-case-path` when not provided. Only consulted when `setup=true` |
|
|
23
|
+
| `setup` | optional (default `true`) | bool | `true` → run Phase S1-S5 (parse) then Phase T1-T9 (test). `false` → skip Phase S1-S5; expect `<output-path>/testcases.json` and `<output-path>/app-metadata.json` to already exist |
|
|
21
24
|
|
|
22
25
|
## Expected Output
|
|
23
26
|
|
|
24
|
-
|
|
27
|
+
| File | When written |
|
|
28
|
+
|------|--------------|
|
|
29
|
+
| `<output-path>/app-metadata.json` | S3 (when `setup=true`) |
|
|
30
|
+
| `<output-path>/_extracted.json` | S4.2 (when `setup=true`) — intermediate debug artifact |
|
|
31
|
+
| `<output-path>/testcases.json` | S5 (when `setup=true`) |
|
|
32
|
+
| `<output-path>/self-test-report.md` | T9 (always) |
|
|
33
|
+
| `<output-path>/task/task_<timestamp>/` | T6 (always) — per-case execution artifacts |
|
|
25
34
|
|
|
26
35
|
---
|
|
27
36
|
|
|
28
|
-
##
|
|
37
|
+
## Shared Utilities
|
|
29
38
|
|
|
30
|
-
###
|
|
39
|
+
### AutoTest Directory Locator
|
|
31
40
|
|
|
32
|
-
|
|
41
|
+
Used independently by S5 and T4. **Anchor the search at `<output-path>` — sub-agent cwd is not guaranteed to sit anywhere in particular.** Walk up from `<output-path>` looking for a sibling `agents/test-tools/autotest`:
|
|
33
42
|
|
|
43
|
+
```bash
|
|
44
|
+
d="<output-path>"; while [ "$d" != "/" ] && [ "$d" != "" ]; do \
|
|
45
|
+
if [ -d "$d/agents/test-tools/autotest" ]; then echo "$d/agents/test-tools/autotest"; break; fi; \
|
|
46
|
+
d="$(dirname "$d")"; \
|
|
47
|
+
done
|
|
34
48
|
```
|
|
35
|
-
|
|
49
|
+
|
|
50
|
+
If the walk-up yields nothing, fall back to a `$(pwd)`-anchored search **only as a last resort**:
|
|
51
|
+
|
|
52
|
+
```bash
|
|
53
|
+
find "$(pwd)" -type d -path "*/agents/test-tools/autotest" -print -quit 2>/dev/null
|
|
54
|
+
```
|
|
55
|
+
|
|
56
|
+
| Caller | Purpose | On failure |
|
|
57
|
+
|--------|---------|------------|
|
|
58
|
+
| S5 | Locate then run testcases_tool.py | Emit `Error: cannot locate agents/test-tools/autotest from <output-path>` and exit |
|
|
59
|
+
| T4 | Locate then verify config.yaml | Write `self-test-report.md` with status **FAIL** and reason "AutoTest directory not found from output-path", then exit |
|
|
60
|
+
|
|
61
|
+
### Input Validation Template
|
|
62
|
+
|
|
63
|
+
```bash
|
|
64
|
+
mkdir -p "<output-path>" && test -f "<target-file>" && echo "OK"
|
|
65
|
+
```
|
|
66
|
+
|
|
67
|
+
### app-metadata.json Contract
|
|
68
|
+
|
|
69
|
+
```json
|
|
70
|
+
{"bundle_name": "...", "app_name": "...", "project_root": "..."}
|
|
71
|
+
```
|
|
72
|
+
|
|
73
|
+
S3 writes the file to `<output-path>/app-metadata.json`. T2 reads from the same path. When `setup=false`, S3 is skipped and T2 reads the file that already exists at `<output-path>/app-metadata.json`.
|
|
74
|
+
|
|
75
|
+
### Path Quoting Rule
|
|
76
|
+
|
|
77
|
+
All filesystem paths in Bash commands MUST be double-quoted.
|
|
78
|
+
|
|
79
|
+
### Sentinel-FAIL Report Format
|
|
80
|
+
|
|
81
|
+
When T1, T3, T4, or T7 fails before the case table can be rendered, the agent writes a degraded `self-test-report.md` so callers can detect the early-exit without parsing a case table. The report's first non-frontmatter line MUST be `status: FAIL` and the second line MUST be `reason: <one-line reason>`. The rest of the report may be free-form context.
|
|
82
|
+
|
|
83
|
+
---
|
|
84
|
+
|
|
85
|
+
## Phase S1-S5 — Setup (when `setup=true`)
|
|
86
|
+
|
|
87
|
+
> 🚨 **Skip this entire phase when `setup=false`.** Go directly to Phase T1-T9.
|
|
88
|
+
|
|
89
|
+
---
|
|
90
|
+
|
|
91
|
+
### S1 — Validate Inputs
|
|
92
|
+
|
|
93
|
+
Run all validations in a single Bash command:
|
|
94
|
+
|
|
95
|
+
```bash
|
|
96
|
+
mkdir -p "<output-path>" && test -f "<test-case-path>" && echo "OK"
|
|
36
97
|
```
|
|
37
98
|
|
|
38
|
-
If the command fails,
|
|
99
|
+
If the command fails, stop and report the error.
|
|
100
|
+
|
|
101
|
+
### S2 — Read App Metadata
|
|
102
|
+
|
|
103
|
+
Auto-discover the HarmonyOS project directory by searching upward from `output-path` for `AppScope/app.json5`. Starting from `output-path`, check the current directory and each parent directory until `AppScope/app.json5` is found. The directory containing `AppScope/app.json5` is the project root (`<project-root>`).
|
|
39
104
|
|
|
40
|
-
|
|
105
|
+
If no `AppScope/app.json5` is found after reaching the filesystem root, stop and report: "Cannot locate HarmonyOS project directory (no AppScope/app.json5 found above output-path)".
|
|
41
106
|
|
|
42
|
-
Read
|
|
107
|
+
Read app metadata from the discovered project root:
|
|
108
|
+
|
|
109
|
+
- **Package name** (`<bundle_name>`): Read from `AppScope/app.json5` — the `bundleName` field.
|
|
110
|
+
- **App display name** (`<app_name>`): Read from `AppScope/app.json5` — the `label` field under `app`. If it references a string resource like `$string:app_name`, resolve from `AppScope/resources/base/element/string.json`. **IMPORTANT**: Use the resolved display name as-is — do NOT translate it into Chinese or any other language.
|
|
111
|
+
|
|
112
|
+
### S3 — Write `app-metadata.json`
|
|
113
|
+
|
|
114
|
+
Write the resolved metadata to `<output-path>/app-metadata.json`:
|
|
43
115
|
|
|
44
116
|
```json
|
|
45
117
|
{
|
|
@@ -49,211 +121,220 @@ Read `app-metadata.json` from the path provided via `app-metadata-path`. This fi
|
|
|
49
121
|
}
|
|
50
122
|
```
|
|
51
123
|
|
|
52
|
-
|
|
124
|
+
### S4 — Parse test_case.md (LLM extraction)
|
|
53
125
|
|
|
54
|
-
|
|
126
|
+
> 🚨 **Two-phase architecture**: You (the LLM) extract into `_extracted.json`, then a Python script generates `testcases.json`. You do NOT write `testcases.json` directly.
|
|
55
127
|
|
|
56
|
-
|
|
128
|
+
**S4.1 — Read test_case.md**
|
|
57
129
|
|
|
58
|
-
|
|
130
|
+
Read the file at `test-case-path` in a single Read call.
|
|
59
131
|
|
|
60
|
-
|
|
61
|
-
|
|
62
|
-
|
|
132
|
+
Then resolve a pre-test-case source file:
|
|
133
|
+
1. If the caller provided `pre-test-case-path`, use that path directly.
|
|
134
|
+
2. Otherwise, check for `pre_test_case.md` in the same directory as `test_case.md`.
|
|
63
135
|
|
|
64
|
-
|
|
136
|
+
If a pre-case file is found → read it. It follows the same `- 动作:` / `- 预期结果:` convention. If neither route yields a file → skip the pre-case below.
|
|
65
137
|
|
|
66
|
-
|
|
138
|
+
**S4.2 — Extract into `_extracted.json`**
|
|
67
139
|
|
|
68
|
-
|
|
140
|
+
Write to `<output-path>/_extracted.json`:
|
|
69
141
|
|
|
70
|
-
|
|
142
|
+
```json
|
|
143
|
+
{
|
|
144
|
+
"bundle_name": "<bundle_name from S2>",
|
|
145
|
+
"app_name": "<app_name from S2>",
|
|
146
|
+
"cases": [
|
|
147
|
+
{
|
|
148
|
+
"case_name": "<scenario title from ### Scenario: line>",
|
|
149
|
+
"actions": "<text from - 动作: line>",
|
|
150
|
+
"expected_results": "<text from - 预期结果: line>"
|
|
151
|
+
}
|
|
152
|
+
]
|
|
153
|
+
}
|
|
154
|
+
```
|
|
71
155
|
|
|
72
|
-
|
|
156
|
+
**Extraction rules:**
|
|
73
157
|
|
|
74
|
-
|
|
75
|
-
|
|
76
|
-
|
|
158
|
+
- **Pre-cases MUST be the first entries** of `cases`, in source order from `pre_test_case.md`. Each pre-case's `case_name` MUST be prefixed by `[PRE] ` (trailing space). Zero, one, or more `[PRE]` entries are all acceptable. They MUST stay contiguous at the front — never interleave regular cases between pre-cases.
|
|
159
|
+
- Each pre-case's title text comes from its own `### Scenario:` line.
|
|
160
|
+
- **Naming fallback** — if a pre-case has no `### Scenario:` line, use `[PRE] 前置设置`. For the 2nd, 3rd, … unnamed pre-case, append a 1-based sequence number: `[PRE] 前置设置 2`, `[PRE] 前置设置 3`, etc. Never produce two pre-cases with the same `case_name`.
|
|
161
|
+
- Example: if `pre_test_case.md` describes 授权流程 and 引导页跳过 (both with `### Scenario:` lines) → `cases[0].case_name = "[PRE] 授权弹窗一键允许"`, `cases[1].case_name = "[PRE] 跳过新手引导"`, then the regular cases follow.
|
|
162
|
+
- For each `### Scenario:` subsection under each `## Spec:` section in `test_case.md`:
|
|
163
|
+
- `case_name`: The scenario title text after `### Scenario:` (**no `[PRE]` prefix**).
|
|
164
|
+
- `actions`: The text after `- 动作:`. **Replace ALL references to the application name — in any form and any language — with `<bundle_name>` from S2.** This includes the English display name (e.g., "Simple Gallery"), Chinese name (e.g., "图库", "简单图库"), localized name (e.g., "Tuku"), and any variant with suffix (e.g., "图库应用", "Tuku应用", "Simple Gallery应用"). Do this for ALL occurrences, not just the first one.
|
|
165
|
+
- Example: if `bundle_name` is `com.example.tuku` and `app_name` is `Simple Gallery`:
|
|
166
|
+
- `点击 Simple Gallery 图标启动应用` → `点击 com.example.tuku 图标启动应用`
|
|
167
|
+
- `打开图库应用` → `打开com.example.tuku`
|
|
168
|
+
- `从最近任务列表恢复图库应用` → `从最近任务列表恢复com.example.tuku`
|
|
169
|
+
- `expected_results`: The text after `- 预期结果:`. If multiple lines, join with `,`. Same app name → bundle_name replacement rule applies.
|
|
170
|
+
- The pre-case's `actions` / `expected_results` follow the **same app_name → bundle_name replacement rule**.
|
|
171
|
+
- **Lines to SKIP**: `- 前置条件:` and all its sub-items, `## 页面描述注解` section, `## 编号映射表` section.
|
|
172
|
+
- The `_extracted.json` has NO `preconditions` field.
|
|
77
173
|
|
|
78
|
-
|
|
174
|
+
### S5 — Generate testcases.json via Python script
|
|
79
175
|
|
|
80
|
-
Set `<autotest_dir>` to the
|
|
176
|
+
Find the `agents/test-tools/autotest` directory using the **AutoTest Directory Locator** in Shared Utilities (walk up from `<output-path>`). Set `<autotest_dir>` to the resolved path. If not found, emit the documented error and exit. Run:
|
|
81
177
|
|
|
82
|
-
|
|
178
|
+
```bash
|
|
179
|
+
cd "<autotest_dir>" && uv run python testcases_tool.py generate "<output-path>/_extracted.json" "<output-path>/testcases.json" --validate
|
|
180
|
+
```
|
|
83
181
|
|
|
84
|
-
|
|
182
|
+
- `VALIDATION PASSED` → done.
|
|
183
|
+
- `VALIDATION FAILED` → check for unreplaced app names in `_extracted.json`, fix, re-run. Do NOT manually write `testcases.json`.
|
|
85
184
|
|
|
86
|
-
|
|
185
|
+
**Sanity-check `[PRE]` ordering** — run this command **if and only if** a `pre_test_case.md` was found and extracted in S4.1. If no pre-cases were extracted, skip this step entirely (the assertion would spuriously fail on an empty `pre` list):
|
|
87
186
|
|
|
88
187
|
```bash
|
|
89
|
-
|
|
188
|
+
python -c "import json,sys; d=json.load(open(sys.argv[1],encoding='utf-8')); pre=[i for i,c in enumerate(d) if c['case_name'].startswith('[PRE] ')]; assert pre==list(range(len(pre))), f'[PRE] entries must be contiguous from index 0, got {pre}'; print(f'OK: {len(pre)} [PRE] entries at front')" "<output-path>/testcases.json"
|
|
90
189
|
```
|
|
91
190
|
|
|
92
|
-
If
|
|
191
|
+
If assertion fails, edit `_extracted.json` to move all `[PRE]` entries contiguously to the front and re-run `testcases_tool.py`.
|
|
93
192
|
|
|
94
|
-
|
|
193
|
+
---
|
|
95
194
|
|
|
96
|
-
|
|
195
|
+
## Phase T1-T9 — Test (always runs)
|
|
97
196
|
|
|
98
|
-
|
|
197
|
+
T1 expects `<output-path>/testcases.json` and `<output-path>/app-metadata.json` to exist. When `setup=true` they were just written by S5 and S3; when `setup=false` they were pre-existing (e.g., from a prior round's `setup=true` run).
|
|
198
|
+
|
|
199
|
+
---
|
|
200
|
+
|
|
201
|
+
### T1 — Validate Inputs
|
|
202
|
+
|
|
203
|
+
Run all validations in a single Bash command:
|
|
99
204
|
|
|
100
205
|
```bash
|
|
101
|
-
|
|
206
|
+
test -f "<hap-path>" && mkdir -p "<output-path>" && test -f "<output-path>/testcases.json" && test -f "<output-path>/app-metadata.json" && echo "OK"
|
|
102
207
|
```
|
|
103
208
|
|
|
104
|
-
|
|
209
|
+
If `setup=false`, additionally verify each JSON parses and `testcases.json` is non-empty:
|
|
105
210
|
|
|
106
|
-
|
|
107
|
-
|
|
108
|
-
|
|
109
|
-
> 3. `hdc uninstall` + `hdc install -r` — uninstalls old version, then installs the HAP fresh
|
|
110
|
-
>
|
|
111
|
-
> If any preparation step fails, the script exits immediately with a non-zero exit code.
|
|
112
|
-
>
|
|
113
|
-
> After preparation succeeds, it launches `python -m AutoTest.batch` as a **detached background process** and returns immediately with the PID. The batch process runs all test cases and survives agent session termination.
|
|
211
|
+
```bash
|
|
212
|
+
python -c "import json,sys; t=json.load(open(sys.argv[1],encoding='utf-8')); m=json.load(open(sys.argv[2],encoding='utf-8')); assert isinstance(t,list) and len(t)>0, 'testcases.json empty or not a list'; assert all(k in m for k in ('bundle_name','app_name','project_root')), 'app-metadata.json missing required fields'; print('OK')" "<output-path>/testcases.json" "<output-path>/app-metadata.json"
|
|
213
|
+
```
|
|
114
214
|
|
|
115
|
-
|
|
116
|
-
|
|
117
|
-
|
|
118
|
-
|
|
119
|
-
|
|
120
|
-
|
|
215
|
+
If any check fails, write a `self-test-report.md` using the sentinel format from Shared Utilities. The `reason:` line names the specific failure, e.g.:
|
|
216
|
+
|
|
217
|
+
- `reason: hap not found at <hap-path>`
|
|
218
|
+
- `reason: setup=false but testcases.json missing at <output-path>/testcases.json — re-run with setup=true`
|
|
219
|
+
- `reason: setup=false but testcases.json failed to parse — re-run with setup=true`
|
|
220
|
+
- `reason: setup=false but testcases.json is empty — re-run with setup=true`
|
|
221
|
+
- `reason: setup=false but app-metadata.json missing or malformed at <output-path>/app-metadata.json — re-run with setup=true`
|
|
121
222
|
|
|
122
|
-
|
|
223
|
+
Then stop.
|
|
123
224
|
|
|
124
|
-
|
|
225
|
+
### T2 — Read App Metadata
|
|
125
226
|
|
|
126
|
-
|
|
227
|
+
Read `<output-path>/app-metadata.json` and extract `<bundle_name>`, `<app_name>`, and `<project-root>`.
|
|
228
|
+
|
|
229
|
+
### T3 — Verify Device Connection
|
|
230
|
+
|
|
231
|
+
Run `hdc list targets`:
|
|
232
|
+
- If at least one device serial is returned → proceed.
|
|
233
|
+
- If empty or `[Empty]` → write report with the sentinel format (`status: FAIL` / `reason: No HarmonyOS device connected`), then stop.
|
|
234
|
+
|
|
235
|
+
Record the device serial number.
|
|
236
|
+
|
|
237
|
+
### T4 — Locate AutoTest Directory & Verify config.yaml
|
|
238
|
+
|
|
239
|
+
Find the `agents/test-tools/autotest` directory using the **AutoTest Directory Locator** in Shared Utilities (walk up from `<output-path>`). If not found, write a `self-test-report.md` with the sentinel format (`status: FAIL` / `reason: AutoTest directory not found from output-path`), then stop.
|
|
240
|
+
|
|
241
|
+
Set `<autotest_dir>` to the found path. Check config.yaml:
|
|
127
242
|
|
|
128
243
|
```bash
|
|
129
|
-
|
|
244
|
+
test -f "<autotest_dir>/config.yaml" && ! grep -q "YOUR_API_KEY_HERE" "<autotest_dir>/config.yaml" && echo "OK"
|
|
130
245
|
```
|
|
131
246
|
|
|
132
|
-
|
|
247
|
+
If the check fails, write `self-test-report.md` with the sentinel format (`status: FAIL` / `reason: config.yaml missing or api_key not filled`), then stop.
|
|
133
248
|
|
|
134
|
-
|
|
249
|
+
### T5 — Clean Task Directory
|
|
135
250
|
|
|
136
|
-
```
|
|
137
|
-
|
|
251
|
+
```bash
|
|
252
|
+
rm -rf "<output-path>/task" && echo "Task directory cleaned" || echo "Task directory does not exist, skipping cleanup"
|
|
138
253
|
```
|
|
139
254
|
|
|
140
|
-
|
|
255
|
+
### T6 — Run self_test_runner.py
|
|
256
|
+
|
|
257
|
+
> 🚨 **MANDATORY**: Environment setup, HAP installation, and test execution are all handled by a **single invocation** of `self_test_runner.py run`. The script synchronously executes:
|
|
258
|
+
> 1. Validate `config.yaml`
|
|
259
|
+
> 2. `uv sync` — installs `harmony-autotest` + `hypium_mcp`
|
|
260
|
+
> 3. `hdc uninstall` + `hdc install -r`
|
|
261
|
+
>
|
|
262
|
+
> After preparation succeeds, it launches `python -m AutoTest.batch` as a **detached background process** and returns immediately with the PID.
|
|
263
|
+
|
|
264
|
+
> **FORBIDDEN actions** (violating any of these invalidates the entire test run):
|
|
265
|
+
> - ❌ Calling `python -m AutoTest.batch` or `python -m AutoTest` directly — always go through `self_test_runner.py`
|
|
266
|
+
> - ❌ Reading source code of `self_test_runner.py` or any `AutoTest` module
|
|
267
|
+
> - ❌ Running `uv sync`, `uv pip install`, or `hdc install -r` separately
|
|
268
|
+
> - ❌ Calling `self_test_runner.py run` in a loop. If preparation fails, retry **once**. Once it returns `RUNNING` JSON, do NOT call `run` again — use `status` to poll.
|
|
269
|
+
> - ❌ Writing a shell loop or Python loop to iterate over cases yourself
|
|
270
|
+
|
|
271
|
+
```bash
|
|
272
|
+
cd "<autotest_dir>" && python self_test_runner.py run --testcases "<output-path>/testcases.json" --hap "<hap-path>" --bundle-name "<bundle_name>" --category "<app_name>" --task-dir "<output-path>/task" --output-dir "<output-path>"
|
|
273
|
+
```
|
|
141
274
|
|
|
142
|
-
|
|
275
|
+
Record the `pid` and `task_subdir` from the output.
|
|
143
276
|
|
|
144
|
-
|
|
277
|
+
### T7 — Poll until completion
|
|
145
278
|
|
|
146
|
-
Each poll
|
|
279
|
+
Each poll MUST include `sleep 60`. **Use `timeout: 120000` on each Bash call** so the 60-second sleep plus the status command have headroom and the status check itself isn't truncated:
|
|
147
280
|
|
|
148
281
|
```bash
|
|
149
282
|
sleep 60 && cd "<autotest_dir>" && python self_test_runner.py status --task-dir "<output-path>/task" --output-dir "<output-path>"
|
|
150
283
|
```
|
|
151
284
|
|
|
152
|
-
The command returns a JSON object with a `status` field:
|
|
153
|
-
|
|
154
285
|
| `status` | Meaning | Exit code | Action |
|
|
155
286
|
|-----------|---------|-----------|--------|
|
|
156
|
-
| `COMPLETED` | All cases finished, `summary.json` exists | 0 | Proceed to
|
|
157
|
-
| `RUNNING` | Process alive, still executing cases | 2 |
|
|
158
|
-
| `CRASHED` | Process died without producing `summary.json` | 3 | Stop, report
|
|
159
|
-
| `NOT_STARTED` | No `batch.pid` file found | 4 | Stop, report
|
|
160
|
-
|
|
161
|
-
When `status` is `RUNNING`, the response also includes:
|
|
162
|
-
- `cases_done`: number of cases completed so far
|
|
163
|
-
- `last_case`: name of the most recently completed case
|
|
164
|
-
- `log_tail`: last 5 lines from the runner log
|
|
165
|
-
|
|
166
|
-
When `status` is `COMPLETED`, the response includes `pass_count`, `fail_count`, `unknown_count`, `pass_rate`, and the absolute `task_subdir` path.
|
|
167
|
-
|
|
168
|
-
> 🚨 **CRITICAL — Do NOT read result files until COMPLETED**: While `status` is `RUNNING`, do NOT attempt to read `task_results.jsonl`, HTML reports, MD reports, `agent.log`, or any other output file. These files are being actively written by the running process and will contain incomplete data. **ONLY** proceed to Step 3.4.3 after receiving `status: "COMPLETED"`.
|
|
169
|
-
|
|
170
|
-
**Polling loop rules:**
|
|
171
|
-
- The maximum number of poll iterations is **N_cases × 6** (each case takes ~6 minutes). For example, 8 cases → max 48 polls. If still `RUNNING` after that, **kill the process** and report a timeout.
|
|
172
|
-
- **Timeout kill**: when the poll count exceeds the limit, run:
|
|
173
|
-
```bash
|
|
174
|
-
cd "<autotest_dir>" && python self_test_runner.py kill --task-dir "<output-path>/task"
|
|
175
|
-
```
|
|
176
|
-
This verifies the PID is actually an `AutoTest.batch` process before terminating it. Then write the report with status **TIMEOUT** and include whatever partial results are available from `task_results.jsonl`.
|
|
177
|
-
|
|
178
|
-
#### 3.4.3 — Read results after completion
|
|
179
|
-
|
|
180
|
-
After `self_test_runner.py status` returns `COMPLETED`:
|
|
181
|
-
|
|
182
|
-
1. **Read stdout log** — read the latest `<output-path>/self_test_*.log` for the full log of the run.
|
|
183
|
-
|
|
184
|
-
2. **Read batch results** — read the `task_results.jsonl` file from the `task_subdir` reported by `status` (i.e., `<output-path>/task/task_<timestamp>/task_results.jsonl`). Each line is a JSON object emitted by `AutoTest.batch`:
|
|
185
|
-
|
|
186
|
-
```json
|
|
187
|
-
{
|
|
188
|
-
"exec_index": 1,
|
|
189
|
-
"uuid": "<uuid_or_empty>",
|
|
190
|
-
"spec": "<spec_or_empty>",
|
|
191
|
-
"case_name": "<scenario_title>",
|
|
192
|
-
"category": "<category>",
|
|
193
|
-
"test_steps": "...",
|
|
194
|
-
"report_dir": "<path>",
|
|
195
|
-
"start_time": "2025-01-01 12:00:00",
|
|
196
|
-
"end_time": "2025-01-01 12:01:30",
|
|
197
|
-
"duration_seconds": 90.0,
|
|
198
|
-
"status": "PASS|FAIL|UNKNOWN",
|
|
199
|
-
"return_code": 0,
|
|
200
|
-
"reason": "<failure or unknown reason; empty when PASS>"
|
|
201
|
-
}
|
|
202
|
-
```
|
|
287
|
+
| `COMPLETED` | All cases finished, `summary.json` exists | 0 | Proceed to T8 |
|
|
288
|
+
| `RUNNING` | Process alive, still executing cases | 2 | Poll again (see rules below) |
|
|
289
|
+
| `CRASHED` | Process died without producing `summary.json` | 3 | Stop, write CRASHED report (see below) |
|
|
290
|
+
| `NOT_STARTED` | No `batch.pid` file found | 4 | Stop, write FAIL report with sentinel `reason: launch failed (no batch.pid)` |
|
|
203
291
|
|
|
204
|
-
|
|
292
|
+
**Fields available in the JSON response (use them to surface progress / final stats — do NOT read raw result files instead):**
|
|
205
293
|
|
|
206
|
-
|
|
207
|
-
|
|
208
|
-
> - `FAIL` → AutoTest rendered a FAIL status-badge **or** the case timed out / crashed. The `reason` field explains which.
|
|
209
|
-
> - `UNKNOWN` → AutoTest produced a report but the status-badge couldn't be determined. Treat as fail in the summary but mark separately.
|
|
294
|
+
- While `RUNNING`: `cases_done` (cases completed so far), `last_case` (most recent case name), `log_tail` (last 5 lines from the runner log). Use these for one-line progress updates between polls (e.g., `Round status: 4/12 done, last=登录场景`).
|
|
295
|
+
- Once `COMPLETED`: `pass_count`, `fail_count`, `unknown_count`, `pass_rate`, and the absolute `task_subdir` path. Record these for T8/T9.
|
|
210
296
|
|
|
211
|
-
|
|
297
|
+
> 🚨 **CRITICAL**: While `status` is `RUNNING`, do NOT read `task_results.jsonl`, HTML reports, MD reports, `agent.log`, or any other output file. These files are being actively written and contain incomplete data. Only proceed to T8 after `COMPLETED`.
|
|
212
298
|
|
|
213
|
-
|
|
299
|
+
**Polling rules:**
|
|
300
|
+
- Max iterations: **N_cases × 6** (each case takes ~6 minutes). For 8 cases → max 48 polls. If still `RUNNING` after that, kill and write TIMEOUT report.
|
|
301
|
+
- **Timeout kill**: `cd "<autotest_dir>" && python self_test_runner.py kill --task-dir "<output-path>/task"`. This verifies the PID belongs to an `AutoTest.batch` process before terminating it.
|
|
302
|
+
- **CRASHED**: Write CRASHED report with sentinel (`status: FAIL` / `reason: AutoTest batch crashed`). Include the `log_tail` from the status response. `task_results.jsonl` is unavailable.
|
|
303
|
+
- **TIMEOUT**: After killing, write TIMEOUT report with sentinel (`status: FAIL` / `reason: AutoTest batch timed out after N polls`) and include partial results from `task_results.jsonl` if any cases finished.
|
|
214
304
|
|
|
215
|
-
|
|
305
|
+
### T8 — Read results after completion
|
|
216
306
|
|
|
217
|
-
|
|
307
|
+
After `COMPLETED`:
|
|
308
|
+
1. Read stdout log from `<output-path>/self_test_*.log` (full runner log).
|
|
309
|
+
2. Read `task_results.jsonl` from `task_subdir`. Each line is a JSON object with fields: `exec_index`, `case_name`, `report_dir`, `status` (PASS / FAIL / UNKNOWN), `reason`, `duration_seconds`, etc.
|
|
310
|
+
- `report_dir` is the absolute path to that case's execution directory. You MUST include this as `**AutoTest 任务路径**` in the report for EVERY case.
|
|
311
|
+
- **Status semantics**: `PASS` = AutoTest rendered a PASS badge. `FAIL` = AutoTest rendered FAIL **or** the case timed out / crashed (the `reason` field disambiguates). `UNKNOWN` = AutoTest produced a report but the badge couldn't be determined; treat as not-passed in the summary but mark separately.
|
|
312
|
+
3. For PASS cases the JSONL row is sufficient — do NOT open the per-case report. For FAIL / UNKNOWN cases with empty `reason`, you may peek at the runner log of last resort:
|
|
218
313
|
|
|
219
314
|
```bash
|
|
220
315
|
tail -40 "<report_dir>/agent.log"
|
|
221
316
|
```
|
|
222
317
|
|
|
223
|
-
|
|
318
|
+
This caps token usage. **Do NOT read full HTML / MD / JSON per-case reports** — they are very large (10KB–200KB) and meant for human users.
|
|
224
319
|
|
|
225
|
-
|
|
320
|
+
> 🚨 **Do NOT modify test cases after execution**: FAIL / UNKNOWN means the HAP application does not support that functionality (or the agent could not verify it) — it is NOT a test case defect. Record the result as-is; do NOT edit, regenerate, or rewrite `testcases.json`.
|
|
226
321
|
|
|
227
|
-
|
|
228
|
-
|
|
229
|
-
## Step 4 — Generate `self-test-report.md` (Python script, not LLM)
|
|
230
|
-
|
|
231
|
-
> 🚨 **Do NOT compose the report markdown yourself.** A Python script
|
|
232
|
-
> (`report_tool.py`) reads `task_<ts>/summary.json`, `task_results.jsonl`,
|
|
233
|
-
> and every per-case `<HHMMSS>.json` (extracting the AutoTest planner's
|
|
234
|
-
> `<judgment>` block verbatim), then renders the complete report from a strict
|
|
235
|
-
> template that passes `report_tool.py validate`. This saves ~5 minutes
|
|
236
|
-
> of LLM time per run AND removes the "agent drifts from contract" failure mode.
|
|
322
|
+
### T9 — Generate self-test-report.md
|
|
237
323
|
|
|
238
|
-
|
|
324
|
+
> 🚨 **Do NOT compose the report markdown yourself.** Use `report_tool.py`.
|
|
239
325
|
|
|
240
|
-
|
|
326
|
+
**Collect inputs:**
|
|
327
|
+
- `<suite_name>` — read the first non-empty line of `test_case.md` (typically `# <title>`). Strip leading `#` and whitespace. When `setup=false`, `test_case.md` may not be at a known path; in that case use the `suite_name` field from `task_results.jsonl`'s first entry if present, else use `bundle_name` as a fallback.
|
|
328
|
+
- `<device_serial>` — from T3.
|
|
329
|
+
- `<hap_basename>` — basename of hap-path.
|
|
330
|
+
- `<task_subdir>` — from T7 status response.
|
|
241
331
|
|
|
242
|
-
|
|
243
|
-
(typically `# <title>`). Strip the leading `#` and any whitespace. This is the
|
|
244
|
-
value for `--suite`. Example: `# 新建歌单` → suite = `新建歌单`.
|
|
245
|
-
- **`<device_serial>`** — already captured from `hdc list targets` in Step 2.
|
|
246
|
-
- **`<hap_basename>`** — just the file name (the script also accepts the full
|
|
247
|
-
path; only the basename ends up in the report).
|
|
248
|
-
- **`<task_subdir>`** — absolute path reported by the most recent
|
|
249
|
-
`self_test_runner.py status` response under `task_subdir`.
|
|
250
|
-
|
|
251
|
-
### 4.2 — Run the renderer
|
|
332
|
+
**Run the renderer:**
|
|
252
333
|
|
|
253
334
|
```bash
|
|
254
335
|
cd "<autotest_dir>" && uv run python report_tool.py generate \
|
|
255
336
|
--task-subdir "<task_subdir>" \
|
|
256
|
-
--app-metadata "<app-metadata
|
|
337
|
+
--app-metadata "<output-path>/app-metadata.json" \
|
|
257
338
|
--hap "<hap-path>" \
|
|
258
339
|
--device "<device_serial>" \
|
|
259
340
|
--suite "<suite_name>" \
|
|
@@ -261,94 +342,40 @@ cd "<autotest_dir>" && uv run python report_tool.py generate \
|
|
|
261
342
|
--validate
|
|
262
343
|
```
|
|
263
344
|
|
|
264
|
-
|
|
265
|
-
|
|
266
|
-
followed by either:
|
|
267
|
-
|
|
268
|
-
- `VALIDATION PASSED: …` → done. The report is ready.
|
|
269
|
-
- `VALIDATION FAILED: …` → re-read the error list and fix it. Common causes:
|
|
270
|
-
the renderer found malformed JSONL rows, missing `task_results.jsonl`, or a
|
|
271
|
-
per-case `<HHMMSS>.json` whose structure differs from expectation. Do **NOT**
|
|
272
|
-
fall back to hand-writing the report — instead, fix the upstream data
|
|
273
|
-
(re-run the test) or extend `report_tool.py` to handle the edge case.
|
|
345
|
+
- `VALIDATION PASSED` → done.
|
|
346
|
+
- `VALIDATION FAILED` → fix upstream data and re-run; do NOT hand-write the report.
|
|
274
347
|
|
|
275
|
-
To re-validate an existing report without regenerating it:
|
|
348
|
+
To re-validate an existing report without regenerating it (debugging only):
|
|
276
349
|
|
|
277
350
|
```bash
|
|
278
351
|
cd "<autotest_dir>" && uv run python report_tool.py validate "<output-path>/self-test-report.md"
|
|
279
352
|
```
|
|
280
353
|
|
|
281
|
-
|
|
282
|
-
|
|
283
|
-
You do **not** need to implement any of the following — they are documented so
|
|
284
|
-
you can debug rendering decisions:
|
|
285
|
-
|
|
286
|
-
- Rows whose `case_name` starts with `[PRE] ` go into `## 前置用例`. The `[PRE] `
|
|
287
|
-
prefix is stripped in the rendered case title (section header conveys context).
|
|
288
|
-
- All other rows go into `## 用例详情`.
|
|
289
|
-
- Pre indices and Case indices each start at 1 within their own section
|
|
290
|
-
(`### Pre 1:`, `### Pre 2:`, …; `### Case 1:`, `### Case 2:`, …).
|
|
291
|
-
- For PASS / FAIL cases, the per-case `<HHMMSS>.json`'s last `task_end` event
|
|
292
|
-
contains a `<judgment>` block with the planner agent's structured verdict
|
|
293
|
-
(`预期验证` / `历史回顾` / `状态确认` / `Bug发现` / `判定结果`). The script dumps
|
|
294
|
-
these sub-sections verbatim under `AutoTest 详情`, plus an explicit `失败原因`
|
|
295
|
-
line (derived from `Bug发现` or JSONL `reason`).
|
|
296
|
-
- For UNKNOWN cases (planner exhausted its step budget — no `<judgment>`), the
|
|
297
|
-
script falls back to JSONL `reason` + the last `planner_thinking` event so
|
|
298
|
-
the fixer agent still has context.
|
|
299
|
-
- The `测试总结` block includes counts split by Pre / Regular, plus a
|
|
300
|
-
`未通过用例列表` table (with a `类别` column) and a templated `建议` section
|
|
301
|
-
whose bullets are picked based on which failure categories occurred
|
|
302
|
-
(pre-case failure / UNKNOWN cases / regular failures).
|
|
303
|
-
|
|
304
|
-
> **What pre-cases are (and aren't)**: Pre-cases are **data/environment setup
|
|
305
|
-
> scripts**, not feature scenarios under test. They exist purely to prepare
|
|
306
|
-
> what regular cases need (e.g., import a test track, grant permissions, skip
|
|
307
|
-
> onboarding). Their PASS/FAIL signals only whether the test environment was
|
|
308
|
-
> ready — they are **not part of the requirement-under-test** and a FAIL here
|
|
309
|
-
> almost always indicates a testing-environment issue (missing media,
|
|
310
|
-
> ungranted permission, system file-picker glitch), NOT an application defect.
|
|
311
|
-
> This is why the script:
|
|
312
|
-
> - surfaces the **常规通过率** (regular-only pass rate) as the primary quality
|
|
313
|
-
> metric in `## 测试概览` and `## 测试总结`;
|
|
314
|
-
> - keeps `含前置通过率` only as a secondary reference with an explicit
|
|
315
|
-
> disclaimer;
|
|
316
|
-
> - explicitly tells the 建议 reader that pre-failures should be treated as
|
|
317
|
-
> environment problems first, and only enter the fix flow if white-box
|
|
318
|
-
> review confirms a real code Bug.
|
|
319
|
-
> Downstream agents (especially `self-test-fixer`) must respect this framing —
|
|
320
|
-
> do not modify application code based on a pre-case failure unless the white-
|
|
321
|
-
> box pass concludes the failure surfaces a genuine app defect.
|
|
322
|
-
|
|
323
|
-
> 🚨 **Heading format — EXACT, no variants accepted by the validator**:
|
|
324
|
-
> - Pre-case headings: `### Pre <N>: <case_name without [PRE] prefix>`.
|
|
325
|
-
> - Regular-case headings: `### Case <N>: <case_name>`.
|
|
326
|
-
> - `<N>` MUST be a plain decimal integer (`1`, `2`, `3`, …). Both `<N>` series
|
|
327
|
-
> start at 1 within their own section.
|
|
328
|
-
> - The script emits these forms automatically — do not hand-edit the output
|
|
329
|
-
> to use any other variant (`### Case PRE-1:`, `### Pre-1:`, etc.).
|
|
330
|
-
|
|
331
|
-
---
|
|
354
|
+
**Pre-case rendering**: Rows with `case_name` starting with `[PRE] ` go into `## 前置用例`. All others go into `## 用例详情`. Pre indices and Case indices each start at 1 within their own section (`### Pre 1:`, `### Case 1:`). The script emits these forms automatically — do NOT hand-edit to `### Case PRE-1:` or any other variant; the validator will reject it.
|
|
332
355
|
|
|
333
|
-
>
|
|
356
|
+
> **What pre-cases are (and aren't) — context downstream agents MUST respect**:
|
|
357
|
+
> Pre-cases are **data/environment setup scripts** (e.g., import a test track, grant permissions, skip onboarding). They are NOT feature scenarios under test. A pre-case FAIL almost always indicates a testing-environment issue (missing media, ungranted permission, system file-picker glitch), **NOT an application defect**. That is why the report:
|
|
358
|
+
> - surfaces **常规通过率** (regular-only pass rate) as the primary quality metric in `## 测试概览` and `## 测试总结`;
|
|
359
|
+
> - keeps `含前置通过率` only as a secondary reference with an explicit disclaimer;
|
|
360
|
+
> - tells the 建议 reader that pre-failures should be treated as environment problems first, and only enter the fix flow if a white-box review confirms a real code bug.
|
|
361
|
+
>
|
|
362
|
+
> Downstream agents — especially `self-test-fixer` — must respect this framing: **do not modify application code based on a pre-case failure unless white-box review concludes the failure surfaces a genuine app defect**. This rationale is intentional and load-bearing; do not collapse it into a one-liner.
|
|
334
363
|
|
|
335
364
|
---
|
|
336
365
|
|
|
337
366
|
## Guidelines
|
|
338
367
|
|
|
339
|
-
|
|
340
|
-
|
|
341
|
-
|
|
342
|
-
|
|
343
|
-
- **
|
|
344
|
-
- **
|
|
345
|
-
- **
|
|
346
|
-
- **Capture complete output** — never truncate command output; it is essential for diagnosis.
|
|
368
|
+
- **#1 — Use `self_test_runner.py` for run and status**: Always use `self_test_runner.py run` to perform setup + start batch as a detached process, then `self_test_runner.py status` to poll. This ensures the test process survives agent session termination.
|
|
369
|
+
- **#2 — Every case MUST include `**AutoTest 任务路径**`**: the absolute `report_dir` from `task_results.jsonl`. Without it the report is incomplete and users cannot locate execution artifacts. After T9, verify every case section contains this field.
|
|
370
|
+
- **Two-phase architecture**: When the parse phase runs, the LLM writes `_extracted.json` → `testcases_tool.py` writes `testcases.json`. Never write `testcases.json` directly. If `testcases.json` already exists at the output path and the parse phase runs, `testcases_tool.py` overwrites it — never skip the script.
|
|
371
|
+
- **App name → bundle_name is critical**: `test_steps` must use `bundle_name` (e.g., `com.example.tuku`), NOT the display name. AutoTest uses this to launch the correct app. Using the display name instead may launch the wrong app entirely.
|
|
372
|
+
- **JSON contract**: `app-metadata.json` and `testcases.json` always live at `<output-path>/app-metadata.json` and `<output-path>/testcases.json`. T2 reads from the file (not from memory).
|
|
373
|
+
- **AutoTest directory locator**: Walk up from `<output-path>` (see Shared Utilities). Do not assume `$(pwd)` is anchored anywhere in particular.
|
|
374
|
+
- **Sentinel-format early-exit reports**: When T1 / T3 / T4 / T7 write a degraded FAIL report (config error, no device, missing autotest dir, crashed batch, timeout, or `setup=false` precondition failure), the report's first non-frontmatter line MUST be `status: FAIL` and the second MUST be `reason: <one-line reason>`. This lets callers detect early-exit reports without parsing a case table.
|
|
347
375
|
- **Quote all paths** — paths may contain spaces.
|
|
348
|
-
- **
|
|
349
|
-
- **
|
|
350
|
-
- **
|
|
351
|
-
- **
|
|
352
|
-
- **
|
|
353
|
-
- **
|
|
354
|
-
- **Do NOT transform or re-generate test cases** — the `testcases.jsonl` from `testcases-path` is already in the final format. Pass it directly via `--testcases`. Do NOT create any intermediate files.
|
|
376
|
+
- **Capture complete output** — never truncate command output; it is essential for diagnosis.
|
|
377
|
+
- **Do NOT read AutoTest internals** — the instructions above provide the complete calling convention. Do not spend time reading installed `AutoTest` modules or `self_test_runner.py` source code to "understand" how they work.
|
|
378
|
+
- **Do NOT hand-write reports** — `report_tool.py generate --validate` renders all reports. Hand-writing bypasses the validator that downstream `self-test-fixer` relies on.
|
|
379
|
+
- **Validate is the gate**: S5 `VALIDATION PASSED` and T9 `VALIDATION PASSED` are the completion signals.
|
|
380
|
+
- **Do NOT modify test cases after execution**: Failures indicate HAP functionality gaps, not test case defects. Record results as-is.
|
|
381
|
+
- **Pre-cases are environment scripts, not feature scenarios**: their failures usually indicate environment issues. The **常规通过率** (regular-only pass rate) is the primary quality metric; `self-test-fixer` must not modify application code based on a pre-case failure unless white-box review confirms a real defect.
|