npm - @pzy560117/opentest - Versions diffs - 0.1.14 → 0.1.15 - Mend

@pzy560117/opentest 0.1.14 → 0.1.15

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (15) hide show

package/assets/manifest.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
-  "version": "0.1.14",
+  "version": "0.1.15",
   "languages": [
     {
       "id": "en",

package/assets/skills/opentest/references/android-app-testing.md CHANGED Viewed

@@ -25,6 +25,32 @@ If a repository already has an Android harness, use the closest existing paths b
 The canonical Midscene slot is `tests/android/midscene/`.
+## Natural-language GUI scenario rules
+Midscene-facing GUI scenarios are natural-language Markdown, not command scripts.
+Use `tests/android/midscene/*.md` or `docs/opentest/acceptance/*.md` for concise
+Agent instructions, visual anchors, memory variables, expected visual assertions,
+self-healing notes, and cleanup intent.
+Do not put `adb`, `python -m pytest`, `npm run`, `aapt`, or `dumpsys` commands in GUI action steps.
+Those commands belong to runner/evidence setup, teardown, security review, and report collection.
+## Minimum Midscene harness
+For `android-app` rows with `visual-acceptance`, `e2e`, or GUI integration evidence, author/heal must create the minimum Midscene harness instead of leaving permanent pytest skips:
+- `package.json` with `npm run test:android`.
+- `tests/android/midscene/*.test.ts` or Midscene YAML that reads the natural-language scenario.
+- A pytest wrapper that checks device/APK prerequisites, runs ADB smoke, calls `npm run test:android` when model environment is ready, and archives `midscene_run/report/*.html`.
+If that harness is missing, record an author/heal gap and build it before claiming visual acceptance. ADB-only smoke is valid smoke evidence, not a substitute for Midscene visual evidence.
+## Model credential handling
+Do not ask the user to paste secrets into chat. When no recognized Midscene key is present, ask the user to set process environment variables or confirm the local mapping.
+Recognized OpenAI-compatible variables normally include `MIDSCENE_MODEL_API_KEY` or `OPENAI_API_KEY`, plus base URL/model variables when the provider needs them. For unknown key variables such as `SSS_API_KEY`, ask for provider, OpenAI-compatible base URL, model name/family, and permission to map that variable for the current process.
 ## Default Route
 ```text
@@ -40,6 +66,7 @@ opentest-android-app
 User-facing entry is pytest. Prefer `python -m pytest tests_py -v` in an existing Android harness, or `scripts/opentest-run-android.ps1` when the repository owns the fixed layout.
 Run `npm run test:android` only when Midscene model environment variables are complete or when debugging Midscene directly.
+When model environment variables are complete, the pytest wrapper must call `npm run test:android`; if it cannot, treat the missing bridge as an author/heal gap.
 ## Required Evidence

package/assets/skills/opentest/templates/android-app-acceptance-template.md CHANGED Viewed

@@ -12,20 +12,27 @@
 - fixture/reset:
 - status: pending
-### Environment
+### Natural-Language Scenario
-- ADB status:
-- model env status:
-- app version/build:
-- device/app metadata:
-### Steps
+- precondition:
+- AI memory variables:
+- self-healing notes: permission dialogs, onboarding, keyboard, overlays, scrolling
-1.
+| Step | Agent action instruction | Visual assertion |
+| --- | --- | --- |
+| 1 |  |  |
 ### Expected Outcome
--
+-
+### Runner/Evidence Setup
+- tool route: opentest-android-app | android-midscene-pytest | pytest wrapper | @midscene/android
+- ADB status:
+- model env status:
+- app version/build:
+- device/app metadata:
 ### Read-Back Contract

package/assets/skills/opentest-accept/SKILL.md CHANGED Viewed

@@ -27,7 +27,7 @@ Write PASS, FAIL, or blocked evidence to cases and matrix.
 1. Read the matrix, fixtures, and `docs/opentest/acceptance/`.
 2. Select the acceptance tool from the matrix execution surface.
 3. `web-browser`: Playwright MCP/CLI; `@playwright/test` for durable regression; Midscene only for visual assist.
-4. `android-app`: `opentest-android-app` plus `android-midscene-pytest`, `python -m pytest tests_py -v`, ADB smoke, Midscene HTML, logcat, and `midscene_run`; block missing prerequisites.
+4. `android-app`: execute the natural-language scenario through `opentest-android-app`/`android-midscene-pytest`; keep commands in runner/evidence (ADB smoke, logcat, `midscene_run`). Visual rows need Midscene HTML or blocked/heal evidence.
 5. `desktop-gui`: `opentest-desktop-gui`, project GUI automation or `@midscene/computer`, screenshot/recording, metadata, and read-back.
 6. `api`: `opentest-api`, project API command or `pytest` with `httpx`/`requests`, schema, fixtures, read-after-write, and cleanup/teardown.
 7. For CRUD/data changes, execute the full chain from the workflow reference.

package/assets/skills/opentest-android-app/SKILL.md CHANGED Viewed

@@ -17,8 +17,9 @@ Use this adapter for `android-app` matrix rows.
 1. Use `android-midscene-pytest` as the detailed Android execution skill when available.
 2. Keep durable Android assets in the fixed layout from `test-asset-layout.md`: `tests/android/tests_py/`, `tests/android/midscene/`, and `scripts/opentest-run-android.ps1` or an existing Android harness command.
 3. User-facing execution should enter through pytest, normally `python -m pytest tests_py -v` in an existing harness or the repository's Android test entry.
-4. Run `npm run test:android` only when model environment variables are ready or when debugging the Midscene layer.
-5. Mark blocked when ADB, emulator/device, APK path, package name, model credentials, fixture data, or stable result surface is missing.
+4. For visual/e2e rows, ensure a minimum Midscene harness exists (`package.json`, `npm run test:android`, and `tests/android/midscene/`); if missing, route to author/heal.
+5. Run `npm run test:android` only when model environment variables are ready or when debugging the Midscene layer.
+6. Mark blocked when ADB, emulator/device, APK path, package name, model credentials, fixture data, or stable result surface is missing.
 ## Evidence Contract

package/assets/skills/opentest-author/SKILL.md CHANGED Viewed

@@ -21,7 +21,9 @@ Turn the matrix into executable tests, fixtures, seed/teardown notes, and accept
 1. Read `matrix` and `fixtures` from `.opentest.yaml`.
 2. Preserve each row's requirement source and expected behavior. Do not rewrite acceptance cases around current implementation names, component internals, or existing test files.
 3. Place assets in the fixed layout from `test-asset-layout.md`; default to pytest under `tests/` when no project framework exists, and do not convert required framework evidence to `none`. Missing implementation means evidence stays pending.
-4. Create/update fixtures, seed, teardown, users, roles, entities, files/images, and assertion surfaces.
-5. For CRUD/data changes, author the full acceptance flow: create -> list -> detail -> update -> read back -> delete -> confirm absence -> teardown.
-6. Record any gap/blocker with reason and risk.
-7. Write `.opentest.yaml` fields: `fixtures`, `acceptance`, then run `bash "$OPENTEST_GUARD" author --apply`.
+4. For GUI surfaces, author a natural-language GUI scenario for the agent/Midscene, and keep commands in runner/evidence setup or reports.
+5. For Android visual/e2e rows, create the minimum Midscene harness: `package.json`, `npm run test:android`, `tests/android/midscene/`, and pytest bridge/report archiving.
+6. Create/update fixtures, seed, teardown, users, roles, entities, files/images, and assertion surfaces.
+7. For CRUD/data changes, author the full acceptance flow: create -> list -> detail -> update -> read back -> delete -> confirm absence -> teardown.
+8. Record any gap/blocker with reason and risk.
+9. Write `.opentest.yaml` fields: `fixtures`, `acceptance`, then run `bash "$OPENTEST_GUARD" author --apply`.

package/assets/skills/opentest-run/SKILL.md CHANGED Viewed

@@ -1,11 +1,11 @@
 ---
 name: opentest-run
-description: "OpenTest phase 3: run project verification commands in targeted, fast, full, ci-like, or pre-push mode."
+description: "OpenTest phase 3: run verification."
 ---
 # OpenTest Run
-Run matrix-driven commands and write reports under `docs/opentest/runs/`.
+Run matrix-driven commands and write run evidence.
 ## Required references
@@ -30,10 +30,11 @@ Run matrix-driven commands and write reports under `docs/opentest/runs/`.
 ## Steps
 1. Read `run_mode`, matrix, fixtures, required evidence, and fixed asset layout.
-2. Choose by matrix execution surface: MCP/CLI or `npx playwright test` for `web-browser`; `opentest-android-app` or `python -m pytest tests_py -v` for `android-app`, with `npm run test:android` only when model env is ready or debugging Midscene; `opentest-desktop-gui` for `desktop-gui`; `opentest-api` or `python -m pytest tests/api -v` for `api`.
-3. Prefer explicit project commands; otherwise use `python -m pytest` for code-level tests.
-4. For coverage, prefer `python -m pytest --cov=. --cov-report=term-missing`.
-5. Smoke evidence is required unless the matrix says not applicable.
-6. For `pre-push`, run or record format/check, lint, type, unit, targeted integration, smoke, and `git diff --check`.
-7. Write `run_report`, `coverage_report`, `smoke_report`, or `pre_push_report` when required.
-8. Run `bash "$OPENTEST_GUARD" run --apply`.
+2. Choose by matrix execution surface: MCP/CLI or `npx playwright test` for `web-browser`; `opentest-android-app` or `python -m pytest tests_py -v` for `android-app`; `opentest-desktop-gui` for `desktop-gui`; `opentest-api` or `python -m pytest tests/api -v` for `api`.
+3. For Android visual rows, the pytest wrapper must call `npm run test:android` when model env is ready/debugging; missing harness -> author/heal.
+4. Prefer explicit project commands; otherwise use `python -m pytest` for code-level tests.
+5. For coverage, prefer `python -m pytest --cov=. --cov-report=term-missing`.
+6. Smoke evidence is required unless the matrix says not applicable.
+7. For `pre-push`, run or record format/check, lint, type, unit, targeted integration, smoke, and `git diff --check`.
+8. Write `run_report`, `coverage_report`, `smoke_report`, or `pre_push_report` when required.
+9. Run `bash "$OPENTEST_GUARD" run --apply`.

package/assets/skills-zh/opentest/references/android-app-testing.md CHANGED Viewed

@@ -25,6 +25,31 @@ scripts/
 标准 Midscene 槽位是 `tests/android/midscene/`。
+## 自然语言 GUI 场景
+给 Midscene 执行的 GUI 场景必须是自然语言 Markdown，不是命令脚本。
+使用 `tests/android/midscene/*.md` 或 `docs/opentest/acceptance/*.md` 记录简短的
+Agent 动作指令、视觉锚点、AI 记忆变量、期望视觉断言、自愈说明和清理意图。
+不要把 `adb`、`python -m pytest`、`npm run`、`aapt` 或 `dumpsys` 命令写进 GUI 操作步骤。
+这些命令只属于 runner/evidence 的环境准备、teardown、安全检查和报告采集。
+## 最小 Midscene harness
+对 `android-app` 中 `visual-acceptance`、`e2e` 或 GUI 集成证据行，author/heal 必须创建最小 Midscene harness，不能只留下永久 pytest skip：
+- `package.json`，包含 `npm run test:android`。
+- `tests/android/midscene/*.test.ts` 或读取自然语言场景的 Midscene YAML。
+- pytest wrapper 负责检查设备/APK 前置、执行 ADB 冒烟、在模型环境就绪时调用 `npm run test:android`，并归档 `midscene_run/report/*.html`。
+缺少该 harness 时，记录 author/heal 缺口并先补齐，再宣称视觉验收。只跑 ADB smoke 可以作为冒烟证据，但不能替代 Midscene 视觉证据。
+## 模型凭据处理
+不要要求用户把密钥贴到聊天里。没有识别到 Midscene key 时，让用户设置进程环境变量，或确认本地变量映射方式。
+常见 OpenAI-compatible 变量包括 `MIDSCENE_MODEL_API_KEY` 或 `OPENAI_API_KEY`，provider 需要时还要 base URL/model 变量。遇到未知 key 变量（如 `SSS_API_KEY`），先询问 provider、OpenAI-compatible base URL、模型名/family，以及是否允许在当前进程做映射许可。
 ## 默认路线
 ```text
@@ -40,6 +65,7 @@ opentest-android-app
 面向用户的入口是 pytest。优先使用现有 Android harness 中的 `python -m pytest tests_py -v`，或仓库采用固定目录时的 `scripts/opentest-run-android.ps1`。
 只有 Midscene 模型环境变量齐全，或需要直接排查 Midscene 层时，才运行 `npm run test:android`。
+模型环境变量齐全时，pytest wrapper 必须调用 `npm run test:android`；不能调用时，把缺失桥接记录为 author/heal 缺口。
 ## 必需证据

package/assets/skills-zh/opentest/templates/android-app-acceptance-template.md CHANGED Viewed

@@ -12,20 +12,27 @@
 - fixture/reset:
 - 状态: pending
-### 环境
+### 自然语言场景
-- ADB 状态:
-- 模型环境状态:
-- App 版本/build:
-- 设备/App 元数据:
-### 步骤
+- 前置状态:
+- AI 记忆变量:
+- 自愈说明: 权限弹窗、首次引导、键盘遮挡、浮层、滚动查找
-1.
+| 步序 | Agent 动作指令 | 视觉断言 |
+| --- | --- | --- |
+| 1 |  |  |
 ### 期望结果
--
+-
+### Runner/Evidence 准备
+- 工具路线: opentest-android-app | android-midscene-pytest | pytest wrapper | @midscene/android
+- ADB 状态:
+- 模型环境状态:
+- App 版本/build:
+- 设备/App 元数据:
 ### 回读契约

package/assets/skills-zh/opentest-accept/SKILL.md CHANGED Viewed

@@ -27,7 +27,7 @@ description: "OpenTest 阶段 4：执行自然语言验收、MCP 验收或真实
 1. 读取矩阵、fixtures 和 `docs/opentest/acceptance/`。
 2. 根据矩阵里的执行面选择验收工具。
 3. `web-browser` 使用 `opentest-web-browser`：优先 Playwright MCP，失败时降级 Playwright CLI；稳定回归证据用 `@playwright/test`；视觉补充才用 Midscene。
-4. `android-app` 使用 `opentest-android-app` 和 `android-midscene-pytest`；入口是 `python -m pytest tests_py -v`，收集 pytest、ADB 冒烟、Midscene HTML、截图、logcat、设备/App 元数据和可用的 `midscene_run` 日志。缺前置条件时标为 blocked。
+4. `android-app` 通过 `opentest-android-app`/`android-midscene-pytest` 执行自然语言场景；`python -m pytest`、ADB 冒烟、logcat 和 `midscene_run` 只属于 runner/evidence。视觉行必须有 Midscene HTML 报告，或记录 blocked/heal evidence。
 5. `desktop-gui` 使用 `opentest-desktop-gui`：优先项目 GUI 自动化，视觉桌面/原生/RDP 流程用 `@midscene/computer`，并收集截图或录屏、GUI 操作日志、窗口/App 元数据和确定性回读。缺 display/RDP、App 启动、目标窗口标识、模型凭据或稳定结果面时标为 blocked。
 6. `api` 使用 `opentest-api`：优先项目 API 命令；否则用 `pytest` + `httpx`/`requests`、schema 校验、fixtures、写后读和 cleanup/teardown 证据。
 7. CRUD/数据变更执行 workflow 引用中的完整链路。

package/assets/skills-zh/opentest-android-app/SKILL.md CHANGED Viewed

@@ -17,8 +17,9 @@ description: "OpenTest 的 Android App 执行面适配器。用于通过 android
 1. 已安装 `android-midscene-pytest` 时，用它作为详细 Android 执行 skill。
 2. 稳定 Android 资产放入 `test-asset-layout.md` 的固定目录：`tests/android/tests_py/`、`tests/android/midscene/`，以及 `scripts/opentest-run-android.ps1` 或现有 Android harness 命令。
 3. 面向用户的执行入口应通过 pytest，通常是现有 harness 中的 `python -m pytest tests_py -v` 或仓库 Android 测试入口。
-4. 只有模型环境变量齐全或排查 Midscene 层时，才运行 `npm run test:android`。
-5. 缺 ADB、模拟器/真机、APK 路径、package name、模型凭据、fixture 数据或稳定结果面时，记录 `blocked`。
+4. visual/e2e 行必须有最小 Midscene harness（`package.json`、`npm run test:android` 和 `tests/android/midscene/`）；缺失时转入 author/heal。
+5. 只有模型环境变量齐全或排查 Midscene 层时，才运行 `npm run test:android`。
+6. 缺 ADB、模拟器/真机、APK 路径、package name、模型凭据、fixture 数据或稳定结果面时，记录 `blocked`。
 ## 证据契约

package/assets/skills-zh/opentest-author/SKILL.md CHANGED Viewed

@@ -21,7 +21,9 @@ description: "OpenTest 阶段 2：根据矩阵补齐测试资产、fixtures 和
 1. 读取 `.opentest.yaml` 的 `matrix` 和 `fixtures`。
 2. 保留每条矩阵行的需求来源和期望行为。不要围绕当前实现命名、组件内部结构或已有测试文件重写验收用例。
 3. 资产必须放入 `test-asset-layout.md` 的固定目录；没有项目框架时默认使用 pytest + `tests/`，不得把必需框架证据降级为 `none`。实现缺失只表示证据 pending。
-4. 创建/更新 fixtures、seed、teardown、用户、角色、实体、文件/图片和断言界面。
-5. CRUD/数据变更必须补全链路：新增 -> 列表 -> 详情 -> 修改 -> 回读 -> 删除 -> 确认消失 -> 清理。
-6. 记录 gap/blocker 的原因和风险。
-7. 写入 `.opentest.yaml` 的 `fixtures`、`acceptance`，再运行 `bash "$OPENTEST_GUARD" author --apply`。
+4. GUI 执行面要写给 Agent/Midscene 的自然语言 GUI 场景，命令只放在 runner/evidence 准备或报告里。
+5. Android visual/e2e 行必须创建最小 Midscene harness：`package.json`、`npm run test:android`、`tests/android/midscene/`，以及 pytest 桥接/报告归档。
+6. 创建/更新 fixtures、seed、teardown、用户、角色、实体、文件/图片和断言界面。
+7. CRUD/数据变更必须补全链路：新增 -> 列表 -> 详情 -> 修改 -> 回读 -> 删除 -> 确认消失 -> 清理。
+8. 记录 gap/blocker 的原因和风险。
+9. 写入 `.opentest.yaml` 的 `fixtures`、`acceptance`，再运行 `bash "$OPENTEST_GUARD" author --apply`。

package/assets/skills-zh/opentest-run/SKILL.md CHANGED Viewed

@@ -30,10 +30,11 @@ description: "OpenTest 阶段 3：按 targeted、fast、full、ci-like 或 pre-p
 ## 步骤
 1. 读取 `run_mode`、矩阵、fixtures、必需证据和固定资产目录。
-2. 根据矩阵执行面和验收模式选择命令/工具：`web-browser` 走 MCP/CLI 或 `npx playwright test`，`android-app` 走 `opentest-android-app` 或 `python -m pytest tests_py -v`；只有模型环境变量齐全或排查 Midscene 层时才跑 `npm run test:android`；`desktop-gui` 走 `opentest-desktop-gui`，`api` 走 `opentest-api` 或 `python -m pytest tests/api -v`。
-3. 优先使用项目命令；没有代码级命令时用 `python -m pytest`。
-4. 覆盖率优先用 `python -m pytest --cov=. --cov-report=term-missing`。
-5. 除非矩阵写明不适用，否则必须有冒烟证据。
-6. `pre-push` 运行或记录 format/check、lint、type、unit、targeted integration、smoke、`git diff --check`。
-7. 写入 `run_report`，必要时写入 `coverage_report`、`smoke_report`、`pre_push_report`。
-8. 运行 `bash "$OPENTEST_GUARD" run --apply`。
+2. 根据矩阵执行面和验收模式选择命令/工具：`web-browser` 走 MCP/CLI 或 `npx playwright test`，`android-app` 走 `opentest-android-app` 或 `python -m pytest tests_py -v`；`desktop-gui` 走 `opentest-desktop-gui`，`api` 走 `opentest-api` 或 `python -m pytest tests/api -v`。
+3. Android 视觉行在模型环境变量就绪时，pytest wrapper 必须调用 `npm run test:android`；桥接/harness 缺失时停止并转入 author/heal。
+4. 优先使用项目命令；没有代码级命令时用 `python -m pytest`。
+5. 覆盖率优先用 `python -m pytest --cov=. --cov-report=term-missing`。
+6. 除非矩阵写明不适用，否则必须有冒烟证据。
+7. `pre-push` 运行或记录 format/check、lint、type、unit、targeted integration、smoke、`git diff --check`。
+8. 写入 `run_report`，必要时写入 `coverage_report`、`smoke_report`、`pre_push_report`。
+9. 运行 `bash "$OPENTEST_GUARD" run --apply`。

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "@pzy560117/opentest",
-  "version": "0.1.14",
+  "version": "0.1.15",
   "description": "OpenTest quality evidence lifecycle skills for Codex",
   "keywords": [
     "opentest",

package/scripts/smoke-test.js CHANGED Viewed

@@ -359,6 +359,8 @@ function assertAndroidAppContracts() {
   const chineseTemplate = readRequiredText('assets/skills-zh/opentest/templates/android-app-acceptance-template.md', '[ANDROID] missing Chinese Android app template');
   const englishPlan = readFileSync('assets/skills/opentest-plan/SKILL.md', 'utf8');
   const chinesePlan = readFileSync('assets/skills-zh/opentest-plan/SKILL.md', 'utf8');
+  const englishAuthor = readFileSync('assets/skills/opentest-author/SKILL.md', 'utf8');
+  const chineseAuthor = readFileSync('assets/skills-zh/opentest-author/SKILL.md', 'utf8');
   const englishRun = readFileSync('assets/skills/opentest-run/SKILL.md', 'utf8');
   const chineseRun = readFileSync('assets/skills-zh/opentest-run/SKILL.md', 'utf8');
   const englishAccept = readFileSync('assets/skills/opentest-accept/SKILL.md', 'utf8');
@@ -374,6 +376,26 @@ function assertAndroidAppContracts() {
   assert(chineseReference.includes('适配边界') && chineseReference.includes('tests/android/midscene/') && chineseReference.includes('不得只凭静态截图'), '[ANDROID] Chinese reference must define adapter boundary, layout, and no-static-screenshot rule');
   assert(englishTemplate.includes('tool route: opentest-android-app | android-midscene-pytest') && englishTemplate.includes('ADB smoke'), '[ANDROID] English template must include route and ADB evidence');
   assert(chineseTemplate.includes('工具路线: opentest-android-app | android-midscene-pytest') && chineseTemplate.includes('ADB 冒烟'), '[ANDROID] Chinese template must include route and ADB evidence');
+  assert(englishReference.includes('Natural-language GUI scenario') && englishReference.includes('Do not put `adb`, `python -m pytest`, `npm run`, `aapt`, or `dumpsys` commands in GUI action steps'), '[ANDROID] English reference must separate natural-language GUI scenarios from runner/evidence commands');
+  assert(chineseReference.includes('自然语言 GUI 场景') && chineseReference.includes('不要把 `adb`、`python -m pytest`、`npm run`、`aapt` 或 `dumpsys` 命令写进 GUI 操作步骤'), '[ANDROID] Chinese reference must separate natural-language GUI scenarios from runner/evidence commands');
+  assert(englishReference.includes('Minimum Midscene harness') && englishReference.includes('package.json') && englishReference.includes('tests/android/midscene/*.test.ts') && englishReference.includes('npm run test:android'), '[ANDROID] English reference must require a minimum Midscene harness');
+  assert(chineseReference.includes('最小 Midscene harness') && chineseReference.includes('package.json') && chineseReference.includes('tests/android/midscene/*.test.ts') && chineseReference.includes('npm run test:android'), '[ANDROID] Chinese reference must require a minimum Midscene harness');
+  assert(englishReference.includes('pytest wrapper must call `npm run test:android`') && englishReference.includes('author/heal gap'), '[ANDROID] English reference must make missing harness a heal/author gap instead of permanent skip');
+  assert(chineseReference.includes('pytest wrapper 必须调用 `npm run test:android`') && chineseReference.includes('author/heal 缺口'), '[ANDROID] Chinese reference must make missing harness a heal/author gap instead of permanent skip');
+  assert(englishReference.includes('unknown key variables such as `SSS_API_KEY`') && englishReference.includes('provider') && englishReference.includes('base URL') && englishReference.includes('permission to map'), '[ANDROID] English reference must ask for unknown model key mapping details');
+  assert(chineseReference.includes('未知 key 变量（如 `SSS_API_KEY`）') && chineseReference.includes('provider') && chineseReference.includes('base URL') && chineseReference.includes('映射许可'), '[ANDROID] Chinese reference must ask for unknown model key mapping details');
+  assert(englishTemplate.includes('### Natural-Language Scenario') && englishTemplate.includes('Agent action instruction') && englishTemplate.includes('Visual assertion'), '[ANDROID] English Android template must make Midscene scenarios natural-language first');
+  assert(chineseTemplate.includes('### 自然语言场景') && chineseTemplate.includes('Agent 动作指令') && chineseTemplate.includes('视觉断言'), '[ANDROID] Chinese Android template must make Midscene scenarios natural-language first');
+  assert(englishSkill.includes('minimum Midscene harness') && englishSkill.includes('author/heal'), '[ANDROID] English Android skill must route missing Midscene harness to author/heal');
+  assert(chineseSkill.includes('最小 Midscene harness') && chineseSkill.includes('author/heal'), '[ANDROID] Chinese Android skill must route missing Midscene harness to author/heal');
+  assert(englishAuthor.includes('natural-language GUI scenario') && englishAuthor.includes('runner/evidence'), '[ANDROID] English author skill must keep GUI cases separate from runner/evidence commands');
+  assert(chineseAuthor.includes('自然语言 GUI 场景') && chineseAuthor.includes('runner/evidence'), '[ANDROID] Chinese author skill must keep GUI cases separate from runner/evidence commands');
+  assert(englishAuthor.includes('minimum Midscene harness') && englishAuthor.includes('npm run test:android'), '[ANDROID] English author skill must create minimum Midscene harness for Android visual rows');
+  assert(chineseAuthor.includes('最小 Midscene harness') && chineseAuthor.includes('npm run test:android'), '[ANDROID] Chinese author skill must create minimum Midscene harness for Android visual rows');
+  assert(englishRun.includes('pytest wrapper must call `npm run test:android`') && englishRun.includes('author/heal'), '[ANDROID] English run skill must call Midscene when ready and heal missing harness');
+  assert(chineseRun.includes('pytest wrapper 必须调用 `npm run test:android`') && chineseRun.includes('author/heal'), '[ANDROID] Chinese run skill must call Midscene when ready and heal missing harness');
+  assert(englishAccept.includes('execute the natural-language scenario') && englishAccept.includes('runner/evidence'), '[ANDROID] English accept skill must execute natural-language scenarios through runner/evidence separation');
+  assert(chineseAccept.includes('执行自然语言场景') && chineseAccept.includes('runner/evidence'), '[ANDROID] Chinese accept skill must execute natural-language scenarios through runner/evidence separation');
   for (const content of [englishPlan, chinesePlan, englishRun, chineseRun, englishAccept, chineseAccept]) {
     assert(content.includes('opentest/references/android-app-testing.md'), '[ANDROID] plan/run/accept skills must read Android app reference');