@pencil-agent/nano-pencil 2.0.1 → 2.0.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (186) hide show
  1. package/README.md +267 -267
  2. package/dist/build-meta.json +3 -3
  3. package/dist/core/export-html/AGENT.md +11 -11
  4. package/dist/core/export-html/template.css +971 -971
  5. package/dist/core/export-html/template.html +54 -54
  6. package/dist/core/model/custom-providers.js +1 -1
  7. package/dist/core/model-registry.js +5 -5
  8. package/dist/extensions/builtin/AGENT.md +115 -115
  9. package/dist/extensions/builtin/browser/AGENT.md +17 -17
  10. package/dist/extensions/builtin/browser/agent-workspace/agent_helpers.py +12 -12
  11. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/amazon/product-search.md +198 -198
  12. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/archive-org/scraping.md +341 -341
  13. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/arxiv/scraping.md +311 -311
  14. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/arxiv-bulk/scraping.md +333 -333
  15. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/atlas/overview.md +70 -70
  16. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/booking-com/scraping.md +578 -578
  17. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/capterra/scraping.md +440 -440
  18. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/centilebrain/generate-estimates.md +110 -110
  19. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/coingecko/scraping.md +325 -325
  20. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/coinmarketcap/scraping.md +463 -463
  21. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/coursera/scraping.md +360 -360
  22. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/craigslist/scraping.md +390 -390
  23. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/crossref/scraping.md +568 -568
  24. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/dev-to/scraping.md +323 -323
  25. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/duckduckgo/scraping.md +349 -349
  26. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/ebay/scraping.md +435 -435
  27. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/etsy/scraping.md +506 -506
  28. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/eventbrite/scraping.md +363 -363
  29. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/expedia/automation.md +168 -168
  30. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/facebook/groups.md +236 -236
  31. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/facebook/pages.md +295 -295
  32. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/framer/editor.md +108 -108
  33. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/fred/scraping.md +493 -493
  34. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/g2/scraping.md +580 -580
  35. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/genius/scraping.md +511 -511
  36. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/github/repo-actions.md +65 -65
  37. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/github/scraping.md +184 -184
  38. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/glassdoor/scraping.md +543 -543
  39. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/gmail/compose.md +122 -122
  40. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/goodreads/scraping.md +461 -461
  41. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/gutenberg/scraping.md +383 -383
  42. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/hackernews/scraping.md +243 -243
  43. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/howlongtobeat/scraping.md +473 -473
  44. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/imdb/scraping.md +271 -271
  45. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/itch-io/scraping.md +436 -436
  46. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/job-boards/indeed-glassdoor.md +1021 -1021
  47. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/letterboxd/scraping.md +349 -349
  48. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/linkedin/invitation-manager.md +109 -109
  49. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/loom/folder-enumeration.md +170 -170
  50. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/macrotrends/scraping.md +537 -537
  51. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/medium/article-hydration.md +120 -120
  52. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/medium/scraping.md +414 -414
  53. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/metacritic/scraping.md +477 -477
  54. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/musicbrainz/scraping.md +478 -478
  55. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/nasa/scraping.md +339 -339
  56. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/news-aggregation/multi-source.md +205 -205
  57. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/open-library/scraping.md +472 -472
  58. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/openalex/scraping.md +470 -470
  59. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/openstreetmap/scraping.md +490 -490
  60. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/package-registries/npm-pypi.md +478 -478
  61. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/polymarket/scraping.md +234 -234
  62. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/producthunt/scraping.md +307 -307
  63. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/pubmed/scraping.md +421 -421
  64. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/quora/scraping.md +364 -364
  65. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/rawg/scraping.md +352 -352
  66. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/reddit/scraping.md +124 -124
  67. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/rest-countries/scraping.md +233 -233
  68. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/sec-edgar/scraping.md +361 -361
  69. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/shopify-admin/README.md +36 -36
  70. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/shopify-admin/embedded-apps.md +72 -72
  71. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/shopify-admin/knowledge-base.md +109 -109
  72. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/shopify-admin/polaris-inputs.md +137 -137
  73. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/soundcloud/scraping.md +362 -362
  74. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/spotify/scraping.md +339 -339
  75. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/stackoverflow/scraping.md +435 -435
  76. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/steam/scraping.md +575 -575
  77. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/substack/scraping.md +338 -338
  78. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/thetechgeeks/pricing.md +52 -52
  79. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/tiktok/upload.md +107 -107
  80. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/tradingview/scraping.md +309 -309
  81. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/trello/boards-and-lists.md +88 -88
  82. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/trustpilot/scraping.md +375 -375
  83. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/walmart/scraping.md +444 -444
  84. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/wayback-machine/scraping.md +306 -306
  85. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/weather/scraping.md +398 -398
  86. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/wellfound/scraping.md +596 -596
  87. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/world-bank/scraping.md +356 -356
  88. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/xiaohongshu/scraping.md +84 -84
  89. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/youtube/scraping.md +418 -418
  90. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/zillow/scraping.md +433 -433
  91. package/dist/extensions/builtin/browser/browser.md +73 -73
  92. package/dist/extensions/builtin/browser/install.md +142 -142
  93. package/dist/extensions/builtin/browser/interaction-skills/connection.md +48 -48
  94. package/dist/extensions/builtin/browser/interaction-skills/cookies.md +3 -3
  95. package/dist/extensions/builtin/browser/interaction-skills/cross-origin-iframes.md +3 -3
  96. package/dist/extensions/builtin/browser/interaction-skills/dialogs.md +64 -64
  97. package/dist/extensions/builtin/browser/interaction-skills/downloads.md +3 -3
  98. package/dist/extensions/builtin/browser/interaction-skills/drag-and-drop.md +3 -3
  99. package/dist/extensions/builtin/browser/interaction-skills/dropdowns.md +3 -3
  100. package/dist/extensions/builtin/browser/interaction-skills/iframes.md +3 -3
  101. package/dist/extensions/builtin/browser/interaction-skills/network-requests.md +3 -3
  102. package/dist/extensions/builtin/browser/interaction-skills/print-as-pdf.md +3 -3
  103. package/dist/extensions/builtin/browser/interaction-skills/profile-sync.md +90 -90
  104. package/dist/extensions/builtin/browser/interaction-skills/screenshots.md +17 -17
  105. package/dist/extensions/builtin/browser/interaction-skills/scrolling.md +3 -3
  106. package/dist/extensions/builtin/browser/interaction-skills/shadow-dom.md +3 -3
  107. package/dist/extensions/builtin/browser/interaction-skills/tabs.md +69 -69
  108. package/dist/extensions/builtin/browser/interaction-skills/uploads.md +1 -1
  109. package/dist/extensions/builtin/browser/interaction-skills/viewport.md +3 -3
  110. package/dist/extensions/builtin/browser/src/browser_harness/AGENT.md +15 -15
  111. package/dist/extensions/builtin/browser/src/browser_harness/__init__.py +8 -8
  112. package/dist/extensions/builtin/browser/src/browser_harness/_ipc.py +90 -90
  113. package/dist/extensions/builtin/browser/src/browser_harness/admin.py +722 -722
  114. package/dist/extensions/builtin/browser/src/browser_harness/daemon.py +328 -328
  115. package/dist/extensions/builtin/browser/src/browser_harness/helpers.py +396 -396
  116. package/dist/extensions/builtin/browser/src/browser_harness/run.py +103 -103
  117. package/dist/extensions/builtin/discipline/skills/brainstorming/SKILL.md +33 -33
  118. package/dist/extensions/builtin/discipline/skills/executing-plans/SKILL.md +25 -25
  119. package/dist/extensions/builtin/discipline/skills/finishing-development-branch/SKILL.md +25 -25
  120. package/dist/extensions/builtin/discipline/skills/receiving-code-review/SKILL.md +22 -22
  121. package/dist/extensions/builtin/discipline/skills/requesting-code-review/SKILL.md +31 -31
  122. package/dist/extensions/builtin/discipline/skills/systematic-debugging/SKILL.md +28 -28
  123. package/dist/extensions/builtin/discipline/skills/test-driven-development/SKILL.md +32 -32
  124. package/dist/extensions/builtin/discipline/skills/using-git-worktrees/SKILL.md +25 -25
  125. package/dist/extensions/builtin/discipline/skills/verification-before-completion/SKILL.md +27 -27
  126. package/dist/extensions/builtin/discipline/skills/writing-plans/SKILL.md +26 -26
  127. package/dist/extensions/builtin/goal/README.md +67 -67
  128. package/dist/extensions/builtin/grub/README.md +112 -112
  129. package/dist/extensions/builtin/link-world/agent-workspace/README.md +16 -16
  130. package/dist/extensions/builtin/link-world/internet-search/internet-search.md +65 -65
  131. package/dist/extensions/builtin/link-world/link-world-agent.md +82 -82
  132. package/dist/extensions/builtin/link-world/linkworld.md +313 -313
  133. package/dist/extensions/builtin/link-world/network-routing/network-routing.md +67 -67
  134. package/dist/extensions/builtin/loop/README.md +92 -92
  135. package/dist/extensions/builtin/mcp/figma-design.md +68 -68
  136. package/dist/extensions/builtin/mcp/mcp-management.md +85 -85
  137. package/dist/extensions/builtin/recap/AGENT.md +15 -15
  138. package/dist/extensions/builtin/sal/README.md +72 -72
  139. package/dist/extensions/builtin/security-audit/README.md +289 -289
  140. package/dist/extensions/builtin/team/AGENT.md +112 -112
  141. package/dist/extensions/builtin/team/TESTING.md +299 -299
  142. package/dist/extensions/builtin/token-save/README.md +56 -56
  143. package/dist/extensions/optional/AGENT.md +10 -10
  144. package/dist/modes/interactive/controllers/input-submit-controller.js +2 -2
  145. package/dist/modes/interactive/controllers/stream-render-controller.js +2 -2
  146. package/dist/modes/interactive/interactive-mode.js +19 -19
  147. package/dist/modes/interactive/theme/dark.json +85 -85
  148. package/dist/modes/interactive/theme/light.json +84 -84
  149. package/dist/modes/interactive/theme/theme-schema.json +335 -335
  150. package/dist/modes/interactive/theme/warm.json +81 -81
  151. package/dist/node_modules/@pencil-agent/ai/dist/cli.js +0 -0
  152. package/dist/node_modules/@pencil-agent/ai/dist/models.generated.js +1 -1
  153. package/docs/ACP/345/215/217/350/256/256/351/233/206/346/210/220/345/274/200/345/217/221/346/226/207/346/241/243.md +851 -0
  154. package/docs/SDK-TESTING.md +364 -0
  155. package/docs/codex-goal-command-impl.md +1055 -1055
  156. package/docs/codex-goal-vs-grub.md +500 -500
  157. package/docs/custom-provider.md +27 -27
  158. package/docs/extensions.md +27 -27
  159. package/docs/keybindings.md +27 -27
  160. package/docs/loop /351/207/215/346/236/204/345/256/214/346/210/220/346/200/273/347/273/223.md" +250 -250
  161. package/docs/loop /351/207/215/346/236/204/345/256/214/346/210/220/346/212/245/345/221/212.md" +122 -122
  162. package/docs/loop /351/207/215/346/236/204/346/226/271/346/241/210.md" +1222 -1222
  163. package/docs/loop /351/207/215/346/236/204/346/226/271/346/241/210/345/256/236/347/216/260/346/212/245/345/221/212.md" +158 -158
  164. package/docs/loop /351/207/215/346/236/204/346/226/271/346/241/210/345/257/271/346/257/224/345/210/206/346/236/220.md" +128 -128
  165. package/docs/loop /351/207/215/346/236/204/350/256/241/345/210/222.md" +320 -320
  166. package/docs/loop-usage-examples.md +214 -214
  167. package/docs/mem-core/346/212/200/346/234/257/346/226/207/346/241/243.md +593 -0
  168. package/docs/models.md +27 -27
  169. package/docs/packages.md +27 -27
  170. package/docs/pi-design-philosophy.md +457 -457
  171. package/docs/planmode.md +1987 -1987
  172. package/docs/prompt-templates.md +27 -27
  173. package/docs/providers.md +27 -27
  174. package/docs/sdk.md +27 -27
  175. package/docs/skills.md +27 -27
  176. package/docs/startup-performance-optimization.md +301 -0
  177. package/docs/themes.md +27 -27
  178. package/docs/tui.md +27 -27
  179. package/docs//350/256/244/347/237/245/345/234/260/345/233/276.md +47 -0
  180. package/package.json +190 -190
  181. package/docs/cc-agent-design.md +0 -1297
  182. package/docs/cc-tui-design.md +0 -1333
  183. package/docs/nanoPencil-/345/255/246/344/271/240/350/256/241/345/210/222.md +0 -170
  184. package/docs/scan-report.md +0 -3820
  185. package/docs//345/257/271/346/240/207Claude-Code.md +0 -1775
  186. package/docs//351/230/277/351/207/214/345/267/264/345/267/264/350/264/242/346/212/245/345/210/206/346/236/220/344/271/246.md +0 -261
@@ -1,27 +1,27 @@
1
- ---
2
- name: verification-before-completion
3
- description: Use before saying work is complete, fixed, passing, implemented, ready, or safe to merge.
4
- ---
5
-
6
- # Verification Before Completion
7
-
8
- Evidence must precede completion claims.
9
-
10
- ## Gate
11
-
12
- Do not claim success from intent, plausibility, previous output, or another agent's report. Verify against the current state.
13
-
14
- ## Process
15
-
16
- 1. Identify each claim you are about to make.
17
- 2. Identify the command, file inspection, diff, runtime check, or rendered artifact that would prove it.
18
- 3. Run or inspect that evidence freshly.
19
- 4. Read the output, exit code, or artifact carefully.
20
- 5. Report the actual state:
21
- - If verified, name the evidence.
22
- - If not verified, say what remains unverified.
23
- - If failed, report the failure and continue work.
24
-
25
- ## Evidence Matching
26
-
27
- Use focused checks for narrow claims and broader checks for broad claims. A passing unit test does not prove a full build, and a successful build does not prove the requested behavior unless the behavior is covered.
1
+ ---
2
+ name: verification-before-completion
3
+ description: Use before saying work is complete, fixed, passing, implemented, ready, or safe to merge.
4
+ ---
5
+
6
+ # Verification Before Completion
7
+
8
+ Evidence must precede completion claims.
9
+
10
+ ## Gate
11
+
12
+ Do not claim success from intent, plausibility, previous output, or another agent's report. Verify against the current state.
13
+
14
+ ## Process
15
+
16
+ 1. Identify each claim you are about to make.
17
+ 2. Identify the command, file inspection, diff, runtime check, or rendered artifact that would prove it.
18
+ 3. Run or inspect that evidence freshly.
19
+ 4. Read the output, exit code, or artifact carefully.
20
+ 5. Report the actual state:
21
+ - If verified, name the evidence.
22
+ - If not verified, say what remains unverified.
23
+ - If failed, report the failure and continue work.
24
+
25
+ ## Evidence Matching
26
+
27
+ Use focused checks for narrow claims and broader checks for broad claims. A passing unit test does not prove a full build, and a successful build does not prove the requested behavior unless the behavior is covered.
@@ -1,26 +1,26 @@
1
- ---
2
- name: writing-plans
3
- description: Use when a task needs multiple implementation steps, multiple files, handoff to another agent, or careful verification sequencing.
4
- ---
5
-
6
- # Writing Plans
7
-
8
- Create an implementation plan that can be executed without rediscovering context.
9
-
10
- ## Required Sections
11
-
12
- - Goal: one sentence describing the user-visible outcome.
13
- - Context: what the codebase currently does and why the change is needed.
14
- - Architecture: the recommended approach and why it fits existing boundaries.
15
- - Files: exact files to create or modify and each file's responsibility.
16
- - Tasks: small ordered steps with verification after meaningful changes.
17
- - Test plan: exact commands and expected evidence.
18
- - Documentation impact: P1/P2/P3 or user docs that must change.
19
-
20
- ## Task Quality
21
-
22
- Each task should be independently understandable. Include exact paths, APIs, data shapes, and expected behavior. Avoid placeholders such as "handle edge cases" or "add tests"; state the actual edge cases and tests.
23
-
24
- ## Handoff
25
-
26
- When the plan is approved, execute it directly or use `executing-plans` for inline execution. Use subagents when tasks are independent and reviewable.
1
+ ---
2
+ name: writing-plans
3
+ description: Use when a task needs multiple implementation steps, multiple files, handoff to another agent, or careful verification sequencing.
4
+ ---
5
+
6
+ # Writing Plans
7
+
8
+ Create an implementation plan that can be executed without rediscovering context.
9
+
10
+ ## Required Sections
11
+
12
+ - Goal: one sentence describing the user-visible outcome.
13
+ - Context: what the codebase currently does and why the change is needed.
14
+ - Architecture: the recommended approach and why it fits existing boundaries.
15
+ - Files: exact files to create or modify and each file's responsibility.
16
+ - Tasks: small ordered steps with verification after meaningful changes.
17
+ - Test plan: exact commands and expected evidence.
18
+ - Documentation impact: P1/P2/P3 or user docs that must change.
19
+
20
+ ## Task Quality
21
+
22
+ Each task should be independently understandable. Include exact paths, APIs, data shapes, and expected behavior. Avoid placeholders such as "handle edge cases" or "add tests"; state the actual edge cases and tests.
23
+
24
+ ## Handoff
25
+
26
+ When the plan is approved, execute it directly or use `executing-plans` for inline execution. Use subagents when tasks are independent and reviewable.
@@ -1,67 +1,67 @@
1
- # Goal Extension
2
-
3
- Long-running task management for nanoPencil. Set a goal with `/goal <objective>` and the
4
- agent will auto-continue working on it across turns until the objective is achieved,
5
- the token budget runs out, or you pause/clear it.
6
-
7
- This extension mirrors the Codex `/goal` command semantics: a per-thread goal, persisted
8
- to disk, with idle-continuation prompts, token accounting, and budget enforcement.
9
-
10
- ## Usage
11
-
12
- ```
13
- /goal Show current goal summary menu
14
- /goal <objective> Set or replace the goal
15
- /goal clear Clear the goal
16
- /goal edit Open the editor to change the objective
17
- /goal pause Pause auto-continuation
18
- /goal resume Resume auto-continuation
19
- /goal help Show usage help
20
- ```
21
-
22
- ## LLM Tools
23
-
24
- The extension registers three LLM-facing tools:
25
-
26
- | Tool | Purpose | Who can call |
27
- |------|---------|--------------|
28
- | `get_goal` | Read the current goal | LLM |
29
- | `create_goal` | Create a new goal (only when the user explicitly asks) | LLM |
30
- | `update_goal` | Mark the goal `complete` or `blocked` | LLM |
31
-
32
- The LLM is only allowed to set the goal's status to `complete` or `blocked`. Pause /
33
- resume / budget limits are user-driven and happen exclusively through `/goal`.
34
-
35
- ## Lifecycle
36
-
37
- The extension subscribes to `turn_start`, `turn_end`, `message_end`, `tool_execution_end`,
38
- and `agent_end` to track token usage and time per turn. When a turn ends with an
39
- `active` goal, the extension injects a follow-up user message containing the
40
- continuation prompt so the agent keeps working on the objective.
41
-
42
- When a turn causes the goal to cross its token budget, the extension injects a
43
- budget-limit steering prompt and marks the goal `budget_limited`. Once budget-limited,
44
- auto-continuation stops and the goal is terminal.
45
-
46
- ## Persistence
47
-
48
- Goals are stored as JSON files under `<agentDir>/goals/<threadId>.json`. They survive
49
- session restarts and are keyed by the active session ID.
50
-
51
- ## Status
52
-
53
- `Status: active` shows in the footer while a goal is running.
54
-
55
- ## Architecture
56
-
57
- | File | Responsibility |
58
- |------|----------------|
59
- | `goal-types.ts` | `ThreadGoalStatus`, `ThreadGoal`, helper predicates |
60
- | `goal-store.ts` | Atomic JSON-file persistence (replace / insert / update / delete / account_usage) |
61
- | `goal-format.ts` | Time/token formatting, summary lines, status indicator, validators |
62
- | `goal-prompts.ts` | Continuation / budget-limit / objective-updated prompt templates |
63
- | `goal-controller.ts` | Per-thread runtime: mutex, turn accounting, idle continuation |
64
- | `goal-tools.ts` | `get_goal`, `create_goal`, `update_goal` LLM tool definitions |
65
- | `goal-parser.ts` | `/goal` slash-command argument parsing |
66
- | `goal-command.ts` | `/goal` slash-command handler (UI + controller dispatch) |
67
- | `index.ts` | Extension entry: tools, command, lifecycle hooks, status indicator |
1
+ # Goal Extension
2
+
3
+ Long-running task management for nanoPencil. Set a goal with `/goal <objective>` and the
4
+ agent will auto-continue working on it across turns until the objective is achieved,
5
+ the token budget runs out, or you pause/clear it.
6
+
7
+ This extension mirrors the Codex `/goal` command semantics: a per-thread goal, persisted
8
+ to disk, with idle-continuation prompts, token accounting, and budget enforcement.
9
+
10
+ ## Usage
11
+
12
+ ```
13
+ /goal Show current goal summary menu
14
+ /goal <objective> Set or replace the goal
15
+ /goal clear Clear the goal
16
+ /goal edit Open the editor to change the objective
17
+ /goal pause Pause auto-continuation
18
+ /goal resume Resume auto-continuation
19
+ /goal help Show usage help
20
+ ```
21
+
22
+ ## LLM Tools
23
+
24
+ The extension registers three LLM-facing tools:
25
+
26
+ | Tool | Purpose | Who can call |
27
+ |------|---------|--------------|
28
+ | `get_goal` | Read the current goal | LLM |
29
+ | `create_goal` | Create a new goal (only when the user explicitly asks) | LLM |
30
+ | `update_goal` | Mark the goal `complete` or `blocked` | LLM |
31
+
32
+ The LLM is only allowed to set the goal's status to `complete` or `blocked`. Pause /
33
+ resume / budget limits are user-driven and happen exclusively through `/goal`.
34
+
35
+ ## Lifecycle
36
+
37
+ The extension subscribes to `turn_start`, `turn_end`, `message_end`, `tool_execution_end`,
38
+ and `agent_end` to track token usage and time per turn. When a turn ends with an
39
+ `active` goal, the extension injects a follow-up user message containing the
40
+ continuation prompt so the agent keeps working on the objective.
41
+
42
+ When a turn causes the goal to cross its token budget, the extension injects a
43
+ budget-limit steering prompt and marks the goal `budget_limited`. Once budget-limited,
44
+ auto-continuation stops and the goal is terminal.
45
+
46
+ ## Persistence
47
+
48
+ Goals are stored as JSON files under `<agentDir>/goals/<threadId>.json`. They survive
49
+ session restarts and are keyed by the active session ID.
50
+
51
+ ## Status
52
+
53
+ `Status: active` shows in the footer while a goal is running.
54
+
55
+ ## Architecture
56
+
57
+ | File | Responsibility |
58
+ |------|----------------|
59
+ | `goal-types.ts` | `ThreadGoalStatus`, `ThreadGoal`, helper predicates |
60
+ | `goal-store.ts` | Atomic JSON-file persistence (replace / insert / update / delete / account_usage) |
61
+ | `goal-format.ts` | Time/token formatting, summary lines, status indicator, validators |
62
+ | `goal-prompts.ts` | Continuation / budget-limit / objective-updated prompt templates |
63
+ | `goal-controller.ts` | Per-thread runtime: mutex, turn accounting, idle continuation |
64
+ | `goal-tools.ts` | `get_goal`, `create_goal`, `update_goal` LLM tool definitions |
65
+ | `goal-parser.ts` | `/goal` slash-command argument parsing |
66
+ | `goal-command.ts` | `/goal` slash-command handler (UI + controller dispatch) |
67
+ | `index.ts` | Extension entry: tools, command, lifecycle hooks, status indicator |
@@ -1,112 +1,112 @@
1
- # Grub Extension
2
-
3
- `/grub` runs one autonomous long-running task until the agent reports it
4
- complete, reports it is blocked, the user stops it, or a safety limit is
5
- reached. The harness design follows the pattern described in Anthropic's
6
- [Effective harnesses for long-running agents](https://www.anthropic.com/engineering/effective-harnesses-for-long-running-agents):
7
- a structured on-disk state lets coding agents pick up where they left off
8
- even across fresh context windows and full process restarts.
9
-
10
- ## Commands
11
-
12
- - `/grub <goal> [--max-iter N] [--max-fail N]` — start one autonomous task
13
- - `/grub status [--json]` — show the active or last finished grub task
14
- - `/grub resume` — resume dispatch for an adopted/persisted task
15
- - `/grub stop` — stop the active grub task
16
-
17
- ## Harness artifacts
18
-
19
- Each task owns a directory at `.grub/<task-id>/`:
20
-
21
- | File | Purpose | Who may write |
22
- |------|---------|---------------|
23
- | `feature-list.json` | Structured list of end-to-end features | Initializer writes the whole file once; coding agents may only flip `passes` and set `evidence` |
24
- | `progress-log.md` | Dated notes describing each iteration | Agent, append-only |
25
- | `init.sh` | Get-bearings + project smoke script run at the start of every iteration | Initializer; later agents may add project-specific smoke commands |
26
- | `state.json` | Durable `GrubTaskState` snapshot for cross-session resume | Controller, atomic writes |
27
-
28
- `feature-list.json` schema (version 1):
29
-
30
- ```json
31
- {
32
- "version": 1,
33
- "goal": "<user goal>",
34
- "features": [
35
- {
36
- "id": "kebab-slug",
37
- "category": "functional|verification|polish",
38
- "description": "observable behavior",
39
- "steps": ["actionable", "verification", "steps"],
40
- "passes": false,
41
- "evidence": "optional git sha or short proof"
42
- }
43
- ]
44
- }
45
- ```
46
-
47
- The controller validates every mutation: changing `description`, `steps`,
48
- `category`, `id`, list length, or reordering counts as a violation. Agents
49
- are told up front that the only permitted edits are toggling `passes` and
50
- setting `evidence`.
51
-
52
- ## How it works
53
-
54
- - Each grub iteration is tagged with a `[GRUB:<id>:<n>]` prompt prefix so
55
- the extension can recognise its own injected turns.
56
- - On start, grub creates the harness directory and writes the initial
57
- artifacts without creating a git commit. The harness keeps durable state on
58
- disk and leaves source changes visible in the working tree, avoiding noisy
59
- `grub(...)` commits unless the user explicitly asks for them.
60
- - Two phase-specialized system prompts are injected via
61
- `before_agent_start`:
62
- - **Initializer prompt** (first successful turn): expand
63
- `feature-list.json` into 15-40 concrete features, harden `init.sh`,
64
- seed `progress-log.md`. No broad implementation yet.
65
- - **Coding prompt** (remaining turns): run `init.sh`, pick exactly one
66
- pending feature, implement + verify end-to-end, flip `passes` +
67
- `evidence`, and append to `progress-log.md`.
68
- - At the end of every grub turn the assistant must emit a single
69
- `<loop-state>{"status":"continue|complete|blocked", "summary":"...", "nextStep":"..."}</loop-state>`
70
- block. The extension parses it and dispatches the next iteration or
71
- stops with a terminal status.
72
- - **Initializer sanitize-not-fail**: while in the initializer phase, only
73
- genuinely unfixable structural problems (feature count out of 15-40,
74
- unreplaced placeholder, non-kebab ids, duplicate ids) fail the turn and
75
- force a retry. Recoverable hygiene issues are auto-corrected instead of
76
- killing the task: a wrong `goal` is restored to the authoritative task goal,
77
- pre-marked `passes:true` are reset to `false`, and stray `evidence` is
78
- dropped. The sanitized list is written back to disk and becomes the baseline,
79
- so the phase always advances once the structure is valid.
80
- - **Phase-aware failure budget**: the initializer gets a more forgiving budget
81
- (default 5) than execution (default 3, via `--max-fail`), because standing up
82
- a valid harness is a distinct, retry-friendly activity from execution work.
83
- - **Mutation guard**: after the initializer creates the first real
84
- `feature-list.json`, each subsequent turn is diffed against the persisted
85
- baseline. Rewriting feature ids, descriptions, categories, steps, count, or
86
- order is rejected and retried; only `passes` and `evidence` may change.
87
- - **Completion guard**: if the decision says `complete` but
88
- `feature-list.json` still has `passes:false` entries, the controller
89
- rewrites the decision to `continue` with a synthetic `nextStep`
90
- pointing at the first pending feature. The harness will not allow
91
- premature "done".
92
- - **Cross-session resume**: `GrubTaskState` is written atomically to
93
- `state.json` on every transition. On the next session, `session_start`
94
- calls `discoverActiveTasks()` and adopts the most recent running task
95
- without auto-dispatching — the user types `/grub resume` to continue.
96
- - **Safety limits**: 25 iterations and 3 consecutive failures by default;
97
- override with `--max-iter` / `--max-fail`.
98
- - **Stale harness cleanup**: on extension load, terminal harnesses older
99
- than 30 days are pruned from `.grub/`.
100
-
101
- ## Legacy migration
102
-
103
- Earlier versions wrote `feature-checklist.md` (markdown checkboxes). When a
104
- new iteration starts and `feature-list.json` is missing but the legacy file
105
- exists, its checkbox items are migrated into the JSON format (category
106
- defaults to `functional`; `steps` start empty so the initializer can refine
107
- later).
108
-
109
- ## Related
110
-
111
- For the recurring scheduler that runs prompts or slash commands on an
112
- interval, see the sibling [`loop` extension](../loop/README.md).
1
+ # Grub Extension
2
+
3
+ `/grub` runs one autonomous long-running task until the agent reports it
4
+ complete, reports it is blocked, the user stops it, or a safety limit is
5
+ reached. The harness design follows the pattern described in Anthropic's
6
+ [Effective harnesses for long-running agents](https://www.anthropic.com/engineering/effective-harnesses-for-long-running-agents):
7
+ a structured on-disk state lets coding agents pick up where they left off
8
+ even across fresh context windows and full process restarts.
9
+
10
+ ## Commands
11
+
12
+ - `/grub <goal> [--max-iter N] [--max-fail N]` — start one autonomous task
13
+ - `/grub status [--json]` — show the active or last finished grub task
14
+ - `/grub resume` — resume dispatch for an adopted/persisted task
15
+ - `/grub stop` — stop the active grub task
16
+
17
+ ## Harness artifacts
18
+
19
+ Each task owns a directory at `.grub/<task-id>/`:
20
+
21
+ | File | Purpose | Who may write |
22
+ |------|---------|---------------|
23
+ | `feature-list.json` | Structured list of end-to-end features | Initializer writes the whole file once; coding agents may only flip `passes` and set `evidence` |
24
+ | `progress-log.md` | Dated notes describing each iteration | Agent, append-only |
25
+ | `init.sh` | Get-bearings + project smoke script run at the start of every iteration | Initializer; later agents may add project-specific smoke commands |
26
+ | `state.json` | Durable `GrubTaskState` snapshot for cross-session resume | Controller, atomic writes |
27
+
28
+ `feature-list.json` schema (version 1):
29
+
30
+ ```json
31
+ {
32
+ "version": 1,
33
+ "goal": "<user goal>",
34
+ "features": [
35
+ {
36
+ "id": "kebab-slug",
37
+ "category": "functional|verification|polish",
38
+ "description": "observable behavior",
39
+ "steps": ["actionable", "verification", "steps"],
40
+ "passes": false,
41
+ "evidence": "optional git sha or short proof"
42
+ }
43
+ ]
44
+ }
45
+ ```
46
+
47
+ The controller validates every mutation: changing `description`, `steps`,
48
+ `category`, `id`, list length, or reordering counts as a violation. Agents
49
+ are told up front that the only permitted edits are toggling `passes` and
50
+ setting `evidence`.
51
+
52
+ ## How it works
53
+
54
+ - Each grub iteration is tagged with a `[GRUB:<id>:<n>]` prompt prefix so
55
+ the extension can recognise its own injected turns.
56
+ - On start, grub creates the harness directory and writes the initial
57
+ artifacts without creating a git commit. The harness keeps durable state on
58
+ disk and leaves source changes visible in the working tree, avoiding noisy
59
+ `grub(...)` commits unless the user explicitly asks for them.
60
+ - Two phase-specialized system prompts are injected via
61
+ `before_agent_start`:
62
+ - **Initializer prompt** (first successful turn): expand
63
+ `feature-list.json` into 15-40 concrete features, harden `init.sh`,
64
+ seed `progress-log.md`. No broad implementation yet.
65
+ - **Coding prompt** (remaining turns): run `init.sh`, pick exactly one
66
+ pending feature, implement + verify end-to-end, flip `passes` +
67
+ `evidence`, and append to `progress-log.md`.
68
+ - At the end of every grub turn the assistant must emit a single
69
+ `<loop-state>{"status":"continue|complete|blocked", "summary":"...", "nextStep":"..."}</loop-state>`
70
+ block. The extension parses it and dispatches the next iteration or
71
+ stops with a terminal status.
72
+ - **Initializer sanitize-not-fail**: while in the initializer phase, only
73
+ genuinely unfixable structural problems (feature count out of 15-40,
74
+ unreplaced placeholder, non-kebab ids, duplicate ids) fail the turn and
75
+ force a retry. Recoverable hygiene issues are auto-corrected instead of
76
+ killing the task: a wrong `goal` is restored to the authoritative task goal,
77
+ pre-marked `passes:true` are reset to `false`, and stray `evidence` is
78
+ dropped. The sanitized list is written back to disk and becomes the baseline,
79
+ so the phase always advances once the structure is valid.
80
+ - **Phase-aware failure budget**: the initializer gets a more forgiving budget
81
+ (default 5) than execution (default 3, via `--max-fail`), because standing up
82
+ a valid harness is a distinct, retry-friendly activity from execution work.
83
+ - **Mutation guard**: after the initializer creates the first real
84
+ `feature-list.json`, each subsequent turn is diffed against the persisted
85
+ baseline. Rewriting feature ids, descriptions, categories, steps, count, or
86
+ order is rejected and retried; only `passes` and `evidence` may change.
87
+ - **Completion guard**: if the decision says `complete` but
88
+ `feature-list.json` still has `passes:false` entries, the controller
89
+ rewrites the decision to `continue` with a synthetic `nextStep`
90
+ pointing at the first pending feature. The harness will not allow
91
+ premature "done".
92
+ - **Cross-session resume**: `GrubTaskState` is written atomically to
93
+ `state.json` on every transition. On the next session, `session_start`
94
+ calls `discoverActiveTasks()` and adopts the most recent running task
95
+ without auto-dispatching — the user types `/grub resume` to continue.
96
+ - **Safety limits**: 25 iterations and 3 consecutive failures by default;
97
+ override with `--max-iter` / `--max-fail`.
98
+ - **Stale harness cleanup**: on extension load, terminal harnesses older
99
+ than 30 days are pruned from `.grub/`.
100
+
101
+ ## Legacy migration
102
+
103
+ Earlier versions wrote `feature-checklist.md` (markdown checkboxes). When a
104
+ new iteration starts and `feature-list.json` is missing but the legacy file
105
+ exists, its checkbox items are migrated into the JSON format (category
106
+ defaults to `functional`; `steps` start empty so the initializer can refine
107
+ later).
108
+
109
+ ## Related
110
+
111
+ For the recurring scheduler that runs prompts or slash commands on an
112
+ interval, see the sibling [`loop` extension](../loop/README.md).
@@ -1,16 +1,16 @@
1
- # Link-world Workspace
2
-
3
- This directory is copied to:
4
-
5
- ```text
6
- .nanopencil/link-world-workspace/
7
- ```
8
-
9
- Use it to store project-local, reusable internet-access knowledge for NanoPencil.
10
-
11
- Suggested layout:
12
-
13
- - `domain-skills/<site>/...` for durable site-specific notes
14
- - `notes/` for broader task or provider observations that should not live in the shipped extension
15
-
16
- Keep public-safe content only. Do not store secrets, cookies, tokens, or user-specific private data here.
1
+ # Link-world Workspace
2
+
3
+ This directory is copied to:
4
+
5
+ ```text
6
+ .nanopencil/link-world-workspace/
7
+ ```
8
+
9
+ Use it to store project-local, reusable internet-access knowledge for NanoPencil.
10
+
11
+ Suggested layout:
12
+
13
+ - `domain-skills/<site>/...` for durable site-specific notes
14
+ - `notes/` for broader task or provider observations that should not live in the shipped extension
15
+
16
+ Keep public-safe content only. Do not store secrets, cookies, tokens, or user-specific private data here.
@@ -1,65 +1,65 @@
1
- ---
2
- name: internet-search
3
- description: 当用户需要联网搜索最新信息、新闻、在线内容时使用此技能。包括搜索 Twitter、YouTube、Bilibili、知乎、微博、小红书等平台。
4
- ---
5
-
6
- # 互联网搜索
7
-
8
- ## 触发条件
9
-
10
- 当用户请求以下内容时使用此技能:
11
- - 搜索互联网、联网搜索、网络搜索
12
- - 查找最新新闻、资讯、信息
13
- - 搜索 YouTube、Bilibili、Twitter、小红书、微博等内容
14
- - 查找某个话题的最新讨论
15
- - 获取实时信息或热点内容
16
-
17
- ## 执行步骤
18
-
19
- ### 步骤 1:检查 agent-reach 是否安装
20
-
21
- 运行以下命令检查 agent-reach 是否可用:
22
-
23
- ```bash
24
- agent-reach --version
25
- ```
26
-
27
- ### 步骤 2:如果未安装
28
-
29
- 如果 agent-reach 命令不可用,告知用户需要先安装:
30
-
31
- > link-world (agent-reach) 未安装。请先运行 `/link-world` 安装 agent-reach 以支持联网搜索功能。
32
-
33
- ### 步骤 3:如果已安装
34
-
35
- 根据用户需求,使用 linkworld.md 中定义的工具进行搜索。
36
-
37
- **完整命令参考请查阅:extensions/link-world/linkworld.md**
38
-
39
- | 场景 | 推荐工具 | 示例命令 |
40
- |------|----------|----------|
41
- | 搜索 Twitter/X 推文 | xreach | `xreach search "关键词" --json` |
42
- | 解析 YouTube 视频 | yt-dlp | `yt-dlp --dump-json "视频URL"` |
43
- | 解析 Bilibili 视频 | yt-dlp | `yt-dlp --dump-json "视频URL"` |
44
- | 搜索 Reddit 帖子 | curl | `curl -s "https://reddit.com/r/xxx/search.json?q=关键词"` |
45
- | 搜索 GitHub 仓库 | gh | `gh search repos "关键词" --limit 5` |
46
- | 读取网页内容 | curl + Jina | `curl -s "https://r.jina.ai/网页URL"` |
47
- | 通用搜索引擎 | Exa (MCP) | `mcporter call 'exa.web_search_exa(query: "关键词", num_results: 5)'` |
48
- | 搜索小红书笔记 | 小红书 (MCP) | `mcporter call 'xiaohongshu.search_feeds(keyword: "关键词", limit: 5)'` |
49
- | 解析抖音视频 | 抖音 (MCP) | `mcporter call 'douyin.parse_douyin_video_info(url: "视频URL")'` |
50
-
51
- ### 步骤 4:返回结果
52
-
53
- 将搜索结果整理后返回给用户,包括:
54
- - 来源平台
55
- - 标题
56
- - 内容摘要
57
- - 链接
58
-
59
- ## 注意事项
60
-
61
- 1. 遵守平台的 API 使用规范
62
- 2. 如需登录平台的搜索功能,确保用户已配置认证信息
63
- 3. 如果搜索失败,尝试其他平台或告知用户可能的错误原因
64
- 4. YouTube/Bilibili 需要提供具体的视频 URL,不能直接搜索关键词
65
- 5. 详细的配置说明(Cookie、代理等)请参考 linkworld.md
1
+ ---
2
+ name: internet-search
3
+ description: 当用户需要联网搜索最新信息、新闻、在线内容时使用此技能。包括搜索 Twitter、YouTube、Bilibili、知乎、微博、小红书等平台。
4
+ ---
5
+
6
+ # 互联网搜索
7
+
8
+ ## 触发条件
9
+
10
+ 当用户请求以下内容时使用此技能:
11
+ - 搜索互联网、联网搜索、网络搜索
12
+ - 查找最新新闻、资讯、信息
13
+ - 搜索 YouTube、Bilibili、Twitter、小红书、微博等内容
14
+ - 查找某个话题的最新讨论
15
+ - 获取实时信息或热点内容
16
+
17
+ ## 执行步骤
18
+
19
+ ### 步骤 1:检查 agent-reach 是否安装
20
+
21
+ 运行以下命令检查 agent-reach 是否可用:
22
+
23
+ ```bash
24
+ agent-reach --version
25
+ ```
26
+
27
+ ### 步骤 2:如果未安装
28
+
29
+ 如果 agent-reach 命令不可用,告知用户需要先安装:
30
+
31
+ > link-world (agent-reach) 未安装。请先运行 `/link-world` 安装 agent-reach 以支持联网搜索功能。
32
+
33
+ ### 步骤 3:如果已安装
34
+
35
+ 根据用户需求,使用 linkworld.md 中定义的工具进行搜索。
36
+
37
+ **完整命令参考请查阅:extensions/link-world/linkworld.md**
38
+
39
+ | 场景 | 推荐工具 | 示例命令 |
40
+ |------|----------|----------|
41
+ | 搜索 Twitter/X 推文 | xreach | `xreach search "关键词" --json` |
42
+ | 解析 YouTube 视频 | yt-dlp | `yt-dlp --dump-json "视频URL"` |
43
+ | 解析 Bilibili 视频 | yt-dlp | `yt-dlp --dump-json "视频URL"` |
44
+ | 搜索 Reddit 帖子 | curl | `curl -s "https://reddit.com/r/xxx/search.json?q=关键词"` |
45
+ | 搜索 GitHub 仓库 | gh | `gh search repos "关键词" --limit 5` |
46
+ | 读取网页内容 | curl + Jina | `curl -s "https://r.jina.ai/网页URL"` |
47
+ | 通用搜索引擎 | Exa (MCP) | `mcporter call 'exa.web_search_exa(query: "关键词", num_results: 5)'` |
48
+ | 搜索小红书笔记 | 小红书 (MCP) | `mcporter call 'xiaohongshu.search_feeds(keyword: "关键词", limit: 5)'` |
49
+ | 解析抖音视频 | 抖音 (MCP) | `mcporter call 'douyin.parse_douyin_video_info(url: "视频URL")'` |
50
+
51
+ ### 步骤 4:返回结果
52
+
53
+ 将搜索结果整理后返回给用户,包括:
54
+ - 来源平台
55
+ - 标题
56
+ - 内容摘要
57
+ - 链接
58
+
59
+ ## 注意事项
60
+
61
+ 1. 遵守平台的 API 使用规范
62
+ 2. 如需登录平台的搜索功能,确保用户已配置认证信息
63
+ 3. 如果搜索失败,尝试其他平台或告知用户可能的错误原因
64
+ 4. YouTube/Bilibili 需要提供具体的视频 URL,不能直接搜索关键词
65
+ 5. 详细的配置说明(Cookie、代理等)请参考 linkworld.md