@pencil-agent/nano-pencil 2.0.0 → 2.0.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (195) hide show
  1. package/README.md +267 -267
  2. package/dist/build-meta.json +3 -3
  3. package/dist/core/export-html/AGENT.md +11 -11
  4. package/dist/core/export-html/template.css +971 -971
  5. package/dist/core/export-html/template.html +54 -54
  6. package/dist/core/mcp/mcp-client.d.ts +3 -1
  7. package/dist/core/mcp/mcp-client.js +6 -6
  8. package/dist/core/mcp/mcp-config.d.ts +3 -3
  9. package/dist/core/mcp/mcp-config.js +1 -1
  10. package/dist/core/mcp/mcp-manager.d.ts +5 -1
  11. package/dist/core/mcp/mcp-manager.js +1 -1
  12. package/dist/core/platform/config/resource-loader.d.ts +2 -0
  13. package/dist/core/platform/config/resource-loader.js +2 -2
  14. package/dist/core/runtime/agent-session.d.ts +12 -0
  15. package/dist/core/runtime/agent-session.js +8 -8
  16. package/dist/core/runtime/sdk.d.ts +8 -0
  17. package/dist/core/runtime/sdk.js +1 -1
  18. package/dist/extensions/builtin/AGENT.md +115 -115
  19. package/dist/extensions/builtin/browser/AGENT.md +17 -17
  20. package/dist/extensions/builtin/browser/agent-workspace/agent_helpers.py +12 -12
  21. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/amazon/product-search.md +198 -198
  22. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/archive-org/scraping.md +341 -341
  23. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/arxiv/scraping.md +311 -311
  24. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/arxiv-bulk/scraping.md +333 -333
  25. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/atlas/overview.md +70 -70
  26. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/booking-com/scraping.md +578 -578
  27. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/capterra/scraping.md +440 -440
  28. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/centilebrain/generate-estimates.md +110 -110
  29. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/coingecko/scraping.md +325 -325
  30. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/coinmarketcap/scraping.md +463 -463
  31. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/coursera/scraping.md +360 -360
  32. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/craigslist/scraping.md +390 -390
  33. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/crossref/scraping.md +568 -568
  34. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/dev-to/scraping.md +323 -323
  35. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/duckduckgo/scraping.md +349 -349
  36. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/ebay/scraping.md +435 -435
  37. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/etsy/scraping.md +506 -506
  38. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/eventbrite/scraping.md +363 -363
  39. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/expedia/automation.md +168 -168
  40. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/facebook/groups.md +236 -236
  41. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/facebook/pages.md +295 -295
  42. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/framer/editor.md +108 -108
  43. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/fred/scraping.md +493 -493
  44. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/g2/scraping.md +580 -580
  45. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/genius/scraping.md +511 -511
  46. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/github/repo-actions.md +65 -65
  47. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/github/scraping.md +184 -184
  48. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/glassdoor/scraping.md +543 -543
  49. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/gmail/compose.md +122 -122
  50. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/goodreads/scraping.md +461 -461
  51. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/gutenberg/scraping.md +383 -383
  52. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/hackernews/scraping.md +243 -243
  53. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/howlongtobeat/scraping.md +473 -473
  54. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/imdb/scraping.md +271 -271
  55. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/itch-io/scraping.md +436 -436
  56. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/job-boards/indeed-glassdoor.md +1021 -1021
  57. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/letterboxd/scraping.md +349 -349
  58. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/linkedin/invitation-manager.md +109 -109
  59. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/loom/folder-enumeration.md +170 -170
  60. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/macrotrends/scraping.md +537 -537
  61. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/medium/article-hydration.md +120 -120
  62. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/medium/scraping.md +414 -414
  63. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/metacritic/scraping.md +477 -477
  64. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/musicbrainz/scraping.md +478 -478
  65. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/nasa/scraping.md +339 -339
  66. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/news-aggregation/multi-source.md +205 -205
  67. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/open-library/scraping.md +472 -472
  68. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/openalex/scraping.md +470 -470
  69. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/openstreetmap/scraping.md +490 -490
  70. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/package-registries/npm-pypi.md +478 -478
  71. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/polymarket/scraping.md +234 -234
  72. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/producthunt/scraping.md +307 -307
  73. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/pubmed/scraping.md +421 -421
  74. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/quora/scraping.md +364 -364
  75. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/rawg/scraping.md +352 -352
  76. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/reddit/scraping.md +124 -124
  77. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/rest-countries/scraping.md +233 -233
  78. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/sec-edgar/scraping.md +361 -361
  79. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/shopify-admin/README.md +36 -36
  80. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/shopify-admin/embedded-apps.md +72 -72
  81. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/shopify-admin/knowledge-base.md +109 -109
  82. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/shopify-admin/polaris-inputs.md +137 -137
  83. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/soundcloud/scraping.md +362 -362
  84. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/spotify/scraping.md +339 -339
  85. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/stackoverflow/scraping.md +435 -435
  86. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/steam/scraping.md +575 -575
  87. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/substack/scraping.md +338 -338
  88. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/thetechgeeks/pricing.md +52 -52
  89. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/tiktok/upload.md +107 -107
  90. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/tradingview/scraping.md +309 -309
  91. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/trello/boards-and-lists.md +88 -88
  92. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/trustpilot/scraping.md +375 -375
  93. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/walmart/scraping.md +444 -444
  94. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/wayback-machine/scraping.md +306 -306
  95. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/weather/scraping.md +398 -398
  96. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/wellfound/scraping.md +596 -596
  97. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/world-bank/scraping.md +356 -356
  98. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/xiaohongshu/scraping.md +84 -84
  99. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/youtube/scraping.md +418 -418
  100. package/dist/extensions/builtin/browser/agent-workspace/domain-skills/zillow/scraping.md +433 -433
  101. package/dist/extensions/builtin/browser/browser.md +73 -73
  102. package/dist/extensions/builtin/browser/install.md +142 -142
  103. package/dist/extensions/builtin/browser/interaction-skills/connection.md +48 -48
  104. package/dist/extensions/builtin/browser/interaction-skills/cookies.md +3 -3
  105. package/dist/extensions/builtin/browser/interaction-skills/cross-origin-iframes.md +3 -3
  106. package/dist/extensions/builtin/browser/interaction-skills/dialogs.md +64 -64
  107. package/dist/extensions/builtin/browser/interaction-skills/downloads.md +3 -3
  108. package/dist/extensions/builtin/browser/interaction-skills/drag-and-drop.md +3 -3
  109. package/dist/extensions/builtin/browser/interaction-skills/dropdowns.md +3 -3
  110. package/dist/extensions/builtin/browser/interaction-skills/iframes.md +3 -3
  111. package/dist/extensions/builtin/browser/interaction-skills/network-requests.md +3 -3
  112. package/dist/extensions/builtin/browser/interaction-skills/print-as-pdf.md +3 -3
  113. package/dist/extensions/builtin/browser/interaction-skills/profile-sync.md +90 -90
  114. package/dist/extensions/builtin/browser/interaction-skills/screenshots.md +17 -17
  115. package/dist/extensions/builtin/browser/interaction-skills/scrolling.md +3 -3
  116. package/dist/extensions/builtin/browser/interaction-skills/shadow-dom.md +3 -3
  117. package/dist/extensions/builtin/browser/interaction-skills/tabs.md +69 -69
  118. package/dist/extensions/builtin/browser/interaction-skills/uploads.md +1 -1
  119. package/dist/extensions/builtin/browser/interaction-skills/viewport.md +3 -3
  120. package/dist/extensions/builtin/browser/src/browser_harness/AGENT.md +15 -15
  121. package/dist/extensions/builtin/browser/src/browser_harness/__init__.py +8 -8
  122. package/dist/extensions/builtin/browser/src/browser_harness/_ipc.py +90 -90
  123. package/dist/extensions/builtin/browser/src/browser_harness/admin.py +722 -722
  124. package/dist/extensions/builtin/browser/src/browser_harness/daemon.py +328 -328
  125. package/dist/extensions/builtin/browser/src/browser_harness/helpers.py +396 -396
  126. package/dist/extensions/builtin/browser/src/browser_harness/run.py +103 -103
  127. package/dist/extensions/builtin/discipline/skills/brainstorming/SKILL.md +33 -33
  128. package/dist/extensions/builtin/discipline/skills/executing-plans/SKILL.md +25 -25
  129. package/dist/extensions/builtin/discipline/skills/finishing-development-branch/SKILL.md +25 -25
  130. package/dist/extensions/builtin/discipline/skills/receiving-code-review/SKILL.md +22 -22
  131. package/dist/extensions/builtin/discipline/skills/requesting-code-review/SKILL.md +31 -31
  132. package/dist/extensions/builtin/discipline/skills/systematic-debugging/SKILL.md +28 -28
  133. package/dist/extensions/builtin/discipline/skills/test-driven-development/SKILL.md +32 -32
  134. package/dist/extensions/builtin/discipline/skills/using-git-worktrees/SKILL.md +25 -25
  135. package/dist/extensions/builtin/discipline/skills/verification-before-completion/SKILL.md +27 -27
  136. package/dist/extensions/builtin/discipline/skills/writing-plans/SKILL.md +26 -26
  137. package/dist/extensions/builtin/goal/README.md +67 -67
  138. package/dist/extensions/builtin/grub/README.md +112 -112
  139. package/dist/extensions/builtin/link-world/agent-workspace/README.md +16 -16
  140. package/dist/extensions/builtin/link-world/internet-search/internet-search.md +65 -65
  141. package/dist/extensions/builtin/link-world/link-world-agent.md +82 -82
  142. package/dist/extensions/builtin/link-world/linkworld.md +313 -313
  143. package/dist/extensions/builtin/link-world/network-routing/network-routing.md +67 -67
  144. package/dist/extensions/builtin/loop/README.md +92 -92
  145. package/dist/extensions/builtin/mcp/figma-design.md +68 -68
  146. package/dist/extensions/builtin/mcp/mcp-management.md +85 -85
  147. package/dist/extensions/builtin/recap/AGENT.md +15 -15
  148. package/dist/extensions/builtin/sal/README.md +72 -72
  149. package/dist/extensions/builtin/security-audit/README.md +289 -289
  150. package/dist/extensions/builtin/team/AGENT.md +112 -112
  151. package/dist/extensions/builtin/team/TESTING.md +299 -299
  152. package/dist/extensions/builtin/token-save/README.md +56 -56
  153. package/dist/extensions/optional/AGENT.md +10 -10
  154. package/dist/modes/interactive/interactive-mode.js +36 -36
  155. package/dist/modes/interactive/theme/dark.json +85 -85
  156. package/dist/modes/interactive/theme/light.json +84 -84
  157. package/dist/modes/interactive/theme/theme-schema.json +335 -335
  158. package/dist/modes/interactive/theme/warm.json +81 -81
  159. package/dist/node_modules/@pencil-agent/agent-core/dist/agent-loop.js +3 -2
  160. package/dist/node_modules/@pencil-agent/agent-core/dist/structured-adaptive-agent-loop.js +2 -1
  161. package/dist/node_modules/@pencil-agent/ai/dist/cli.js +0 -0
  162. package/docs/cc-agent-design.md +1297 -0
  163. package/docs/cc-tui-design.md +1333 -0
  164. package/docs/codex-goal-command-impl.md +1055 -1055
  165. package/docs/codex-goal-vs-grub.md +500 -500
  166. package/docs/custom-provider.md +27 -27
  167. package/docs/extensions.md +27 -27
  168. package/docs/keybindings.md +27 -27
  169. package/docs/loop /351/207/215/346/236/204/345/256/214/346/210/220/346/200/273/347/273/223.md" +250 -250
  170. package/docs/loop /351/207/215/346/236/204/345/256/214/346/210/220/346/212/245/345/221/212.md" +122 -122
  171. package/docs/loop /351/207/215/346/236/204/346/226/271/346/241/210.md" +1222 -1222
  172. package/docs/loop /351/207/215/346/236/204/346/226/271/346/241/210/345/256/236/347/216/260/346/212/245/345/221/212.md" +158 -158
  173. package/docs/loop /351/207/215/346/236/204/346/226/271/346/241/210/345/257/271/346/257/224/345/210/206/346/236/220.md" +128 -128
  174. package/docs/loop /351/207/215/346/236/204/350/256/241/345/210/222.md" +320 -320
  175. package/docs/loop-usage-examples.md +214 -214
  176. package/docs/models.md +27 -27
  177. package/docs/nanoPencil-/345/255/246/344/271/240/350/256/241/345/210/222.md +170 -0
  178. package/docs/packages.md +27 -27
  179. package/docs/pi-design-philosophy.md +457 -457
  180. package/docs/planmode.md +1987 -1987
  181. package/docs/prompt-templates.md +27 -27
  182. package/docs/providers.md +27 -27
  183. package/docs/scan-report.md +3820 -0
  184. package/docs/sdk.md +27 -27
  185. package/docs/skills.md +27 -27
  186. package/docs/themes.md +27 -27
  187. package/docs/tui.md +27 -27
  188. package/docs//345/257/271/346/240/207Claude-Code.md +1775 -0
  189. package/docs//351/230/277/351/207/214/345/267/264/345/267/264/350/264/242/346/212/245/345/210/206/346/236/220/344/271/246.md +261 -0
  190. package/package.json +190 -190
  191. package/docs/ACP/345/215/217/350/256/256/351/233/206/346/210/220/345/274/200/345/217/221/346/226/207/346/241/243.md +0 -851
  192. package/docs/SDK-TESTING.md +0 -364
  193. package/docs/mem-core/346/212/200/346/234/257/346/226/207/346/241/243.md +0 -593
  194. package/docs/startup-performance-optimization.md +0 -301
  195. package/docs//350/256/244/347/237/245/345/234/260/345/233/276.md +0 -47
@@ -1,27 +1,27 @@
1
- ---
2
- name: verification-before-completion
3
- description: Use before saying work is complete, fixed, passing, implemented, ready, or safe to merge.
4
- ---
5
-
6
- # Verification Before Completion
7
-
8
- Evidence must precede completion claims.
9
-
10
- ## Gate
11
-
12
- Do not claim success from intent, plausibility, previous output, or another agent's report. Verify against the current state.
13
-
14
- ## Process
15
-
16
- 1. Identify each claim you are about to make.
17
- 2. Identify the command, file inspection, diff, runtime check, or rendered artifact that would prove it.
18
- 3. Run or inspect that evidence freshly.
19
- 4. Read the output, exit code, or artifact carefully.
20
- 5. Report the actual state:
21
- - If verified, name the evidence.
22
- - If not verified, say what remains unverified.
23
- - If failed, report the failure and continue work.
24
-
25
- ## Evidence Matching
26
-
27
- Use focused checks for narrow claims and broader checks for broad claims. A passing unit test does not prove a full build, and a successful build does not prove the requested behavior unless the behavior is covered.
1
+ ---
2
+ name: verification-before-completion
3
+ description: Use before saying work is complete, fixed, passing, implemented, ready, or safe to merge.
4
+ ---
5
+
6
+ # Verification Before Completion
7
+
8
+ Evidence must precede completion claims.
9
+
10
+ ## Gate
11
+
12
+ Do not claim success from intent, plausibility, previous output, or another agent's report. Verify against the current state.
13
+
14
+ ## Process
15
+
16
+ 1. Identify each claim you are about to make.
17
+ 2. Identify the command, file inspection, diff, runtime check, or rendered artifact that would prove it.
18
+ 3. Run or inspect that evidence freshly.
19
+ 4. Read the output, exit code, or artifact carefully.
20
+ 5. Report the actual state:
21
+ - If verified, name the evidence.
22
+ - If not verified, say what remains unverified.
23
+ - If failed, report the failure and continue work.
24
+
25
+ ## Evidence Matching
26
+
27
+ Use focused checks for narrow claims and broader checks for broad claims. A passing unit test does not prove a full build, and a successful build does not prove the requested behavior unless the behavior is covered.
@@ -1,26 +1,26 @@
1
- ---
2
- name: writing-plans
3
- description: Use when a task needs multiple implementation steps, multiple files, handoff to another agent, or careful verification sequencing.
4
- ---
5
-
6
- # Writing Plans
7
-
8
- Create an implementation plan that can be executed without rediscovering context.
9
-
10
- ## Required Sections
11
-
12
- - Goal: one sentence describing the user-visible outcome.
13
- - Context: what the codebase currently does and why the change is needed.
14
- - Architecture: the recommended approach and why it fits existing boundaries.
15
- - Files: exact files to create or modify and each file's responsibility.
16
- - Tasks: small ordered steps with verification after meaningful changes.
17
- - Test plan: exact commands and expected evidence.
18
- - Documentation impact: P1/P2/P3 or user docs that must change.
19
-
20
- ## Task Quality
21
-
22
- Each task should be independently understandable. Include exact paths, APIs, data shapes, and expected behavior. Avoid placeholders such as "handle edge cases" or "add tests"; state the actual edge cases and tests.
23
-
24
- ## Handoff
25
-
26
- When the plan is approved, execute it directly or use `executing-plans` for inline execution. Use subagents when tasks are independent and reviewable.
1
+ ---
2
+ name: writing-plans
3
+ description: Use when a task needs multiple implementation steps, multiple files, handoff to another agent, or careful verification sequencing.
4
+ ---
5
+
6
+ # Writing Plans
7
+
8
+ Create an implementation plan that can be executed without rediscovering context.
9
+
10
+ ## Required Sections
11
+
12
+ - Goal: one sentence describing the user-visible outcome.
13
+ - Context: what the codebase currently does and why the change is needed.
14
+ - Architecture: the recommended approach and why it fits existing boundaries.
15
+ - Files: exact files to create or modify and each file's responsibility.
16
+ - Tasks: small ordered steps with verification after meaningful changes.
17
+ - Test plan: exact commands and expected evidence.
18
+ - Documentation impact: P1/P2/P3 or user docs that must change.
19
+
20
+ ## Task Quality
21
+
22
+ Each task should be independently understandable. Include exact paths, APIs, data shapes, and expected behavior. Avoid placeholders such as "handle edge cases" or "add tests"; state the actual edge cases and tests.
23
+
24
+ ## Handoff
25
+
26
+ When the plan is approved, execute it directly or use `executing-plans` for inline execution. Use subagents when tasks are independent and reviewable.
@@ -1,67 +1,67 @@
1
- # Goal Extension
2
-
3
- Long-running task management for nanoPencil. Set a goal with `/goal <objective>` and the
4
- agent will auto-continue working on it across turns until the objective is achieved,
5
- the token budget runs out, or you pause/clear it.
6
-
7
- This extension mirrors the Codex `/goal` command semantics: a per-thread goal, persisted
8
- to disk, with idle-continuation prompts, token accounting, and budget enforcement.
9
-
10
- ## Usage
11
-
12
- ```
13
- /goal Show current goal summary menu
14
- /goal <objective> Set or replace the goal
15
- /goal clear Clear the goal
16
- /goal edit Open the editor to change the objective
17
- /goal pause Pause auto-continuation
18
- /goal resume Resume auto-continuation
19
- /goal help Show usage help
20
- ```
21
-
22
- ## LLM Tools
23
-
24
- The extension registers three LLM-facing tools:
25
-
26
- | Tool | Purpose | Who can call |
27
- |------|---------|--------------|
28
- | `get_goal` | Read the current goal | LLM |
29
- | `create_goal` | Create a new goal (only when the user explicitly asks) | LLM |
30
- | `update_goal` | Mark the goal `complete` or `blocked` | LLM |
31
-
32
- The LLM is only allowed to set the goal's status to `complete` or `blocked`. Pause /
33
- resume / budget limits are user-driven and happen exclusively through `/goal`.
34
-
35
- ## Lifecycle
36
-
37
- The extension subscribes to `turn_start`, `turn_end`, `message_end`, `tool_execution_end`,
38
- and `agent_end` to track token usage and time per turn. When a turn ends with an
39
- `active` goal, the extension injects a follow-up user message containing the
40
- continuation prompt so the agent keeps working on the objective.
41
-
42
- When a turn causes the goal to cross its token budget, the extension injects a
43
- budget-limit steering prompt and marks the goal `budget_limited`. Once budget-limited,
44
- auto-continuation stops and the goal is terminal.
45
-
46
- ## Persistence
47
-
48
- Goals are stored as JSON files under `<agentDir>/goals/<threadId>.json`. They survive
49
- session restarts and are keyed by the active session ID.
50
-
51
- ## Status
52
-
53
- `Status: active` shows in the footer while a goal is running.
54
-
55
- ## Architecture
56
-
57
- | File | Responsibility |
58
- |------|----------------|
59
- | `goal-types.ts` | `ThreadGoalStatus`, `ThreadGoal`, helper predicates |
60
- | `goal-store.ts` | Atomic JSON-file persistence (replace / insert / update / delete / account_usage) |
61
- | `goal-format.ts` | Time/token formatting, summary lines, status indicator, validators |
62
- | `goal-prompts.ts` | Continuation / budget-limit / objective-updated prompt templates |
63
- | `goal-controller.ts` | Per-thread runtime: mutex, turn accounting, idle continuation |
64
- | `goal-tools.ts` | `get_goal`, `create_goal`, `update_goal` LLM tool definitions |
65
- | `goal-parser.ts` | `/goal` slash-command argument parsing |
66
- | `goal-command.ts` | `/goal` slash-command handler (UI + controller dispatch) |
67
- | `index.ts` | Extension entry: tools, command, lifecycle hooks, status indicator |
1
+ # Goal Extension
2
+
3
+ Long-running task management for nanoPencil. Set a goal with `/goal <objective>` and the
4
+ agent will auto-continue working on it across turns until the objective is achieved,
5
+ the token budget runs out, or you pause/clear it.
6
+
7
+ This extension mirrors the Codex `/goal` command semantics: a per-thread goal, persisted
8
+ to disk, with idle-continuation prompts, token accounting, and budget enforcement.
9
+
10
+ ## Usage
11
+
12
+ ```
13
+ /goal Show current goal summary menu
14
+ /goal <objective> Set or replace the goal
15
+ /goal clear Clear the goal
16
+ /goal edit Open the editor to change the objective
17
+ /goal pause Pause auto-continuation
18
+ /goal resume Resume auto-continuation
19
+ /goal help Show usage help
20
+ ```
21
+
22
+ ## LLM Tools
23
+
24
+ The extension registers three LLM-facing tools:
25
+
26
+ | Tool | Purpose | Who can call |
27
+ |------|---------|--------------|
28
+ | `get_goal` | Read the current goal | LLM |
29
+ | `create_goal` | Create a new goal (only when the user explicitly asks) | LLM |
30
+ | `update_goal` | Mark the goal `complete` or `blocked` | LLM |
31
+
32
+ The LLM is only allowed to set the goal's status to `complete` or `blocked`. Pause /
33
+ resume / budget limits are user-driven and happen exclusively through `/goal`.
34
+
35
+ ## Lifecycle
36
+
37
+ The extension subscribes to `turn_start`, `turn_end`, `message_end`, `tool_execution_end`,
38
+ and `agent_end` to track token usage and time per turn. When a turn ends with an
39
+ `active` goal, the extension injects a follow-up user message containing the
40
+ continuation prompt so the agent keeps working on the objective.
41
+
42
+ When a turn causes the goal to cross its token budget, the extension injects a
43
+ budget-limit steering prompt and marks the goal `budget_limited`. Once budget-limited,
44
+ auto-continuation stops and the goal is terminal.
45
+
46
+ ## Persistence
47
+
48
+ Goals are stored as JSON files under `<agentDir>/goals/<threadId>.json`. They survive
49
+ session restarts and are keyed by the active session ID.
50
+
51
+ ## Status
52
+
53
+ `Status: active` shows in the footer while a goal is running.
54
+
55
+ ## Architecture
56
+
57
+ | File | Responsibility |
58
+ |------|----------------|
59
+ | `goal-types.ts` | `ThreadGoalStatus`, `ThreadGoal`, helper predicates |
60
+ | `goal-store.ts` | Atomic JSON-file persistence (replace / insert / update / delete / account_usage) |
61
+ | `goal-format.ts` | Time/token formatting, summary lines, status indicator, validators |
62
+ | `goal-prompts.ts` | Continuation / budget-limit / objective-updated prompt templates |
63
+ | `goal-controller.ts` | Per-thread runtime: mutex, turn accounting, idle continuation |
64
+ | `goal-tools.ts` | `get_goal`, `create_goal`, `update_goal` LLM tool definitions |
65
+ | `goal-parser.ts` | `/goal` slash-command argument parsing |
66
+ | `goal-command.ts` | `/goal` slash-command handler (UI + controller dispatch) |
67
+ | `index.ts` | Extension entry: tools, command, lifecycle hooks, status indicator |
@@ -1,112 +1,112 @@
1
- # Grub Extension
2
-
3
- `/grub` runs one autonomous long-running task until the agent reports it
4
- complete, reports it is blocked, the user stops it, or a safety limit is
5
- reached. The harness design follows the pattern described in Anthropic's
6
- [Effective harnesses for long-running agents](https://www.anthropic.com/engineering/effective-harnesses-for-long-running-agents):
7
- a structured on-disk state lets coding agents pick up where they left off
8
- even across fresh context windows and full process restarts.
9
-
10
- ## Commands
11
-
12
- - `/grub <goal> [--max-iter N] [--max-fail N]` — start one autonomous task
13
- - `/grub status [--json]` — show the active or last finished grub task
14
- - `/grub resume` — resume dispatch for an adopted/persisted task
15
- - `/grub stop` — stop the active grub task
16
-
17
- ## Harness artifacts
18
-
19
- Each task owns a directory at `.grub/<task-id>/`:
20
-
21
- | File | Purpose | Who may write |
22
- |------|---------|---------------|
23
- | `feature-list.json` | Structured list of end-to-end features | Initializer writes the whole file once; coding agents may only flip `passes` and set `evidence` |
24
- | `progress-log.md` | Dated notes describing each iteration | Agent, append-only |
25
- | `init.sh` | Get-bearings + project smoke script run at the start of every iteration | Initializer; later agents may add project-specific smoke commands |
26
- | `state.json` | Durable `GrubTaskState` snapshot for cross-session resume | Controller, atomic writes |
27
-
28
- `feature-list.json` schema (version 1):
29
-
30
- ```json
31
- {
32
- "version": 1,
33
- "goal": "<user goal>",
34
- "features": [
35
- {
36
- "id": "kebab-slug",
37
- "category": "functional|verification|polish",
38
- "description": "observable behavior",
39
- "steps": ["actionable", "verification", "steps"],
40
- "passes": false,
41
- "evidence": "optional git sha or short proof"
42
- }
43
- ]
44
- }
45
- ```
46
-
47
- The controller validates every mutation: changing `description`, `steps`,
48
- `category`, `id`, list length, or reordering counts as a violation. Agents
49
- are told up front that the only permitted edits are toggling `passes` and
50
- setting `evidence`.
51
-
52
- ## How it works
53
-
54
- - Each grub iteration is tagged with a `[GRUB:<id>:<n>]` prompt prefix so
55
- the extension can recognise its own injected turns.
56
- - On start, grub creates the harness directory and writes the initial
57
- artifacts without creating a git commit. The harness keeps durable state on
58
- disk and leaves source changes visible in the working tree, avoiding noisy
59
- `grub(...)` commits unless the user explicitly asks for them.
60
- - Two phase-specialized system prompts are injected via
61
- `before_agent_start`:
62
- - **Initializer prompt** (first successful turn): expand
63
- `feature-list.json` into 15-40 concrete features, harden `init.sh`,
64
- seed `progress-log.md`. No broad implementation yet.
65
- - **Coding prompt** (remaining turns): run `init.sh`, pick exactly one
66
- pending feature, implement + verify end-to-end, flip `passes` +
67
- `evidence`, and append to `progress-log.md`.
68
- - At the end of every grub turn the assistant must emit a single
69
- `<loop-state>{"status":"continue|complete|blocked", "summary":"...", "nextStep":"..."}</loop-state>`
70
- block. The extension parses it and dispatches the next iteration or
71
- stops with a terminal status.
72
- - **Initializer sanitize-not-fail**: while in the initializer phase, only
73
- genuinely unfixable structural problems (feature count out of 15-40,
74
- unreplaced placeholder, non-kebab ids, duplicate ids) fail the turn and
75
- force a retry. Recoverable hygiene issues are auto-corrected instead of
76
- killing the task: a wrong `goal` is restored to the authoritative task goal,
77
- pre-marked `passes:true` are reset to `false`, and stray `evidence` is
78
- dropped. The sanitized list is written back to disk and becomes the baseline,
79
- so the phase always advances once the structure is valid.
80
- - **Phase-aware failure budget**: the initializer gets a more forgiving budget
81
- (default 5) than execution (default 3, via `--max-fail`), because standing up
82
- a valid harness is a distinct, retry-friendly activity from execution work.
83
- - **Mutation guard**: after the initializer creates the first real
84
- `feature-list.json`, each subsequent turn is diffed against the persisted
85
- baseline. Rewriting feature ids, descriptions, categories, steps, count, or
86
- order is rejected and retried; only `passes` and `evidence` may change.
87
- - **Completion guard**: if the decision says `complete` but
88
- `feature-list.json` still has `passes:false` entries, the controller
89
- rewrites the decision to `continue` with a synthetic `nextStep`
90
- pointing at the first pending feature. The harness will not allow
91
- premature "done".
92
- - **Cross-session resume**: `GrubTaskState` is written atomically to
93
- `state.json` on every transition. On the next session, `session_start`
94
- calls `discoverActiveTasks()` and adopts the most recent running task
95
- without auto-dispatching — the user types `/grub resume` to continue.
96
- - **Safety limits**: 25 iterations and 3 consecutive failures by default;
97
- override with `--max-iter` / `--max-fail`.
98
- - **Stale harness cleanup**: on extension load, terminal harnesses older
99
- than 30 days are pruned from `.grub/`.
100
-
101
- ## Legacy migration
102
-
103
- Earlier versions wrote `feature-checklist.md` (markdown checkboxes). When a
104
- new iteration starts and `feature-list.json` is missing but the legacy file
105
- exists, its checkbox items are migrated into the JSON format (category
106
- defaults to `functional`; `steps` start empty so the initializer can refine
107
- later).
108
-
109
- ## Related
110
-
111
- For the recurring scheduler that runs prompts or slash commands on an
112
- interval, see the sibling [`loop` extension](../loop/README.md).
1
+ # Grub Extension
2
+
3
+ `/grub` runs one autonomous long-running task until the agent reports it
4
+ complete, reports it is blocked, the user stops it, or a safety limit is
5
+ reached. The harness design follows the pattern described in Anthropic's
6
+ [Effective harnesses for long-running agents](https://www.anthropic.com/engineering/effective-harnesses-for-long-running-agents):
7
+ a structured on-disk state lets coding agents pick up where they left off
8
+ even across fresh context windows and full process restarts.
9
+
10
+ ## Commands
11
+
12
+ - `/grub <goal> [--max-iter N] [--max-fail N]` — start one autonomous task
13
+ - `/grub status [--json]` — show the active or last finished grub task
14
+ - `/grub resume` — resume dispatch for an adopted/persisted task
15
+ - `/grub stop` — stop the active grub task
16
+
17
+ ## Harness artifacts
18
+
19
+ Each task owns a directory at `.grub/<task-id>/`:
20
+
21
+ | File | Purpose | Who may write |
22
+ |------|---------|---------------|
23
+ | `feature-list.json` | Structured list of end-to-end features | Initializer writes the whole file once; coding agents may only flip `passes` and set `evidence` |
24
+ | `progress-log.md` | Dated notes describing each iteration | Agent, append-only |
25
+ | `init.sh` | Get-bearings + project smoke script run at the start of every iteration | Initializer; later agents may add project-specific smoke commands |
26
+ | `state.json` | Durable `GrubTaskState` snapshot for cross-session resume | Controller, atomic writes |
27
+
28
+ `feature-list.json` schema (version 1):
29
+
30
+ ```json
31
+ {
32
+ "version": 1,
33
+ "goal": "<user goal>",
34
+ "features": [
35
+ {
36
+ "id": "kebab-slug",
37
+ "category": "functional|verification|polish",
38
+ "description": "observable behavior",
39
+ "steps": ["actionable", "verification", "steps"],
40
+ "passes": false,
41
+ "evidence": "optional git sha or short proof"
42
+ }
43
+ ]
44
+ }
45
+ ```
46
+
47
+ The controller validates every mutation: changing `description`, `steps`,
48
+ `category`, `id`, list length, or reordering counts as a violation. Agents
49
+ are told up front that the only permitted edits are toggling `passes` and
50
+ setting `evidence`.
51
+
52
+ ## How it works
53
+
54
+ - Each grub iteration is tagged with a `[GRUB:<id>:<n>]` prompt prefix so
55
+ the extension can recognise its own injected turns.
56
+ - On start, grub creates the harness directory and writes the initial
57
+ artifacts without creating a git commit. The harness keeps durable state on
58
+ disk and leaves source changes visible in the working tree, avoiding noisy
59
+ `grub(...)` commits unless the user explicitly asks for them.
60
+ - Two phase-specialized system prompts are injected via
61
+ `before_agent_start`:
62
+ - **Initializer prompt** (first successful turn): expand
63
+ `feature-list.json` into 15-40 concrete features, harden `init.sh`,
64
+ seed `progress-log.md`. No broad implementation yet.
65
+ - **Coding prompt** (remaining turns): run `init.sh`, pick exactly one
66
+ pending feature, implement + verify end-to-end, flip `passes` +
67
+ `evidence`, and append to `progress-log.md`.
68
+ - At the end of every grub turn the assistant must emit a single
69
+ `<loop-state>{"status":"continue|complete|blocked", "summary":"...", "nextStep":"..."}</loop-state>`
70
+ block. The extension parses it and dispatches the next iteration or
71
+ stops with a terminal status.
72
+ - **Initializer sanitize-not-fail**: while in the initializer phase, only
73
+ genuinely unfixable structural problems (feature count out of 15-40,
74
+ unreplaced placeholder, non-kebab ids, duplicate ids) fail the turn and
75
+ force a retry. Recoverable hygiene issues are auto-corrected instead of
76
+ killing the task: a wrong `goal` is restored to the authoritative task goal,
77
+ pre-marked `passes:true` are reset to `false`, and stray `evidence` is
78
+ dropped. The sanitized list is written back to disk and becomes the baseline,
79
+ so the phase always advances once the structure is valid.
80
+ - **Phase-aware failure budget**: the initializer gets a more forgiving budget
81
+ (default 5) than execution (default 3, via `--max-fail`), because standing up
82
+ a valid harness is a distinct, retry-friendly activity from execution work.
83
+ - **Mutation guard**: after the initializer creates the first real
84
+ `feature-list.json`, each subsequent turn is diffed against the persisted
85
+ baseline. Rewriting feature ids, descriptions, categories, steps, count, or
86
+ order is rejected and retried; only `passes` and `evidence` may change.
87
+ - **Completion guard**: if the decision says `complete` but
88
+ `feature-list.json` still has `passes:false` entries, the controller
89
+ rewrites the decision to `continue` with a synthetic `nextStep`
90
+ pointing at the first pending feature. The harness will not allow
91
+ premature "done".
92
+ - **Cross-session resume**: `GrubTaskState` is written atomically to
93
+ `state.json` on every transition. On the next session, `session_start`
94
+ calls `discoverActiveTasks()` and adopts the most recent running task
95
+ without auto-dispatching — the user types `/grub resume` to continue.
96
+ - **Safety limits**: 25 iterations and 3 consecutive failures by default;
97
+ override with `--max-iter` / `--max-fail`.
98
+ - **Stale harness cleanup**: on extension load, terminal harnesses older
99
+ than 30 days are pruned from `.grub/`.
100
+
101
+ ## Legacy migration
102
+
103
+ Earlier versions wrote `feature-checklist.md` (markdown checkboxes). When a
104
+ new iteration starts and `feature-list.json` is missing but the legacy file
105
+ exists, its checkbox items are migrated into the JSON format (category
106
+ defaults to `functional`; `steps` start empty so the initializer can refine
107
+ later).
108
+
109
+ ## Related
110
+
111
+ For the recurring scheduler that runs prompts or slash commands on an
112
+ interval, see the sibling [`loop` extension](../loop/README.md).
@@ -1,16 +1,16 @@
1
- # Link-world Workspace
2
-
3
- This directory is copied to:
4
-
5
- ```text
6
- .nanopencil/link-world-workspace/
7
- ```
8
-
9
- Use it to store project-local, reusable internet-access knowledge for NanoPencil.
10
-
11
- Suggested layout:
12
-
13
- - `domain-skills/<site>/...` for durable site-specific notes
14
- - `notes/` for broader task or provider observations that should not live in the shipped extension
15
-
16
- Keep public-safe content only. Do not store secrets, cookies, tokens, or user-specific private data here.
1
+ # Link-world Workspace
2
+
3
+ This directory is copied to:
4
+
5
+ ```text
6
+ .nanopencil/link-world-workspace/
7
+ ```
8
+
9
+ Use it to store project-local, reusable internet-access knowledge for NanoPencil.
10
+
11
+ Suggested layout:
12
+
13
+ - `domain-skills/<site>/...` for durable site-specific notes
14
+ - `notes/` for broader task or provider observations that should not live in the shipped extension
15
+
16
+ Keep public-safe content only. Do not store secrets, cookies, tokens, or user-specific private data here.
@@ -1,65 +1,65 @@
1
- ---
2
- name: internet-search
3
- description: 当用户需要联网搜索最新信息、新闻、在线内容时使用此技能。包括搜索 Twitter、YouTube、Bilibili、知乎、微博、小红书等平台。
4
- ---
5
-
6
- # 互联网搜索
7
-
8
- ## 触发条件
9
-
10
- 当用户请求以下内容时使用此技能:
11
- - 搜索互联网、联网搜索、网络搜索
12
- - 查找最新新闻、资讯、信息
13
- - 搜索 YouTube、Bilibili、Twitter、小红书、微博等内容
14
- - 查找某个话题的最新讨论
15
- - 获取实时信息或热点内容
16
-
17
- ## 执行步骤
18
-
19
- ### 步骤 1:检查 agent-reach 是否安装
20
-
21
- 运行以下命令检查 agent-reach 是否可用:
22
-
23
- ```bash
24
- agent-reach --version
25
- ```
26
-
27
- ### 步骤 2:如果未安装
28
-
29
- 如果 agent-reach 命令不可用,告知用户需要先安装:
30
-
31
- > link-world (agent-reach) 未安装。请先运行 `/link-world` 安装 agent-reach 以支持联网搜索功能。
32
-
33
- ### 步骤 3:如果已安装
34
-
35
- 根据用户需求,使用 linkworld.md 中定义的工具进行搜索。
36
-
37
- **完整命令参考请查阅:extensions/link-world/linkworld.md**
38
-
39
- | 场景 | 推荐工具 | 示例命令 |
40
- |------|----------|----------|
41
- | 搜索 Twitter/X 推文 | xreach | `xreach search "关键词" --json` |
42
- | 解析 YouTube 视频 | yt-dlp | `yt-dlp --dump-json "视频URL"` |
43
- | 解析 Bilibili 视频 | yt-dlp | `yt-dlp --dump-json "视频URL"` |
44
- | 搜索 Reddit 帖子 | curl | `curl -s "https://reddit.com/r/xxx/search.json?q=关键词"` |
45
- | 搜索 GitHub 仓库 | gh | `gh search repos "关键词" --limit 5` |
46
- | 读取网页内容 | curl + Jina | `curl -s "https://r.jina.ai/网页URL"` |
47
- | 通用搜索引擎 | Exa (MCP) | `mcporter call 'exa.web_search_exa(query: "关键词", num_results: 5)'` |
48
- | 搜索小红书笔记 | 小红书 (MCP) | `mcporter call 'xiaohongshu.search_feeds(keyword: "关键词", limit: 5)'` |
49
- | 解析抖音视频 | 抖音 (MCP) | `mcporter call 'douyin.parse_douyin_video_info(url: "视频URL")'` |
50
-
51
- ### 步骤 4:返回结果
52
-
53
- 将搜索结果整理后返回给用户,包括:
54
- - 来源平台
55
- - 标题
56
- - 内容摘要
57
- - 链接
58
-
59
- ## 注意事项
60
-
61
- 1. 遵守平台的 API 使用规范
62
- 2. 如需登录平台的搜索功能,确保用户已配置认证信息
63
- 3. 如果搜索失败,尝试其他平台或告知用户可能的错误原因
64
- 4. YouTube/Bilibili 需要提供具体的视频 URL,不能直接搜索关键词
65
- 5. 详细的配置说明(Cookie、代理等)请参考 linkworld.md
1
+ ---
2
+ name: internet-search
3
+ description: 当用户需要联网搜索最新信息、新闻、在线内容时使用此技能。包括搜索 Twitter、YouTube、Bilibili、知乎、微博、小红书等平台。
4
+ ---
5
+
6
+ # 互联网搜索
7
+
8
+ ## 触发条件
9
+
10
+ 当用户请求以下内容时使用此技能:
11
+ - 搜索互联网、联网搜索、网络搜索
12
+ - 查找最新新闻、资讯、信息
13
+ - 搜索 YouTube、Bilibili、Twitter、小红书、微博等内容
14
+ - 查找某个话题的最新讨论
15
+ - 获取实时信息或热点内容
16
+
17
+ ## 执行步骤
18
+
19
+ ### 步骤 1:检查 agent-reach 是否安装
20
+
21
+ 运行以下命令检查 agent-reach 是否可用:
22
+
23
+ ```bash
24
+ agent-reach --version
25
+ ```
26
+
27
+ ### 步骤 2:如果未安装
28
+
29
+ 如果 agent-reach 命令不可用,告知用户需要先安装:
30
+
31
+ > link-world (agent-reach) 未安装。请先运行 `/link-world` 安装 agent-reach 以支持联网搜索功能。
32
+
33
+ ### 步骤 3:如果已安装
34
+
35
+ 根据用户需求,使用 linkworld.md 中定义的工具进行搜索。
36
+
37
+ **完整命令参考请查阅:extensions/link-world/linkworld.md**
38
+
39
+ | 场景 | 推荐工具 | 示例命令 |
40
+ |------|----------|----------|
41
+ | 搜索 Twitter/X 推文 | xreach | `xreach search "关键词" --json` |
42
+ | 解析 YouTube 视频 | yt-dlp | `yt-dlp --dump-json "视频URL"` |
43
+ | 解析 Bilibili 视频 | yt-dlp | `yt-dlp --dump-json "视频URL"` |
44
+ | 搜索 Reddit 帖子 | curl | `curl -s "https://reddit.com/r/xxx/search.json?q=关键词"` |
45
+ | 搜索 GitHub 仓库 | gh | `gh search repos "关键词" --limit 5` |
46
+ | 读取网页内容 | curl + Jina | `curl -s "https://r.jina.ai/网页URL"` |
47
+ | 通用搜索引擎 | Exa (MCP) | `mcporter call 'exa.web_search_exa(query: "关键词", num_results: 5)'` |
48
+ | 搜索小红书笔记 | 小红书 (MCP) | `mcporter call 'xiaohongshu.search_feeds(keyword: "关键词", limit: 5)'` |
49
+ | 解析抖音视频 | 抖音 (MCP) | `mcporter call 'douyin.parse_douyin_video_info(url: "视频URL")'` |
50
+
51
+ ### 步骤 4:返回结果
52
+
53
+ 将搜索结果整理后返回给用户,包括:
54
+ - 来源平台
55
+ - 标题
56
+ - 内容摘要
57
+ - 链接
58
+
59
+ ## 注意事项
60
+
61
+ 1. 遵守平台的 API 使用规范
62
+ 2. 如需登录平台的搜索功能,确保用户已配置认证信息
63
+ 3. 如果搜索失败,尝试其他平台或告知用户可能的错误原因
64
+ 4. YouTube/Bilibili 需要提供具体的视频 URL,不能直接搜索关键词
65
+ 5. 详细的配置说明(Cookie、代理等)请参考 linkworld.md