@groupby/ai-dev 0.5.8 → 0.5.9

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (28) hide show
  1. package/package.json +1 -1
  2. package/teams/agentic-checkout/prompts/AGENTS.md +103 -0
  3. package/teams/agentic-checkout/prompts/create-plan.md +103 -0
  4. package/teams/agentic-checkout/prompts/create-pull-request.md +157 -0
  5. package/teams/agentic-checkout/prompts/fix-pr-comments.md +170 -0
  6. package/teams/agentic-checkout/prompts/fix-review-findings.md +1 -12
  7. package/teams/agentic-checkout/prompts/implement-task.md +62 -0
  8. package/teams/agentic-checkout/prompts/new-workspace.md +12 -0
  9. package/teams/agentic-checkout/prompts/orchestrate-component-change.md +25 -0
  10. package/teams/agentic-checkout/prompts/review-change.md +8 -2
  11. package/teams/agentic-checkout/scripts/check-secrets +51 -0
  12. package/teams/agentic-checkout/scripts/install-git-hooks +15 -0
  13. package/teams/agentic-checkout/scripts/local-fast-report +5 -0
  14. package/teams/agentic-checkout/scripts/local-report +205 -0
  15. package/teams/agentic-checkout/scripts/local-summarize +47 -0
  16. package/teams/agentic-checkout/scripts/logs-deps +9 -0
  17. package/teams/agentic-checkout/scripts/setup-local-fast-model +20 -0
  18. package/teams/agentic-checkout/scripts/start-deps +15 -0
  19. package/teams/agentic-checkout/scripts/status-deps +9 -0
  20. package/teams/agentic-checkout/scripts/stop-deps +9 -0
  21. package/teams/agentic-checkout/scripts/sync-components +110 -0
  22. package/teams/agentic-checkout/skills/approval-gated-task-execution/SKILL.md +57 -0
  23. package/teams/agentic-checkout/skills/component-verification/SKILL.md +34 -0
  24. package/teams/agentic-checkout/skills/grill-me/SKILL.md +23 -0
  25. package/teams/agentic-checkout/skills/karpathy-guidelines/SKILL.md +67 -0
  26. package/teams/agentic-checkout/skills/secret-safety/SKILL.md +41 -0
  27. package/teams/agentic-checkout/skills/sync-components/SKILL.md +23 -60
  28. package/teams/agentic-checkout/skills/tdd/SKILL.md +48 -0
@@ -0,0 +1,67 @@
1
+ ---
2
+ name: karpathy-guidelines
3
+ description: Behavioral guidelines to reduce common LLM coding mistakes. Use when writing, reviewing, or refactoring code to avoid overcomplication, make surgical changes, surface assumptions, and define verifiable success criteria.
4
+ license: MIT
5
+ ---
6
+
7
+ # Karpathy Guidelines
8
+
9
+ Behavioral guidelines to reduce common LLM coding mistakes, derived from [Andrej Karpathy's observations](https://x.com/karpathy/status/2015883857489522876) on LLM coding pitfalls.
10
+
11
+ **Tradeoff:** These guidelines bias toward caution over speed. For trivial tasks, use judgment.
12
+
13
+ ## 1. Think Before Coding
14
+
15
+ **Don't assume. Don't hide confusion. Surface tradeoffs.**
16
+
17
+ Before implementing:
18
+ - State your assumptions explicitly. If uncertain, ask.
19
+ - If multiple interpretations exist, present them - don't pick silently.
20
+ - If a simpler approach exists, say so. Push back when warranted.
21
+ - If something is unclear, stop. Name what's confusing. Ask.
22
+
23
+ ## 2. Simplicity First
24
+
25
+ **Minimum code that solves the problem. Nothing speculative.**
26
+
27
+ - No features beyond what was asked.
28
+ - No abstractions for single-use code.
29
+ - No "flexibility" or "configurability" that wasn't requested.
30
+ - No error handling for impossible scenarios.
31
+ - If you write 200 lines and it could be 50, rewrite it.
32
+
33
+ Ask yourself: "Would a senior engineer say this is overcomplicated?" If yes, simplify.
34
+
35
+ ## 3. Surgical Changes
36
+
37
+ **Touch only what you must. Clean up only your own mess.**
38
+
39
+ When editing existing code:
40
+ - Don't "improve" adjacent code, comments, or formatting.
41
+ - Don't refactor things that aren't broken.
42
+ - Match existing style, even if you'd do it differently.
43
+ - If you notice unrelated dead code, mention it - don't delete it.
44
+
45
+ When your changes create orphans:
46
+ - Remove imports/variables/functions that YOUR changes made unused.
47
+ - Don't remove pre-existing dead code unless asked.
48
+
49
+ The test: Every changed line should trace directly to the user's request.
50
+
51
+ ## 4. Goal-Driven Execution
52
+
53
+ **Define success criteria. Loop until verified.**
54
+
55
+ Transform tasks into verifiable goals:
56
+ - "Add validation" → "Write tests for invalid inputs, then make them pass"
57
+ - "Fix the bug" → "Write a test that reproduces it, then make it pass"
58
+ - "Refactor X" → "Ensure tests pass before and after"
59
+
60
+ For multi-step tasks, state a brief plan:
61
+ ```
62
+ 1. [Step] → verify: [check]
63
+ 2. [Step] → verify: [check]
64
+ 3. [Step] → verify: [check]
65
+ ```
66
+
67
+ Strong success criteria let you loop independently. Weak criteria ("make it work") require constant clarification.
@@ -0,0 +1,41 @@
1
+ ---
2
+ name: secret-safety
3
+ description: Prevent credentials and sensitive tokens from being committed, logged, or pushed. Use before commits, pushes, PRs, and when editing config or examples.
4
+ license: MIT
5
+ ---
6
+
7
+ # Secret Safety
8
+
9
+ Use this skill before committing, pushing, creating PRs, or editing configuration, examples, logs, credentials, or local
10
+ environment files.
11
+
12
+ ## Rules
13
+
14
+ - Never commit real credentials, API keys, private keys, tokens, session cookies, certificates, keystores, or populated `.env` files.
15
+ - Keep secrets in environment variables, local untracked files, Vault, Azure Key Vault, GitHub Actions secrets, or another approved secret store.
16
+ - Only commit placeholder examples such as `.env.example`, `.env.sample`, or documentation values that are clearly non-secret.
17
+ - Do not paste secrets into prompts, PR descriptions, logs, commit messages, plans, or generated documentation.
18
+ - If a secret appears in a tracked file, stop and remove it before committing or pushing.
19
+ - If a real secret was committed or pushed, treat it as compromised: rotate/revoke it and remove it from git history using an approved process.
20
+
21
+ ## Checks
22
+
23
+ Run the harness secret scanner before commit or push:
24
+
25
+ ```bash
26
+ scripts/check-secrets .
27
+ ```
28
+
29
+ When checking a component repository from the harness root, pass the component path:
30
+
31
+ ```bash
32
+ scripts/check-secrets components/<component>
33
+ ```
34
+
35
+ Install the harness pre-push hook for the harness checkout:
36
+
37
+ ```bash
38
+ scripts/install-git-hooks
39
+ ```
40
+
41
+ The scanner is a guardrail, not a guarantee. Also inspect diffs manually for sensitive values.
@@ -1,76 +1,39 @@
1
1
  ---
2
2
  name: sync-components
3
- description: Sync a multi-repository component workspace by cloning repositories listed in components.txt with minimal inspection. Use when asked to sync up, clone, or refresh project components.
3
+ description: Sync component repositories from components.txt using the deterministic harness script.
4
4
  license: MIT
5
5
  ---
6
6
 
7
7
  # Sync Components
8
8
 
9
- Use this skill when the user asks to sync up a project's component repositories.
10
-
11
- ## Goal
12
-
13
- Populate `components/` from the entries in `components.txt` using `git clone`, and leave each synced component on the `main` branch.
14
-
15
- Keep this workflow lightweight. Do not inspect component source trees, run builds, run tests, scan every repository, or perform broad Git status checks unless the user explicitly asks.
16
-
17
- Do not create a shell script, temporary script, generated helper, Makefile target, or wrapper command for this workflow. Running generated scripts often triggers extra approval prompts. Use direct shell commands only.
18
-
19
- ## Source Of Truth
20
-
21
- Read `components.txt` from the framework or workspace checkout.
22
-
23
- Each non-empty, non-comment line is either:
24
-
25
- - A full Git remote URL, used as-is.
26
- - A repository name. If the file uses bare repository names, infer the Git remote from the project instructions or ask the user for the organization/remote prefix before cloning.
27
-
28
- The local checkout path for each entry is:
29
-
30
- ```text
31
- components/<repo-name>
32
- ```
33
-
34
- Derive `<repo-name>` from the entry basename and remove a trailing `.git`.
9
+ Use when the user asks to sync, clone, refresh, or update component repositories.
35
10
 
36
11
  ## Workflow
37
12
 
38
- 1. Confirm the current directory is the framework checkout.
39
- 2. Create `components/` if it does not exist.
40
- 3. Read `components.txt`.
41
- 4. For each component entry:
42
- - If `components/<repo-name>` does not exist, run:
43
-
44
- ```bash
45
- git clone <remote> components/<repo-name>
46
- ```
47
-
48
- - If the directory already exists, keep it and report `exists`.
49
- - After clone or exists, run:
50
-
51
- ```bash
52
- git -C components/<repo-name> checkout main
53
- ```
13
+ 1. Ask for the user's command approval through the normal Copilot CLI approval flow; do not enable `/allow-all`.
14
+ 2. Run exactly one of these commands after approval:
54
15
 
55
- - If `main` is not available locally, fetch only the main branch and try again:
16
+ ```bash
17
+ scripts/sync-components
18
+ ```
56
19
 
57
- ```bash
58
- git -C components/<repo-name> fetch origin main
59
- git -C components/<repo-name> checkout main
60
- ```
20
+ or, when a local fast model summary is useful:
61
21
 
62
- 5. Report only cloned, existing, checked-out-main, skipped, and failed component names.
22
+ ```bash
23
+ scripts/local-fast-report sync
24
+ ```
63
25
 
64
- Prefer issuing the `git clone` commands directly from the terminal. It is acceptable to run them one at a time. Do not generate a loop script or ask the user to run a script.
26
+ 3. Label `scripts/local-fast-report sync` runs as using the local fast model in any shell command title/description.
27
+ 4. Do not run both commands; `scripts/local-fast-report sync` already runs `scripts/sync-components`.
28
+ 5. Report cloned, existing, checked-out-main, rebased-main, skipped-dirty, and failed components.
29
+ 6. If any component is `skipped-dirty`, stop and ask before destructive cleanup or retry.
65
30
 
66
- ## Constraints
31
+ ## Rules
67
32
 
68
- - Use direct `git clone` commands for this workflow.
69
- - Do not create, modify, chmod, or execute any sync script.
70
- - Do not use heredocs, shell redirection, or generated files to automate cloning.
71
- - Do not delete, reset, clean, or force checkout any component repository.
72
- - Do not pull existing repositories unless the user asks to update already-cloned components.
73
- - Do not fetch existing repositories except `git fetch origin main` when needed to make `main` available for checkout.
74
- - Do not open or validate component-local instruction files during this sync workflow.
75
- - Do not run dependency installation, tests, formatters, linters, or builds.
76
- - Keep all cloned repositories directly under `components/`.
33
+ - Do not reimplement sync with model-driven `git clone`/`fetch`/`checkout`/`rebase` sequences unless the script fails and
34
+ the user approves manual recovery.
35
+ - Do not bypass the user's normal command approval prompts.
36
+ - Never delete, reset, clean, force-checkout, or discard existing component work without explicit approval for that
37
+ component.
38
+ - Do not inspect component source, install dependencies, run tests/builds/formatters/linters, or read component-local
39
+ instructions during sync.
@@ -0,0 +1,48 @@
1
+ ---
2
+ name: tdd
3
+ description: Use a test-driven workflow for implementation tasks. Write or update a failing test first, implement the smallest change to pass, then verify and refactor.
4
+ license: MIT
5
+ ---
6
+
7
+ # TDD
8
+
9
+ Use this skill when the user asks for TDD, test-first development, bug fixes with regression tests, or implementation
10
+ where behavior should be proven by tests.
11
+
12
+ ## Workflow
13
+
14
+ 1. Identify the smallest observable behavior change.
15
+ 2. Find the closest existing test file, fixture style, and assertion pattern.
16
+ 3. Write or update a focused test before changing production code.
17
+ 4. Run the narrowest relevant test command and confirm the test fails for the expected reason.
18
+ 5. Implement the smallest production change needed to pass.
19
+ 6. Re-run the focused test.
20
+ 7. Run the nearest broader relevant test set when the change touches shared behavior, contracts, event payloads, or
21
+ public APIs.
22
+ 8. Refactor only after tests pass, then re-run the focused test.
23
+
24
+ ## Component Commands
25
+
26
+ Prefer component-local instructions when present. Otherwise use the verification matrix in `AGENTS.md` as the canonical source for install/test/build/lint commands.
27
+
28
+ For focused tests, use the component's existing test runner syntax for a single file, module, class, or test case.
29
+
30
+ ## Constraints
31
+
32
+ - Do not make broad refactors before the red/green loop is complete.
33
+ - Do not add new test frameworks or dependencies unless the component already uses them and the change requires it.
34
+ - Do not add large test scaffolding when an existing test style can cover the behavior.
35
+ - Do not skip the failing-test step unless tests cannot run locally or the change is documentation/configuration only.
36
+ - If the failing-test step is skipped, explain why before editing production code.
37
+ - Keep tests focused on observable behavior, not private implementation details.
38
+ - For bug fixes, the first test should fail on the original bug.
39
+ - For new behavior, the first test should encode the expected contract or user-visible behavior.
40
+
41
+ ## Reporting
42
+
43
+ In the final response, include:
44
+
45
+ - The test added or changed.
46
+ - The initial failing command and failure summary, if run.
47
+ - The passing verification command.
48
+ - Any broader tests that were skipped and why.