agentflight 0.3.0 → 0.3.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/CHANGELOG.md ADDED
@@ -0,0 +1,140 @@
1
+ # Changelog
2
+
3
+ All notable AgentFlight changes are documented here.
4
+
5
+ ## [0.3.1] - 2026-06-13
6
+
7
+ ### Changed
8
+
9
+ - Reworked the README around a clearer 60-second AgentFlight workflow.
10
+ - Added sample CLI output for status, report, replay, and resume.
11
+ - Added a replay timeline screenshot to make the v0.3.0 experience easier to understand.
12
+ - Added a basic example session walkthrough and v0.3.0 launch-note drafts.
13
+ - Improved npm keywords and package metadata.
14
+ - Narrowed packaged files to include useful README-linked docs/assets without shipping marketing drafts.
15
+
16
+ ### Documentation
17
+
18
+ - Added launch notes draft for v0.3.0 public/demo positioning.
19
+ - Updated development log with post-v0.3.0 polish verification.
20
+
21
+ ### Verification
22
+
23
+ - `npm run verify` passed.
24
+ - `npm run format:check` passed.
25
+ - `npm pack --dry-run` passed.
26
+ - `npm audit --audit-level=moderate` found `0 vulnerabilities`.
27
+ - ProjScan preflight passed.
28
+ - AgentLoopKit verification passed.
29
+
30
+ ## [0.3.0] - 2026-06-13
31
+
32
+ AgentFlight now records session events and snapshots so reports and replays show how a coding session evolved.
33
+
34
+ ### Added
35
+
36
+ - Added `agentflight snapshot --note "..."` to record a local checkpoint for the active session.
37
+ - Added session-level `events` with backward compatibility for older sessions.
38
+ - Added event recording for session start, verification attempts, snapshots, report generation, replay generation, resume prompt generation, and doctor runs.
39
+ - Added timeline sections to Markdown reports and HTML replays.
40
+ - Added latest snapshot context to status and resume prompts.
41
+ - Added changed-file groups to replay output.
42
+ - Added tests for session events, snapshot creation, missing active sessions, timeline rendering, older session compatibility, and verification events.
43
+
44
+ ### Changed
45
+
46
+ - Replays now feel more like flight recorder timelines instead of final-state summaries.
47
+ - Reports now include a concise event timeline before changed files and proof evidence.
48
+ - Resume prompts now include latest snapshot notes and current verification state.
49
+
50
+ ### Verification
51
+
52
+ - `npm run verify` passed.
53
+ - `npm run format:check` passed.
54
+ - `npm pack --dry-run` passed for `agentflight@0.3.0`.
55
+ - `npm audit --audit-level=moderate` found `0 vulnerabilities`.
56
+ - ProjScan preflight passed with health `100/100`.
57
+ - AgentLoopKit verification passed.
58
+
59
+ ## [0.2.1] - 2026-06-13
60
+
61
+ Patch release candidate focused on friction found while dogfooding the v0.2.0 core workflow.
62
+
63
+ ### Changed
64
+
65
+ - `agentflight verify` now prints the stdout and stderr evidence paths immediately after each recorded run.
66
+ - Failed verification output and downstream next actions now include the exact command to rerun.
67
+ - `agentflight report`, `agentflight replay`, and `agentflight resume` now avoid stale "generate a report" next actions once proof is ready.
68
+ - HTML replays now include a compact summary strip for risk, changed files, proof counts, and review readiness.
69
+ - AgentLoopKit workflow files under `.agentloop/` are treated as low-risk dogfooding artifacts instead of unknown code changes.
70
+
71
+ ### Fixed
72
+
73
+ - ProjScan and AgentLoopKit adapters now prefer repo-local `node_modules/.bin` binaries before PATH-global commands, preventing stale global versions from appearing in reports.
74
+ - Tool adapter version output is normalized when CLIs include decorated version text.
75
+
76
+ ### Verification
77
+
78
+ - Targeted v0.2.1 regression tests passed.
79
+ - `npm run verify` passed.
80
+ - `npm run format:check` passed.
81
+ - `npm pack --dry-run` passed for `agentflight@0.2.1`.
82
+ - `npm audit --audit-level=moderate` found `0 vulnerabilities`.
83
+ - ProjScan preflight passed with health `100/100`.
84
+ - AgentLoopKit verification passed.
85
+
86
+ ## [0.2.0] - 2026-06-13
87
+
88
+ AgentFlight now captures real verification evidence and uses it across status, report, replay, and resume.
89
+
90
+ ### Added
91
+
92
+ - Added `agentflight verify` to run verification commands and capture local evidence.
93
+ - Added verification run persistence in session records, including command, timestamps, duration, exit code, status, stdout path, and stderr path.
94
+ - Added `.agentflight/evidence/` for local stdout/stderr artifacts.
95
+ - Added evidence-aware status, report, replay, and resume outputs.
96
+ - Added tests for verification success, failure, persistence, evidence-aware outputs, and v0.1 session compatibility.
97
+
98
+ ### Changed
99
+
100
+ - `agentflight status` now reports changed areas, proof gaps, review readiness, and a next action based on captured evidence.
101
+ - `agentflight report` now includes verification evidence and honest review recommendations.
102
+ - `agentflight replay` now renders verification cards in the local HTML artifact.
103
+ - `agentflight resume` now includes verification gaps, the exact next command, and stronger continuation guardrails.
104
+
105
+ ### Fixed
106
+
107
+ - `agentflight doctor` now checks write permission for `.agentflight/`, not just path existence.
108
+ - `agentflight --version` now reports the package version instead of the stale `0.1.0` value.
109
+
110
+ ### Verification
111
+
112
+ - `npm run verify` passed.
113
+ - `npm run format:check` passed.
114
+ - `npm pack --dry-run` passed.
115
+ - `npm audit --audit-level=moderate` found 0 vulnerabilities.
116
+ - ProjScan preflight passed.
117
+ - AgentLoopKit verification passed.
118
+
119
+ ## [0.1.1] - 2026-06-13
120
+
121
+ ### Fixed
122
+
123
+ - Fixed npm `.bin` symlink invocation so `npx agentflight --help` runs the CLI instead of exiting silently.
124
+
125
+ ### Added
126
+
127
+ - Added CI verification on pushes and pull requests.
128
+ - Added tag-based npm publishing through GitHub Actions Trusted Publishing.
129
+ - Added release process documentation.
130
+
131
+ ## [0.1.0] - 2026-06-13
132
+
133
+ ### Added
134
+
135
+ - First AgentFlight MVP CLI.
136
+ - Added `init`, `start`, `status`, `report`, `replay`, `resume`, and `doctor` commands.
137
+ - Added local `.agentflight/` config, session, report, replay, and resume prompt artifacts.
138
+ - Added ProjScan and AgentLoopKit adapters with graceful fallbacks.
139
+ - Added TypeScript, Vitest, ESLint, Prettier, and npm package setup.
140
+ - Added README, architecture docs, dogfooding docs, verification docs, roadmap, and monetisation notes.
package/README.md CHANGED
@@ -2,23 +2,163 @@
2
2
 
3
3
  See what your coding agent did. Prove it works. Know what to do next.
4
4
 
5
- AgentFlight is a local-first flight recorder for AI coding agents from Baseframe Labs. It records AI-assisted coding sessions, explains what changed, highlights risk, captures proof gaps, generates replay artifacts, and produces a clean handoff for the next agent or human reviewer.
5
+ AgentFlight is a local-first flight recorder for AI coding agents from Baseframe Labs. It sits around Codex, Claude Code, Cursor, Windsurf, Gemini CLI, Aider, OpenCode, and similar tools so you can review the session instead of guessing what happened.
6
6
 
7
- AgentFlight is not another coding agent. It is the trust, observability, replay, and proof layer around tools like Codex, Claude Code, Cursor, Windsurf, Gemini CLI, Aider, OpenCode, and similar coding agents.
7
+ AgentFlight helps you:
8
8
 
9
- ## Why AgentFlight Exists
9
+ - start an AI coding session
10
+ - capture verification evidence
11
+ - see changed files and risk
12
+ - create snapshots during the session
13
+ - generate a proof report
14
+ - generate a local replay timeline
15
+ - create a resume prompt for the next agent or reviewer
10
16
 
11
- AI coding agents can move quickly, but developers still need to answer practical review questions:
17
+ ![AgentFlight replay timeline](docs/assets/agentflight-replay-timeline.png)
12
18
 
13
- - What changed?
14
- - Which files are risky?
15
- - What proof exists?
16
- - What proof is missing?
17
- - What should the next agent or reviewer do?
19
+ ## 60-Second Workflow
18
20
 
19
- AgentFlight makes those answers local, inspectable, and repeatable without uploading source code or adding cloud infrastructure.
21
+ ```bash
22
+ npx agentflight@latest init
23
+ npx agentflight@latest start --task "Add password reset flow"
24
+
25
+ # Run Codex, Claude Code, Cursor, or your coding agent normally
26
+
27
+ npx agentflight@latest verify -- npm test
28
+ npx agentflight@latest snapshot --note "Initial implementation verified"
29
+ npx agentflight@latest status
30
+ npx agentflight@latest report
31
+ npx agentflight@latest replay
32
+ npx agentflight@latest resume
33
+ ```
34
+
35
+ What you get:
36
+
37
+ - `init` creates local `.agentflight/` project files.
38
+ - `start` records the task, git branch, commit, dirty state, package manager, and tool availability.
39
+ - `verify -- npm test` runs the command and stores stdout, stderr, exit code, timing, and pass/fail status.
40
+ - `snapshot --note "..."` records the current git, risk, and proof state as a timeline event.
41
+ - `status` answers what changed, how risky it is, what proof exists, what proof is missing, and what to do next.
42
+ - `report` writes a Markdown proof report for review.
43
+ - `replay` writes a local HTML timeline you can open in a browser.
44
+ - `resume` writes a Codex/Claude-ready prompt for the next safe step.
45
+
46
+ ## Why This Exists
47
+
48
+ AI coding agents move fast. After a few prompts, you can lose track of:
49
+
50
+ - what changed
51
+ - whether the agent drifted from the task
52
+ - what was verified
53
+ - what failed
54
+ - what is safe to review
55
+ - how to resume the work later
56
+
57
+ AgentFlight gives you a local control room for that work. It records the session, captures proof, shows risk, and creates handoff artifacts without uploading source code.
58
+
59
+ ## Sample Outputs
60
+
61
+ `agentflight status`:
62
+
63
+ ```text
64
+ AgentFlight status
65
+
66
+ Task:
67
+ Add password reset flow
68
+
69
+ Changed files:
70
+ 3
71
+
72
+ Risk: medium
73
+ - Dependency, backend, or unknown files changed.
74
+
75
+ Verification Evidence:
76
+ 1 passed, 0 failed
77
+
78
+ Latest snapshot:
79
+ - Note: Initial implementation verified
80
+ - Risk: medium
81
+ - Changed files: 3
82
+
83
+ Review readiness: Ready for review
84
+
85
+ Next action:
86
+ Generate a proof report with agentflight report
87
+ ```
88
+
89
+ `agentflight report`:
90
+
91
+ ```text
92
+ # AgentFlight Proof Report
93
+
94
+ ## Recommendation
95
+ Ready for review
96
+
97
+ ## Verification Evidence
98
+ - passed: npm test
99
+ - stdout: .agentflight/evidence/.../verification-1.stdout.txt
100
+ - stderr: .agentflight/evidence/.../verification-1.stderr.txt
101
+ ```
102
+
103
+ `agentflight replay`:
104
+
105
+ ```text
106
+ Replay saved:
107
+ .agentflight/reports/af-...-replay.html
108
+
109
+ Timeline:
110
+ session_started -> verification_passed -> snapshot_created -> replay_generated
111
+ ```
112
+
113
+ `agentflight resume`:
114
+
115
+ ```text
116
+ Continue the AgentFlight session for: Add password reset flow
117
+
118
+ Latest snapshot:
119
+ Initial implementation verified
120
+
121
+ Verification state:
122
+ 1 passed, 0 failed
123
+
124
+ Guardrails:
125
+ - Stay scoped to the current task.
126
+ - Do not claim completion without proof.
127
+ - Run relevant verification before declaring success.
128
+ ```
20
129
 
21
- ## How It Works
130
+ ## Current Capabilities
131
+
132
+ The current AgentFlight release supports:
133
+
134
+ - local session setup
135
+ - active session tracking
136
+ - git branch, commit, dirty state, and changed file detection
137
+ - changed file risk categorisation
138
+ - verification evidence capture with `agentflight verify`
139
+ - session events
140
+ - snapshots with `agentflight snapshot --note "..."`
141
+ - Markdown proof reports
142
+ - self-contained HTML replay timelines
143
+ - resume prompts for Codex, Claude Code, or a human reviewer
144
+ - doctor checks for local setup
145
+ - defensive ProjScan and AgentLoopKit adapters
146
+ - no telemetry, cloud sync, or source upload
147
+
148
+ ## What AgentFlight Is Not
149
+
150
+ AgentFlight is:
151
+
152
+ - not a coding agent
153
+ - not a cloud service
154
+ - not a replacement for tests
155
+ - not a security scanner
156
+ - not a CI platform
157
+ - not a code review replacement
158
+
159
+ Use your coding agent to make changes. Use AgentFlight to understand, verify, replay, and hand off the work.
160
+
161
+ ## How It Works Locally
22
162
 
23
163
  AgentFlight creates a local `.agentflight/` directory in your repo:
24
164
 
@@ -28,65 +168,32 @@ AgentFlight creates a local `.agentflight/` directory in your repo:
28
168
  - `reports/` stores Markdown proof reports and HTML replays.
29
169
  - `evidence/` stores stdout and stderr from captured verification runs.
30
170
 
31
- Sessions also store an `events` timeline with meaningful moments such as session start, verification attempts, snapshots, and generated artifacts. Reports include filenames and summaries by default, not full source diffs. AgentFlight does not collect telemetry and does not upload source code.
32
-
33
- ## Quick Start
171
+ Sessions store an `events` timeline with meaningful moments such as session start, verification attempts, snapshots, and generated artifacts. Reports include filenames and summaries by default, not full source diffs.
34
172
 
35
- ```bash
36
- npm install
37
- npm run build
38
- npx agentflight init
39
- npx agentflight start --task "Add example feature"
40
- npx agentflight status
41
- npx agentflight verify -- npm test
42
- npx agentflight snapshot --note "Tests passing locally"
43
- npx agentflight report
44
- npx agentflight replay
45
- npx agentflight resume
46
- npx agentflight doctor
47
- ```
173
+ Runtime session data is ignored by git by default in this repo:
48
174
 
49
- During local development of this repo, use:
175
+ - `.agentflight/sessions/`
176
+ - `.agentflight/reports/`
177
+ - `.agentflight/evidence/`
178
+ - `.agentflight/current/`
50
179
 
51
- ```bash
52
- npm run agentflight -- init
53
- npm run agentflight -- start --task "Add example feature"
54
- ```
180
+ `.agentflight/config.json` is intentionally not ignored, so a project can commit its local AgentFlight defaults when useful.
55
181
 
56
182
  ## Commands
57
183
 
58
184
  - `agentflight init` initializes `.agentflight/` with safe writes.
59
185
  - `agentflight start --task "..."` starts a session and writes the current handoff.
60
- - `agentflight status` summarizes changed files, risk, verification status, and next action.
186
+ - `agentflight status` summarizes changed files, risk, verification status, snapshots, and next action.
61
187
  - `agentflight verify -- <command>` runs a proof command and records stdout/stderr evidence.
62
188
  - `agentflight verify` runs commands from `.agentflight/config.json`.
63
189
  - `agentflight snapshot --note "..."` records current git, risk, and verification state as a timeline event.
64
190
  - `agentflight report` generates a Markdown proof report.
65
191
  - `agentflight replay` generates a local self-contained HTML replay.
66
- - `agentflight resume` prints and saves a Codex/Claude-ready continuation prompt.
192
+ - `agentflight resume` prints and saves a continuation prompt.
67
193
  - `agentflight doctor` checks local setup, scripts, tools, config, and current session state.
68
194
 
69
195
  Future placeholders exist for `upgrade`, `license`, and `login`; AgentFlight Pro/Team is not available yet.
70
196
 
71
- ## Example Workflow With Codex Or Claude Code
72
-
73
- ```bash
74
- agentflight init
75
- agentflight start --task "Add password reset flow"
76
-
77
- # Run Codex, Claude Code, Cursor, or another coding agent normally.
78
-
79
- agentflight status
80
- agentflight verify -- npm run typecheck
81
- agentflight verify -- npm test
82
- agentflight snapshot --note "Implementation and proof complete"
83
- agentflight report
84
- agentflight replay
85
- agentflight resume
86
- ```
87
-
88
- Use `agentflight verify -- <command>` when you want AgentFlight to capture proof. The command records exit code, timing, stdout path, and stderr path in the current session, then prints the evidence paths. Use `agentflight snapshot --note "..."` at meaningful milestones so replay and report artifacts show the session timeline. Use the generated report for review and the resume prompt when handing the work to another agent or human.
89
-
90
197
  ## Powered By ProjScan And AgentLoopKit
91
198
 
92
199
  AgentFlight is powered by two open engines from Baseframe Labs:
@@ -94,7 +201,7 @@ AgentFlight is powered by two open engines from Baseframe Labs:
94
201
  - ProjScan provides repo intelligence, risk analysis, codebase understanding, and preflight signals.
95
202
  - AgentLoopKit provides task discipline, verification evidence, policies, and handoffs.
96
203
 
97
- This repository dogfoods both tools from day one. See [docs/development/dogfooding.md](docs/development/dogfooding.md).
204
+ This repository dogfoods both tools. See [docs/development/dogfooding.md](docs/development/dogfooding.md).
98
205
 
99
206
  Strategic architecture:
100
207
 
@@ -102,50 +209,22 @@ Strategic architecture:
102
209
  - AgentLoopKit: agent workflow discipline engine
103
210
  - AgentFlight: commercial and user-facing experience layer
104
211
 
105
- ## Local-First And Privacy
106
-
107
- AgentFlight runs locally. It does not add telemetry, login, billing, cloud sync, or source upload. The MVP reads git status and package metadata, writes human-readable local artifacts, and calls local or `npx` ProjScan/AgentLoopKit commands with graceful fallbacks.
108
-
109
- Runtime session data is ignored by git by default:
110
-
111
- - `.agentflight/sessions/`
112
- - `.agentflight/reports/`
113
- - `.agentflight/evidence/`
114
- - `.agentflight/current/`
212
+ ## Example Session
115
213
 
116
- `.agentflight/config.json` is intentionally not ignored, so a project can commit its local AgentFlight defaults if that is useful.
214
+ Read [docs/examples/basic-agentflight-session.md](docs/examples/basic-agentflight-session.md) for a short password-reset walkthrough with status, report, replay, and resume artifacts.
117
215
 
118
- ## Current Status
216
+ ## Roadmap
119
217
 
120
- AgentFlight current package version is `0.3.0`, centered on local session timelines, snapshots, and verification evidence.
218
+ See [docs/roadmap.md](docs/roadmap.md).
121
219
 
122
- Implemented:
220
+ Not built yet:
123
221
 
124
- - TypeScript ESM npm CLI package
125
- - Safe local initialization
126
- - Session start and current handoff
127
- - Status risk summary
128
- - Markdown proof reports
129
- - Self-contained HTML replay
130
- - Resume prompt generation
131
- - Doctor checks
132
- - Verification evidence capture with `agentflight verify`
133
- - Session events and snapshots with `agentflight snapshot`
134
- - Defensive ProjScan and AgentLoopKit adapters
135
- - Vitest coverage for core behavior, renderers, adapters, and command workflow
136
-
137
- Not implemented:
138
-
139
- - Cloud sync
140
- - Login
141
- - Billing
222
+ - cloud sync
223
+ - login
224
+ - billing
142
225
  - GitHub App
143
226
  - Team dashboards
144
- - Paid feature gates
145
-
146
- ## Roadmap
147
-
148
- See [docs/roadmap.md](docs/roadmap.md).
227
+ - paid feature gates
149
228
 
150
229
  ## Releases
151
230
 
@@ -0,0 +1,173 @@
1
+ # Dogfooding
2
+
3
+ AgentFlight dogfoods Baseframe Labs tooling from day one.
4
+
5
+ ## ProjScan
6
+
7
+ Version used during setup: `4.3.1`.
8
+
9
+ Commands discovered and used:
10
+
11
+ ```bash
12
+ npx projscan@latest --help
13
+ npx projscan@latest --version
14
+ npx projscan@latest init
15
+ npx projscan@latest start --intent "Build AgentFlight, a local-first flight recorder for AI coding agents"
16
+ npx projscan@latest coordinate --format json
17
+ npx projscan@latest preflight --mode before_edit --format json
18
+ npx projscan@latest privacy-check --offline
19
+ npx projscan@latest doctor
20
+ npx projscan@latest start --mode before_commit --intent "Review AgentFlight MVP core commands after package and build setup"
21
+ ```
22
+
23
+ Evidence:
24
+
25
+ - `projscan init` created `.projscanrc.json`.
26
+ - Initial health was `68/100`, reflecting the fresh repository state.
27
+ - Pre-edit preflight verdict was `proceed`.
28
+ - Offline privacy check confirmed telemetry disabled for the run.
29
+ - Post-core health improved to `94/100` before README completion.
30
+
31
+ ## AgentLoopKit
32
+
33
+ Version used during setup: `0.28.7`.
34
+
35
+ Commands discovered and used:
36
+
37
+ ```bash
38
+ npx agentloopkit@latest --help
39
+ npx agentloopkit@latest --version
40
+ npx agentloopkit@latest init
41
+ npx agentloopkit@latest doctor
42
+ npx agentloopkit@latest create-task --title "Build AgentFlight MVP" --type feature ...
43
+ npx agentloopkit@latest task status .agentloop/tasks/2026-06-13-build-agentflight-mvp.md in-progress
44
+ npx agentloopkit@latest status
45
+ ```
46
+
47
+ Evidence:
48
+
49
+ - AgentLoopKit initialized `.agentloop/`, `AGENTS.md`, `AGENTLOOP.md`, and `agentloop.config.json`.
50
+ - Active task: `.agentloop/tasks/2026-06-13-build-agentflight-mvp.md`.
51
+ - The task is marked `in-progress`.
52
+ - `agentloop.config.json` now maps test, lint, typecheck, build, and format checks to npm scripts.
53
+
54
+ ## AgentFlight Self-Dogfooding
55
+
56
+ The AgentFlight self-dogfood run was performed after `npm run build` produced `dist/cli.js`.
57
+
58
+ Commands run:
59
+
60
+ ```bash
61
+ npm run build
62
+ node dist/cli.js init
63
+ node dist/cli.js start --task "Dogfood AgentFlight MVP"
64
+ node dist/cli.js status
65
+ node dist/cli.js report
66
+ node dist/cli.js replay
67
+ node dist/cli.js resume
68
+ node dist/cli.js doctor
69
+ ```
70
+
71
+ Evidence:
72
+
73
+ - `node dist/cli.js init` initialized `.agentflight/` and detected ProjScan and AgentLoopKit.
74
+ - `node dist/cli.js start --task "Dogfood AgentFlight MVP"` created session `af-20260613-123923-dogfood-agentflight-mvp`.
75
+ - `status` reported `114` changed files and `high` risk because package/dependency and config files changed.
76
+ - `status` was honest that verification evidence was not recorded in the AgentFlight session.
77
+ - `report` wrote `.agentflight/reports/af-20260613-123923-dogfood-agentflight-mvp-proof.md`.
78
+ - `replay` wrote `.agentflight/reports/af-20260613-123923-dogfood-agentflight-mvp-replay.html`.
79
+ - `resume` wrote `.agentflight/current/resume-prompt.md`.
80
+ - `doctor` reported overall `OK`.
81
+
82
+ Self-dogfooding fixes made:
83
+
84
+ - Fixed ESM CLI entrypoint detection for repository paths with spaces.
85
+ - Fixed git porcelain parsing so untracked filenames are not truncated.
86
+ - Tuned risk categorisation so Markdown policy docs are not flagged as secret material solely because their filenames mention secrets.
87
+
88
+ ## Final Tooling Evidence
89
+
90
+ Final commands:
91
+
92
+ ```bash
93
+ npm run verify
94
+ npm run format:check
95
+ npx projscan@latest doctor
96
+ npx projscan@latest preflight --mode before_commit --format json
97
+ npx agentloopkit@latest doctor
98
+ npx agentloopkit@latest verify --task .agentloop/tasks/2026-06-13-build-agentflight-mvp.md
99
+ ```
100
+
101
+ Final results:
102
+
103
+ - npm verification passed with `14` test files and `36` tests.
104
+ - Formatting check passed.
105
+ - ProjScan reported health `100/100`.
106
+ - ProjScan preflight returned verdict `proceed`.
107
+ - AgentLoopKit verification passed and wrote a local report.
108
+ - AgentLoopKit doctor remained `warn` because the worktree is intentionally dirty during MVP construction and generated runtime artifacts exist locally.
109
+
110
+ ## v0.2.0 In-Progress Dogfooding
111
+
112
+ The v0.2.0 work uses AgentLoopKit task discipline and ProjScan checkpoints before final release preparation.
113
+
114
+ Commands run so far:
115
+
116
+ ```bash
117
+ npx agentloopkit@latest create-task --title "Prepare AgentFlight v0.2.0 verification evidence" --type feature ...
118
+ npx projscan@latest start --mode before_edit --intent "Prepare AgentFlight v0.2.0 verification evidence capture"
119
+ npm test -- tests/core/session.test.ts tests/core/verification.test.ts tests/commands/verify.test.ts
120
+ npm test -- tests/commands/evidence-output.test.ts
121
+ npm test -- tests/core/fs-safe.test.ts
122
+ npm run typecheck
123
+ ```
124
+
125
+ Evidence so far:
126
+
127
+ - AgentLoopKit created `.agentloop/tasks/2026-06-13-prepare-agentflight-v0-2-0-verification-evidence.md`.
128
+ - ProjScan reported health `100/100` before the implementation pass.
129
+ - Red tests captured the missing verification evidence model, missing `verify` command, evidence-blind outputs, and doctor writable-check bug.
130
+ - Targeted implementation tests now pass for verification capture and evidence-aware outputs.
131
+
132
+ Built CLI self-dogfood commands:
133
+
134
+ ```bash
135
+ npm run build
136
+ node dist/cli.js start --task "Dogfood AgentFlight v0.2.0 verification evidence"
137
+ node dist/cli.js verify -- npm run typecheck
138
+ node dist/cli.js verify -- npm run lint
139
+ node dist/cli.js verify -- npm test
140
+ node dist/cli.js verify -- npm run build
141
+ node dist/cli.js status
142
+ node dist/cli.js report
143
+ node dist/cli.js replay
144
+ node dist/cli.js resume
145
+ node dist/cli.js doctor
146
+ ```
147
+
148
+ Built CLI evidence:
149
+
150
+ - Session: `af-20260613-132334-dogfood-agentflight-v0-2-0-verification-evidence`.
151
+ - Verification evidence captured: `npm run typecheck`, `npm run lint`, `npm test`, and `npm run build`.
152
+ - `npm test` passed with `16` test files and `49` tests.
153
+ - `status` reported `4 passed, 0 failed`, no configured verification gaps, and `Ready for review`.
154
+ - `report` and `replay` were generated under `.agentflight/reports/`.
155
+ - `resume` included no configured verification gaps and the scoped-work guardrails.
156
+ - `doctor` reported overall `OK`.
157
+ - No v0.2.0 release, tag, or npm publish was performed.
158
+
159
+ Post-implementation tool checks:
160
+
161
+ ```bash
162
+ npx projscan@latest preflight --mode before_commit --format json
163
+ npx projscan@latest start --mode hardening --intent "Review AgentFlight v0.2.0 verification evidence implementation"
164
+ npx agentloopkit@latest status
165
+ npx agentloopkit@latest verify
166
+ ```
167
+
168
+ Tool evidence:
169
+
170
+ - ProjScan preflight returned `proceed` with health `100/100`.
171
+ - ProjScan hardening returned health `100/100` with attention requested for the dirty worktree and hotspot review suggestions.
172
+ - AgentLoopKit status found all configured commands and requested verification.
173
+ - AgentLoopKit verification passed and wrote `.agentloop/reports/2026-06-13-15-25-verification-report.md`.
@@ -0,0 +1,66 @@
1
+ # Release Process
2
+
3
+ AgentFlight releases use npm Trusted Publishing from GitHub Actions.
4
+
5
+ ## Why Tags, Not Every Push
6
+
7
+ npm versions are immutable. Publishing on every push would either fail whenever the version is unchanged or force noisy version churn. AgentFlight verifies every push, but publishes only from version tags.
8
+
9
+ ## npm Trusted Publisher Settings
10
+
11
+ Configure these values on npmjs.com for the `agentflight` package:
12
+
13
+ - Publisher: `GitHub Actions`
14
+ - Organization or user: `abhiyoheswaran1`
15
+ - Repository: `AgentFlight`
16
+ - Workflow filename: `release.yml`
17
+ - Environment name: `npm`
18
+ - Allowed actions: `Allow npm publish`
19
+
20
+ Do not enable `Allow npm stage publish` unless AgentFlight intentionally adopts npm staged releases later.
21
+
22
+ ## Release Checklist
23
+
24
+ 1. Update `package.json` and `package-lock.json` to the new version.
25
+ 2. Update `CHANGELOG.md`.
26
+ 3. Run verification:
27
+
28
+ ```bash
29
+ npm run verify
30
+ npm run format:check
31
+ npm pack --dry-run
32
+ ```
33
+
34
+ 4. Commit the release:
35
+
36
+ ```bash
37
+ git add .
38
+ git commit -m "chore: release v0.1.1"
39
+ ```
40
+
41
+ 5. Tag and push:
42
+
43
+ ```bash
44
+ git tag v0.1.1
45
+ git push origin main --tags
46
+ ```
47
+
48
+ 6. Confirm:
49
+
50
+ ```bash
51
+ npm view agentflight version
52
+ npx --yes agentflight@latest --help
53
+ ```
54
+
55
+ ## Manual Emergency Publish
56
+
57
+ Manual publishing is allowed for emergencies while the package is young:
58
+
59
+ ```bash
60
+ npm run verify
61
+ npm run format:check
62
+ npm pack --dry-run
63
+ npm publish --access public
64
+ ```
65
+
66
+ Record any manual publish in `CHANGELOG.md` and `AGENTFLIGHT_DEVLOG.md`.
@@ -0,0 +1,164 @@
1
+ # Basic AgentFlight Session
2
+
3
+ This example shows a fictional local session for adding a password reset flow.
4
+
5
+ ## 1. Start The Flight
6
+
7
+ ```bash
8
+ npx agentflight@latest init
9
+ npx agentflight@latest start --task "Add password reset flow"
10
+ ```
11
+
12
+ AgentFlight records the task, current branch, commit, dirty state, package manager, and available tools. It also writes `.agentflight/current/handoff.md` so the coding agent or human reviewer has a starting point.
13
+
14
+ ## 2. Run The Coding Agent
15
+
16
+ Run Codex, Claude Code, Cursor, or another coding agent normally.
17
+
18
+ In this example the agent changes:
19
+
20
+ ```text
21
+ src/password-reset.ts
22
+ tests/password-reset.test.ts
23
+ docs/password-reset.md
24
+ ```
25
+
26
+ ## 3. Capture Proof
27
+
28
+ ```bash
29
+ npx agentflight@latest verify -- npm test
30
+ ```
31
+
32
+ AgentFlight runs the command and stores local evidence:
33
+
34
+ ```text
35
+ Verification recorded
36
+
37
+ Command:
38
+ npm test
39
+
40
+ Status:
41
+ passed
42
+
43
+ Evidence:
44
+ stdout: .agentflight/evidence/.../verification-1.stdout.txt
45
+ stderr: .agentflight/evidence/.../verification-1.stderr.txt
46
+ ```
47
+
48
+ If the command fails, AgentFlight records it as failed. It does not turn failed proof into success.
49
+
50
+ ## 4. Create A Snapshot
51
+
52
+ ```bash
53
+ npx agentflight@latest snapshot --note "Initial implementation verified"
54
+ ```
55
+
56
+ The snapshot records:
57
+
58
+ - branch
59
+ - commit
60
+ - dirty state
61
+ - changed files
62
+ - risk level
63
+ - verification summary
64
+ - note
65
+
66
+ AgentFlight stores this as a session event. It does not store full code diffs by default.
67
+
68
+ ## 5. Check Status
69
+
70
+ ```bash
71
+ npx agentflight@latest status
72
+ ```
73
+
74
+ Example:
75
+
76
+ ```text
77
+ AgentFlight status
78
+
79
+ Task:
80
+ Add password reset flow
81
+
82
+ Changed files:
83
+ 3
84
+
85
+ Changed areas:
86
+ - docs: docs/password-reset.md
87
+ - tests: tests/password-reset.test.ts
88
+ - unknown: src/password-reset.ts
89
+
90
+ Risk: medium
91
+ - Dependency, backend, or unknown files changed.
92
+
93
+ Verification Evidence:
94
+ 1 passed, 0 failed
95
+
96
+ Latest snapshot:
97
+ - Note: Initial implementation verified
98
+ - Risk: medium
99
+ - Changed files: 3
100
+
101
+ Review readiness: Ready for review
102
+
103
+ Next action:
104
+ Generate a proof report with agentflight report
105
+ ```
106
+
107
+ ## 6. Generate Review Artifacts
108
+
109
+ ```bash
110
+ npx agentflight@latest report
111
+ npx agentflight@latest replay
112
+ ```
113
+
114
+ The Markdown report gives a reviewer the task, session metadata, changed files, risk summary, timeline, verification evidence, and recommendation.
115
+
116
+ The HTML replay gives a browser-friendly timeline:
117
+
118
+ ```text
119
+ session_started
120
+ verification_started
121
+ verification_passed
122
+ snapshot_created
123
+ report_generated
124
+ replay_generated
125
+ ```
126
+
127
+ Both artifacts stay local under `.agentflight/reports/`.
128
+
129
+ ## 7. Resume Safely
130
+
131
+ ```bash
132
+ npx agentflight@latest resume
133
+ ```
134
+
135
+ The resume prompt includes:
136
+
137
+ - original task
138
+ - current state
139
+ - changed files
140
+ - risk
141
+ - latest snapshot note
142
+ - verification state
143
+ - next recommended action
144
+ - guardrails for the next agent
145
+
146
+ Example guardrails:
147
+
148
+ ```text
149
+ - Stay scoped to the current task.
150
+ - Do not start unrelated work.
151
+ - Do not claim completion without proof.
152
+ - Run relevant verification before declaring success.
153
+ ```
154
+
155
+ ## What This Gives You
156
+
157
+ By the end of the session, you know:
158
+
159
+ - what changed
160
+ - how risky it looks
161
+ - what proof exists
162
+ - what proof is missing
163
+ - whether the work is ready for review
164
+ - how to hand it off without losing context
@@ -0,0 +1,49 @@
1
+ # Roadmap
2
+
3
+ ## MVP
4
+
5
+ - Local CLI package.
6
+ - Safe `.agentflight/` initialization.
7
+ - Session start and current handoff.
8
+ - Changed-file risk summary.
9
+ - Markdown proof report.
10
+ - Self-contained HTML replay.
11
+ - Resume prompt.
12
+ - Doctor checks.
13
+ - Defensive ProjScan and AgentLoopKit adapters.
14
+ - Verification evidence capture.
15
+ - Session events and snapshots.
16
+
17
+ ## Future Free/Open-Core
18
+
19
+ - Basic local session history.
20
+ - Structured `--json` output.
21
+ - Richer configurable verification command profiles.
22
+ - More precise git diff stats without including full code by default.
23
+ - Timeline filtering and richer local replay navigation.
24
+
25
+ ## Future Pro
26
+
27
+ - Searchable local session history.
28
+ - Advanced replay exports.
29
+ - Multi-repo workspace.
30
+ - Custom report themes.
31
+ - Advanced policy packs.
32
+ - Agent mistake analytics.
33
+
34
+ ## Future Team
35
+
36
+ - GitHub PR comments.
37
+ - Shared policies.
38
+ - Team dashboard.
39
+ - Audit trail.
40
+ - Required verification gates.
41
+ - Slack, Linear, and Jira integrations.
42
+
43
+ ## Future Enterprise
44
+
45
+ - Self-hosted deployment.
46
+ - SSO.
47
+ - Compliance exports.
48
+ - Custom risk rules.
49
+ - Support.
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "agentflight",
3
- "version": "0.3.0",
3
+ "version": "0.3.1",
4
4
  "description": "Local-first flight recorder for AI coding agents.",
5
5
  "type": "module",
6
6
  "main": "./dist/cli.js",
@@ -11,6 +11,12 @@
11
11
  "files": [
12
12
  "dist",
13
13
  "README.md",
14
+ "CHANGELOG.md",
15
+ "docs/assets/agentflight-replay-timeline.png",
16
+ "docs/development/dogfooding.md",
17
+ "docs/development/release.md",
18
+ "docs/examples/basic-agentflight-session.md",
19
+ "docs/roadmap.md",
14
20
  "LICENSE"
15
21
  ],
16
22
  "scripts": {
@@ -34,8 +40,13 @@
34
40
  "keywords": [
35
41
  "ai",
36
42
  "coding-agents",
43
+ "codex",
44
+ "claude-code",
37
45
  "cli",
38
46
  "developer-tools",
47
+ "agentic-coding",
48
+ "verification",
49
+ "code-review",
39
50
  "local-first",
40
51
  "observability"
41
52
  ],