npm - agentweaver - Versions diffs - 0.1.3 → 0.1.5 - Mend

agentweaver 0.1.3 → 0.1.5

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (33) hide show

package/README.md +48 -14
package/dist/artifacts.js +86 -3
package/dist/executors/verify-build-executor.js +110 -9
package/dist/index.js +170 -18
package/dist/interactive-ui.js +525 -33
package/dist/pipeline/checks.js +5 -0
package/dist/pipeline/context.js +1 -0
package/dist/pipeline/declarative-flow-runner.js +16 -0
package/dist/pipeline/flow-specs/auto.json +191 -3
package/dist/pipeline/flow-specs/bug-analyze.json +140 -0
package/dist/pipeline/flow-specs/bug-fix.json +44 -0
package/dist/pipeline/flow-specs/implement.json +12 -0
package/dist/pipeline/flow-specs/mr-description.json +89 -0
package/dist/pipeline/flow-specs/plan.json +52 -0
package/dist/pipeline/flow-specs/preflight.json +32 -0
package/dist/pipeline/flow-specs/review-fix.json +79 -1
package/dist/pipeline/flow-specs/review.json +79 -0
package/dist/pipeline/flow-specs/run-linter-loop.json +149 -0
package/dist/pipeline/flow-specs/run-tests-loop.json +149 -0
package/dist/pipeline/flow-specs/task-describe.json +89 -0
package/dist/pipeline/node-registry.js +19 -0
package/dist/pipeline/nodes/flow-run-node.js +40 -0
package/dist/pipeline/nodes/review-findings-form-node.js +65 -0
package/dist/pipeline/nodes/user-input-node.js +93 -0
package/dist/pipeline/nodes/verify-build-node.js +1 -0
package/dist/pipeline/prompt-registry.js +6 -1
package/dist/pipeline/spec-compiler.js +13 -0
package/dist/pipeline/spec-validator.js +12 -0
package/dist/pipeline/value-resolver.js +49 -4
package/dist/prompts.js +46 -14
package/dist/structured-artifacts.js +272 -0
package/dist/user-input.js +171 -0
package/package.json +1 -1

package/README.md CHANGED Viewed

@@ -11,9 +11,10 @@ The package is designed to run as an npm CLI and includes an interactive termina
 ## What It Does
 - Fetches a Jira issue by key or browse URL
-- Generates workflow artifacts such as design, implementation plan, QA plan, reviews, and summaries
-- Runs workflow stages like `plan`, `implement`, `review`, `review-fix`, `test`, and `auto`
-- Persists `auto` pipeline state on disk so runs can resume
+- Generates workflow artifacts such as design, implementation plan, QA plan, bug analysis, reviews, and summaries
+- Machine-readable JSON artifacts are stored under `.agentweaver-<TASK>/.artifacts/` and act as the source of truth between workflow steps; Markdown artifacts remain for human inspection
+- Runs workflow stages like `bug-analyze`, `bug-fix`, `mr-description`, `plan`, `task-describe`, `implement`, `review`, `review-fix`, `test`, and `auto`
+- Persists compact `auto` pipeline state on disk so runs can resume without storing large agent outputs
 - Uses Docker runtime services for isolated Codex execution and build verification
 ## Architecture
@@ -23,7 +24,7 @@ The CLI now uses an executor + node + declarative flow architecture.
 - `src/index.ts` remains the CLI entrypoint and high-level orchestration layer
 - `src/executors/` contains first-class executors for external actions such as Jira fetch, local Codex, Docker-based build verification, Claude, Claude summaries, and process execution
 - `src/pipeline/nodes/` contains reusable runtime nodes built on top of executors
-- `src/pipeline/flow-specs/` contains declarative JSON flow specs for `preflight`, `plan`, `implement`, `review`, `review-fix`, `test`, `test-fix`, `test-linter-fix`, and `auto`
+- `src/pipeline/flow-specs/` contains declarative JSON flow specs for `preflight`, `bug-analyze`, `bug-fix`, `mr-description`, `plan`, `task-describe`, `implement`, `review`, `review-fix`, `test`, `test-fix`, `test-linter-fix`, `run-tests-loop`, `run-linter-loop`, and `auto`
 - `src/runtime/` contains shared runtime services such as command resolution, Docker runtime environment setup, and subprocess execution
 This keeps command handlers focused on choosing a flow and providing parameters instead of assembling prompts and subprocess wiring inline.
@@ -41,7 +42,9 @@ This keeps command handlers focused on choosing a flow and providing parameters
 - `src/runtime/` — shared runtime services used by executors
 - `docker-compose.yml` — runtime services for Codex and build verification
 - `Dockerfile.codex` — container image for Codex runtime
-- `verify_build.sh` — project-specific verification entrypoint used by `verify-build`
+- `verify_build.sh` — aggregated verification entrypoint used by `verify-build`
+- `run_tests.sh` — isolated test and coverage verification entrypoint
+- `run_linter.sh` — isolated generate + lint verification entrypoint
 - `package.json` — npm package metadata and scripts
 - `tsconfig.json` — TypeScript configuration
@@ -50,7 +53,7 @@ This keeps command handlers focused on choosing a flow and providing parameters
 - Node.js `>= 18.19.0`
 - npm
 - Docker with `docker compose` or `docker-compose`
-- `codex` CLI for `plan` and Codex-driven steps
+- `codex` CLI for `bug-analyze`, `bug-fix`, `mr-description`, `plan`, and other Codex-driven steps
 - `claude` CLI for review and summary steps
 ## Installation
@@ -112,8 +115,14 @@ Direct CLI usage:
 ```bash
 agentweaver plan DEMO-3288
+agentweaver bug-analyze DEMO-3288
+agentweaver bug-fix DEMO-3288
+agentweaver mr-description DEMO-3288
+agentweaver task-describe DEMO-3288
 agentweaver implement DEMO-3288
 agentweaver review DEMO-3288
+agentweaver run-tests-loop DEMO-3288
+agentweaver run-linter-loop DEMO-3288
 agentweaver auto DEMO-3288
 ```
@@ -121,6 +130,10 @@ From source checkout:
 ```bash
 node dist/index.js plan DEMO-3288
+node dist/index.js bug-analyze DEMO-3288
+node dist/index.js bug-fix DEMO-3288
+node dist/index.js mr-description DEMO-3288
+node dist/index.js task-describe DEMO-3288
 node dist/index.js auto DEMO-3288
 ```
@@ -145,28 +158,35 @@ agentweaver auto-status DEMO-3288
 agentweaver auto-reset DEMO-3288
 ```
+Notes:
+- `--verbose` streams child process `stdout/stderr` in direct CLI mode
+- the interactive `Activity` pane is intentionally structured: it shows launch separators, prompts, summaries, and short status messages instead of raw Codex/Claude logs by default
 ## Interactive TUI
 Interactive mode opens a full-screen terminal UI with:
-- command input
+- flow list
+- current flow progress
 - activity log
 - task summary pane
-- command list/help
 - keyboard navigation between panes
 Current navigation:
-- `Enter` — run command
+- `Enter` — run selected flow
 - `Tab` / `Shift+Tab` — switch panes
-- `Ctrl+J` — focus activity log
-- `Ctrl+K` — focus command input
-- `Ctrl+U` — focus task summary
-- `Ctrl+H` — focus commands pane
 - `PgUp` / `PgDn` / `Home` / `End` — scroll focused panes
-- `?` or `F1` — help overlay
+- `h` — help overlay
 - `q` or `Ctrl+C` — exit
+Activity pane behavior:
+- each external launch is separated with a framed block that shows the current `node`, `executor`, and `model` when available
+- prompts and summaries are rendered as plain text for readability
+- live raw Codex/Claude output is not shown there in normal mode
 ## Docker Runtime
 Docker is used as an isolated execution environment for Codex and build/test verification.
@@ -176,6 +196,8 @@ Main services:
 - `codex` — interactive Codex container
 - `codex-exec` — non-interactive `codex exec`
 - `verify-build` — project verification script inside container
+- `run-tests` — isolated `run_tests.sh` execution inside container
+- `run-linter` — isolated `run_linter.sh` execution inside container
 - `codex-login` — interactive login container
 - `dockerd` — internal Docker daemon for testcontainers/build flows
@@ -205,6 +227,18 @@ Build verification:
 PROJECT_DIR="$PWD" docker compose -f "$AGENTWEAVER_HOME/docker-compose.yml" run --rm verify-build
 ```
+Tests only:
+```bash
+PROJECT_DIR="$PWD" docker compose -f "$AGENTWEAVER_HOME/docker-compose.yml" run --rm run-tests
+```
+Linter only:
+```bash
+PROJECT_DIR="$PWD" docker compose -f "$AGENTWEAVER_HOME/docker-compose.yml" run --rm run-linter
+```
 ## Development
 Install dependencies and build:

package/dist/artifacts.js CHANGED Viewed

@@ -11,37 +11,120 @@ export function taskWorkspaceDir(taskKey) {
 export function ensureTaskWorkspaceDir(taskKey) {
     const workspaceDir = taskWorkspaceDir(taskKey);
     mkdirSync(workspaceDir, { recursive: true });
+    mkdirSync(taskArtifactsDir(taskKey), { recursive: true });
     return workspaceDir;
 }
 export function taskWorkspaceFile(taskKey, fileName) {
     return path.join(taskWorkspaceDir(taskKey), fileName);
 }
+export function taskArtifactsDir(taskKey) {
+    return path.join(taskWorkspaceDir(taskKey), ".artifacts");
+}
+export function taskArtifactsFile(taskKey, fileName) {
+    return path.join(taskArtifactsDir(taskKey), fileName);
+}
 export function artifactFile(prefix, taskKey, iteration) {
     return taskWorkspaceFile(taskKey, `${prefix}-${taskKey}-${iteration}.md`);
 }
+export function artifactJsonFile(prefix, taskKey, iteration) {
+    return taskArtifactsFile(taskKey, `${prefix}-${taskKey}-${iteration}.json`);
+}
 export function designFile(taskKey) {
     return artifactFile("design", taskKey, 1);
 }
+export function designJsonFile(taskKey) {
+    return artifactJsonFile("design", taskKey, 1);
+}
 export function planFile(taskKey) {
     return artifactFile("plan", taskKey, 1);
 }
+export function planJsonFile(taskKey) {
+    return artifactJsonFile("plan", taskKey, 1);
+}
+export function bugAnalyzeFile(taskKey) {
+    return taskWorkspaceFile(taskKey, `bug-analyze-${taskKey}.md`);
+}
+export function bugAnalyzeJsonFile(taskKey) {
+    return taskArtifactsFile(taskKey, `bug-analyze-${taskKey}.json`);
+}
+export function bugFixDesignFile(taskKey) {
+    return taskWorkspaceFile(taskKey, `bug-fix-design-${taskKey}.md`);
+}
+export function bugFixDesignJsonFile(taskKey) {
+    return taskArtifactsFile(taskKey, `bug-fix-design-${taskKey}.json`);
+}
+export function bugFixPlanFile(taskKey) {
+    return taskWorkspaceFile(taskKey, `bug-fix-plan-${taskKey}.md`);
+}
+export function bugFixPlanJsonFile(taskKey) {
+    return taskArtifactsFile(taskKey, `bug-fix-plan-${taskKey}.json`);
+}
 export function qaFile(taskKey) {
     return artifactFile("qa", taskKey, 1);
 }
+export function qaJsonFile(taskKey) {
+    return artifactJsonFile("qa", taskKey, 1);
+}
 export function taskSummaryFile(taskKey) {
     return artifactFile("task", taskKey, 1);
 }
+export function taskSummaryJsonFile(taskKey) {
+    return artifactJsonFile("task", taskKey, 1);
+}
 export function readyToMergeFile(taskKey) {
     return taskWorkspaceFile(taskKey, READY_TO_MERGE_FILE);
 }
 export function jiraTaskFile(taskKey) {
-    return taskWorkspaceFile(taskKey, `${taskKey}.json`);
+    return taskArtifactsFile(taskKey, `${taskKey}.json`);
+}
+export function jiraDescriptionFile(taskKey) {
+    return taskWorkspaceFile(taskKey, `jira-${taskKey}-description.md`);
+}
+export function jiraDescriptionJsonFile(taskKey) {
+    return taskArtifactsFile(taskKey, `jira-${taskKey}-description.json`);
+}
+export function mrDescriptionFile(taskKey) {
+    return taskWorkspaceFile(taskKey, `mr-description-${taskKey}.md`);
+}
+export function mrDescriptionJsonFile(taskKey) {
+    return taskArtifactsFile(taskKey, `mr-description-${taskKey}.json`);
 }
 export function autoStateFile(taskKey) {
-    return taskWorkspaceFile(taskKey, `.agentweaver-state-${taskKey}.json`);
+    return taskArtifactsFile(taskKey, `.agentweaver-state-${taskKey}.json`);
 }
 export function planArtifacts(taskKey) {
-    return [designFile(taskKey), planFile(taskKey), qaFile(taskKey)];
+    return [designFile(taskKey), designJsonFile(taskKey), planFile(taskKey), planJsonFile(taskKey), qaFile(taskKey), qaJsonFile(taskKey)];
+}
+export function bugAnalyzeArtifacts(taskKey) {
+    return [
+        bugAnalyzeFile(taskKey),
+        bugAnalyzeJsonFile(taskKey),
+        bugFixDesignFile(taskKey),
+        bugFixDesignJsonFile(taskKey),
+        bugFixPlanFile(taskKey),
+        bugFixPlanJsonFile(taskKey),
+    ];
+}
+export function reviewFile(taskKey, iteration) {
+    return artifactFile("review", taskKey, iteration);
+}
+export function reviewJsonFile(taskKey, iteration) {
+    return artifactJsonFile("review", taskKey, iteration);
+}
+export function reviewReplyFile(taskKey, iteration) {
+    return artifactFile("review-reply", taskKey, iteration);
+}
+export function reviewReplyJsonFile(taskKey, iteration) {
+    return artifactJsonFile("review-reply", taskKey, iteration);
+}
+export function reviewFixFile(taskKey, iteration) {
+    return artifactFile("review-fix", taskKey, iteration);
+}
+export function reviewFixJsonFile(taskKey, iteration) {
+    return artifactJsonFile("review-fix", taskKey, iteration);
+}
+export function reviewFixSelectionJsonFile(taskKey, iteration) {
+    return artifactJsonFile("review-fix-selection", taskKey, iteration);
 }
 export function requireArtifacts(paths, message) {
     const missing = paths.filter((filePath) => !existsSync(filePath));

package/dist/executors/verify-build-executor.js CHANGED Viewed

@@ -1,22 +1,123 @@
 import { verifyBuildExecutorDefaultConfig } from "./configs/verify-build-config.js";
+import { TaskRunnerError } from "../errors.js";
 import { processExecutor } from "./process-executor.js";
+function parseStructuredResult(output, service) {
+    const lines = output
+        .split(/\r?\n/)
+        .map((line) => line.replace(/\u001b\[[0-9;]*m/g, "").trim())
+        .filter(Boolean);
+    if (lines.length === 0) {
+        throw new TaskRunnerError(`Structured result is missing from service '${service}' output.`);
+    }
+    for (let index = lines.length - 1; index >= 0; index -= 1) {
+        const line = lines[index];
+        if (!line) {
+            continue;
+        }
+        const candidates = [];
+        if (line.startsWith("{") && line.endsWith("}")) {
+            candidates.push(line);
+        }
+        const firstBrace = line.indexOf("{");
+        const lastBrace = line.lastIndexOf("}");
+        if (firstBrace >= 0 && lastBrace > firstBrace) {
+            const slice = line.slice(firstBrace, lastBrace + 1).trim();
+            if (slice && !candidates.includes(slice)) {
+                candidates.push(slice);
+            }
+        }
+        for (const rawJson of candidates) {
+            let parsed;
+            try {
+                parsed = JSON.parse(rawJson);
+            }
+            catch {
+                continue;
+            }
+            if (!parsed || typeof parsed !== "object" || Array.isArray(parsed)) {
+                continue;
+            }
+            const candidate = parsed;
+            if (typeof candidate.ok !== "boolean" ||
+                typeof candidate.kind !== "string" ||
+                typeof candidate.stage !== "string" ||
+                typeof candidate.exitCode !== "number" ||
+                typeof candidate.summary !== "string" ||
+                typeof candidate.command !== "string") {
+                continue;
+            }
+            const details = candidate.details;
+            if (details !== undefined && (!details || typeof details !== "object" || Array.isArray(details))) {
+                continue;
+            }
+            return {
+                ok: candidate.ok,
+                kind: candidate.kind,
+                stage: candidate.stage,
+                exitCode: candidate.exitCode,
+                summary: candidate.summary,
+                command: candidate.command,
+                details: details ?? {},
+            };
+        }
+    }
+    throw new TaskRunnerError(`Structured result is missing or invalid in service '${service}' output.`);
+}
 export const verifyBuildExecutor = {
     kind: "verify-build",
     version: 1,
     defaultConfig: verifyBuildExecutorDefaultConfig,
     async execute(context, input, config) {
         const composeCommand = context.runtime.resolveDockerComposeCmd();
-        const result = await processExecutor.execute(context, {
-            argv: [...composeCommand, config.composeFileFlag, input.dockerComposeFile, ...config.runArgs, config.service],
-            env: context.runtime.dockerRuntimeEnv(),
-            verbose: config.verbose,
-            label: config.service,
-        }, {
-            printFailureOutput: config.printFailureOutput,
-        });
+        const service = input.service ?? config.service;
+        if (context.dryRun) {
+            await processExecutor.execute(context, {
+                argv: [...composeCommand, config.composeFileFlag, input.dockerComposeFile, ...config.runArgs, service],
+                env: context.runtime.dockerRuntimeEnv(),
+                verbose: config.verbose,
+                label: service,
+            }, {
+                printFailureOutput: config.printFailureOutput,
+            });
+            return {
+                output: "",
+                composeCommand,
+                parsed: {
+                    ok: true,
+                    kind: service,
+                    stage: "dry_run",
+                    exitCode: 0,
+                    summary: `Dry run for service '${service}'`,
+                    command: [...composeCommand, config.composeFileFlag, input.dockerComposeFile, ...config.runArgs, service].join(" "),
+                    details: {},
+                },
+            };
+        }
+        let output = "";
+        let exitCode = 0;
+        try {
+            const result = await processExecutor.execute(context, {
+                argv: [...composeCommand, config.composeFileFlag, input.dockerComposeFile, ...config.runArgs, service],
+                env: context.runtime.dockerRuntimeEnv(),
+                verbose: config.verbose,
+                label: service,
+            }, {
+                printFailureOutput: config.printFailureOutput,
+            });
+            output = result.output;
+        }
+        catch (error) {
+            output = String(error.output ?? "");
+            exitCode = Number(error.returnCode ?? 1);
+        }
+        const parsed = parseStructuredResult(output, service);
+        if (parsed.exitCode !== exitCode && exitCode !== 0) {
+            throw new TaskRunnerError(`Structured result exit code mismatch for service '${service}': script=${parsed.exitCode}, runtime=${exitCode}.`);
+        }
         return {
-            output: result.output,
+            output,
             composeCommand,
+            parsed,
         };
     },
 };