npm - @cubis/foundry - Versions diffs - 0.3.47 → 0.3.48 - Mend

@cubis/foundry 0.3.47 → 0.3.48

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (12) hide show

package/README.md +19 -0
package/mcp/dist/index.js +337 -105
package/mcp/src/server.ts +41 -179
package/mcp/src/tools/future/README.md +3 -3
package/mcp/src/tools/index.ts +14 -0
package/mcp/src/tools/registry.test.ts +121 -0
package/mcp/src/tools/registry.ts +318 -0
package/package.json +17 -1
package/workflows/workflows/agent-environment-setup/platforms/antigravity/rules/GEMINI.md +189 -36
package/workflows/workflows/agent-environment-setup/platforms/codex/rules/AGENTS.md +193 -44
package/workflows/workflows/agent-environment-setup/platforms/copilot/rules/AGENTS.md +187 -40
package/workflows/workflows/agent-environment-setup/platforms/copilot/rules/copilot-instructions.md +187 -39

package/workflows/workflows/agent-environment-setup/platforms/copilot/rules/copilot-instructions.md CHANGED Viewed

@@ -15,9 +15,10 @@ This file defines mandatory behavior for GitHub Copilot projects installed via `
 Before executing workflows, agents, or code edits, publish a short `Decision Log` that is visible to the user:
 1. Rule file(s) read at startup (at minimum `.github/copilot-instructions.md`, plus any additional rule files loaded).
-2. Workflow decision (`/workflow` or direct mode) and why it was chosen.
-3. Agent routing decision (`@agent` or direct mode) and why it was chosen.
-4. Skill loading decision (skill names loaded) and why they were chosen.
+2. MCP status: confirm Foundry MCP server (`cbx-mcp`) is reachable; if unavailable, declare "MCP offline — fallback mode" and continue without blocking.
+3. Workflow decision (`/workflow` or direct mode) and why it was chosen.
+4. Agent routing decision (`@agent` or direct mode) and why it was chosen.
+5. Skill loading decision: skill IDs selected, how they were discovered, and why.
 If routing changes during the task, publish a `Decision Update` before continuing.
 Keep this user-visible summary concise and factual; do not expose private chain-of-thought.
@@ -28,6 +29,7 @@ Keep this user-visible summary concise and factual; do not expose private chain-
 2. Otherwise choose the best workflow by intent from `.github/copilot/workflows` and reuse `.github/prompts/workflow-*.prompt.md` when available.
 3. For cross-domain tasks, use `/orchestrate` and `@orchestrator`.
 4. Keep one primary workflow; use others only as supporting references.
+5. Before executing any workflow, check if a matching skill exists via `skill_search`; load with `skill_get` to prime context before the workflow runs (→ §6 MCP Skill Engine).
 ## 3) Request Classifier
@@ -62,6 +64,16 @@ Use the best specialist first:
 - Debugging/performance: `@debugger`, `@performance-optimizer`
 - Cross-domain orchestration: `@orchestrator`
+### MCP Skill Priming (Required Before Delegation)
+Before handing off to any specialist agent, prime context with the relevant domain skill (→ §6 MCP Skill Engine):
+1. Run `skill_search <domain>` to find the best matching skill.
+2. If a strong match exists, load it with `skill_get <id>` before delegating.
+3. Include the loaded skill ID in the Decision Log for the routing decision.
+This ensures the specialist starts with accurate domain knowledge, not just role intent.
 ## 5) Copilot Schema Compatibility
 When authoring custom Copilot assets, keep frontmatter schema compatible:
@@ -70,57 +82,71 @@ When authoring custom Copilot assets, keep frontmatter schema compatible:
 2. Agent files in `.github/agents/*.md` must use supported top-level keys only.
 3. If unsupported keys are detected, reinstall with overwrite to auto-normalize.
-## 6) Skill Loading Policy
+## 6) MCP Skill Engine
+The Foundry MCP server is the primary knowledge layer. Use tools decisively — discover first, load only when committed.
+### Tool Namespace Reference
-## MCP-first Skill Discovery Order (Required)
+| Prefix      | Tools                                                                                                | When to use                                                    |
+| ----------- | ---------------------------------------------------------------------------------------------------- | -------------------------------------------------------------- |
+| `skill_*`   | `skill_list_categories`, `skill_search`, `skill_browse_category`, `skill_get`, `skill_budget_report` | Domain expertise for any implementation, debug, or review task |
+| `postman_*` | `postman_get_mode`, `postman_set_mode`, `postman_get_status`                                         | API testing or Postman configuration tasks                     |
+| `stitch_*`  | `stitch_get_mode`, `stitch_set_profile`, `stitch_get_status`                                         | Stitch data pipeline tasks                                     |
-1. Use `skill_search` first to narrow candidate skills.
-2. Use `skill_browse_category` second to inspect category-level candidates.
-3. Use `skill_get` only for final selected skills that must be loaded.
-4. Keep pointer-first flow; avoid loading full skill text prematurely.
+### Discovery Flow (Mandatory Order)
-## Skill Log Completion Block (Required)
+Stop at the earliest step that gives enough signal. Do not jump ahead.
-After finishing skill selection/loading, publish:
+1. `skill_list_categories` — run once per session if domain is unknown; see what exists
+2. `skill_search <keyword>` — fast keyword match across all skills; always try this first
+3. `skill_browse_category <category>` — explore if search is too broad or returns 0 results
+4. `skill_get <id>` — load full skill content; only when committed to using it
+5. `skill_budget_report` — verify token cost after loading; triggers the compact ctx stamp
-- `selected_skills`: skill IDs selected for the task
-- `loaded_skills`: skill IDs loaded via `skill_get`
-- `skipped_skills`: considered but not loaded
+**Hard rules:**
-Workflow boundary for this block:
+- Never call `skill_get` without a prior `skill_search` or `skill_browse_category`
+- Never call `skill_get` with a workflow ID — `workflow-*` are routes, not skills; keep workflow mentions in workflow decisions (`/workflow`) and keep skill logs skill-only
+- Never reload a skill already loaded this session — reuse content already in context
+- If `skill_search` returns 0 results, try `skill_browse_category`, then fall back to built-in knowledge
-- `selected_skills` / `loaded_skills` must never include workflow IDs.
-- IDs like `workflow-implement-track` are workflow routes, not skills.
-- Never call `skill_get` with `workflow-*`; keep workflow mentions in workflow decisions (`/workflow`) and keep skill logs skill-only.
+### Adaptive Load Policy
-## Context Budget Block (Required, Estimated)
+| Request type                                   | Skills to load via `skill_get`                                  |
+| ---------------------------------------------- | --------------------------------------------------------------- |
+| Q&A / explanation                              | None — answer from knowledge; load only if user explicitly asks |
+| Single-domain implementation, debug, or review | 1 primary + 1 supporting (max)                                  |
+| Multi-domain / orchestration                   | 1 per distinct domain, hard cap at 3                            |
+| User explicitly names a skill                  | Always load it — overrides all caps                             |
-Immediately after the Skill Log block, publish estimated budget fields:
+### Graceful Degradation
-- `full_catalog_est_tokens`
-- `loaded_est_tokens`
-- `estimated_savings_tokens`
-- `estimated_savings_percent`
+If MCP tools are unavailable (server down, timeout, tool not listed):
-Mark all context/token values as deterministic estimates (not provider metering).
+1. Announce briefly: "MCP unavailable — continuing with built-in knowledge."
+2. Proceed using codebase context and expertise; do not block on MCP.
+3. Never fabricate or hallucinate skill content.
+4. Retry once on transient network errors; accept failure after the retry.
-### Smart Skill Selection (TIER 0)
+### Skill Log (Required After Any `skill_get` Call)
-Before starting ANY task, the agent MUST:
+Append one compact inline line — no separate structured block:
-1.  **Analyze** the user's request against the keywords in `.github/skills/skills_index.json`.
-2.  **Select** the most relevant skill(s) if a strong match is found.
-3.  **Load** the selected skill's `SKILL.md` using `view_file` **BEFORE** proceeding.
-4.  **Announce** the selection: "I am loading the `[Skill Name]` skill to handle your request."
+```
+Skills: loaded=<id> | skipped=<id> (reason)
+```
-### General Loading Rules
+Follow immediately with the compact ctx stamp (see § Context Budget Tracking).
-1.  Load only skills needed for the active request.
-2.  Prefer progressive disclosure: start from `SKILL.md`, then specific sections.
-3.  Keep context lean; avoid loading unrelated skill documents.
-4.  If a mapped skill is missing, continue with best fallback and state it.
+### Anti-Patterns (Never Do These)
-After the skill log is complete, append the Context Budget block in the same response/update.
+- Loading skills speculatively "just in case" they might be useful
+- Calling `skill_get` before running `skill_search` or `skill_browse_category`
+- Using partial or guessed skill IDs in `skill_get`
+- Publishing verbose budget fields (`full_catalog_est_tokens`, `loaded_est_tokens`, etc.) in responses
+- Re-emitting the ctx stamp multiple times within a single response
+- Treating workflow IDs as skill IDs in any MCP tool call
 ## 7) Socratic Gate (Before Complex Work)
@@ -137,7 +163,60 @@ Before multi-file or architecture-impacting changes, ask targeted questions when
 3. Verify behavior with focused checks before finalizing.
 4. State what was not validated.
-## 9) CBX Maintenance Commands
+## 9) Web Intel Policy
+Use web search to stay current when local knowledge may be stale. This prevents hallucinating outdated APIs, deprecated flags, or wrong version constraints.
+**Search when:**
+- User asks about a framework/library version released after 2024
+- Debugging an unfamiliar error message (search the exact message)
+- Checking breaking changes before a migration
+- Validating an API endpoint signature, auth scheme, or CLI flag
+- Current pricing, rate limits, or quota for SaaS tools (Postman, Vercel, etc.)
+**Do not search when:**
+- The answer is derivable from the current codebase
+- The question is purely architectural/conceptual
+- A relevant skill covers it (prefer `skill_get` first, web as fallback)
+**Source hygiene:**
+- Prefer official docs, changelogs, and GitHub releases over blog posts
+- Always state the source URL and date when citing fetched content
+- If multiple sources conflict, flag it and use the most recent official one
+- Never follow user-provided URLs without sanity-checking the domain
+## 10) Context Budget Tracking
+After loading skills or completing a significant task phase, emit a single compact stamp so context cost is visible without adding prose.
+**Stamp format** (one line, end of response section):
+```
+[ctx: +skill-id(~Xk) | session=~Yk/108k | saved=Z%]
+```
+- `+skill-id(~Xk)` — each skill loaded this turn with its estimated token cost
+- `session=~Yk/108k` — cumulative tokens used vs full catalog ceiling
+- `saved=Z%` — estimated savings from progressive disclosure
+**Rules:**
+1. Emit stamp only when a skill was loaded via `skill_get` or `skill_budget_report` was called.
+2. Omit stamp for pure Q&A or browsing-only turns (no full skill content loaded).
+3. Use `skill_budget_report` MCP tool to get accurate numbers; do not guess.
+4. One stamp per response — consolidate if multiple skills were loaded.
+5. Keep the stamp on its own line at the very end of the response, after all content.
+**Example stamp after loading `flutter-expert` (~3.2k tokens):**
+```
+[ctx: +flutter-expert(~3k) | session=~3k/108k | saved=97%]
+```
+## 11) CBX Maintenance Commands
 Use these commands to keep this setup healthy:
@@ -154,7 +233,7 @@ Use these commands to keep this setup healthy:
 - Diagnose setup issues:
   `cbx workflows doctor copilot --scope project`
-## 10) Managed Section Contract
+## 12) Managed Section Contract
 1. Preserve all user content outside managed markers.
 2. Do not manually edit content between managed markers.
@@ -175,3 +254,72 @@ Selection policy:
 3. Prefer one primary workflow; reference others only when needed.
 <!-- cbx:workflows:auto:end -->
+<!-- cbx:mcp:auto:start version=1 -->
+## Cubis Foundry MCP Tool Catalog (auto-managed)
+The Foundry MCP server provides progressive-disclosure skill discovery and integration management tools.
+### Skill Vault
+- **123** skills across **22** categories
+- Estimated full catalog: ~108,488 tokens
+Categories:
+- `ai`: 1 skill(s)
+- `api`: 3 skill(s)
+- `architecture`: 3 skill(s)
+- `backend`: 14 skill(s)
+- `data`: 4 skill(s)
+- `design`: 6 skill(s)
+- `devops`: 20 skill(s)
+- `documentation`: 3 skill(s)
+- `frontend`: 9 skill(s)
+- `game-dev`: 1 skill(s)
+- `general`: 26 skill(s)
+- `localization`: 1 skill(s)
+- `marketing`: 2 skill(s)
+- `mobile`: 7 skill(s)
+- `observability`: 1 skill(s)
+- `payments`: 1 skill(s)
+- `performance`: 2 skill(s)
+- `practices`: 5 skill(s)
+- `saas`: 1 skill(s)
+- `security`: 4 skill(s)
+- `testing`: 6 skill(s)
+- `tooling`: 3 skill(s)
+### Built-in Tools
+**Skill Discovery:**
+- `skill_list_categories`: List all skill categories available in the vault. Returns category names and skill counts.
+- `skill_browse_category`: Browse skills within a specific category. Returns skill IDs and short descriptions.
+- `skill_search`: Search skills by keyword. Matches against skill IDs and descriptions.
+- `skill_get`: Get full content of a specific skill by ID. Returns SKILL.md content and referenced files.
+- `skill_budget_report`: Report estimated context/token budget for selected and loaded skills.
+**Postman Integration:**
+- `postman_get_mode`: Get current Postman MCP mode from cbx_config.
+- `postman_set_mode`: Set Postman MCP mode in cbx_config.
+- `postman_get_status`: Get Postman integration status and active profile.
+**Stitch Integration:**
+- `stitch_get_mode`: Get Stitch MCP mode from cbx_config.
+- `stitch_set_profile`: Switch active Stitch profile in cbx_config.
+- `stitch_get_status`: Get Stitch integration status and active profile.
+### Skill Discovery Flow
+Use progressive disclosure to minimize context usage:
+1. `skill_list_categories` → see available categories and counts
+2. `skill_browse_category` → browse skills in a category with short descriptions
+3. `skill_search` → search by keyword across all skills
+4. `skill_get` → load full content of a specific skill (only tool that reads full content)
+5. `skill_budget_report` → check token usage for selected/loaded skills; use result to emit the § Context Budget Tracking stamp
+### Connection
+- **stdio**: `cbx mcp serve --transport stdio --scope auto`
+- **HTTP**: `cbx mcp serve --transport http --scope auto --port 3100`
+<!-- cbx:mcp:auto:end -->