@ara-commons/ara-skills 0.3.0 → 0.4.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
@@ -148,6 +148,7 @@ For each claim's Falsification criteria field:
148
148
 
149
149
  - **Over-claiming**: Does any Statement use universal scope markers ("all models", "any dataset", "state-of-the-art across all") while cited experiments cover only specific, narrow conditions? The gap must be substantial.
150
150
  - **Under-claiming**: Are there important experimental results present in evidence/ that are not captured by any claim? (Evidence without a corresponding claim.)
151
+ - **Attribution vs mechanism**: Does any Statement merely name *which* components of this one system rank highest/lowest (load-bearing, dominant, decorative, inert) without stating what that ranking *reveals*? Apply the name-deletion test — strike the system's component names; if no transferable relationship survives, the Statement is attribution, not insight. Flag as `major` (the claim is a league table of this system, not a reusable finding); suggest the generalization the ranking licenses.
151
152
  - **Assumption explicitness**: Are key assumptions stated in problem.md (Assumptions section) or constraints.md? Are there unstated assumptions implied by the experimental design?
152
153
  - **Generalization boundaries**: Does the artifact clearly state what the claims do NOT apply to? Check constraints.md and limitations in the exploration tree.
153
154
  - **Qualifier consistency**: When claims use hedging ("tends to", "in most cases"), is this consistent with the evidence strength?
@@ -65,6 +65,7 @@ resolution, field presence, YAML parsing) is handled entirely by Level 1.
65
65
  |-------|---------------|-----------------|
66
66
  | Over-claiming | Statement uses universal scope while evidence covers narrow conditions | critical if extreme, major if moderate |
67
67
  | Under-claiming | Evidence files or experiment results not captured by any claim | minor |
68
+ | Attribution vs mechanism | Statement names which components rank where (name-deletion test leaves nothing transferable) instead of what the ranking reveals | major |
68
69
  | Assumption explicitness | Key assumptions stated in problem.md or constraints.md | major if unstated assumptions affect validity |
69
70
  | Generalization boundaries | Artifact states what claims do NOT apply to | minor |
70
71
  | Qualifier consistency | Hedging language matches evidence strength | minor |
package/src/index.js CHANGED
@@ -17,7 +17,7 @@ ${chalk.bold('Usage:')}
17
17
 
18
18
  ${chalk.bold('Install options:')}
19
19
  --all Install every skill (default if no --skill given)
20
- --skill <id> Install one skill (repeatable). Ids: compiler, research-manager, rigor-reviewer
20
+ --skill <id> Install one skill (repeatable). Ids: compiler, research-manager, rigor-reviewer, research-visualizer
21
21
  --agent <id> Target one agent (repeatable). Default: auto-detect, else claude-code
22
22
  --local Install into ./<agent>/skills instead of $HOME
23
23
  --force Overwrite existing installations