npm - agentic-qe - Versions diffs - 3.6.9 → 3.6.10 - Mend

agentic-qe 3.6.9 → 3.6.10

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (55) hide show

package/.claude/skills/.validation/schemas/skill-eval.schema.json CHANGED Viewed

@@ -167,6 +167,11 @@
           "type": "string",
           "description": "Reason for skipping"
         },
+        "negative_control": {
+          "type": "boolean",
+          "default": false,
+          "description": "When true, grading logic inverts: test passes when must_contain items are ABSENT (skill correctly declines irrelevant prompts)"
+        },
         "input": {
           "$ref": "#/$defs/test_input"
         },
@@ -324,6 +329,11 @@
           "default": false,
           "description": "Allow partial matches"
         },
+        "adaptive_rubric": {
+          "type": "boolean",
+          "default": false,
+          "description": "When true, dynamically extracts keywords from test prompt (quoted strings, format words, standards) and adds them to must_contain checks"
+        },
         "grading_rubric": {
           "type": "object",
           "properties": {
@@ -331,7 +341,7 @@
             "accuracy": { "type": "number", "minimum": 0, "maximum": 1 },
             "actionability": { "type": "number", "minimum": 0, "maximum": 1 }
           },
-          "description": "Weighted grading rubric (weights should sum to 1.0)"
+          "description": "Weighted grading rubric (weights should sum to 1.0). Computes sub-scores: completeness (must_contain match ratio), accuracy (1 - violation ratio), actionability (code blocks, steps, recommendations)"
         }
       }
     },

package/.claude/skills/pr-review/SKILL.md CHANGED Viewed

@@ -24,8 +24,8 @@ Read the complete diff and PR description. Do not skim — read every changed fi
 ### 2. Scope Check
 - Only analyze AQE/QE skills (NOT Claude Flow platform skills)
-- Platform skills to EXCLUDE: v3-*, flow-nexus-*, agentdb-*, reasoningbank-*, swarm-advanced, swarm-orchestration
-- If the PR touches skills, verify the count/scope matches expectations (~63 AQE skills)
+- Platform skills to EXCLUDE: v3-*, flow-nexus-*, agentdb-*, reasoningbank-*, swarm-*, github-*, hive-mind-advanced, hooks-automation, iterative-loop, stream-chain, skill-builder, sparc-methodology, pair-programming, release, debug-loop, aqe-v2-v3-migration
+- If the PR touches skills, verify the count/scope matches expectations (~75 AQE skills)
 - Flag any platform skill changes that may have leaked into an AQE-focused PR
 ### 3. Summarize Changes