npm - @simplysm/sd-claude - Versions diffs - 13.0.41 → 13.0.42 - Mend

@simplysm/sd-claude 13.0.41 → 13.0.42

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (2) hide show

package/claude/skills/sd-skill/SKILL.md +13 -1
package/package.json +1 -1

package/claude/skills/sd-skill/SKILL.md CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 name: sd-skill
 description: Use when creating new skills, editing existing skills, or verifying skills work before deployment
-model: sonnet
+model: opus
 ---
 # Writing Skills
@@ -401,6 +401,15 @@ Different skill types need different test approaches:
 - Variation scenarios: Do they handle edge cases?
 - Missing information tests: Do instructions have gaps?
+**How to test:** Give a subagent a problem the technique solves, WITHOUT the skill. Observe what approach they use naturally. Then give the SAME problem WITH the skill and verify they apply the technique correctly.
+```
+Example: Testing a "condition-based-waiting" skill
+1. Ask subagent: "Fix this flaky test that uses setTimeout(500)"
+2. WITHOUT skill: Agent increases timeout to 2000ms (wrong approach)
+3. WITH skill: Agent replaces with polling/condition check (correct)
+```
 **Success criteria:** Agent successfully applies technique to new scenario
 ### Pattern Skills (mental models)
@@ -437,6 +446,7 @@ Different skill types need different test approaches:
 | "I'm confident it's good" | Overconfidence guarantees issues. Test anyway. |
 | "Academic review is enough" | Reading ≠ using. Test application scenarios. |
 | "No time to test" | Deploying untested skill wastes more time fixing it later. |
+| "I already know the baseline failures" | You know what YOU think the failures are. Run a subagent to see what ACTUALLY happens. Knowledge ≠ observation. |
 **All of these mean: Test before deploying. No exceptions.**
@@ -527,6 +537,8 @@ Run pressure scenario with subagent WITHOUT the skill. Document exact behavior:
 This is "watch the test fail" - you must see what agents naturally do before writing the skill.
+**You MUST actually run a subagent.** Do not substitute your own knowledge of "what agents would probably do." Your prediction of baseline behavior ≠ observed baseline behavior. Run the subagent, read the output, document what actually happened.
 ### GREEN: Write Minimal Skill
 Write skill that addresses those specific rationalizations. Don't add extra content for hypothetical cases.

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "@simplysm/sd-claude",
-  "version": "13.0.41",
+  "version": "13.0.42",
   "description": "Simplysm Claude Code CLI — asset installer and cross-platform npx wrapper",
   "author": "김석래",
   "license": "Apache-2.0",