@rfxlamia/skillkit 1.0.0 → 1.2.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (269) hide show
  1. package/agents/agents/creative-copywriter.md +212 -0
  2. package/agents/agents/dario-amodei.md +135 -0
  3. package/agents/agents/doc-simplifier.md +63 -0
  4. package/agents/agents/kotlin-pro.md +433 -0
  5. package/agents/agents/red-team.md +136 -0
  6. package/agents/agents/sam-altman.md +121 -0
  7. package/agents/agents/seo-manager.md +184 -0
  8. package/package.json +7 -2
  9. package/skills/quick-spec/tests/__pycache__/test_skill.cpython-314-pytest-9.0.2.pyc +0 -0
  10. package/skills/skillkit/.claude/settings.local.json +7 -0
  11. package/skills/skillkit/scripts/__pycache__/decision_helper.cpython-314.pyc +0 -0
  12. package/skills/skillkit/scripts/__pycache__/quick_validate.cpython-312.pyc +0 -0
  13. package/skills/skillkit/scripts/__pycache__/quick_validate.cpython-314.pyc +0 -0
  14. package/skills/skillkit/scripts/__pycache__/test_generator.cpython-314-pytest-9.0.2.pyc +0 -0
  15. package/skills/skillkit/scripts/utils/__pycache__/__init__.cpython-312.pyc +0 -0
  16. package/skills/skillkit/scripts/utils/__pycache__/__init__.cpython-314.pyc +0 -0
  17. package/skills/skillkit/scripts/utils/__pycache__/budget_tracker.cpython-312.pyc +0 -0
  18. package/skills/skillkit/scripts/utils/__pycache__/budget_tracker.cpython-314.pyc +0 -0
  19. package/skills/skillkit/scripts/utils/__pycache__/output_formatter.cpython-312.pyc +0 -0
  20. package/skills/skillkit/scripts/utils/__pycache__/output_formatter.cpython-314.pyc +0 -0
  21. package/skills/skillkit/scripts/utils/__pycache__/reference_validator.cpython-312.pyc +0 -0
  22. package/skills/skillkit/scripts/utils/__pycache__/reference_validator.cpython-314.pyc +0 -0
  23. package/skills/skillkit-help/SKILL.md +81 -0
  24. package/skills/skillkit-help/knowledge/application/09-case-studies.md +257 -0
  25. package/skills/skillkit-help/knowledge/application/12-testing-and-validation.md +276 -0
  26. package/skills/skillkit-help/knowledge/foundation/01-why-skills-exist.md +246 -0
  27. package/skills/skillkit-help/knowledge/foundation/02-skills-vs-subagents-comparison.md +312 -0
  28. package/skills/skillkit-help/knowledge/foundation/03-skills-vs-subagents-decision-tree.md +346 -0
  29. package/skills/skillkit-help/knowledge/foundation/06-platform-constraints.md +237 -0
  30. package/skills/skillkit-help/knowledge/foundation/08-when-not-to-use-skills.md +270 -0
  31. package/skills/skillkit-help/template/SKILL.md +52 -0
  32. package/skills/skills/adversarial-review/SKILL.md +219 -0
  33. package/skills/skills/baby-education/SKILL.md +260 -0
  34. package/skills/skills/baby-education/references/advanced-techniques.md +323 -0
  35. package/skills/skills/baby-education/references/transformations.md +345 -0
  36. package/skills/skills/been-there-done-that/SKILL.md +455 -0
  37. package/skills/skills/been-there-done-that/references/analysis-patterns.md +162 -0
  38. package/skills/skills/been-there-done-that/references/git-commands.md +132 -0
  39. package/skills/skills/been-there-done-that/references/tree-insertion-logic.md +145 -0
  40. package/skills/skills/coolhunter/SKILL.md +270 -0
  41. package/skills/skills/coolhunter/assets/elicitation-methods.csv +51 -0
  42. package/skills/skills/coolhunter/knowledge/elicitation-methods.md +312 -0
  43. package/skills/skills/coolhunter/references/workflow-execution.md +238 -0
  44. package/skills/skills/coolhunter/workflow-plan-coolhunter.md +232 -0
  45. package/skills/skills/creative-copywriting/SKILL.md +324 -0
  46. package/skills/skills/creative-copywriting/databases/README.md +60 -0
  47. package/skills/skills/creative-copywriting/databases/carousel-structures.csv +16 -0
  48. package/skills/skills/creative-copywriting/databases/emotional-arcs.csv +11 -0
  49. package/skills/skills/creative-copywriting/databases/hook-formulas.csv +51 -0
  50. package/skills/skills/creative-copywriting/databases/power-words.csv +201 -0
  51. package/skills/skills/creative-copywriting/databases/psychological-triggers.csv +21 -0
  52. package/skills/skills/creative-copywriting/databases/read-more-patterns.csv +26 -0
  53. package/skills/skills/creative-copywriting/databases/swipe-triggers.csv +31 -0
  54. package/skills/skills/creative-copywriting/references/carousel-psychology.md +223 -0
  55. package/skills/skills/creative-copywriting/references/hook-anatomy.md +169 -0
  56. package/skills/skills/creative-copywriting/references/power-word-science.md +134 -0
  57. package/skills/skills/creative-copywriting/references/storytelling-frameworks.md +157 -0
  58. package/skills/skills/diverse-content-gen/SKILL.md +201 -0
  59. package/skills/skills/diverse-content-gen/references/advanced-techniques.md +320 -0
  60. package/skills/skills/diverse-content-gen/references/research-findings.md +379 -0
  61. package/skills/skills/diverse-content-gen/references/task-workflows.md +241 -0
  62. package/skills/skills/diverse-content-gen/references/tool-integration.md +419 -0
  63. package/skills/skills/diverse-content-gen/references/troubleshooting.md +426 -0
  64. package/skills/skills/diverse-content-gen/references/vs-core-technique.md +240 -0
  65. package/skills/skills/framework-critical-thinking/SKILL.md +220 -0
  66. package/skills/skills/framework-critical-thinking/references/bias_detector.md +375 -0
  67. package/skills/skills/framework-critical-thinking/references/fallback_handler.md +239 -0
  68. package/skills/skills/framework-critical-thinking/references/memory_curator.md +161 -0
  69. package/skills/skills/framework-critical-thinking/references/metacognitive_monitor.md +297 -0
  70. package/skills/skills/framework-critical-thinking/references/producer_critic_orchestrator.md +333 -0
  71. package/skills/skills/framework-critical-thinking/references/reasoning_router.md +235 -0
  72. package/skills/skills/framework-critical-thinking/references/reasoning_validator.md +97 -0
  73. package/skills/skills/framework-critical-thinking/references/reflection_trigger.md +78 -0
  74. package/skills/skills/framework-critical-thinking/references/self_verification.md +388 -0
  75. package/skills/skills/framework-critical-thinking/references/uncertainty_quantifier.md +207 -0
  76. package/skills/skills/framework-initiative/SKILL.md +231 -0
  77. package/skills/skills/framework-initiative/references/examples.md +150 -0
  78. package/skills/skills/framework-initiative/references/impact-analysis.md +157 -0
  79. package/skills/skills/framework-initiative/references/intent-patterns.md +145 -0
  80. package/skills/skills/framework-initiative/references/star-framework.md +165 -0
  81. package/skills/skills/humanize-docs/SKILL.md +203 -0
  82. package/skills/skills/humanize-docs/references/advanced-techniques.md +13 -0
  83. package/skills/skills/humanize-docs/references/core-transformations.md +368 -0
  84. package/skills/skills/humanize-docs/references/detection-patterns.md +400 -0
  85. package/skills/skills/humanize-docs/references/examples-gallery.md +374 -0
  86. package/skills/skills/imagine/SKILL.md +190 -0
  87. package/skills/skills/imagine/references/artstyle-corporate-memphis.md +625 -0
  88. package/skills/skills/imagine/references/artstyle-crewdson-hyperrealism.md +295 -0
  89. package/skills/skills/imagine/references/artstyle-iphone-social-media.md +426 -0
  90. package/skills/skills/imagine/references/artstyle-sciencesaru.md +276 -0
  91. package/skills/skills/pre-deploy-checklist/README.md +26 -0
  92. package/skills/skills/pre-deploy-checklist/SKILL.md +153 -0
  93. package/skills/skills/pre-deploy-checklist/references/checklist-categories.md +174 -0
  94. package/skills/skills/pre-deploy-checklist/references/domain-prompts.md +216 -0
  95. package/skills/skills/prompt-engineering/SKILL.md +209 -0
  96. package/skills/skills/prompt-engineering/references/advanced-combinations.md +444 -0
  97. package/skills/skills/prompt-engineering/references/chain-of-thought.md +140 -0
  98. package/skills/skills/prompt-engineering/references/decision_matrix.md +220 -0
  99. package/skills/skills/prompt-engineering/references/few-shot.md +346 -0
  100. package/skills/skills/prompt-engineering/references/json-format.md +270 -0
  101. package/skills/skills/prompt-engineering/references/natural-language.md +420 -0
  102. package/skills/skills/prompt-engineering/references/pitfalls.md +365 -0
  103. package/skills/skills/prompt-engineering/references/prompt-chaining.md +498 -0
  104. package/skills/skills/prompt-engineering/references/react.md +108 -0
  105. package/skills/skills/prompt-engineering/references/self-consistency.md +322 -0
  106. package/skills/skills/prompt-engineering/references/tree-of-thoughts.md +386 -0
  107. package/skills/skills/prompt-engineering/references/xml-format.md +220 -0
  108. package/skills/skills/prompt-engineering/references/yaml-format.md +488 -0
  109. package/skills/skills/prompt-engineering/references/zero-shot.md +74 -0
  110. package/skills/skills/quick-spec/SKILL.md +280 -0
  111. package/skills/skills/quick-spec/assets/tech-spec-template.md +74 -0
  112. package/skills/skills/quick-spec/references/step-01-understand.md +189 -0
  113. package/skills/skills/quick-spec/references/step-02-investigate.md +144 -0
  114. package/skills/skills/quick-spec/references/step-03-generate.md +128 -0
  115. package/skills/skills/quick-spec/references/step-04-review.md +173 -0
  116. package/skills/skills/quick-spec/tests/__pycache__/test_skill.cpython-314-pytest-9.0.2.pyc +0 -0
  117. package/skills/skills/quick-spec/tests/test_scenarios.md +83 -0
  118. package/skills/skills/quick-spec/tests/test_skill.py +136 -0
  119. package/skills/skills/readme-expert/SKILL.md +538 -0
  120. package/skills/skills/readme-expert/knowledge/INDEX.md +192 -0
  121. package/skills/skills/readme-expert/knowledge/application/quality-standards.md +470 -0
  122. package/skills/skills/readme-expert/knowledge/application/script-executor.md +604 -0
  123. package/skills/skills/readme-expert/knowledge/application/template-library.md +822 -0
  124. package/skills/skills/readme-expert/knowledge/foundation/codebase-scanner.md +361 -0
  125. package/skills/skills/readme-expert/knowledge/foundation/validation-checklist.md +481 -0
  126. package/skills/skills/red-teaming/SKILL.md +321 -0
  127. package/skills/skills/red-teaming/references/ai-llm-redteam.md +517 -0
  128. package/skills/skills/red-teaming/references/attack-techniques.md +410 -0
  129. package/skills/skills/red-teaming/references/cybersecurity-redteam.md +383 -0
  130. package/skills/skills/red-teaming/references/tools-frameworks.md +446 -0
  131. package/skills/skills/releasing/.skillkit-mode +1 -0
  132. package/skills/skills/releasing/SKILL.md +225 -0
  133. package/skills/skills/releasing/references/version-detection.md +108 -0
  134. package/skills/skills/screenwriter/SKILL.md +273 -0
  135. package/skills/skills/screenwriter/references/advanced-techniques.md +216 -0
  136. package/skills/skills/screenwriter/references/pipeline-integration.md +266 -0
  137. package/skills/skills/skillkit/.claude/settings.local.json +7 -0
  138. package/skills/skills/skillkit/.claude-plugin/plugin.json +27 -0
  139. package/skills/skills/skillkit/CHANGELOG.md +484 -0
  140. package/skills/skills/skillkit/SKILL.md +511 -0
  141. package/skills/skills/skillkit/commands/skillkit.md +6 -0
  142. package/skills/skills/skillkit/commands/validate-plan.md +6 -0
  143. package/skills/skills/skillkit/commands/verify.md +6 -0
  144. package/skills/skills/skillkit/knowledge/INDEX.md +352 -0
  145. package/skills/skills/skillkit/knowledge/application/09-case-studies.md +257 -0
  146. package/skills/skills/skillkit/knowledge/application/10-technical-architecture.md +324 -0
  147. package/skills/skills/skillkit/knowledge/application/11-adoption-strategy.md +267 -0
  148. package/skills/skills/skillkit/knowledge/application/12-testing-and-validation.md +276 -0
  149. package/skills/skills/skillkit/knowledge/application/13-competitive-landscape.md +198 -0
  150. package/skills/skills/skillkit/knowledge/foundation/01-why-skills-exist.md +246 -0
  151. package/skills/skills/skillkit/knowledge/foundation/02-skills-vs-subagents-comparison.md +312 -0
  152. package/skills/skills/skillkit/knowledge/foundation/03-skills-vs-subagents-decision-tree.md +346 -0
  153. package/skills/skills/skillkit/knowledge/foundation/04-hybrid-patterns.md +308 -0
  154. package/skills/skills/skillkit/knowledge/foundation/05-token-economics.md +275 -0
  155. package/skills/skills/skillkit/knowledge/foundation/06-platform-constraints.md +237 -0
  156. package/skills/skills/skillkit/knowledge/foundation/07-security-concerns.md +322 -0
  157. package/skills/skills/skillkit/knowledge/foundation/08-when-not-to-use-skills.md +270 -0
  158. package/skills/skills/skillkit/knowledge/plugin-guide.md +614 -0
  159. package/skills/skills/skillkit/knowledge/tools/14-validation-tools-guide.md +150 -0
  160. package/skills/skills/skillkit/knowledge/tools/15-cost-tools-guide.md +157 -0
  161. package/skills/skills/skillkit/knowledge/tools/16-security-tools-guide.md +122 -0
  162. package/skills/skills/skillkit/knowledge/tools/17-pattern-tools-guide.md +161 -0
  163. package/skills/skills/skillkit/knowledge/tools/18-decision-helper-guide.md +243 -0
  164. package/skills/skills/skillkit/knowledge/tools/19-test-generator-guide.md +275 -0
  165. package/skills/skills/skillkit/knowledge/tools/20-split-skill-guide.md +149 -0
  166. package/skills/skills/skillkit/knowledge/tools/21-quality-scorer-guide.md +226 -0
  167. package/skills/skills/skillkit/knowledge/tools/22-migration-helper-guide.md +356 -0
  168. package/skills/skills/skillkit/knowledge/tools/23-subagent-creation-guide.md +448 -0
  169. package/skills/skills/skillkit/knowledge/tools/24-behavioral-testing-guide.md +122 -0
  170. package/skills/skills/skillkit/references/proposal-generation.md +982 -0
  171. package/skills/skills/skillkit/references/rationalization-catalog.md +75 -0
  172. package/skills/skills/skillkit/references/research-methodology.md +661 -0
  173. package/skills/skills/skillkit/references/section-2-full-creation-workflow.md +452 -0
  174. package/skills/skills/skillkit/references/section-3-validation-workflow-existing-skill.md +63 -0
  175. package/skills/skills/skillkit/references/section-4-decision-workflow-skills-vs-subagents.md +64 -0
  176. package/skills/skills/skillkit/references/section-5-migration-workflow-doc-to-skill.md +58 -0
  177. package/skills/skills/skillkit/references/section-6-subagent-creation-workflow.md +499 -0
  178. package/skills/skills/skillkit/references/section-7-knowledge-reference-map.md +72 -0
  179. package/skills/skills/skillkit/scripts/__pycache__/decision_helper.cpython-314.pyc +0 -0
  180. package/skills/skills/skillkit/scripts/__pycache__/quick_validate.cpython-312.pyc +0 -0
  181. package/skills/skills/skillkit/scripts/__pycache__/quick_validate.cpython-314.pyc +0 -0
  182. package/skills/skills/skillkit/scripts/__pycache__/test_generator.cpython-314-pytest-9.0.2.pyc +0 -0
  183. package/skills/skills/skillkit/scripts/decision_helper.py +799 -0
  184. package/skills/skills/skillkit/scripts/init_skill.py +400 -0
  185. package/skills/skills/skillkit/scripts/init_subagent.py +231 -0
  186. package/skills/skills/skillkit/scripts/migration_helper.py +669 -0
  187. package/skills/skills/skillkit/scripts/package_skill.py +211 -0
  188. package/skills/skills/skillkit/scripts/pattern_detector.py +381 -0
  189. package/skills/skills/skillkit/scripts/pattern_detector_new.py +382 -0
  190. package/skills/skills/skillkit/scripts/pressure_tester.py +157 -0
  191. package/skills/skills/skillkit/scripts/quality_scorer.py +999 -0
  192. package/skills/skills/skillkit/scripts/quick_validate.py +100 -0
  193. package/skills/skills/skillkit/scripts/security_scanner.py +474 -0
  194. package/skills/skills/skillkit/scripts/split_skill.py +540 -0
  195. package/skills/skills/skillkit/scripts/test_generator.py +695 -0
  196. package/skills/skills/skillkit/scripts/token_estimator.py +493 -0
  197. package/skills/skills/skillkit/scripts/utils/__init__.py +49 -0
  198. package/skills/skills/skillkit/scripts/utils/__pycache__/__init__.cpython-312.pyc +0 -0
  199. package/skills/skills/skillkit/scripts/utils/__pycache__/__init__.cpython-314.pyc +0 -0
  200. package/skills/skills/skillkit/scripts/utils/__pycache__/budget_tracker.cpython-312.pyc +0 -0
  201. package/skills/skills/skillkit/scripts/utils/__pycache__/budget_tracker.cpython-314.pyc +0 -0
  202. package/skills/skills/skillkit/scripts/utils/__pycache__/output_formatter.cpython-312.pyc +0 -0
  203. package/skills/skills/skillkit/scripts/utils/__pycache__/output_formatter.cpython-314.pyc +0 -0
  204. package/skills/skills/skillkit/scripts/utils/__pycache__/reference_validator.cpython-312.pyc +0 -0
  205. package/skills/skills/skillkit/scripts/utils/__pycache__/reference_validator.cpython-314.pyc +0 -0
  206. package/skills/skills/skillkit/scripts/utils/budget_tracker.py +388 -0
  207. package/skills/skills/skillkit/scripts/utils/output_formatter.py +263 -0
  208. package/skills/skills/skillkit/scripts/utils/reference_validator.py +401 -0
  209. package/skills/skills/skillkit/scripts/validate_skill.py +594 -0
  210. package/skills/skills/skillkit/tests/test_behavioral.py +39 -0
  211. package/skills/skills/skillkit/tests/test_scenarios.md +83 -0
  212. package/skills/skills/skillkit/tests/test_skill.py +136 -0
  213. package/skills/skills/skillkit-help/SKILL.md +81 -0
  214. package/skills/skills/skillkit-help/knowledge/application/09-case-studies.md +257 -0
  215. package/skills/skills/skillkit-help/knowledge/application/12-testing-and-validation.md +276 -0
  216. package/skills/skills/skillkit-help/knowledge/foundation/01-why-skills-exist.md +246 -0
  217. package/skills/skills/skillkit-help/knowledge/foundation/02-skills-vs-subagents-comparison.md +312 -0
  218. package/skills/skills/skillkit-help/knowledge/foundation/03-skills-vs-subagents-decision-tree.md +346 -0
  219. package/skills/skills/skillkit-help/knowledge/foundation/06-platform-constraints.md +237 -0
  220. package/skills/skills/skillkit-help/knowledge/foundation/08-when-not-to-use-skills.md +270 -0
  221. package/skills/skills/skillkit-help/template/SKILL.md +52 -0
  222. package/skills/skills/social-media-seo/SKILL.md +278 -0
  223. package/skills/skills/social-media-seo/databases/caption-styles.csv +31 -0
  224. package/skills/skills/social-media-seo/databases/engagement-tactics.csv +16 -0
  225. package/skills/skills/social-media-seo/databases/hashtag-strategies.csv +21 -0
  226. package/skills/skills/social-media-seo/databases/hook-formulas.csv +26 -0
  227. package/skills/skills/social-media-seo/databases/keyword-clusters.csv +11 -0
  228. package/skills/skills/social-media-seo/databases/thread-structures.csv +26 -0
  229. package/skills/skills/social-media-seo/databases/viral-patterns.csv +21 -0
  230. package/skills/skills/social-media-seo/references/analytics-guide.md +321 -0
  231. package/skills/skills/social-media-seo/references/instagram-seo.md +235 -0
  232. package/skills/skills/social-media-seo/references/threads-seo.md +305 -0
  233. package/skills/skills/social-media-seo/references/x-twitter-seo.md +337 -0
  234. package/skills/skills/social-media-seo/scripts/query_database.py +191 -0
  235. package/skills/skills/storyteller/SKILL.md +241 -0
  236. package/skills/skills/storyteller/references/transformation-methodology.md +293 -0
  237. package/skills/skills/storyteller/references/visual-vocabulary.md +177 -0
  238. package/skills/skills/thread-pro/SKILL.md +162 -0
  239. package/skills/skills/thread-pro/anti-ai-patterns.md +120 -0
  240. package/skills/skills/thread-pro/hook-formulas.md +138 -0
  241. package/skills/skills/thread-pro/references/anti-ai-patterns.md +120 -0
  242. package/skills/skills/thread-pro/references/hook-formulas.md +138 -0
  243. package/skills/skills/thread-pro/references/thread-structures.md +240 -0
  244. package/skills/skills/thread-pro/references/voice-injection.md +130 -0
  245. package/skills/skills/thread-pro/thread-structures.md +240 -0
  246. package/skills/skills/thread-pro/voice-injection.md +130 -0
  247. package/skills/skills/tinkering/SKILL.md +251 -0
  248. package/skills/skills/tinkering/references/graduation-checklist.md +100 -0
  249. package/skills/skills/validate-plan/.skillkit-mode +1 -0
  250. package/skills/skills/validate-plan/SKILL.md +406 -0
  251. package/skills/skills/validate-plan/references/dry-principles.md +251 -0
  252. package/skills/skills/validate-plan/references/gap-analysis-guide.md +320 -0
  253. package/skills/skills/validate-plan/references/tdd-patterns.md +413 -0
  254. package/skills/skills/validate-plan/references/yagni-checklist.md +330 -0
  255. package/skills/skills/verify-before-ship/.skillkit-mode +1 -0
  256. package/skills/skills/verify-before-ship/SKILL.md +116 -0
  257. package/skills/skills/verify-before-ship/references/anti-rationalization.md +212 -0
  258. package/skills/skills/verify-before-ship/references/verification-gates.md +305 -0
  259. package/skills-manifest.json +8 -2
  260. package/src/banner.js +1 -1
  261. package/src/cli.js +15 -4
  262. package/src/install.js +45 -29
  263. package/src/install.test.js +75 -7
  264. package/src/picker.js +15 -4
  265. package/src/picker.test.js +36 -1
  266. package/src/scope.js +8 -39
  267. package/src/scope.test.js +9 -13
  268. package/src/tools.js +76 -0
  269. package/src/tools.test.js +80 -0
@@ -0,0 +1,276 @@
1
+ ---
2
+ title: "Testing & Validation: Quality Assurance for Skills"
3
+ purpose: "Pre-deployment validation, testing frameworks, debugging workflows"
4
+ token_estimate: "2000"
5
+ read_priority: "high"
6
+ read_when:
7
+ - "Before deploying any skill"
8
+ - "User asking 'How do I test this?'"
9
+ - "Debugging skill issues"
10
+ - "Quality assurance planning"
11
+ - "Creating testing checklist"
12
+ related_files:
13
+ must_read_first:
14
+ - "01-why-skills-exist.md"
15
+ read_together:
16
+ - "11-adoption-strategy.md"
17
+ read_next:
18
+ - "14-validation-best-practices.md"
19
+ - "15-cost-optimization-guide.md"
20
+ - "16-security-scanning-guide.md"
21
+ avoid_reading_when:
22
+ - "Still learning concepts (not implementing yet)"
23
+ - "Using only official Anthropic skills"
24
+ last_updated: "2025-11-03"
25
+ ---
26
+
27
+ # Testing & Validation: Quality Assurance for Skills
28
+
29
+ ## I. INTRODUCTION
30
+
31
+ **Why Testing Critical:** Validation failures waste 2-4 hours debugging post-deployment. Pre-deployment testing catches issues early, ensures quality, prevents user frustration.
32
+
33
+ **Testing Philosophy:** Test BEFORE deployment, test WHAT matters (not everything), automate where possible, iterate based on failures.
34
+
35
+ **Scope:** Pre-deployment validation, functional testing, debugging workflows. **For automated scripts:** `14-validation-best-practices.md`. **For security:** `07-security-concerns.md`.
36
+
37
+ ---
38
+
39
+ ## II. PRE-DEPLOYMENT VALIDATION
40
+
41
+ ### A. Structure Validation
42
+
43
+ | Check | Requirement | Status |
44
+ |-------|-------------|--------|
45
+ | **YAML** | Valid frontmatter, required fields | ☐ |
46
+ | **Name** | Max 64 chars, descriptive | ☐ |
47
+ | **Description** | Max 1,024 chars, has triggers | ☐ |
48
+ | **Files** | SKILL.md present, proper structure | ☐ |
49
+ | **Organization** | Progressive disclosure (main + refs) | ☐ |
50
+
51
+ **File Organization:**
52
+ ```
53
+ skill-name/
54
+ SKILL.md # Required, <500 lines
55
+ reference/ # Optional, Level 3 content
56
+ scripts/ # Optional, executables
57
+ ```
58
+
59
+ **For automated validation:** `validate_skill.py` (see `14-validation-best-practices.md`).
60
+
61
+ ### B. Content Quality
62
+
63
+ | Aspect | Good Example | Bad Example |
64
+ |--------|--------------|-------------|
65
+ | **Description** | "Extract PDFs. Use when..." | "PDF tool" |
66
+ | **Triggers** | "convert PDF", "extract text" | Vague wording |
67
+ | **Instructions** | "1. Run X, 2. Verify Y" | "Handle appropriately" |
68
+ | **Examples** | 2-3 inline, realistic | Too many, unrealistic |
69
+ | **Cross-Refs** | Valid file paths | Broken links |
70
+
71
+ **Description Tips:** Include task verbs ("extract", "convert"), add trigger phrases ("Use when"), be specific ("PDF to Word" NOT "documents").
72
+
73
+ ### C. Token Efficiency
74
+
75
+ | Component | Target | Max | Action if Over |
76
+ |-----------|--------|-----|----------------|
77
+ | SKILL.md | 200-350 lines | 500 | Split to refs |
78
+ | Description | 50-150 chars | 1,024 | Condense |
79
+ | Token estimate | ±10% actual | N/A | Recalculate |
80
+
81
+ **Token Formula:** Tokens ≈ Words × 1.3 to 1.5
82
+
83
+ **Progressive Disclosure:** Core in SKILL.md (<500 lines), advanced in reference files, scripts output-only (don't load), examples inline.
84
+
85
+ **For optimization:** `15-cost-optimization-guide.md`
86
+
87
+ ### D. Security Audit
88
+
89
+ | Risk | Check | Vulnerable | Fixed |
90
+ |------|-------|-----------|--------|
91
+ | **Secrets** | No hardcoded keys | `API_KEY="abc"` | `os.getenv()` |
92
+ | **Injection** | No unchecked input | `os.system(input)` | `subprocess.run()` |
93
+ | **Permissions** | Minimal tools | `allowed-tools: [*]` | Specific list |
94
+ | **Network** | Justified access | Unchecked calls | Validate URLs |
95
+
96
+ **Quick Scan:**
97
+ ```bash
98
+ grep -r "API_KEY\s*=" skill-name/ # Hardcoded secrets
99
+ grep -r "os\.system" skill-name/ # Injection risk
100
+ grep -r "eval\|exec" skill-name/ # Code execution
101
+ ```
102
+
103
+ **For comprehensive security:** `07-security-concerns.md` + `16-security-scanning-guide.md`
104
+
105
+ ---
106
+
107
+ ## III. FUNCTIONAL TESTING
108
+
109
+ ### A. Positive Tests (Should Succeed)
110
+
111
+ | Type | Test Case | Expected |
112
+ |------|-----------|----------|
113
+ | **Direct** | "Use PDF skill to extract" | Activates immediately |
114
+ | **Implicit** | "Extract text from PDF" | Detects relevance, activates |
115
+ | **Multi-Skill** | "Extract PDF, analyze Excel" | Both coordinate |
116
+
117
+ **Examples:**
118
+ 1. Direct: "Use data-analysis skill" → Triggers, processes
119
+ 2. Implicit: "Analyze sales data" → Detects keywords, triggers
120
+ 3. Multi-step: "Convert PDF, create charts" → Both skills activate
121
+
122
+ ### B. Negative Tests (Should NOT Trigger)
123
+
124
+ | Type | Test Case | Expected |
125
+ |------|-----------|----------|
126
+ | **Unrelated** | "What's the weather?" | No activation |
127
+ | **Similar Keywords** | "I like to analyze movies" | No false positive |
128
+ | **Wrong Context** | "Email analysis" (Excel skill) | Correct skill triggers |
129
+
130
+ **Examples:**
131
+ 1. Unrelated: "Tell joke about data" → No trigger
132
+ 2. False positive: "Document this process" → No doc-gen trigger (instruction, not task)
133
+ 3. Edge: "Summarize PDF" → Only PDF triggers, not redundant summarization
134
+
135
+ ### C. Integration Tests
136
+
137
+ | Type | Focus | Validation |
138
+ |------|-------|------------|
139
+ | **Skill + Subagent** | Coordination | Both execute, no conflicts |
140
+ | **Multi-Skill** | Sequential | Correct order, data passing |
141
+ | **Tool Access** | Permissions | Allowed work, blocked fail |
142
+ | **Error Handling** | Graceful failures | Valid error messages |
143
+
144
+ **Example:** "Extract PDF, analyze Excel" → Verify PDF first, Excel receives data, both complete.
145
+
146
+ ### D. Performance Tests
147
+
148
+ | Metric | Target | Alert |
149
+ |--------|--------|-------|
150
+ | **Token Usage** | ±10% estimate | >20% variance |
151
+ | **Response Time** | <30 sec | >60 sec |
152
+ | **File Handling** | Works to limit | Crashes |
153
+ | **Error Rate** | <5% | >10% |
154
+
155
+ ---
156
+
157
+ ## IV. DEBUGGING WORKFLOWS
158
+
159
+ ### A. Common Issues
160
+
161
+ | Issue | Solution |
162
+ |-------|----------|
163
+ | **Not Triggering** | Improve description (add trigger keywords) |
164
+ | **Wrong Skill** | Make description more specific |
165
+ | **Script Fails** | Check permissions, validate inputs |
166
+ | **Permission Error** | Add required tool to allowed-tools |
167
+ | **Slow** | Check SKILL.md size, split files |
168
+
169
+ **Decision Tree:**
170
+ ```
171
+ Not working?
172
+ ├─ Not activating? → Fix description, test explicit mention
173
+ ├─ Fails execution? → Check permissions, validate code
174
+ ├─ Wrong output? → Review instructions, add examples
175
+ └─ Slow? → Optimize token usage, split files
176
+ ```
177
+
178
+ ### B. Diagnostic Techniques
179
+
180
+ **1. Description Analysis:**
181
+ ```
182
+ Bad: "Helps with documents"
183
+ Good: "Convert Word/PDF/Excel. Use when processing documents."
184
+ ```
185
+
186
+ **2. Trigger Testing:**
187
+ ```
188
+ Test: "Convert PDF", "Extract text", "Process document", "Use converter"
189
+ → Track which phrases trigger consistently
190
+ ```
191
+
192
+ **3. Permission Check:**
193
+ ```yaml
194
+ allowed-tools:
195
+ - bash_tool # Script execution
196
+ - view # Read files
197
+ - create_file # Output
198
+ ```
199
+
200
+ ### C. Iterative Improvement
201
+
202
+ **5-Step Loop:**
203
+ 1. **Observe:** Document failure (screenshot, error)
204
+ 2. **Hypothesize:** "Description lacks 'convert' keyword"
205
+ 3. **Fix:** Add one keyword (minimal change)
206
+ 4. **Re-Test:** Same case again
207
+ 5. **Validate:** Test 3-5 times (confirm reliability)
208
+
209
+ **Example:**
210
+ ```
211
+ Iteration 1: Not triggering → Add "process" keyword → Works
212
+ Iteration 2: Workflow unclear → Add steps → Completes
213
+ Iteration 3: Fails Word docs → Add example → Both formats work
214
+ ```
215
+
216
+ ### D. Documentation
217
+
218
+ **Test Log:**
219
+
220
+ | Date | Test | Result | Issue | Resolution |
221
+ |------|------|--------|-------|------------|
222
+ | 11-01 | PDF extract | ✅ | None | - |
223
+ | 11-01 | Excel convert | ❌ | Permission | Added `create_file` |
224
+ | 11-02 | Excel convert | ✅ | None | Fixed |
225
+
226
+ **Known Issues:**
227
+ ```
228
+ Issue #1: Slow with large PDFs (>50MB)
229
+ Status: Open | Workaround: Split files | Target: v1.2.0
230
+
231
+ Issue #2: False trigger "analyze"
232
+ Status: Fixed v1.1.0 | Solution: Specific description
233
+ ```
234
+
235
+ ---
236
+
237
+ ## V. QUALITY ASSURANCE FRAMEWORK
238
+
239
+ **Testing Stages:**
240
+
241
+ | Stage | Focus | Pass Criteria |
242
+ |-------|-------|---------------|
243
+ | **Dev** | Basic functionality | All positive tests pass |
244
+ | **Staging** | Integration + edges | 90% pass, no critical issues |
245
+ | **Production** | Real usage | <5% error, satisfaction ≥7/10 |
246
+
247
+ **Sign-Off Checklist:**
248
+
249
+ | Criteria | Required |
250
+ |----------|----------|
251
+ | Validation checks passed | Yes ☐ |
252
+ | Positive tests ≥95% | Yes ☐ |
253
+ | Negative tests ≥95% | Yes ☐ |
254
+ | Security audit done | Yes ☐ |
255
+ | Documentation current | Yes ☐ |
256
+ | Peer review complete | Yes ☐ |
257
+
258
+ **Regression Testing:** Re-run ALL tests after ANY change to SKILL.md, scripts, or references.
259
+
260
+ **Monitoring:** Usage frequency (daily), error rate (<5%), complaints (<3/week). **For setup:** `11-adoption-strategy.md` IV.D.
261
+
262
+ ---
263
+
264
+ ## VI. KEY TAKEAWAYS
265
+
266
+ **Testing Priorities:** Pre-deployment validation prevents disasters (structure + security). Functional testing ensures core works (positive tests) and avoids false positives (negative tests). Performance optimization follows (token usage + speed).
267
+
268
+ **Quality Gates:** Pilot requires validation + positive tests. Team expansion needs integration + negative tests. Production demands performance metrics + security audit completion.
269
+
270
+ **Debugging Strategy:** Quick fixes—check description keywords, verify tool permissions, test explicit mentions. Deep fixes—review SKILL.md clarity, test edge cases systematically, document failure patterns.
271
+
272
+ **Next Steps:** Automation → `14-validation-best-practices.md`. Optimization → `15-cost-optimization-guide.md`. Security → `16-security-scanning-guide.md`. Adoption → `11-adoption-strategy.md`.
273
+
274
+ ---
275
+
276
+ **End of File 12**
@@ -0,0 +1,246 @@
1
+ ---
2
+ title: "Why Claude Skills Were Created"
3
+ purpose: "Understanding strategic context and foundational problems Skills solve"
4
+ token_estimate: "1800"
5
+ read_priority: "critical"
6
+ read_when:
7
+ - "User asking 'Why should I use Skills?'"
8
+ - "User comparing Skills to other approaches"
9
+ - "Starting Skills adoption decision"
10
+ - "Understanding Anthropic's vision"
11
+ - "Evaluating Skills vs alternatives"
12
+ related_files:
13
+ must_read_first: []
14
+ read_together:
15
+ - "08-when-not-to-use-skills.md"
16
+ read_next:
17
+ - "02-skills-vs-subagents-comparison.md"
18
+ - "case-studies.md"
19
+ avoid_reading_when:
20
+ - "User already decided to use Skills (skip to implementation)"
21
+ - "User asking pure technical how-to questions"
22
+ - "Debugging specific skill issues"
23
+ last_updated: "2025-10-31"
24
+ ---
25
+
26
+ # Why Claude Skills Were Created
27
+
28
+ ## I. ANTHROPIC'S VISION: COMPOSABLE AI FUTURE
29
+
30
+ ### Launch Context
31
+ **Launch Date:** October 16, 2025
32
+ **Product Status:** Beta, available on Claude.ai, Claude Code CLI, and API
33
+ **Target Users:** Organizations and developers who need specialized AI capabilities
34
+
35
+ ### Strategic Vision from Mahesh Murag (Technical Staff, Anthropic)
36
+
37
+ > "Skills are based on our belief that as model intelligence continues to increase, we will move toward general-purpose agents that often have access to their own filesystem and computing environment."
38
+
39
+ **Core Philosophy: Composability Over Fragmentation**
40
+
41
+ Anthropic identified fundamental tension in AI development:
42
+ - **WANTED:** Specialized capabilities for specific domains
43
+ - **NOT WANTED:** Fragmented ecosystem of custom agents for each use case
44
+
45
+ **The Composability Solution:**
46
+ Instead of building isolated custom agents for every task, Skills enable anyone to specialize general-purpose agents with capabilities that can be combined. A single agent can be equipped with multiple Skills, combining them for complex workflows.
47
+
48
+ ### The Agentic Future Vision
49
+
50
+ Anthropic envisions a future where:
51
+ 1. **Agents self-improve** - Agents can create, edit, and evaluate their own Skills
52
+ 2. **Behavior codification** - Agents codify successful patterns into reusable capabilities
53
+ 3. **Ecosystem growth** - Community-driven skill library that continues to grow
54
+ 4. **Universal portability** - Skills work across platforms (web, CLI, API)
55
+
56
+ **Timeline Evolution:**
57
+
58
+ **Pre-Skills (Before Oct 2025):** Manual prompt engineering every conversation, repetitive context-setting, inconsistent outputs.
59
+
60
+ **Skills Era (Oct 2025+):** Package knowledge once, automatic activation, consistent outputs, reduced cognitive load.
61
+
62
+ **Future Vision:** Self-improving agents, marketplace for community skills, enterprise governance, multi-agent orchestration with shared libraries.
63
+
64
+ ---
65
+
66
+ ## II. 4 FUNDAMENTAL PROBLEMS SOLVED BY SKILLS
67
+
68
+ ### A. SPECIALIZATION PROBLEM
69
+
70
+ **Problem:** General-purpose AI models lack domain-specific knowledge and procedures that organizations need for real work.
71
+
72
+ **Without Skills:**
73
+ ```
74
+ User: "Create a financial DCF model"
75
+ Claude: "I'll create a basic DCF. What discount rate?"
76
+ User: "Use our company standard"
77
+ Claude: "What is your company standard?"
78
+ [15 minutes explaining methodology - repeated every conversation]
79
+ ```
80
+
81
+ **With Skills:**
82
+ ```
83
+ User: "Create a financial DCF model"
84
+ [financial-modeling skill automatically loaded]
85
+ [Company WACC methodology applied]
86
+ [Standard assumptions used]
87
+ [Model generated in company format]
88
+ ```
89
+
90
+ **Impact:** Rakuten AI Team reported "What once took a day, we can now accomplish in an hour" for management accounting workflows. Organizations can transform general intelligence into specialized expertise without rebuilding models.
91
+
92
+ ---
93
+
94
+ ### B. REPETITION PROBLEM
95
+
96
+ **Problem:** Teams constantly provide the same guidance repeatedly. Brad Abrams (Product Lead) called this "endless cycle of prompt engineering and context-setting that makes current AI tools feel more like burdens than breakthroughs."
97
+
98
+ **Repetition Pattern Example:**
99
+ - Monday: Engineer A explains code review checklist (15 minutes)
100
+ - Tuesday: Engineer B explains same checklist (15 minutes)
101
+ - Wednesday: Engineer C explains same checklist (15 minutes)
102
+ - Result: 200+ hours/year wasted on repetitive explanations
103
+
104
+ **Skills Solution:** Create code-review skill once. Every engineer automatically gets same guidance. Zero recurring explanation time.
105
+
106
+ **Impact Metrics:**
107
+ - Box users: "Saving hours of effort" in document transformation
108
+ - Notion feedback: "Less prompt wrangling on complex tasks"
109
+ - Estimated team productivity gain: 30-50% on repetitive tasks
110
+
111
+ Skills package institutional knowledge once and distribute automatically, eliminating recurring overhead.
112
+
113
+ ---
114
+
115
+ ### C. CONSISTENCY PROBLEM
116
+
117
+ **Problem:** Output quality varies wildly depending on who uses AI and how they phrase prompts.
118
+
119
+ **Example Scenario:**
120
+ Three marketing managers write product launch emails:
121
+ - Manager A gets casual, short email (250 words)
122
+ - Manager B gets formal, long email (600 words)
123
+ - Manager C gets mixed tone (inconsistent branding)
124
+
125
+ **With Skills:** All managers automatically get consistent tone, length, format, and branding from brand-guidelines skill.
126
+
127
+ **Consistency Dimensions:**
128
+ - Brand identity (logo, colors, typography, voice)
129
+ - Document formatting (PowerPoint, Word, Excel templates)
130
+ - Process compliance (regulatory requirements, approval workflows)
131
+ - Technical standards (code style, architecture patterns)
132
+
133
+ **Impact:**
134
+ - Reduction in rework: 40-60%
135
+ - Brand compliance: Near 100% vs ~60% without Skills
136
+ - Onboarding time: 50% reduction
137
+
138
+ Skills encode organizational standards that are automatically enforced.
139
+
140
+ ---
141
+
142
+ ### D. TOKEN EFFICIENCY PROBLEM
143
+
144
+ **Problem:** Sending comprehensive documentation in every conversation wastes context window and increases costs dramatically.
145
+
146
+ **Traditional RAG Approach:**
147
+ - API documentation: 15,000 tokens (loaded every time)
148
+ - Company procedures: 8,000 tokens
149
+ - Code examples: 12,000 tokens
150
+ - Templates: 5,000 tokens
151
+ - **TOTAL: 40,000 tokens consumed BEFORE any actual work**
152
+
153
+ **Skills Progressive Disclosure:**
154
+ - Level 1 (Always): Metadata only - 50 tokens
155
+ - Level 2 (When triggered): SKILL.md body - 2,800 tokens
156
+ - Level 3 (On-demand): References - loaded only if accessed
157
+
158
+ **Typical Usage:** 50 tokens (99.875% reduction vs traditional approach)
159
+
160
+ **Cost Impact (Claude Sonnet 4.5 @ $3/M input tokens):**
161
+ - Traditional: $0.12 per conversation just to load context
162
+ - Skills: $0.00015 average overhead
163
+ - Savings: $119.85/month per 1000 conversations
164
+
165
+ Context window is a precious resource. Skills maximize efficiency through progressive disclosure, allowing unlimited knowledge with minimal overhead.
166
+
167
+ **For detailed token economics analysis including multi-agent multipliers and optimization strategies, see:** `05-token-economics.md`
168
+
169
+ ---
170
+
171
+ ## III. POSITIONING VS COMPETITORS
172
+
173
+ ### Skills vs GPTs (OpenAI)
174
+ **GPTs:** Consumer marketplace, pre-configured interfaces, limited customization, closed ecosystem.
175
+
176
+ **Skills Advantages:** Developer-centric control, composable (multiple skills simultaneously), filesystem-based unlimited content, code execution, universal across platforms, version controllable.
177
+
178
+ **Best for:** Enterprise deployments, technical teams, complex workflows, full customization needs.
179
+
180
+ ### Skills vs Copilot Studio (Microsoft)
181
+ **Copilot:** Low-code visual builder, enterprise-focused, Microsoft ecosystem, proprietary format.
182
+
183
+ **Skills Advantages:** Code-first transparency, platform-agnostic portability, Git-friendly versioning, extreme token efficiency, portable across Claude platforms.
184
+
185
+ **Best for:** Code-comfortable teams, cross-platform requirements, version control mandatory, cost sensitivity.
186
+
187
+ ### Skills vs Traditional RAG
188
+ **RAG:** Trade-off between breadth vs relevance, all retrieved content consumes tokens, no code execution.
189
+
190
+ **Skills Advantages:** Progressive disclosure eliminates breadth/relevance trade-off, unlimited bundled content with zero penalty until accessed, executable scripts for deterministic operations, structured workflows beyond document retrieval.
191
+
192
+ **Best for:** Procedural knowledge, deterministic operations, token efficiency critical, structured workflows.
193
+
194
+ **For comprehensive competitive analysis including feature comparisons and migration strategies, see:** `competitive-landscape.md`
195
+
196
+ ---
197
+
198
+ ## IV. USE CASE OVERVIEW
199
+
200
+ ### Financial Services
201
+ - DCF models with company WACC methodology
202
+ - Comparable company analysis with valuation multiples
203
+ - Data room processing to Excel
204
+ - **Validated:** Rakuten - "day to hour" productivity
205
+
206
+ ### Life Sciences
207
+ - Single-cell RNA sequencing QC (scverse best practices)
208
+ - Scientific protocol following
209
+ - Literature reviews with domain knowledge
210
+
211
+ ### Enterprise Integrations
212
+ - Document transformation (Box: files to presentations/spreadsheets)
213
+ - Brand compliance automation
214
+ - Organizational standards enforcement
215
+ - **Validated:** Box "hours saved", Notion "faster action"
216
+
217
+ ### Developer Workflows
218
+ - Code style guide application
219
+ - Boilerplate generation with design standards
220
+ - Validation checks with conventions
221
+ - Task creation in JIRA/Asana/Linear
222
+
223
+ **For detailed case studies with metrics and implementation patterns, see:** `case-studies.md`
224
+
225
+ ---
226
+
227
+ ## WHEN TO READ NEXT
228
+
229
+ **Understanding Choices:**
230
+ - Compare Skills vs Subagents: `02-skills-vs-subagents-comparison.md`
231
+ - See real-world validation: `case-studies.md`
232
+ - Understand limitations: `08-when-not-to-use-skills.md`
233
+
234
+ **Making Decisions:**
235
+ - Implementation decision framework: `03-skills-vs-subagents-decision-tree.md`
236
+ - Cost analysis: `05-token-economics.md`
237
+
238
+ **Technical Details:**
239
+ - Architecture deep dive: `technical-architecture.md`
240
+ - Platform constraints: `06-platform-constraints.md`
241
+
242
+ **If ready to build:** Skip to implementation guides directly.
243
+
244
+ ---
245
+
246
+ **FILE END - Estimated Token Count: ~1,800 tokens (~210 lines)**