@rfxlamia/skillkit 1.0.0 → 1.2.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (269) hide show
  1. package/agents/agents/creative-copywriter.md +212 -0
  2. package/agents/agents/dario-amodei.md +135 -0
  3. package/agents/agents/doc-simplifier.md +63 -0
  4. package/agents/agents/kotlin-pro.md +433 -0
  5. package/agents/agents/red-team.md +136 -0
  6. package/agents/agents/sam-altman.md +121 -0
  7. package/agents/agents/seo-manager.md +184 -0
  8. package/package.json +7 -2
  9. package/skills/quick-spec/tests/__pycache__/test_skill.cpython-314-pytest-9.0.2.pyc +0 -0
  10. package/skills/skillkit/.claude/settings.local.json +7 -0
  11. package/skills/skillkit/scripts/__pycache__/decision_helper.cpython-314.pyc +0 -0
  12. package/skills/skillkit/scripts/__pycache__/quick_validate.cpython-312.pyc +0 -0
  13. package/skills/skillkit/scripts/__pycache__/quick_validate.cpython-314.pyc +0 -0
  14. package/skills/skillkit/scripts/__pycache__/test_generator.cpython-314-pytest-9.0.2.pyc +0 -0
  15. package/skills/skillkit/scripts/utils/__pycache__/__init__.cpython-312.pyc +0 -0
  16. package/skills/skillkit/scripts/utils/__pycache__/__init__.cpython-314.pyc +0 -0
  17. package/skills/skillkit/scripts/utils/__pycache__/budget_tracker.cpython-312.pyc +0 -0
  18. package/skills/skillkit/scripts/utils/__pycache__/budget_tracker.cpython-314.pyc +0 -0
  19. package/skills/skillkit/scripts/utils/__pycache__/output_formatter.cpython-312.pyc +0 -0
  20. package/skills/skillkit/scripts/utils/__pycache__/output_formatter.cpython-314.pyc +0 -0
  21. package/skills/skillkit/scripts/utils/__pycache__/reference_validator.cpython-312.pyc +0 -0
  22. package/skills/skillkit/scripts/utils/__pycache__/reference_validator.cpython-314.pyc +0 -0
  23. package/skills/skillkit-help/SKILL.md +81 -0
  24. package/skills/skillkit-help/knowledge/application/09-case-studies.md +257 -0
  25. package/skills/skillkit-help/knowledge/application/12-testing-and-validation.md +276 -0
  26. package/skills/skillkit-help/knowledge/foundation/01-why-skills-exist.md +246 -0
  27. package/skills/skillkit-help/knowledge/foundation/02-skills-vs-subagents-comparison.md +312 -0
  28. package/skills/skillkit-help/knowledge/foundation/03-skills-vs-subagents-decision-tree.md +346 -0
  29. package/skills/skillkit-help/knowledge/foundation/06-platform-constraints.md +237 -0
  30. package/skills/skillkit-help/knowledge/foundation/08-when-not-to-use-skills.md +270 -0
  31. package/skills/skillkit-help/template/SKILL.md +52 -0
  32. package/skills/skills/adversarial-review/SKILL.md +219 -0
  33. package/skills/skills/baby-education/SKILL.md +260 -0
  34. package/skills/skills/baby-education/references/advanced-techniques.md +323 -0
  35. package/skills/skills/baby-education/references/transformations.md +345 -0
  36. package/skills/skills/been-there-done-that/SKILL.md +455 -0
  37. package/skills/skills/been-there-done-that/references/analysis-patterns.md +162 -0
  38. package/skills/skills/been-there-done-that/references/git-commands.md +132 -0
  39. package/skills/skills/been-there-done-that/references/tree-insertion-logic.md +145 -0
  40. package/skills/skills/coolhunter/SKILL.md +270 -0
  41. package/skills/skills/coolhunter/assets/elicitation-methods.csv +51 -0
  42. package/skills/skills/coolhunter/knowledge/elicitation-methods.md +312 -0
  43. package/skills/skills/coolhunter/references/workflow-execution.md +238 -0
  44. package/skills/skills/coolhunter/workflow-plan-coolhunter.md +232 -0
  45. package/skills/skills/creative-copywriting/SKILL.md +324 -0
  46. package/skills/skills/creative-copywriting/databases/README.md +60 -0
  47. package/skills/skills/creative-copywriting/databases/carousel-structures.csv +16 -0
  48. package/skills/skills/creative-copywriting/databases/emotional-arcs.csv +11 -0
  49. package/skills/skills/creative-copywriting/databases/hook-formulas.csv +51 -0
  50. package/skills/skills/creative-copywriting/databases/power-words.csv +201 -0
  51. package/skills/skills/creative-copywriting/databases/psychological-triggers.csv +21 -0
  52. package/skills/skills/creative-copywriting/databases/read-more-patterns.csv +26 -0
  53. package/skills/skills/creative-copywriting/databases/swipe-triggers.csv +31 -0
  54. package/skills/skills/creative-copywriting/references/carousel-psychology.md +223 -0
  55. package/skills/skills/creative-copywriting/references/hook-anatomy.md +169 -0
  56. package/skills/skills/creative-copywriting/references/power-word-science.md +134 -0
  57. package/skills/skills/creative-copywriting/references/storytelling-frameworks.md +157 -0
  58. package/skills/skills/diverse-content-gen/SKILL.md +201 -0
  59. package/skills/skills/diverse-content-gen/references/advanced-techniques.md +320 -0
  60. package/skills/skills/diverse-content-gen/references/research-findings.md +379 -0
  61. package/skills/skills/diverse-content-gen/references/task-workflows.md +241 -0
  62. package/skills/skills/diverse-content-gen/references/tool-integration.md +419 -0
  63. package/skills/skills/diverse-content-gen/references/troubleshooting.md +426 -0
  64. package/skills/skills/diverse-content-gen/references/vs-core-technique.md +240 -0
  65. package/skills/skills/framework-critical-thinking/SKILL.md +220 -0
  66. package/skills/skills/framework-critical-thinking/references/bias_detector.md +375 -0
  67. package/skills/skills/framework-critical-thinking/references/fallback_handler.md +239 -0
  68. package/skills/skills/framework-critical-thinking/references/memory_curator.md +161 -0
  69. package/skills/skills/framework-critical-thinking/references/metacognitive_monitor.md +297 -0
  70. package/skills/skills/framework-critical-thinking/references/producer_critic_orchestrator.md +333 -0
  71. package/skills/skills/framework-critical-thinking/references/reasoning_router.md +235 -0
  72. package/skills/skills/framework-critical-thinking/references/reasoning_validator.md +97 -0
  73. package/skills/skills/framework-critical-thinking/references/reflection_trigger.md +78 -0
  74. package/skills/skills/framework-critical-thinking/references/self_verification.md +388 -0
  75. package/skills/skills/framework-critical-thinking/references/uncertainty_quantifier.md +207 -0
  76. package/skills/skills/framework-initiative/SKILL.md +231 -0
  77. package/skills/skills/framework-initiative/references/examples.md +150 -0
  78. package/skills/skills/framework-initiative/references/impact-analysis.md +157 -0
  79. package/skills/skills/framework-initiative/references/intent-patterns.md +145 -0
  80. package/skills/skills/framework-initiative/references/star-framework.md +165 -0
  81. package/skills/skills/humanize-docs/SKILL.md +203 -0
  82. package/skills/skills/humanize-docs/references/advanced-techniques.md +13 -0
  83. package/skills/skills/humanize-docs/references/core-transformations.md +368 -0
  84. package/skills/skills/humanize-docs/references/detection-patterns.md +400 -0
  85. package/skills/skills/humanize-docs/references/examples-gallery.md +374 -0
  86. package/skills/skills/imagine/SKILL.md +190 -0
  87. package/skills/skills/imagine/references/artstyle-corporate-memphis.md +625 -0
  88. package/skills/skills/imagine/references/artstyle-crewdson-hyperrealism.md +295 -0
  89. package/skills/skills/imagine/references/artstyle-iphone-social-media.md +426 -0
  90. package/skills/skills/imagine/references/artstyle-sciencesaru.md +276 -0
  91. package/skills/skills/pre-deploy-checklist/README.md +26 -0
  92. package/skills/skills/pre-deploy-checklist/SKILL.md +153 -0
  93. package/skills/skills/pre-deploy-checklist/references/checklist-categories.md +174 -0
  94. package/skills/skills/pre-deploy-checklist/references/domain-prompts.md +216 -0
  95. package/skills/skills/prompt-engineering/SKILL.md +209 -0
  96. package/skills/skills/prompt-engineering/references/advanced-combinations.md +444 -0
  97. package/skills/skills/prompt-engineering/references/chain-of-thought.md +140 -0
  98. package/skills/skills/prompt-engineering/references/decision_matrix.md +220 -0
  99. package/skills/skills/prompt-engineering/references/few-shot.md +346 -0
  100. package/skills/skills/prompt-engineering/references/json-format.md +270 -0
  101. package/skills/skills/prompt-engineering/references/natural-language.md +420 -0
  102. package/skills/skills/prompt-engineering/references/pitfalls.md +365 -0
  103. package/skills/skills/prompt-engineering/references/prompt-chaining.md +498 -0
  104. package/skills/skills/prompt-engineering/references/react.md +108 -0
  105. package/skills/skills/prompt-engineering/references/self-consistency.md +322 -0
  106. package/skills/skills/prompt-engineering/references/tree-of-thoughts.md +386 -0
  107. package/skills/skills/prompt-engineering/references/xml-format.md +220 -0
  108. package/skills/skills/prompt-engineering/references/yaml-format.md +488 -0
  109. package/skills/skills/prompt-engineering/references/zero-shot.md +74 -0
  110. package/skills/skills/quick-spec/SKILL.md +280 -0
  111. package/skills/skills/quick-spec/assets/tech-spec-template.md +74 -0
  112. package/skills/skills/quick-spec/references/step-01-understand.md +189 -0
  113. package/skills/skills/quick-spec/references/step-02-investigate.md +144 -0
  114. package/skills/skills/quick-spec/references/step-03-generate.md +128 -0
  115. package/skills/skills/quick-spec/references/step-04-review.md +173 -0
  116. package/skills/skills/quick-spec/tests/__pycache__/test_skill.cpython-314-pytest-9.0.2.pyc +0 -0
  117. package/skills/skills/quick-spec/tests/test_scenarios.md +83 -0
  118. package/skills/skills/quick-spec/tests/test_skill.py +136 -0
  119. package/skills/skills/readme-expert/SKILL.md +538 -0
  120. package/skills/skills/readme-expert/knowledge/INDEX.md +192 -0
  121. package/skills/skills/readme-expert/knowledge/application/quality-standards.md +470 -0
  122. package/skills/skills/readme-expert/knowledge/application/script-executor.md +604 -0
  123. package/skills/skills/readme-expert/knowledge/application/template-library.md +822 -0
  124. package/skills/skills/readme-expert/knowledge/foundation/codebase-scanner.md +361 -0
  125. package/skills/skills/readme-expert/knowledge/foundation/validation-checklist.md +481 -0
  126. package/skills/skills/red-teaming/SKILL.md +321 -0
  127. package/skills/skills/red-teaming/references/ai-llm-redteam.md +517 -0
  128. package/skills/skills/red-teaming/references/attack-techniques.md +410 -0
  129. package/skills/skills/red-teaming/references/cybersecurity-redteam.md +383 -0
  130. package/skills/skills/red-teaming/references/tools-frameworks.md +446 -0
  131. package/skills/skills/releasing/.skillkit-mode +1 -0
  132. package/skills/skills/releasing/SKILL.md +225 -0
  133. package/skills/skills/releasing/references/version-detection.md +108 -0
  134. package/skills/skills/screenwriter/SKILL.md +273 -0
  135. package/skills/skills/screenwriter/references/advanced-techniques.md +216 -0
  136. package/skills/skills/screenwriter/references/pipeline-integration.md +266 -0
  137. package/skills/skills/skillkit/.claude/settings.local.json +7 -0
  138. package/skills/skills/skillkit/.claude-plugin/plugin.json +27 -0
  139. package/skills/skills/skillkit/CHANGELOG.md +484 -0
  140. package/skills/skills/skillkit/SKILL.md +511 -0
  141. package/skills/skills/skillkit/commands/skillkit.md +6 -0
  142. package/skills/skills/skillkit/commands/validate-plan.md +6 -0
  143. package/skills/skills/skillkit/commands/verify.md +6 -0
  144. package/skills/skills/skillkit/knowledge/INDEX.md +352 -0
  145. package/skills/skills/skillkit/knowledge/application/09-case-studies.md +257 -0
  146. package/skills/skills/skillkit/knowledge/application/10-technical-architecture.md +324 -0
  147. package/skills/skills/skillkit/knowledge/application/11-adoption-strategy.md +267 -0
  148. package/skills/skills/skillkit/knowledge/application/12-testing-and-validation.md +276 -0
  149. package/skills/skills/skillkit/knowledge/application/13-competitive-landscape.md +198 -0
  150. package/skills/skills/skillkit/knowledge/foundation/01-why-skills-exist.md +246 -0
  151. package/skills/skills/skillkit/knowledge/foundation/02-skills-vs-subagents-comparison.md +312 -0
  152. package/skills/skills/skillkit/knowledge/foundation/03-skills-vs-subagents-decision-tree.md +346 -0
  153. package/skills/skills/skillkit/knowledge/foundation/04-hybrid-patterns.md +308 -0
  154. package/skills/skills/skillkit/knowledge/foundation/05-token-economics.md +275 -0
  155. package/skills/skills/skillkit/knowledge/foundation/06-platform-constraints.md +237 -0
  156. package/skills/skills/skillkit/knowledge/foundation/07-security-concerns.md +322 -0
  157. package/skills/skills/skillkit/knowledge/foundation/08-when-not-to-use-skills.md +270 -0
  158. package/skills/skills/skillkit/knowledge/plugin-guide.md +614 -0
  159. package/skills/skills/skillkit/knowledge/tools/14-validation-tools-guide.md +150 -0
  160. package/skills/skills/skillkit/knowledge/tools/15-cost-tools-guide.md +157 -0
  161. package/skills/skills/skillkit/knowledge/tools/16-security-tools-guide.md +122 -0
  162. package/skills/skills/skillkit/knowledge/tools/17-pattern-tools-guide.md +161 -0
  163. package/skills/skills/skillkit/knowledge/tools/18-decision-helper-guide.md +243 -0
  164. package/skills/skills/skillkit/knowledge/tools/19-test-generator-guide.md +275 -0
  165. package/skills/skills/skillkit/knowledge/tools/20-split-skill-guide.md +149 -0
  166. package/skills/skills/skillkit/knowledge/tools/21-quality-scorer-guide.md +226 -0
  167. package/skills/skills/skillkit/knowledge/tools/22-migration-helper-guide.md +356 -0
  168. package/skills/skills/skillkit/knowledge/tools/23-subagent-creation-guide.md +448 -0
  169. package/skills/skills/skillkit/knowledge/tools/24-behavioral-testing-guide.md +122 -0
  170. package/skills/skills/skillkit/references/proposal-generation.md +982 -0
  171. package/skills/skills/skillkit/references/rationalization-catalog.md +75 -0
  172. package/skills/skills/skillkit/references/research-methodology.md +661 -0
  173. package/skills/skills/skillkit/references/section-2-full-creation-workflow.md +452 -0
  174. package/skills/skills/skillkit/references/section-3-validation-workflow-existing-skill.md +63 -0
  175. package/skills/skills/skillkit/references/section-4-decision-workflow-skills-vs-subagents.md +64 -0
  176. package/skills/skills/skillkit/references/section-5-migration-workflow-doc-to-skill.md +58 -0
  177. package/skills/skills/skillkit/references/section-6-subagent-creation-workflow.md +499 -0
  178. package/skills/skills/skillkit/references/section-7-knowledge-reference-map.md +72 -0
  179. package/skills/skills/skillkit/scripts/__pycache__/decision_helper.cpython-314.pyc +0 -0
  180. package/skills/skills/skillkit/scripts/__pycache__/quick_validate.cpython-312.pyc +0 -0
  181. package/skills/skills/skillkit/scripts/__pycache__/quick_validate.cpython-314.pyc +0 -0
  182. package/skills/skills/skillkit/scripts/__pycache__/test_generator.cpython-314-pytest-9.0.2.pyc +0 -0
  183. package/skills/skills/skillkit/scripts/decision_helper.py +799 -0
  184. package/skills/skills/skillkit/scripts/init_skill.py +400 -0
  185. package/skills/skills/skillkit/scripts/init_subagent.py +231 -0
  186. package/skills/skills/skillkit/scripts/migration_helper.py +669 -0
  187. package/skills/skills/skillkit/scripts/package_skill.py +211 -0
  188. package/skills/skills/skillkit/scripts/pattern_detector.py +381 -0
  189. package/skills/skills/skillkit/scripts/pattern_detector_new.py +382 -0
  190. package/skills/skills/skillkit/scripts/pressure_tester.py +157 -0
  191. package/skills/skills/skillkit/scripts/quality_scorer.py +999 -0
  192. package/skills/skills/skillkit/scripts/quick_validate.py +100 -0
  193. package/skills/skills/skillkit/scripts/security_scanner.py +474 -0
  194. package/skills/skills/skillkit/scripts/split_skill.py +540 -0
  195. package/skills/skills/skillkit/scripts/test_generator.py +695 -0
  196. package/skills/skills/skillkit/scripts/token_estimator.py +493 -0
  197. package/skills/skills/skillkit/scripts/utils/__init__.py +49 -0
  198. package/skills/skills/skillkit/scripts/utils/__pycache__/__init__.cpython-312.pyc +0 -0
  199. package/skills/skills/skillkit/scripts/utils/__pycache__/__init__.cpython-314.pyc +0 -0
  200. package/skills/skills/skillkit/scripts/utils/__pycache__/budget_tracker.cpython-312.pyc +0 -0
  201. package/skills/skills/skillkit/scripts/utils/__pycache__/budget_tracker.cpython-314.pyc +0 -0
  202. package/skills/skills/skillkit/scripts/utils/__pycache__/output_formatter.cpython-312.pyc +0 -0
  203. package/skills/skills/skillkit/scripts/utils/__pycache__/output_formatter.cpython-314.pyc +0 -0
  204. package/skills/skills/skillkit/scripts/utils/__pycache__/reference_validator.cpython-312.pyc +0 -0
  205. package/skills/skills/skillkit/scripts/utils/__pycache__/reference_validator.cpython-314.pyc +0 -0
  206. package/skills/skills/skillkit/scripts/utils/budget_tracker.py +388 -0
  207. package/skills/skills/skillkit/scripts/utils/output_formatter.py +263 -0
  208. package/skills/skills/skillkit/scripts/utils/reference_validator.py +401 -0
  209. package/skills/skills/skillkit/scripts/validate_skill.py +594 -0
  210. package/skills/skills/skillkit/tests/test_behavioral.py +39 -0
  211. package/skills/skills/skillkit/tests/test_scenarios.md +83 -0
  212. package/skills/skills/skillkit/tests/test_skill.py +136 -0
  213. package/skills/skills/skillkit-help/SKILL.md +81 -0
  214. package/skills/skills/skillkit-help/knowledge/application/09-case-studies.md +257 -0
  215. package/skills/skills/skillkit-help/knowledge/application/12-testing-and-validation.md +276 -0
  216. package/skills/skills/skillkit-help/knowledge/foundation/01-why-skills-exist.md +246 -0
  217. package/skills/skills/skillkit-help/knowledge/foundation/02-skills-vs-subagents-comparison.md +312 -0
  218. package/skills/skills/skillkit-help/knowledge/foundation/03-skills-vs-subagents-decision-tree.md +346 -0
  219. package/skills/skills/skillkit-help/knowledge/foundation/06-platform-constraints.md +237 -0
  220. package/skills/skills/skillkit-help/knowledge/foundation/08-when-not-to-use-skills.md +270 -0
  221. package/skills/skills/skillkit-help/template/SKILL.md +52 -0
  222. package/skills/skills/social-media-seo/SKILL.md +278 -0
  223. package/skills/skills/social-media-seo/databases/caption-styles.csv +31 -0
  224. package/skills/skills/social-media-seo/databases/engagement-tactics.csv +16 -0
  225. package/skills/skills/social-media-seo/databases/hashtag-strategies.csv +21 -0
  226. package/skills/skills/social-media-seo/databases/hook-formulas.csv +26 -0
  227. package/skills/skills/social-media-seo/databases/keyword-clusters.csv +11 -0
  228. package/skills/skills/social-media-seo/databases/thread-structures.csv +26 -0
  229. package/skills/skills/social-media-seo/databases/viral-patterns.csv +21 -0
  230. package/skills/skills/social-media-seo/references/analytics-guide.md +321 -0
  231. package/skills/skills/social-media-seo/references/instagram-seo.md +235 -0
  232. package/skills/skills/social-media-seo/references/threads-seo.md +305 -0
  233. package/skills/skills/social-media-seo/references/x-twitter-seo.md +337 -0
  234. package/skills/skills/social-media-seo/scripts/query_database.py +191 -0
  235. package/skills/skills/storyteller/SKILL.md +241 -0
  236. package/skills/skills/storyteller/references/transformation-methodology.md +293 -0
  237. package/skills/skills/storyteller/references/visual-vocabulary.md +177 -0
  238. package/skills/skills/thread-pro/SKILL.md +162 -0
  239. package/skills/skills/thread-pro/anti-ai-patterns.md +120 -0
  240. package/skills/skills/thread-pro/hook-formulas.md +138 -0
  241. package/skills/skills/thread-pro/references/anti-ai-patterns.md +120 -0
  242. package/skills/skills/thread-pro/references/hook-formulas.md +138 -0
  243. package/skills/skills/thread-pro/references/thread-structures.md +240 -0
  244. package/skills/skills/thread-pro/references/voice-injection.md +130 -0
  245. package/skills/skills/thread-pro/thread-structures.md +240 -0
  246. package/skills/skills/thread-pro/voice-injection.md +130 -0
  247. package/skills/skills/tinkering/SKILL.md +251 -0
  248. package/skills/skills/tinkering/references/graduation-checklist.md +100 -0
  249. package/skills/skills/validate-plan/.skillkit-mode +1 -0
  250. package/skills/skills/validate-plan/SKILL.md +406 -0
  251. package/skills/skills/validate-plan/references/dry-principles.md +251 -0
  252. package/skills/skills/validate-plan/references/gap-analysis-guide.md +320 -0
  253. package/skills/skills/validate-plan/references/tdd-patterns.md +413 -0
  254. package/skills/skills/validate-plan/references/yagni-checklist.md +330 -0
  255. package/skills/skills/verify-before-ship/.skillkit-mode +1 -0
  256. package/skills/skills/verify-before-ship/SKILL.md +116 -0
  257. package/skills/skills/verify-before-ship/references/anti-rationalization.md +212 -0
  258. package/skills/skills/verify-before-ship/references/verification-gates.md +305 -0
  259. package/skills-manifest.json +8 -2
  260. package/src/banner.js +1 -1
  261. package/src/cli.js +15 -4
  262. package/src/install.js +45 -29
  263. package/src/install.test.js +75 -7
  264. package/src/picker.js +15 -4
  265. package/src/picker.test.js +36 -1
  266. package/src/scope.js +8 -39
  267. package/src/scope.test.js +9 -13
  268. package/src/tools.js +76 -0
  269. package/src/tools.test.js +80 -0
@@ -0,0 +1,276 @@
1
+ ---
2
+ title: "Testing & Validation: Quality Assurance for Skills"
3
+ purpose: "Pre-deployment validation, testing frameworks, debugging workflows"
4
+ token_estimate: "2000"
5
+ read_priority: "high"
6
+ read_when:
7
+ - "Before deploying any skill"
8
+ - "User asking 'How do I test this?'"
9
+ - "Debugging skill issues"
10
+ - "Quality assurance planning"
11
+ - "Creating testing checklist"
12
+ related_files:
13
+ must_read_first:
14
+ - "01-why-skills-exist.md"
15
+ read_together:
16
+ - "11-adoption-strategy.md"
17
+ read_next:
18
+ - "14-validation-best-practices.md"
19
+ - "15-cost-optimization-guide.md"
20
+ - "16-security-scanning-guide.md"
21
+ avoid_reading_when:
22
+ - "Still learning concepts (not implementing yet)"
23
+ - "Using only official Anthropic skills"
24
+ last_updated: "2025-11-03"
25
+ ---
26
+
27
+ # Testing & Validation: Quality Assurance for Skills
28
+
29
+ ## I. INTRODUCTION
30
+
31
+ **Why Testing Critical:** Validation failures waste 2-4 hours debugging post-deployment. Pre-deployment testing catches issues early, ensures quality, prevents user frustration.
32
+
33
+ **Testing Philosophy:** Test BEFORE deployment, test WHAT matters (not everything), automate where possible, iterate based on failures.
34
+
35
+ **Scope:** Pre-deployment validation, functional testing, debugging workflows. **For automated scripts:** `14-validation-best-practices.md`. **For security:** `07-security-concerns.md`.
36
+
37
+ ---
38
+
39
+ ## II. PRE-DEPLOYMENT VALIDATION
40
+
41
+ ### A. Structure Validation
42
+
43
+ | Check | Requirement | Status |
44
+ |-------|-------------|--------|
45
+ | **YAML** | Valid frontmatter, required fields | ☐ |
46
+ | **Name** | Max 64 chars, descriptive | ☐ |
47
+ | **Description** | Max 1,024 chars, has triggers | ☐ |
48
+ | **Files** | SKILL.md present, proper structure | ☐ |
49
+ | **Organization** | Progressive disclosure (main + refs) | ☐ |
50
+
51
+ **File Organization:**
52
+ ```
53
+ skill-name/
54
+ SKILL.md # Required, <500 lines
55
+ reference/ # Optional, Level 3 content
56
+ scripts/ # Optional, executables
57
+ ```
58
+
59
+ **For automated validation:** `validate_skill.py` (see `14-validation-best-practices.md`).
60
+
61
+ ### B. Content Quality
62
+
63
+ | Aspect | Good Example | Bad Example |
64
+ |--------|--------------|-------------|
65
+ | **Description** | "Extract PDFs. Use when..." | "PDF tool" |
66
+ | **Triggers** | "convert PDF", "extract text" | Vague wording |
67
+ | **Instructions** | "1. Run X, 2. Verify Y" | "Handle appropriately" |
68
+ | **Examples** | 2-3 inline, realistic | Too many, unrealistic |
69
+ | **Cross-Refs** | Valid file paths | Broken links |
70
+
71
+ **Description Tips:** Include task verbs ("extract", "convert"), add trigger phrases ("Use when"), be specific ("PDF to Word" NOT "documents").
72
+
73
+ ### C. Token Efficiency
74
+
75
+ | Component | Target | Max | Action if Over |
76
+ |-----------|--------|-----|----------------|
77
+ | SKILL.md | 200-350 lines | 500 | Split to refs |
78
+ | Description | 50-150 chars | 1,024 | Condense |
79
+ | Token estimate | ±10% actual | N/A | Recalculate |
80
+
81
+ **Token Formula:** Tokens ≈ Words × 1.3 to 1.5
82
+
83
+ **Progressive Disclosure:** Core in SKILL.md (<500 lines), advanced in reference files, scripts output-only (don't load), examples inline.
84
+
85
+ **For optimization:** `15-cost-optimization-guide.md`
86
+
87
+ ### D. Security Audit
88
+
89
+ | Risk | Check | Vulnerable | Fixed |
90
+ |------|-------|-----------|--------|
91
+ | **Secrets** | No hardcoded keys | `API_KEY="abc"` | `os.getenv()` |
92
+ | **Injection** | No unchecked input | `os.system(input)` | `subprocess.run()` |
93
+ | **Permissions** | Minimal tools | `allowed-tools: [*]` | Specific list |
94
+ | **Network** | Justified access | Unchecked calls | Validate URLs |
95
+
96
+ **Quick Scan:**
97
+ ```bash
98
+ grep -r "API_KEY\s*=" skill-name/ # Hardcoded secrets
99
+ grep -r "os\.system" skill-name/ # Injection risk
100
+ grep -r "eval\|exec" skill-name/ # Code execution
101
+ ```
102
+
103
+ **For comprehensive security:** `07-security-concerns.md` + `16-security-scanning-guide.md`
104
+
105
+ ---
106
+
107
+ ## III. FUNCTIONAL TESTING
108
+
109
+ ### A. Positive Tests (Should Succeed)
110
+
111
+ | Type | Test Case | Expected |
112
+ |------|-----------|----------|
113
+ | **Direct** | "Use PDF skill to extract" | Activates immediately |
114
+ | **Implicit** | "Extract text from PDF" | Detects relevance, activates |
115
+ | **Multi-Skill** | "Extract PDF, analyze Excel" | Both coordinate |
116
+
117
+ **Examples:**
118
+ 1. Direct: "Use data-analysis skill" → Triggers, processes
119
+ 2. Implicit: "Analyze sales data" → Detects keywords, triggers
120
+ 3. Multi-step: "Convert PDF, create charts" → Both skills activate
121
+
122
+ ### B. Negative Tests (Should NOT Trigger)
123
+
124
+ | Type | Test Case | Expected |
125
+ |------|-----------|----------|
126
+ | **Unrelated** | "What's the weather?" | No activation |
127
+ | **Similar Keywords** | "I like to analyze movies" | No false positive |
128
+ | **Wrong Context** | "Email analysis" (Excel skill) | Correct skill triggers |
129
+
130
+ **Examples:**
131
+ 1. Unrelated: "Tell joke about data" → No trigger
132
+ 2. False positive: "Document this process" → No doc-gen trigger (instruction, not task)
133
+ 3. Edge: "Summarize PDF" → Only PDF triggers, not redundant summarization
134
+
135
+ ### C. Integration Tests
136
+
137
+ | Type | Focus | Validation |
138
+ |------|-------|------------|
139
+ | **Skill + Subagent** | Coordination | Both execute, no conflicts |
140
+ | **Multi-Skill** | Sequential | Correct order, data passing |
141
+ | **Tool Access** | Permissions | Allowed work, blocked fail |
142
+ | **Error Handling** | Graceful failures | Valid error messages |
143
+
144
+ **Example:** "Extract PDF, analyze Excel" → Verify PDF first, Excel receives data, both complete.
145
+
146
+ ### D. Performance Tests
147
+
148
+ | Metric | Target | Alert |
149
+ |--------|--------|-------|
150
+ | **Token Usage** | ±10% estimate | >20% variance |
151
+ | **Response Time** | <30 sec | >60 sec |
152
+ | **File Handling** | Works to limit | Crashes |
153
+ | **Error Rate** | <5% | >10% |
154
+
155
+ ---
156
+
157
+ ## IV. DEBUGGING WORKFLOWS
158
+
159
+ ### A. Common Issues
160
+
161
+ | Issue | Solution |
162
+ |-------|----------|
163
+ | **Not Triggering** | Improve description (add trigger keywords) |
164
+ | **Wrong Skill** | Make description more specific |
165
+ | **Script Fails** | Check permissions, validate inputs |
166
+ | **Permission Error** | Add required tool to allowed-tools |
167
+ | **Slow** | Check SKILL.md size, split files |
168
+
169
+ **Decision Tree:**
170
+ ```
171
+ Not working?
172
+ ├─ Not activating? → Fix description, test explicit mention
173
+ ├─ Fails execution? → Check permissions, validate code
174
+ ├─ Wrong output? → Review instructions, add examples
175
+ └─ Slow? → Optimize token usage, split files
176
+ ```
177
+
178
+ ### B. Diagnostic Techniques
179
+
180
+ **1. Description Analysis:**
181
+ ```
182
+ Bad: "Helps with documents"
183
+ Good: "Convert Word/PDF/Excel. Use when processing documents."
184
+ ```
185
+
186
+ **2. Trigger Testing:**
187
+ ```
188
+ Test: "Convert PDF", "Extract text", "Process document", "Use converter"
189
+ → Track which phrases trigger consistently
190
+ ```
191
+
192
+ **3. Permission Check:**
193
+ ```yaml
194
+ allowed-tools:
195
+ - bash_tool # Script execution
196
+ - view # Read files
197
+ - create_file # Output
198
+ ```
199
+
200
+ ### C. Iterative Improvement
201
+
202
+ **5-Step Loop:**
203
+ 1. **Observe:** Document failure (screenshot, error)
204
+ 2. **Hypothesize:** "Description lacks 'convert' keyword"
205
+ 3. **Fix:** Add one keyword (minimal change)
206
+ 4. **Re-Test:** Same case again
207
+ 5. **Validate:** Test 3-5 times (confirm reliability)
208
+
209
+ **Example:**
210
+ ```
211
+ Iteration 1: Not triggering → Add "process" keyword → Works
212
+ Iteration 2: Workflow unclear → Add steps → Completes
213
+ Iteration 3: Fails Word docs → Add example → Both formats work
214
+ ```
215
+
216
+ ### D. Documentation
217
+
218
+ **Test Log:**
219
+
220
+ | Date | Test | Result | Issue | Resolution |
221
+ |------|------|--------|-------|------------|
222
+ | 11-01 | PDF extract | ✅ | None | - |
223
+ | 11-01 | Excel convert | ❌ | Permission | Added `create_file` |
224
+ | 11-02 | Excel convert | ✅ | None | Fixed |
225
+
226
+ **Known Issues:**
227
+ ```
228
+ Issue #1: Slow with large PDFs (>50MB)
229
+ Status: Open | Workaround: Split files | Target: v1.2.0
230
+
231
+ Issue #2: False trigger "analyze"
232
+ Status: Fixed v1.1.0 | Solution: Specific description
233
+ ```
234
+
235
+ ---
236
+
237
+ ## V. QUALITY ASSURANCE FRAMEWORK
238
+
239
+ **Testing Stages:**
240
+
241
+ | Stage | Focus | Pass Criteria |
242
+ |-------|-------|---------------|
243
+ | **Dev** | Basic functionality | All positive tests pass |
244
+ | **Staging** | Integration + edges | 90% pass, no critical issues |
245
+ | **Production** | Real usage | <5% error, satisfaction ≥7/10 |
246
+
247
+ **Sign-Off Checklist:**
248
+
249
+ | Criteria | Required |
250
+ |----------|----------|
251
+ | Validation checks passed | Yes ☐ |
252
+ | Positive tests ≥95% | Yes ☐ |
253
+ | Negative tests ≥95% | Yes ☐ |
254
+ | Security audit done | Yes ☐ |
255
+ | Documentation current | Yes ☐ |
256
+ | Peer review complete | Yes ☐ |
257
+
258
+ **Regression Testing:** Re-run ALL tests after ANY change to SKILL.md, scripts, or references.
259
+
260
+ **Monitoring:** Usage frequency (daily), error rate (<5%), complaints (<3/week). **For setup:** `11-adoption-strategy.md` IV.D.
261
+
262
+ ---
263
+
264
+ ## VI. KEY TAKEAWAYS
265
+
266
+ **Testing Priorities:** Pre-deployment validation prevents disasters (structure + security). Functional testing ensures core works (positive tests) and avoids false positives (negative tests). Performance optimization follows (token usage + speed).
267
+
268
+ **Quality Gates:** Pilot requires validation + positive tests. Team expansion needs integration + negative tests. Production demands performance metrics + security audit completion.
269
+
270
+ **Debugging Strategy:** Quick fixes—check description keywords, verify tool permissions, test explicit mentions. Deep fixes—review SKILL.md clarity, test edge cases systematically, document failure patterns.
271
+
272
+ **Next Steps:** Automation → `14-validation-best-practices.md`. Optimization → `15-cost-optimization-guide.md`. Security → `16-security-scanning-guide.md`. Adoption → `11-adoption-strategy.md`.
273
+
274
+ ---
275
+
276
+ **End of File 12**
@@ -0,0 +1,198 @@
1
+ ---
2
+ title: "Competitive Landscape: Skills vs Alternatives"
3
+ purpose: "Market positioning, feature comparisons, migration guidance"
4
+ token_estimate: "1500"
5
+ read_priority: "low"
6
+ read_when:
7
+ - "User comparing Skills to GPTs/Copilot/MCP"
8
+ - "User evaluating alternatives"
9
+ - "Migrating from other platforms"
10
+ - "Building business case"
11
+ - "Strategic positioning questions"
12
+ related_files:
13
+ must_read_first:
14
+ - "01-why-skills-exist.md"
15
+ read_together:
16
+ - "09-case-studies.md"
17
+ read_next: []
18
+ avoid_reading_when:
19
+ - "Already committed to Skills"
20
+ - "Not evaluating alternatives"
21
+ - "Pure implementation questions"
22
+ last_updated: "2025-11-02"
23
+ ---
24
+
25
+ # Competitive Landscape: Skills vs Alternatives
26
+
27
+ ## I. INTRODUCTION
28
+
29
+ **Objective comparison** for informed decision-making. Feature-by-feature analysis with honest trade-offs, no marketing fluff.
30
+
31
+ **Evaluation framework:** Technical capabilities, use case alignment, migration paths, integration patterns
32
+
33
+ ---
34
+
35
+ ## II. SKILLS VS GPTs (OPENAI)
36
+
37
+ **Core Difference:** Developer-centric control (Skills) vs Consumer marketplace (GPTs)
38
+
39
+ ### Feature Comparison
40
+
41
+ | Dimension | Claude Skills | OpenAI GPTs | Winner |
42
+ |-----------|---------------|-------------|---------|
43
+ | **Composability** | Multiple Skills simultaneously | Single GPT per chat | Skills |
44
+ | **Customization** | Full filesystem, scripts, references | Instructions + files only | Skills |
45
+ | **Version Control** | Git-friendly (files/folders) | Web UI only | Skills |
46
+ | **Token Efficiency** | Progressive disclosure (30-50T overhead) | All instructions loaded | Skills |
47
+ | **Code Execution** | Full bash/Python/npm | Function calling only | Skills |
48
+ | **Marketplace** | Not available (planned) | Active marketplace | GPTs |
49
+ | **Learning Curve** | Steeper (developer skills) | Easier (GUI) | GPTs |
50
+ | **Governance** | Code-based, auditable | Web UI (harder to track) | Skills |
51
+ | **Modularity** | Reusable across projects | Tied to GPT instance | Skills |
52
+
53
+ **Score: Skills 7 | GPTs 2**
54
+
55
+ ### When to Choose
56
+
57
+ **Skills:** Version control, composability, deep customization, token efficiency, governance, developer teams
58
+ **GPTs:** Non-technical users, public distribution, rapid prototyping, ChatGPT ecosystem investment
59
+
60
+ ### Migration: GPTs → Skills
61
+
62
+ 1. Export GPT instructions → `SKILL.md` body
63
+ 2. Add YAML frontmatter (name, description)
64
+ 3. Uploaded files → `references/` folder
65
+ 4. Actions → `scripts/` (bash/Python)
66
+ 5. Test progressive disclosure, implement version control
67
+
68
+ **Effort:** 2-4 hours per GPT
69
+
70
+ ---
71
+
72
+ ## III. SKILLS VS COPILOT STUDIO (MICROSOFT)
73
+
74
+ **Core Difference:** Code-first transparency (Skills) vs Low-code visual builder (Copilot)
75
+
76
+ ### Feature Comparison
77
+
78
+ | Dimension | Claude Skills | Microsoft Copilot Studio | Winner |
79
+ |-----------|---------------|--------------------------|---------|
80
+ | **Development** | Code-first (markdown, files) | Low-code visual builder | Depends |
81
+ | **Microsoft 365** | Not integrated | Deep native integration | Copilot |
82
+ | **Ecosystem Lock-in** | Claude only | Heavy Microsoft dependency | Skills |
83
+ | **Transparency** | Full code visibility | Visual flows (less clear) | Skills |
84
+ | **Version Control** | Native Git support | Limited versioning | Skills |
85
+ | **Cost** | Included in subscription | Separate complex licensing | Skills |
86
+ | **Learning Curve** | Developer skills needed | Accessible to non-devs | Copilot |
87
+ | **Reproducibility** | Code = documentation | Visual flows ambiguous | Skills |
88
+
89
+ **Score: Skills 5 | Copilot 2 | Depends 1**
90
+
91
+ ### When to Choose
92
+
93
+ **Skills:** Code-first control, version control, transparency, avoid lock-in, developer teams, Claude ecosystem
94
+ **Copilot:** Heavy Microsoft 365 investment, non-developer teams, low-code preference, Microsoft AI standardization
95
+
96
+ ### Migration: Copilot → Skills
97
+
98
+ 1. Document Copilot flow logic (no direct export)
99
+ 2. Map flows → markdown instructions
100
+ 3. Recreate triggers → description
101
+ 4. Connectors → scripts (API calls)
102
+ 5. Knowledge sources → reference files
103
+
104
+ **Effort:** 4-8 hours per Copilot (manual reconstruction)
105
+
106
+ ---
107
+
108
+ ## IV. SKILLS VS TRADITIONAL RAG
109
+
110
+ **Core Difference:** Progressive disclosure (Skills) vs Retrieval-based augmentation (RAG)
111
+
112
+ ### Feature Comparison
113
+
114
+ | Dimension | Claude Skills | Traditional RAG | Winner |
115
+ |-----------|---------------|-----------------|---------|
116
+ | **Token Efficiency** | 30-50T overhead, load on-demand | 100+T minimum retrieval | Skills |
117
+ | **Content Structure** | Organized files/folders/levels | Flat vector embeddings | Skills |
118
+ | **Code Execution** | Native (scripts, bash, Python) | Not applicable | Skills |
119
+ | **Deterministic** | Predictable file loading | Semantic search variability | Skills |
120
+ | **Maintenance** | Update files (version control) | Re-embed corpus | Skills |
121
+ | **Cost** | Storage + token usage | Embedding + vector DB + retrieval | Skills |
122
+ | **Query Flexibility** | Predetermined structure | Semantic similarity search | RAG |
123
+ | **Knowledge Freshness** | Manual update | Real-time retrieval possible | RAG |
124
+
125
+ **Score: Skills 6 | RAG 2**
126
+
127
+ ### When to Choose
128
+
129
+ **Skills:** Structured knowledge (procedures, templates), deterministic behavior, code execution, token efficiency, version control
130
+ **RAG:** Large unstructured knowledge bases, semantic search, constantly changing data, unpredictable queries
131
+
132
+ ### Hybrid: Skills + RAG
133
+
134
+ **Pattern:** Skills (procedural how-to) + RAG (factual knowledge)
135
+
136
+ **Example:**
137
+ ```
138
+ User: "Create Q3 report per company template"
139
+ → Skill: Load report procedures + templates
140
+ → RAG: Retrieve Q3 financial data
141
+ → Skill: Apply formatting + validation
142
+ → Output: Formatted report
143
+ ```
144
+
145
+ **Use when:** Structured processes applied to unstructured data
146
+
147
+ ---
148
+
149
+ ## V. SKILLS VS MCP (MODEL CONTEXT PROTOCOL)
150
+
151
+ **Key Insight:** COMPLEMENTARY, not competitive
152
+
153
+ ### Comparison
154
+
155
+ | Dimension | Claude Skills | MCP |
156
+ |-----------|---------------|-----|
157
+ | **Function** | Procedural "how-to" knowledge | External data/tool access |
158
+ | **Content** | Instructions, templates, procedures | Real-time data, API responses |
159
+ | **Storage** | Filesystem (bundled) | External systems (servers) |
160
+ | **Use Case** | "How to do X" | "Get data from Y" |
161
+ | **Offline** | Yes (files bundled) | No (server required) |
162
+ | **Maintenance** | Update skill files | Update server endpoints |
163
+
164
+ ### Integration: Skills + MCP
165
+
166
+ **Architecture:**
167
+ ```
168
+ User Query → Claude loads Skill (procedures) → Skill calls MCP (data) → Apply procedures to data → Output
169
+ ```
170
+
171
+ **Use Cases:**
172
+ 1. **Support Skill + CRM MCP:** Support procedures + real-time customer data
173
+ 2. **Analysis Skill + Market MCP:** Valuation procedures + live stock prices
174
+ 3. **Content Skill + Assets MCP:** Brand guidelines + current logo files
175
+
176
+ ### When to Use
177
+
178
+ **Skills alone:** All knowledge bundled, offline operation, deterministic behavior
179
+ **MCP alone:** Only data retrieval, simple tool access, real-time data
180
+ **Skills + MCP:** Procedural knowledge needs external data, complex workflows, governance for both
181
+
182
+ ---
183
+
184
+ ## VI. KEY TAKEAWAYS
185
+
186
+ **Skills Positioning:** Excel at procedural knowledge encoding with code execution in developer environments. Best for version control, composability, token efficiency, and deterministic behavior. Complementary to GUI tools (GPTs, Copilot), RAG (unstructured knowledge), and MCP (external data).
187
+
188
+ **When Skills Win:** Developer teams, code-first workflows, version control needs, composable multi-skill scenarios, token optimization requirements. Migration from GPTs straightforward (2-4 hours), Copilot requires reconstruction (4-8 hours).
189
+
190
+ **Integration Pattern:** Skills orchestrate procedural knowledge + RAG (unstructured data) + MCP (external sources) + platform features. Hybrid architectures leverage each tool's strengths rather than forcing single-solution approaches.
191
+
192
+ **Adoption Prerequisites:** Developer team comfort with code-first, existing version control workflows, well-structured knowledge, clear procedural use cases, maintenance commitment. Success correlates strongly with prerequisite coverage.
193
+
194
+ **Next Steps:** ROI validation → `09-case-studies.md`. Strategic foundation → `01-why-skills-exist.md`. Implementation → `11-adoption-strategy.md`.
195
+
196
+ ---
197
+
198
+ **File Status:** ✅ Objective analysis, no marketing bias | **Updated:** 2025-11-02