bobo-ai-cli 1.0.2 → 1.2.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (165) hide show
  1. package/bundled-skills/Skill_Seekers/SKILL.md +1722 -0
  2. package/bundled-skills/ab-test-setup/SKILL.md +557 -0
  3. package/bundled-skills/adversarial-verification/SKILL.md +95 -0
  4. package/bundled-skills/agent-sdk-dev/SKILL.md +238 -0
  5. package/bundled-skills/agent-tools/SKILL.md +136 -0
  6. package/bundled-skills/analytics-tracking/SKILL.md +597 -0
  7. package/bundled-skills/artifacts-builder/SKILL.md +89 -0
  8. package/bundled-skills/asana/SKILL.md +13 -0
  9. package/bundled-skills/backend-expert/SKILL.md +97 -0
  10. package/bundled-skills/brand-voice/SKILL.md +481 -0
  11. package/bundled-skills/browser-use/SKILL.md +419 -0
  12. package/bundled-skills/cache-optimization-skill/SKILL.md +179 -0
  13. package/bundled-skills/canvas-design/SKILL.md +147 -0
  14. package/bundled-skills/citation-validator/SKILL.md +203 -0
  15. package/bundled-skills/clangd-lsp/SKILL.md +52 -0
  16. package/bundled-skills/code-review/SKILL.md +280 -0
  17. package/bundled-skills/code-review-expert/SKILL.md +85 -0
  18. package/bundled-skills/code-simplifier/SKILL.md +13 -0
  19. package/bundled-skills/commit-commands/SKILL.md +258 -0
  20. package/bundled-skills/competitor-alternatives/SKILL.md +795 -0
  21. package/bundled-skills/content-atomizer/SKILL.md +910 -0
  22. package/bundled-skills/content-research-writer/SKILL.md +605 -0
  23. package/bundled-skills/context-budget-analyzer/SKILL.md +76 -0
  24. package/bundled-skills/context-compressor/SKILL.md +75 -0
  25. package/bundled-skills/context-optimization-suite/SKILL.md +162 -0
  26. package/bundled-skills/context7/SKILL.md +13 -0
  27. package/bundled-skills/copy-editing/SKILL.md +494 -0
  28. package/bundled-skills/copywriting/SKILL.md +510 -0
  29. package/bundled-skills/csharp-lsp/SKILL.md +40 -0
  30. package/bundled-skills/decision-making-framework/SKILL.md +154 -0
  31. package/bundled-skills/deep-research/SKILL.md +236 -0
  32. package/bundled-skills/developer-growth-analysis/SKILL.md +335 -0
  33. package/bundled-skills/direct-response-copy/SKILL.md +2336 -0
  34. package/bundled-skills/docker-expert/SKILL.md +229 -0
  35. package/bundled-skills/document-skills/SKILL.md +13 -0
  36. package/bundled-skills/documentation-expert/SKILL.md +126 -0
  37. package/bundled-skills/email-sequence/SKILL.md +1061 -0
  38. package/bundled-skills/email-sequences/SKILL.md +910 -0
  39. package/bundled-skills/example-plugin/SKILL.md +72 -0
  40. package/bundled-skills/explanatory-output-style/SKILL.md +82 -0
  41. package/bundled-skills/feature-dev/SKILL.md +458 -0
  42. package/bundled-skills/file-organizer/SKILL.md +466 -0
  43. package/bundled-skills/firebase.disabled/SKILL.md +13 -0
  44. package/bundled-skills/form-cro/SKILL.md +488 -0
  45. package/bundled-skills/free-tool-strategy/SKILL.md +636 -0
  46. package/bundled-skills/frontend-design/SKILL.md +41 -0
  47. package/bundled-skills/frontend-design-offical/SKILL.md +55 -0
  48. package/bundled-skills/frontend-expert/SKILL.md +93 -0
  49. package/bundled-skills/github/SKILL.md +13 -0
  50. package/bundled-skills/gitlab/SKILL.md +13 -0
  51. package/bundled-skills/gopls-lsp/SKILL.md +32 -0
  52. package/bundled-skills/got-controller/SKILL.md +218 -0
  53. package/bundled-skills/greptile/SKILL.md +72 -0
  54. package/bundled-skills/high-agency/SKILL.md +473 -0
  55. package/bundled-skills/high-agency/references/builder-patterns.md +126 -0
  56. package/bundled-skills/high-agency/references/recovery-playbook.md +298 -0
  57. package/bundled-skills/hookify/SKILL.md +376 -0
  58. package/bundled-skills/image-editor/SKILL.md +189 -0
  59. package/bundled-skills/image-enhancer/SKILL.md +109 -0
  60. package/bundled-skills/jdtls-lsp/SKILL.md +49 -0
  61. package/bundled-skills/json-canvas/SKILL.md +654 -0
  62. package/bundled-skills/keyword-research/SKILL.md +559 -0
  63. package/bundled-skills/kotlin-lsp/SKILL.md +28 -0
  64. package/bundled-skills/laravel-boost/SKILL.md +13 -0
  65. package/bundled-skills/launch-strategy/SKILL.md +394 -0
  66. package/bundled-skills/lead-magnet/SKILL.md +393 -0
  67. package/bundled-skills/learning-output-style/SKILL.md +106 -0
  68. package/bundled-skills/linear/SKILL.md +13 -0
  69. package/bundled-skills/lua-lsp/SKILL.md +47 -0
  70. package/bundled-skills/marketing-ideas/SKILL.md +720 -0
  71. package/bundled-skills/marketing-psychology/SKILL.md +534 -0
  72. package/bundled-skills/mcp-builder/SKILL.md +369 -0
  73. package/bundled-skills/meeting-insights-analyzer/SKILL.md +347 -0
  74. package/bundled-skills/memory-evolution-system/SKILL.md +172 -0
  75. package/bundled-skills/memory-manager/SKILL.md +214 -0
  76. package/bundled-skills/memory-manager/references/advanced-config.md +65 -0
  77. package/bundled-skills/multi-lens-thinking/SKILL.md +407 -0
  78. package/bundled-skills/nano-banana-pro/SKILL.md +116 -0
  79. package/bundled-skills/newsletter/SKILL.md +736 -0
  80. package/bundled-skills/notebooklm/SKILL.md +296 -0
  81. package/bundled-skills/obsidian-bases/SKILL.md +634 -0
  82. package/bundled-skills/obsidian-markdown/SKILL.md +651 -0
  83. package/bundled-skills/onboarding-cro/SKILL.md +494 -0
  84. package/bundled-skills/orchestrator/SKILL.md +681 -0
  85. package/bundled-skills/page-cro/SKILL.md +379 -0
  86. package/bundled-skills/paid-ads/SKILL.md +624 -0
  87. package/bundled-skills/paywall-upgrade-cro/SKILL.md +651 -0
  88. package/bundled-skills/php-lsp/SKILL.md +36 -0
  89. package/bundled-skills/planning-with-files/SKILL.md +193 -0
  90. package/bundled-skills/playwright/SKILL.md +13 -0
  91. package/bundled-skills/plugin-dev/SKILL.md +434 -0
  92. package/bundled-skills/popup-cro/SKILL.md +520 -0
  93. package/bundled-skills/positioning-angles/SKILL.md +330 -0
  94. package/bundled-skills/pr-review-toolkit/SKILL.md +359 -0
  95. package/bundled-skills/pricing-strategy/SKILL.md +777 -0
  96. package/bundled-skills/proactive-self-improving/SKILL.md +435 -0
  97. package/bundled-skills/programmatic-seo/SKILL.md +714 -0
  98. package/bundled-skills/pyright-lsp/SKILL.md +43 -0
  99. package/bundled-skills/quality-assurance-framework/SKILL.md +168 -0
  100. package/bundled-skills/question-refiner/SKILL.md +160 -0
  101. package/bundled-skills/ralph-loop/SKILL.md +205 -0
  102. package/bundled-skills/refactoring-expert/SKILL.md +103 -0
  103. package/bundled-skills/referral-program/SKILL.md +668 -0
  104. package/bundled-skills/research-executor/SKILL.md +164 -0
  105. package/bundled-skills/review-with-security/SKILL.md +12 -0
  106. package/bundled-skills/rust-analyzer-lsp/SKILL.md +50 -0
  107. package/bundled-skills/schema-markup/SKILL.md +647 -0
  108. package/bundled-skills/security-audit-expert/SKILL.md +124 -0
  109. package/bundled-skills/security-expert/SKILL.md +140 -0
  110. package/bundled-skills/security-guidance/SKILL.md +13 -0
  111. package/bundled-skills/seedance-prompt/SKILL.md +139 -0
  112. package/bundled-skills/self-evolution/SKILL.md +1160 -0
  113. package/bundled-skills/seo-audit/SKILL.md +432 -0
  114. package/bundled-skills/seo-content/SKILL.md +787 -0
  115. package/bundled-skills/serena/SKILL.md +13 -0
  116. package/bundled-skills/signup-flow-cro/SKILL.md +409 -0
  117. package/bundled-skills/skill-creator/SKILL.md +220 -0
  118. package/bundled-skills/skill-manager/SKILL.md +226 -0
  119. package/bundled-skills/skill-share/SKILL.md +98 -0
  120. package/bundled-skills/slack/SKILL.md +13 -0
  121. package/bundled-skills/social-content/SKILL.md +878 -0
  122. package/bundled-skills/spec-flow-skill/SKILL.md +124 -0
  123. package/bundled-skills/stripe/SKILL.md +13 -0
  124. package/bundled-skills/supabase/SKILL.md +13 -0
  125. package/bundled-skills/swift-lsp/SKILL.md +40 -0
  126. package/bundled-skills/synthesizer/SKILL.md +236 -0
  127. package/bundled-skills/template-skill/SKILL.md +16 -0
  128. package/bundled-skills/testing-expert/SKILL.md +99 -0
  129. package/bundled-skills/theme-factory/SKILL.md +72 -0
  130. package/bundled-skills/tiktok-research/SKILL.md +208 -0
  131. package/bundled-skills/typescript-lsp/SKILL.md +36 -0
  132. package/bundled-skills/ui-ux-pro-max/SKILL.md +247 -0
  133. package/bundled-skills/verify/SKILL.md +15 -0
  134. package/bundled-skills/visual-prompt-engineer/SKILL.md +102 -0
  135. package/bundled-skills/webapp-testing/SKILL.md +111 -0
  136. package/bundled-skills/wide-research/SKILL.md +191 -0
  137. package/dist/agent.d.ts +4 -0
  138. package/dist/agent.js +42 -11
  139. package/dist/agent.js.map +1 -1
  140. package/dist/config.js +10 -1
  141. package/dist/config.js.map +1 -1
  142. package/dist/index.js +340 -10
  143. package/dist/index.js.map +1 -1
  144. package/dist/insight.d.ts +5 -0
  145. package/dist/insight.js +80 -0
  146. package/dist/insight.js.map +1 -0
  147. package/dist/sessions.d.ts +29 -0
  148. package/dist/sessions.js +106 -0
  149. package/dist/sessions.js.map +1 -0
  150. package/dist/skills.d.ts +6 -2
  151. package/dist/skills.js +33 -6
  152. package/dist/skills.js.map +1 -1
  153. package/dist/spinner.d.ts +14 -0
  154. package/dist/spinner.js +38 -0
  155. package/dist/spinner.js.map +1 -0
  156. package/dist/sub-agent-runner.d.ts +6 -0
  157. package/dist/sub-agent-runner.js +37 -0
  158. package/dist/sub-agent-runner.js.map +1 -0
  159. package/dist/sub-agents.d.ts +30 -0
  160. package/dist/sub-agents.js +112 -0
  161. package/dist/sub-agents.js.map +1 -0
  162. package/dist/ui.d.ts +2 -1
  163. package/dist/ui.js +37 -27
  164. package/dist/ui.js.map +1 -1
  165. package/package.json +2 -1
@@ -0,0 +1,1722 @@
1
+ ---
2
+ id: "Skill_Seekers"
3
+ title: "Skill Seeker"
4
+ category: "agent-engineering"
5
+ tags: ["skill seeker", "what is skill seeker?", "why use this?", "key features", "📦 now available on pypi!", "quick start", "use the unified cli", "unified cli - simple commands", "install dependencies", "set api key once"]
6
+ triggers: []
7
+ dependencies: []
8
+ source: "E:/Bobo's Coding cache/.claude/skills/Skill_Seekers"
9
+ ---
10
+
11
+ [![MseeP.ai Security Assessment Badge](https://mseep.net/pr/yusufkaraaslan-skill-seekers-badge.png)](https://mseep.ai/app/yusufkaraaslan-skill-seekers)
12
+
13
+ # Skill Seeker
14
+
15
+ [![Version](https://img.shields.io/badge/version-2.4.0-blue.svg)](https://github.com/yusufkaraaslan/Skill_Seekers/releases/tag/v2.4.0)
16
+ [![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
17
+ [![Python 3.10+](https://img.shields.io/badge/python-3.10+-blue.svg)](https://www.python.org/downloads/)
18
+ [![MCP Integration](https://img.shields.io/badge/MCP-Integrated-blue.svg)](https://modelcontextprotocol.io)
19
+ [![Tested](https://img.shields.io/badge/Tests-427%20Passing-brightgreen.svg)](tests/)
20
+ [![Project Board](https://img.shields.io/badge/Project-Board-purple.svg)](https://github.com/users/yusufkaraaslan/projects/2)
21
+ [![PyPI version](https://badge.fury.io/py/skill-seekers.svg)](https://pypi.org/project/skill-seekers/)
22
+ [![PyPI - Downloads](https://img.shields.io/pypi/dm/skill-seekers.svg)](https://pypi.org/project/skill-seekers/)
23
+ [![PyPI - Python Version](https://img.shields.io/pypi/pyversions/skill-seekers.svg)](https://pypi.org/project/skill-seekers/)
24
+
25
+ **Automatically convert documentation websites, GitHub repositories, and PDFs into Claude AI skills in minutes.**
26
+
27
+ > 📋 **[View Development Roadmap & Tasks](https://github.com/users/yusufkaraaslan/projects/2)** - 134 tasks across 10 categories, pick any to contribute!
28
+
29
+ ## What is Skill Seeker?
30
+
31
+ Skill Seeker is an automated tool that transforms documentation websites, GitHub repositories, and PDF files into production-ready [Claude AI skills](https://www.anthropic.com/news/skills). Instead of manually reading and summarizing documentation, Skill Seeker:
32
+
33
+ 1. **Scrapes** multiple sources (docs, GitHub repos, PDFs) automatically
34
+ 2. **Analyzes** code repositories with deep AST parsing
35
+ 3. **Detects** conflicts between documentation and code implementation
36
+ 4. **Organizes** content into categorized reference files
37
+ 5. **Enhances** with AI to extract best examples and key concepts
38
+ 6. **Packages** everything into an uploadable `.zip` file for Claude
39
+
40
+ **Result:** Get comprehensive Claude skills for any framework, API, or tool in 20-40 minutes instead of hours of manual work.
41
+
42
+ ## Why Use This?
43
+
44
+ - 🎯 **For Developers**: Create skills from documentation + GitHub repos with conflict detection
45
+ - 🎮 **For Game Devs**: Generate skills for game engines (Godot docs + GitHub, Unity, etc.)
46
+ - 🔧 **For Teams**: Combine internal docs + code repositories into single source of truth
47
+ - 📚 **For Learners**: Build comprehensive skills from docs, code examples, and PDFs
48
+ - 🔍 **For Open Source**: Analyze repos to find documentation gaps and outdated examples
49
+
50
+ ## Key Features
51
+
52
+ ### 🌐 Documentation Scraping
53
+
54
+ - ✅ **llms.txt Support** - Automatically detects and uses LLM-ready documentation files (10x faster)
55
+ - ✅ **Universal Scraper** - Works with ANY documentation website
56
+ - ✅ **Smart Categorization** - Automatically organizes content by topic
57
+ - ✅ **Code Language Detection** - Recognizes Python, JavaScript, C++, GDScript, etc.
58
+ - ✅ **8 Ready-to-Use Presets** - Godot, React, Vue, Django, FastAPI, and more
59
+
60
+ ### 📄 PDF Support (**v1.2.0**)
61
+
62
+ - ✅ **Basic PDF Extraction** - Extract text, code, and images from PDF files
63
+ - ✅ **OCR for Scanned PDFs** - Extract text from scanned documents
64
+ - ✅ **Password-Protected PDFs** - Handle encrypted PDFs
65
+ - ✅ **Table Extraction** - Extract complex tables from PDFs
66
+ - ✅ **Parallel Processing** - 3x faster for large PDFs
67
+ - ✅ **Intelligent Caching** - 50% faster on re-runs
68
+
69
+ ### 🐙 GitHub Repository Scraping (**v2.0.0**)
70
+
71
+ - ✅ **Deep Code Analysis** - AST parsing for Python, JavaScript, TypeScript, Java, C++, Go
72
+ - ✅ **API Extraction** - Functions, classes, methods with parameters and types
73
+ - ✅ **Repository Metadata** - README, file tree, language breakdown, stars/forks
74
+ - ✅ **GitHub Issues & PRs** - Fetch open/closed issues with labels and milestones
75
+ - ✅ **CHANGELOG & Releases** - Automatically extract version history
76
+ - ✅ **Conflict Detection** - Compare documented APIs vs actual code implementation
77
+ - ✅ **MCP Integration** - Natural language: "Scrape GitHub repo facebook/react"
78
+
79
+ ### 🔄 Unified Multi-Source Scraping (**NEW - v2.0.0**)
80
+
81
+ - ✅ **Combine Multiple Sources** - Mix documentation + GitHub + PDF in one skill
82
+ - ✅ **Conflict Detection** - Automatically finds discrepancies between docs and code
83
+ - ✅ **Intelligent Merging** - Rule-based or AI-powered conflict resolution
84
+ - ✅ **Transparent Reporting** - Side-by-side comparison with ⚠️ warnings
85
+ - ✅ **Documentation Gap Analysis** - Identifies outdated docs and undocumented features
86
+ - ✅ **Single Source of Truth** - One skill showing both intent (docs) and reality (code)
87
+ - ✅ **Backward Compatible** - Legacy single-source configs still work
88
+
89
+ ### 🔐 Private Config Repositories (**NEW - v2.2.0**)
90
+
91
+ - ✅ **Git-Based Config Sources** - Fetch configs from private/team git repositories
92
+ - ✅ **Multi-Source Management** - Register unlimited GitHub, GitLab, Bitbucket repos
93
+ - ✅ **Team Collaboration** - Share custom configs across 3-5 person teams
94
+ - ✅ **Enterprise Support** - Scale to 500+ developers with priority-based resolution
95
+ - ✅ **Secure Authentication** - Environment variable tokens (GITHUB_TOKEN, GITLAB_TOKEN)
96
+ - ✅ **Intelligent Caching** - Clone once, pull updates automatically
97
+ - ✅ **Offline Mode** - Work with cached configs when offline
98
+ - ✅ **Backward Compatible** - Existing API-based configs still work
99
+
100
+ ### 🤖 AI & Enhancement
101
+
102
+ - ✅ **AI-Powered Enhancement** - Transforms basic templates into comprehensive guides
103
+ - ✅ **No API Costs** - FREE local enhancement using Claude Code Max
104
+ - ✅ **MCP Server for Claude Code** - Use directly from Claude Code with natural language
105
+
106
+ ### ⚡ Performance & Scale
107
+
108
+ - ✅ **Async Mode** - 2-3x faster scraping with async/await (use `--async` flag)
109
+ - ✅ **Large Documentation Support** - Handle 10K-40K+ page docs with intelligent splitting
110
+ - ✅ **Router/Hub Skills** - Intelligent routing to specialized sub-skills
111
+ - ✅ **Parallel Scraping** - Process multiple skills simultaneously
112
+ - ✅ **Checkpoint/Resume** - Never lose progress on long scrapes
113
+ - ✅ **Caching System** - Scrape once, rebuild instantly
114
+
115
+ ### ✅ Quality Assurance
116
+
117
+ - ✅ **Fully Tested** - 391 tests with comprehensive coverage
118
+
119
+ ---
120
+
121
+ ## 📦 Now Available on PyPI!
122
+
123
+ **Skill Seekers is now published on the Python Package Index!** Install with a single command:
124
+
125
+ ```bash
126
+ pip install skill-seekers
127
+ ```
128
+
129
+ Get started in seconds. No cloning, no setup - just install and run. See installation options below.
130
+
131
+ ---
132
+
133
+ ## Quick Start
134
+
135
+ ### Option 1: Install from PyPI (Recommended)
136
+
137
+ ```bash
138
+ # Install from PyPI (easiest method!)
139
+ pip install skill-seekers
140
+
141
+ # Use the unified CLI
142
+ skill-seekers scrape --config configs/react.json
143
+ skill-seekers github --repo facebook/react
144
+ skill-seekers enhance output/react/
145
+ skill-seekers package output/react/
146
+ ```
147
+
148
+ **Time:** ~25 minutes | **Quality:** Production-ready | **Cost:** Free
149
+
150
+ 📖 **New to Skill Seekers?** Check out our [Quick Start Guide](QUICKSTART.md) or [Bulletproof Guide](BULLETPROOF_QUICKSTART.md)
151
+
152
+ ### Option 2: Install via uv (Modern Python Tool)
153
+
154
+ ```bash
155
+ # Install with uv (fast, modern alternative)
156
+ uv tool install skill-seekers
157
+
158
+ # Or run directly without installing
159
+ uv tool run --from skill-seekers skill-seekers scrape --config https://raw.githubusercontent.com/yusufkaraaslan/Skill_Seekers/main/configs/react.json
160
+
161
+ # Unified CLI - simple commands
162
+ skill-seekers scrape --config configs/react.json
163
+ skill-seekers github --repo facebook/react
164
+ skill-seekers package output/react/
165
+ ```
166
+
167
+ **Time:** ~25 minutes | **Quality:** Production-ready | **Cost:** Free
168
+
169
+ ### Option 3: Development Install (From Source)
170
+
171
+ ```bash
172
+ # Clone and install in editable mode
173
+ git clone https://github.com/yusufkaraaslan/Skill_Seekers.git
174
+ cd Skill_Seekers
175
+ pip install -e .
176
+
177
+ # Use the unified CLI
178
+ skill-seekers scrape --config configs/react.json
179
+ ```
180
+
181
+ ### Option 4: Use from Claude Code & 4 Other AI Agents (MCP Integration)
182
+
183
+ ```bash
184
+ # One-time setup (5 minutes) - Auto-configures 5 AI agents!
185
+ ./setup_mcp.sh
186
+
187
+ # Then in Claude Code, Cursor, Windsurf, VS Code + Cline, or IntelliJ IDEA, just ask:
188
+ "Generate a React skill from https://react.dev/"
189
+ "Scrape PDF at docs/manual.pdf and create skill"
190
+ ```
191
+
192
+ **Time:** Automated | **Quality:** Production-ready | **Cost:** Free
193
+
194
+ **NEW in v2.4.0:** MCP server now supports 5 AI coding agents with automatic configuration!
195
+
196
+ ### Option 5: Legacy CLI (Backwards Compatible)
197
+
198
+ ```bash
199
+ # Install dependencies
200
+ pip3 install requests beautifulsoup4
201
+
202
+ # Run scripts directly (old method)
203
+ python3 src/skill_seekers/cli/doc_scraper.py --config configs/react.json
204
+
205
+ # Upload output/react.zip to Claude - Done!
206
+ ```
207
+
208
+ **Time:** ~25 minutes | **Quality:** Production-ready | **Cost:** Free
209
+
210
+ ---
211
+
212
+ ## 🚀 **NEW!** One-Command Install Workflow (v2.1.1)
213
+
214
+ **The fastest way to go from config to uploaded skill - complete automation:**
215
+
216
+ ```bash
217
+ # Install React skill from official configs (auto-uploads to Claude)
218
+ skill-seekers install --config react
219
+
220
+ # Install from local config file
221
+ skill-seekers install --config configs/custom.json
222
+
223
+ # Install without uploading (package only)
224
+ skill-seekers install --config django --no-upload
225
+
226
+ # Unlimited scraping (no page limits)
227
+ skill-seekers install --config godot --unlimited
228
+
229
+ # Preview workflow without executing
230
+ skill-seekers install --config react --dry-run
231
+ ```
232
+
233
+ **Time:** 20-45 minutes total | **Quality:** Production-ready (9/10) | **Cost:** Free
234
+
235
+ ### What it does automatically:
236
+
237
+ 1. ✅ **Fetches config** from API (if config name provided)
238
+ 2. ✅ **Scrapes documentation** (respects rate limits, handles pagination)
239
+ 3. ✅ **AI Enhancement (MANDATORY)** - 30-60 sec, quality boost from 3/10 → 9/10
240
+ 4. ✅ **Packages skill** to .zip file
241
+ 5. ✅ **Uploads to Claude** (if ANTHROPIC_API_KEY set)
242
+
243
+ ### Why use this?
244
+
245
+ - **Zero friction** - One command instead of 5 separate steps
246
+ - **Quality guaranteed** - Enhancement is mandatory, ensures professional output
247
+ - **Complete automation** - From config name to uploaded skill in Claude
248
+ - **Time savings** - Fully automated end-to-end workflow
249
+
250
+ ### Phases executed:
251
+
252
+ ```
253
+ 📥 PHASE 1: Fetch Config (if config name provided)
254
+ 📖 PHASE 2: Scrape Documentation
255
+ ✨ PHASE 3: AI Enhancement (MANDATORY - no skip option)
256
+ 📦 PHASE 4: Package Skill
257
+ ☁️ PHASE 5: Upload to Claude (optional, requires API key)
258
+ ```
259
+
260
+ **Requirements:**
261
+
262
+ - ANTHROPIC_API_KEY environment variable (for auto-upload)
263
+ - Claude Code Max plan (for local AI enhancement)
264
+
265
+ **Example:**
266
+
267
+ ```bash
268
+ # Set API key once
269
+ export ANTHROPIC_API_KEY=sk-ant-your-key-here
270
+
271
+ # Run one command - sit back and relax!
272
+ skill-seekers install --config react
273
+
274
+ # Result: React skill uploaded to Claude in 20-45 minutes
275
+ ```
276
+
277
+ ---
278
+
279
+ ## Usage Examples
280
+
281
+ ### Documentation Scraping
282
+
283
+ ```bash
284
+ # Scrape documentation website
285
+ skill-seekers scrape --config configs/react.json
286
+
287
+ # Quick scrape without config
288
+ skill-seekers scrape --url https://react.dev --name react
289
+
290
+ # With async mode (3x faster)
291
+ skill-seekers scrape --config configs/godot.json --async --workers 8
292
+ ```
293
+
294
+ ### PDF Extraction
295
+
296
+ ```bash
297
+ # Basic PDF extraction
298
+ skill-seekers pdf --pdf docs/manual.pdf --name myskill
299
+
300
+ # Advanced features
301
+ skill-seekers pdf --pdf docs/manual.pdf --name myskill \
302
+ --extract-tables \ # Extract tables
303
+ --parallel \ # Fast parallel processing
304
+ --workers 8 # Use 8 CPU cores
305
+
306
+ # Scanned PDFs (requires: pip install pytesseract Pillow)
307
+ skill-seekers pdf --pdf docs/scanned.pdf --name myskill --ocr
308
+
309
+ # Password-protected PDFs
310
+ skill-seekers pdf --pdf docs/encrypted.pdf --name myskill --password mypassword
311
+ ```
312
+
313
+ **Time:** ~5-15 minutes (or 2-5 minutes with parallel) | **Quality:** Production-ready | **Cost:** Free
314
+
315
+ ### GitHub Repository Scraping
316
+
317
+ ```bash
318
+ # Basic repository scraping
319
+ skill-seekers github --repo facebook/react
320
+
321
+ # Using a config file
322
+ skill-seekers github --config configs/react_github.json
323
+
324
+ # With authentication (higher rate limits)
325
+ export GITHUB_TOKEN=ghp_your_token_here
326
+ skill-seekers github --repo facebook/react
327
+
328
+ # Customize what to include
329
+ skill-seekers github --repo django/django \
330
+ --include-issues \ # Extract GitHub Issues
331
+ --max-issues 100 \ # Limit issue count
332
+ --include-changelog \ # Extract CHANGELOG.md
333
+ --include-releases # Extract GitHub Releases
334
+ ```
335
+
336
+ **Time:** ~5-10 minutes | **Quality:** Production-ready | **Cost:** Free
337
+
338
+ ### Unified Multi-Source Scraping (**NEW - v2.0.0**)
339
+
340
+ **The Problem:** Documentation and code often drift apart. Docs might be outdated, missing features that exist in code, or documenting features that were removed.
341
+
342
+ **The Solution:** Combine documentation + GitHub + PDF into one unified skill that shows BOTH what's documented AND what actually exists, with clear warnings about discrepancies.
343
+
344
+ ```bash
345
+ # Use existing unified configs
346
+ skill-seekers unified --config configs/react_unified.json
347
+ skill-seekers unified --config configs/django_unified.json
348
+
349
+ # Or create unified config (mix documentation + GitHub)
350
+ cat > configs/myframework_unified.json << 'EOF'
351
+ {
352
+ "name": "myframework",
353
+ "description": "Complete framework knowledge from docs + code",
354
+ "merge_mode": "rule-based",
355
+ "sources": [
356
+ {
357
+ "type": "documentation",
358
+ "base_url": "https://docs.myframework.com/",
359
+ "extract_api": true,
360
+ "max_pages": 200
361
+ },
362
+ {
363
+ "type": "github",
364
+ "repo": "owner/myframework",
365
+ "include_code": true,
366
+ "code_analysis_depth": "surface"
367
+ }
368
+ ]
369
+ }
370
+ EOF
371
+
372
+ # Run unified scraper
373
+ skill-seekers unified --config configs/myframework_unified.json
374
+
375
+ # Package and upload
376
+ skill-seekers package output/myframework/
377
+ # Upload output/myframework.zip to Claude - Done!
378
+ ```
379
+
380
+ **Time:** ~30-45 minutes | **Quality:** Production-ready with conflict detection | **Cost:** Free
381
+
382
+ **What Makes It Special:**
383
+
384
+ ✅ **Conflict Detection** - Automatically finds 4 types of discrepancies:
385
+
386
+ - 🔴 **Missing in code** (high): Documented but not implemented
387
+ - 🟡 **Missing in docs** (medium): Implemented but not documented
388
+ - ⚠️ **Signature mismatch**: Different parameters/types
389
+ - ℹ️ **Description mismatch**: Different explanations
390
+
391
+ ✅ **Transparent Reporting** - Shows both versions side-by-side:
392
+
393
+ ```markdown
394
+ #### `move_local_x(delta: float)`
395
+
396
+ ⚠️ **Conflict**: Documentation signature differs from implementation
397
+
398
+ **Documentation says:**
399
+ ```
400
+
401
+ def move_local_x(delta: float)
402
+
403
+ ````
404
+
405
+ **Code implementation:**
406
+ ```python
407
+ def move_local_x(delta: float, snap: bool = False) -> None
408
+ ````
409
+
410
+ ````
411
+
412
+ ✅ **Advantages:**
413
+ - **Identifies documentation gaps** - Find outdated or missing docs automatically
414
+ - **Catches code changes** - Know when APIs change without docs being updated
415
+ - **Single source of truth** - One skill showing intent (docs) AND reality (code)
416
+ - **Actionable insights** - Get suggestions for fixing each conflict
417
+ - **Development aid** - See what's actually in the codebase vs what's documented
418
+
419
+ **Example Unified Configs:**
420
+ - `configs/react_unified.json` - React docs + GitHub repo
421
+ - `configs/django_unified.json` - Django docs + GitHub repo
422
+ - `configs/fastapi_unified.json` - FastAPI docs + GitHub repo
423
+
424
+ **Full Guide:** See [docs/UNIFIED_SCRAPING.md](docs/UNIFIED_SCRAPING.md) for complete documentation.
425
+
426
+ ### Private Config Repositories (**NEW - v2.2.0**)
427
+
428
+ **The Problem:** Teams need to share custom configs for internal documentation, but don't want to publish them publicly.
429
+
430
+ **The Solution:** Register private git repositories as config sources. Fetch configs from team repos just like the public API, with full authentication support.
431
+
432
+ ```bash
433
+ # Setup: Set your GitHub token (one-time)
434
+ export GITHUB_TOKEN=ghp_your_token_here
435
+
436
+ # Option 1: Using MCP tools (recommended)
437
+ # Register your team's private repo
438
+ add_config_source(
439
+ name="team",
440
+ git_url="https://github.com/mycompany/skill-configs.git",
441
+ token_env="GITHUB_TOKEN"
442
+ )
443
+
444
+ # Fetch config from team repo
445
+ fetch_config(source="team", config_name="internal-api")
446
+
447
+ # List all registered sources
448
+ list_config_sources()
449
+
450
+ # Remove source when no longer needed
451
+ remove_config_source(name="team")
452
+ ````
453
+
454
+ **Direct Git URL mode** (no registration):
455
+
456
+ ```bash
457
+ # Fetch directly from git URL
458
+ fetch_config(
459
+ git_url="https://github.com/mycompany/configs.git",
460
+ config_name="react-custom",
461
+ token="ghp_your_token_here"
462
+ )
463
+ ```
464
+
465
+ **Supported Platforms:**
466
+
467
+ - GitHub (token env: `GITHUB_TOKEN`)
468
+ - GitLab (token env: `GITLAB_TOKEN`)
469
+ - Gitea (token env: `GITEA_TOKEN`)
470
+ - Bitbucket (token env: `BITBUCKET_TOKEN`)
471
+ - Any git server (token env: `GIT_TOKEN`)
472
+
473
+ **Use Cases:**
474
+
475
+ 📋 **Small Teams (3-5 people)**
476
+
477
+ ```bash
478
+ # Team lead creates repo
479
+ gh repo create myteam/skill-configs --private
480
+
481
+ # Add configs to repo
482
+ cd myteam-skill-configs
483
+ cp ../Skill_Seekers/configs/react.json ./react-custom.json
484
+ # Edit selectors, categories for your internal docs...
485
+ git add . && git commit -m "Add custom React config" && git push
486
+
487
+ # Team members register (one-time)
488
+ add_config_source(name="team", git_url="https://github.com/myteam/skill-configs.git")
489
+
490
+ # Everyone can now fetch
491
+ fetch_config(source="team", config_name="react-custom")
492
+ ```
493
+
494
+ 🏢 **Enterprise (500+ developers)**
495
+
496
+ ```bash
497
+ # IT pre-configures sources for everyone
498
+ add_config_source(name="platform", git_url="gitlab.company.com/platform/configs", priority=1)
499
+ add_config_source(name="mobile", git_url="gitlab.company.com/mobile/configs", priority=2)
500
+ add_config_source(name="official", git_url="api.skillseekersweb.com", priority=3)
501
+
502
+ # Developers use transparently
503
+ fetch_config(config_name="internal-platform") # Finds in platform source
504
+ fetch_config(config_name="react") # Falls back to official API
505
+ ```
506
+
507
+ **Storage Locations:**
508
+
509
+ - Registry: `~/.skill-seekers/sources.json`
510
+ - Cache: `$SKILL_SEEKERS_CACHE_DIR` (default: `~/.skill-seekers/cache/`)
511
+
512
+ **Features:**
513
+
514
+ - ✅ **Shallow clone** - 10-50x faster, minimal disk space
515
+ - ✅ **Auto-pull** - Fetches latest changes automatically
516
+ - ✅ **Offline mode** - Works with cached repos when offline
517
+ - ✅ **Priority resolution** - Multiple sources with conflict resolution
518
+ - ✅ **Secure** - Tokens via environment variables only
519
+
520
+ **Example Team Repository:**
521
+
522
+ Try the included example:
523
+
524
+ ```bash
525
+ # Test with file:// URL (no auth needed)
526
+ cd /path/to/Skill_Seekers
527
+
528
+ # Run the E2E test
529
+ python3 configs/example-team/test_e2e.py
530
+
531
+ # Or test manually
532
+ add_config_source(
533
+ name="example",
534
+ git_url="file://$(pwd)/configs/example-team",
535
+ branch="master"
536
+ )
537
+
538
+ fetch_config(source="example", config_name="react-custom")
539
+ ```
540
+
541
+ **Full Guide:** See [docs/GIT_CONFIG_SOURCES.md](docs/GIT_CONFIG_SOURCES.md) for complete documentation.
542
+
543
+ ## How It Works
544
+
545
+ ```mermaid
546
+ graph LR
547
+ A[Documentation Website] --> B[Skill Seeker]
548
+ B --> C[Scraper]
549
+ B --> D[AI Enhancement]
550
+ B --> E[Packager]
551
+ C --> F[Organized References]
552
+ D --> F
553
+ F --> E
554
+ E --> G[Claude Skill .zip]
555
+ G --> H[Upload to Claude AI]
556
+ ```
557
+
558
+ 0. **Detect llms.txt** - Checks for llms-full.txt, llms.txt, llms-small.txt first
559
+ 1. **Scrape**: Extracts all pages from documentation
560
+ 2. **Categorize**: Organizes content into topics (API, guides, tutorials, etc.)
561
+ 3. **Enhance**: AI analyzes docs and creates comprehensive SKILL.md with examples
562
+ 4. **Package**: Bundles everything into a Claude-ready `.zip` file
563
+
564
+ ## 📋 Prerequisites
565
+
566
+ **Before you start, make sure you have:**
567
+
568
+ 1. **Python 3.10 or higher** - [Download](https://www.python.org/downloads/) | Check: `python3 --version`
569
+ 2. **Git** - [Download](https://git-scm.com/) | Check: `git --version`
570
+ 3. **15-30 minutes** for first-time setup
571
+
572
+ **First time user?** → **[Start Here: Bulletproof Quick Start Guide](BULLETPROOF_QUICKSTART.md)** 🎯
573
+
574
+ This guide walks you through EVERYTHING step-by-step (Python install, git clone, first skill creation).
575
+
576
+ ---
577
+
578
+ ## 🚀 Quick Start
579
+
580
+ ### Method 1: MCP Server for 5 AI Agents (Easiest - **NEW v2.4.0!**)
581
+
582
+ Use Skill Seeker directly from **Claude Code, Cursor, Windsurf, VS Code + Cline, or IntelliJ IDEA** with natural language!
583
+
584
+ ```bash
585
+ # Clone repository
586
+ git clone https://github.com/yusufkaraaslan/Skill_Seekers.git
587
+ cd Skill_Seekers
588
+
589
+ # One-time setup (5 minutes) - Auto-configures ALL 5 agents!
590
+ ./setup_mcp.sh
591
+
592
+ # Restart your AI agent, then just ask:
593
+ ```
594
+
595
+ **In Claude Code, Cursor, Windsurf, VS Code + Cline, or IntelliJ IDEA:**
596
+
597
+ ```
598
+ List all available configs
599
+ Generate config for Tailwind at https://tailwindcss.com/docs
600
+ Scrape docs using configs/react.json
601
+ Package skill at output/react/
602
+ ```
603
+
604
+ **Benefits:**
605
+
606
+ - ✅ No manual CLI commands
607
+ - ✅ Natural language interface
608
+ - ✅ Integrated with your workflow
609
+ - ✅ **17 tools** available instantly (up from 9!)
610
+ - ✅ **5 AI agents supported** - auto-configured with one command
611
+ - ✅ **Tested and working** in production
612
+
613
+ **NEW in v2.4.0:**
614
+
615
+ - ✅ **Upgraded to MCP SDK v1.25.0** - Latest features and performance
616
+ - ✅ **FastMCP Framework** - Modern, maintainable MCP implementation
617
+ - ✅ **HTTP + stdio transport** - Works with more AI agents
618
+ - ✅ **17 tools** (up from 9) - More capabilities
619
+ - ✅ **Multi-agent auto-configuration** - Setup all agents with one command
620
+
621
+ **Full guides:**
622
+
623
+ - 📘 [MCP Setup Guide](docs/MCP_SETUP.md) - Complete installation instructions
624
+ - 🧪 [MCP Testing Guide](docs/TEST_MCP_IN_CLAUDE_CODE.md) - Test all 17 tools
625
+ - 📦 [Large Documentation Guide](docs/LARGE_DOCUMENTATION.md) - Handle 10K-40K+ pages
626
+ - 📤 [Upload Guide](docs/UPLOAD_GUIDE.md) - How to upload skills to Claude
627
+
628
+ ### Method 2: CLI (Traditional)
629
+
630
+ #### One-Time Setup: Create Virtual Environment
631
+
632
+ ```bash
633
+ # Clone repository
634
+ git clone https://github.com/yusufkaraaslan/Skill_Seekers.git
635
+ cd Skill_Seekers
636
+
637
+ # Create virtual environment
638
+ python3 -m venv venv
639
+
640
+ # Activate virtual environment
641
+ source venv/bin/activate # macOS/Linux
642
+ # OR on Windows: venv\Scripts\activate
643
+
644
+ # Install dependencies
645
+ pip install requests beautifulsoup4 pytest
646
+
647
+ # Save dependencies
648
+ pip freeze > requirements.txt
649
+
650
+ # Optional: Install anthropic for API-based enhancement (not needed for LOCAL enhancement)
651
+ # pip install anthropic
652
+ ```
653
+
654
+ **Always activate the virtual environment before using Skill Seeker:**
655
+
656
+ ```bash
657
+ source venv/bin/activate # Run this each time you start a new terminal session
658
+ ```
659
+
660
+ #### Easiest: Use a Preset
661
+
662
+ ```bash
663
+ # Make sure venv is activated (you should see (venv) in your prompt)
664
+ source venv/bin/activate
665
+
666
+ # Optional: Estimate pages first (fast, 1-2 minutes)
667
+ skill-seekers estimate configs/godot.json
668
+
669
+ # Use Godot preset
670
+ skill-seekers scrape --config configs/godot.json
671
+
672
+ # Use React preset
673
+ skill-seekers scrape --config configs/react.json
674
+
675
+ # See all presets
676
+ ls configs/
677
+ ```
678
+
679
+ ### Interactive Mode
680
+
681
+ ```bash
682
+ skill-seekers scrape --interactive
683
+ ```
684
+
685
+ ### Quick Mode
686
+
687
+ ```bash
688
+ skill-seekers scrape \
689
+ --name react \
690
+ --url https://react.dev/ \
691
+ --description "React framework for UIs"
692
+ ```
693
+
694
+ ## 📤 Uploading Skills to Claude
695
+
696
+ Once your skill is packaged, you need to upload it to Claude:
697
+
698
+ ### Option 1: Automatic Upload (API-based)
699
+
700
+ ```bash
701
+ # Set your API key (one-time)
702
+ export ANTHROPIC_API_KEY=sk-ant-...
703
+
704
+ # Package and upload automatically
705
+ skill-seekers package output/react/ --upload
706
+
707
+ # OR upload existing .zip
708
+ skill-seekers upload output/react.zip
709
+ ```
710
+
711
+ **Benefits:**
712
+
713
+ - ✅ Fully automatic
714
+ - ✅ No manual steps
715
+ - ✅ Works from command line
716
+
717
+ **Requirements:**
718
+
719
+ - Anthropic API key (get from https://console.anthropic.com/)
720
+
721
+ ### Option 2: Manual Upload (No API Key)
722
+
723
+ ```bash
724
+ # Package skill
725
+ skill-seekers package output/react/
726
+
727
+ # This will:
728
+ # 1. Create output/react.zip
729
+ # 2. Open the output/ folder automatically
730
+ # 3. Show upload instructions
731
+
732
+ # Then manually upload:
733
+ # - Go to https://claude.ai/skills
734
+ # - Click "Upload Skill"
735
+ # - Select output/react.zip
736
+ # - Done!
737
+ ```
738
+
739
+ **Benefits:**
740
+
741
+ - ✅ No API key needed
742
+ - ✅ Works for everyone
743
+ - ✅ Folder opens automatically
744
+
745
+ ### Option 3: Claude Code (MCP) - Smart & Automatic
746
+
747
+ ```
748
+ In Claude Code, just ask:
749
+ "Package and upload the React skill"
750
+
751
+ # With API key set:
752
+ # - Packages the skill
753
+ # - Uploads to Claude automatically
754
+ # - Done! ✅
755
+
756
+ # Without API key:
757
+ # - Packages the skill
758
+ # - Shows where to find the .zip
759
+ # - Provides manual upload instructions
760
+ ```
761
+
762
+ **Benefits:**
763
+
764
+ - ✅ Natural language
765
+ - ✅ Smart auto-detection (uploads if API key available)
766
+ - ✅ Works with or without API key
767
+ - ✅ No errors or failures
768
+
769
+ ---
770
+
771
+ ## 🤖 Installing to AI Agents
772
+
773
+ Skill Seekers can automatically install skills to 10+ AI coding agents.
774
+
775
+ ### Quick Start
776
+
777
+ ```bash
778
+ # Install to specific agent
779
+ skill-seekers install-agent output/react/ --agent cursor
780
+
781
+ # Install to all agents at once
782
+ skill-seekers install-agent output/react/ --agent all
783
+
784
+ # Overwrite existing installation
785
+ skill-seekers install-agent output/react/ --agent claude --force
786
+
787
+ # Preview without installing
788
+ skill-seekers install-agent output/react/ --agent cursor --dry-run
789
+ ```
790
+
791
+ ### Supported Agents
792
+
793
+ | Agent | Path | Type |
794
+ | --------------------- | ------------------------- | ------- |
795
+ | **Claude Code** | `~/.claude/skills/` | Global |
796
+ | **Cursor** | `.cursor/skills/` | Project |
797
+ | **VS Code / Copilot** | `.github/skills/` | Project |
798
+ | **Amp** | `~/.amp/skills/` | Global |
799
+ | **Goose** | `~/.config/goose/skills/` | Global |
800
+ | **OpenCode** | `~/.opencode/skills/` | Global |
801
+ | **Letta** | `~/.letta/skills/` | Global |
802
+ | **Aide** | `~/.aide/skills/` | Global |
803
+ | **Windsurf** | `~/.windsurf/skills/` | Global |
804
+
805
+ **Global paths** install to user's home directory (~/).
806
+ **Project paths** install to current project's root directory.
807
+
808
+ ### Complete Workflow
809
+
810
+ ```bash
811
+ # 1. Scrape documentation
812
+ skill-seekers scrape --config configs/react.json --enhance-local
813
+
814
+ # 2. Package skill
815
+ skill-seekers package output/react/
816
+
817
+ # 3. Install to your agent
818
+ skill-seekers install-agent output/react/ --agent cursor
819
+
820
+ # 4. Restart Cursor to load the skill
821
+ ```
822
+
823
+ ---
824
+
825
+ ## 🤖 Multi-Agent MCP Support (NEW in v2.4.0)
826
+
827
+ **Skill Seekers MCP server now works with 5 leading AI coding agents!**
828
+
829
+ ### Supported AI Agents
830
+
831
+ | Agent | Transport | Setup Difficulty | Auto-Configured |
832
+ | ------------------- | --------- | ---------------- | --------------- |
833
+ | **Claude Code** | stdio | Easy | ✅ Yes |
834
+ | **VS Code + Cline** | stdio | Easy | ✅ Yes |
835
+ | **Cursor** | HTTP | Medium | ✅ Yes |
836
+ | **Windsurf** | HTTP | Medium | ✅ Yes |
837
+ | **IntelliJ IDEA** | HTTP | Medium | ✅ Yes |
838
+
839
+ ### Quick Setup - All Agents at Once
840
+
841
+ ```bash
842
+ # Clone repository
843
+ git clone https://github.com/yusufkaraaslan/Skill_Seekers.git
844
+ cd Skill_Seekers
845
+
846
+ # Run one command - auto-configures ALL 5 agents!
847
+ ./setup_mcp.sh
848
+
849
+ # Restart your AI agent and start using natural language:
850
+ "List all available configs"
851
+ "Generate a React skill from https://react.dev/"
852
+ "Package the skill at output/react/"
853
+ ```
854
+
855
+ **What `setup_mcp.sh` does:**
856
+
857
+ 1. ✅ Installs MCP server dependencies
858
+ 2. ✅ Configures Claude Code (stdio transport)
859
+ 3. ✅ Configures VS Code + Cline (stdio transport)
860
+ 4. ✅ Configures Cursor (HTTP transport)
861
+ 5. ✅ Configures Windsurf (HTTP transport)
862
+ 6. ✅ Configures IntelliJ IDEA (HTTP transport)
863
+ 7. ✅ Shows next steps for each agent
864
+
865
+ **Time:** 5 minutes | **Result:** All agents configured and ready to use
866
+
867
+ ### Transport Modes
868
+
869
+ Skill Seekers MCP server supports 2 transport modes:
870
+
871
+ #### stdio Transport (Claude Code, VS Code + Cline)
872
+
873
+ **How it works:** Agent launches MCP server as subprocess and communicates via stdin/stdout
874
+
875
+ **Benefits:**
876
+
877
+ - ✅ More secure (no network ports)
878
+ - ✅ Automatic lifecycle management
879
+ - ✅ Simpler configuration
880
+ - ✅ Better for single-user development
881
+
882
+ **Configuration example (Claude Code):**
883
+
884
+ ```json
885
+ {
886
+ "mcpServers": {
887
+ "skill-seeker": {
888
+ "command": "python3",
889
+ "args": ["-m", "skill_seekers.mcp.server"],
890
+ "cwd": "/path/to/Skill_Seekers"
891
+ }
892
+ }
893
+ }
894
+ ```
895
+
896
+ #### HTTP Transport (Cursor, Windsurf, IntelliJ IDEA)
897
+
898
+ **How it works:** MCP server runs as HTTP service, agents connect as clients
899
+
900
+ **Benefits:**
901
+
902
+ - ✅ Multi-agent support (one server, multiple clients)
903
+ - ✅ Server can run independently
904
+ - ✅ Better for team collaboration
905
+ - ✅ Easier debugging and monitoring
906
+
907
+ **Configuration example (Cursor):**
908
+
909
+ ```json
910
+ {
911
+ "mcpServers": {
912
+ "skill-seeker": {
913
+ "url": "http://localhost:8765/sse"
914
+ }
915
+ }
916
+ }
917
+ ```
918
+
919
+ **Starting HTTP server:**
920
+
921
+ ```bash
922
+ # Start server manually (runs in background)
923
+ cd /path/to/Skill_Seekers
924
+ python3 -m skill_seekers.mcp.server --transport http --port 8765
925
+
926
+ # Or use auto-start script
927
+ ./scripts/start_mcp_server.sh
928
+ ```
929
+
930
+ ### Agent-Specific Instructions
931
+
932
+ #### Claude Code (stdio)
933
+
934
+ ```bash
935
+ # Already configured by setup_mcp.sh!
936
+ # Just restart Claude Code
937
+
938
+ # Config location: ~/.claude/claude_code_config.json
939
+ ```
940
+
941
+ **Usage:**
942
+
943
+ ```
944
+ In Claude Code:
945
+ "List all available configs"
946
+ "Scrape React docs at https://react.dev/"
947
+ ```
948
+
949
+ #### VS Code + Cline Extension (stdio)
950
+
951
+ ```bash
952
+ # Already configured by setup_mcp.sh!
953
+ # Just restart VS Code
954
+
955
+ # Config location: ~/Library/Application Support/Code/User/globalStorage/saoudrizwan.claude-dev/settings/cline_mcp_settings.json
956
+ ```
957
+
958
+ **Usage:**
959
+
960
+ ```
961
+ In Cline:
962
+ "Generate config for Tailwind"
963
+ "Package skill at output/tailwind/"
964
+ ```
965
+
966
+ #### Cursor (HTTP)
967
+
968
+ ```bash
969
+ # 1. Setup already configured HTTP settings
970
+ # Config location: ~/.cursor/mcp_settings.json
971
+
972
+ # 2. Start HTTP server (one-time per session)
973
+ ./scripts/start_mcp_server.sh
974
+
975
+ # 3. Restart Cursor
976
+ ```
977
+
978
+ **Usage:**
979
+
980
+ ```
981
+ In Cursor:
982
+ "Show me all skill-seeker configs"
983
+ "Create Django skill from docs"
984
+ ```
985
+
986
+ #### Windsurf (HTTP)
987
+
988
+ ```bash
989
+ # 1. Setup already configured HTTP settings
990
+ # Config location: ~/.windsurf/mcp_settings.json
991
+
992
+ # 2. Start HTTP server (one-time per session)
993
+ ./scripts/start_mcp_server.sh
994
+
995
+ # 3. Restart Windsurf
996
+ ```
997
+
998
+ **Usage:**
999
+
1000
+ ```
1001
+ In Windsurf:
1002
+ "Estimate pages for Godot config"
1003
+ "Build unified skill for FastAPI"
1004
+ ```
1005
+
1006
+ #### IntelliJ IDEA (HTTP)
1007
+
1008
+ ```bash
1009
+ # 1. Setup already configured HTTP settings
1010
+ # Config location: ~/.intellij/mcp_settings.json
1011
+
1012
+ # 2. Start HTTP server (one-time per session)
1013
+ ./scripts/start_mcp_server.sh
1014
+
1015
+ # 3. Restart IntelliJ IDEA
1016
+ ```
1017
+
1018
+ **Usage:**
1019
+
1020
+ ```
1021
+ In IntelliJ IDEA:
1022
+ "Validate my config file"
1023
+ "Split large Godot config"
1024
+ ```
1025
+
1026
+ ### Available MCP Tools (17 Total)
1027
+
1028
+ All agents have access to these 17 tools:
1029
+
1030
+ **Core Tools (9):**
1031
+
1032
+ 1. `list_configs` - List all available preset configurations
1033
+ 2. `generate_config` - Generate new config for any docs site
1034
+ 3. `validate_config` - Validate config structure
1035
+ 4. `estimate_pages` - Estimate page count before scraping
1036
+ 5. `scrape_docs` - Scrape and build skill
1037
+ 6. `package_skill` - Package skill into .zip
1038
+ 7. `upload_skill` - Upload .zip to Claude
1039
+ 8. `split_config` - Split large documentation configs
1040
+ 9. `generate_router` - Generate router/hub skills
1041
+
1042
+ **Extended Tools (8 - NEW!):** 10. `scrape_github` - Scrape GitHub repositories 11. `scrape_pdf` - Extract content from PDFs 12. `unified_scrape` - Combine multiple sources 13. `merge_sources` - Merge documentation + code 14. `detect_conflicts` - Find doc/code discrepancies 15. `add_config_source` - Register private git repos 16. `fetch_config` - Fetch configs from git 17. `list_config_sources` - List registered sources
1043
+
1044
+ ### What's New in v2.4.0
1045
+
1046
+ **MCP Infrastructure:**
1047
+
1048
+ - ✅ **Upgraded to MCP SDK v1.25.0** - Latest stable version
1049
+ - ✅ **FastMCP Framework** - Modern, maintainable implementation
1050
+ - ✅ **Dual Transport** - stdio + HTTP support
1051
+ - ✅ **17 Tools** - Up from 9 (almost 2x!)
1052
+ - ✅ **Auto-Configuration** - One script configures all agents
1053
+
1054
+ **Agent Support:**
1055
+
1056
+ - ✅ **5 Agents Supported** - Claude Code, VS Code + Cline, Cursor, Windsurf, IntelliJ IDEA
1057
+ - ✅ **Automatic Setup** - `./setup_mcp.sh` configures everything
1058
+ - ✅ **Transport Detection** - Auto-selects stdio vs HTTP per agent
1059
+ - ✅ **Config Management** - Handles all agent-specific config formats
1060
+
1061
+ **Developer Experience:**
1062
+
1063
+ - ✅ **One Setup Command** - Works for all agents
1064
+ - ✅ **Natural Language** - Use plain English in any agent
1065
+ - ✅ **No CLI Required** - All features via MCP tools
1066
+ - ✅ **Full Testing** - All 17 tools tested and working
1067
+
1068
+ ### Troubleshooting Multi-Agent Setup
1069
+
1070
+ **HTTP server not starting?**
1071
+
1072
+ ```bash
1073
+ # Check if port 8765 is in use
1074
+ lsof -i :8765
1075
+
1076
+ # Use different port
1077
+ python3 -m skill_seekers.mcp.server --transport http --port 9000
1078
+
1079
+ # Update agent config with new port
1080
+ ```
1081
+
1082
+ **Agent not finding MCP server?**
1083
+
1084
+ ```bash
1085
+ # Verify config file exists
1086
+ cat ~/.claude/claude_code_config.json
1087
+ cat ~/.cursor/mcp_settings.json
1088
+
1089
+ # Re-run setup
1090
+ ./setup_mcp.sh
1091
+
1092
+ # Check server logs
1093
+ tail -f logs/mcp_server.log
1094
+ ```
1095
+
1096
+ **Tools not appearing in agent?**
1097
+
1098
+ ```bash
1099
+ # Restart agent completely (quit and relaunch)
1100
+ # For HTTP transport, ensure server is running:
1101
+ ps aux | grep "skill_seekers.mcp.server"
1102
+
1103
+ # Test server directly
1104
+ curl http://localhost:8765/health
1105
+ ```
1106
+
1107
+ ### Complete Multi-Agent Workflow
1108
+
1109
+ ```bash
1110
+ # 1. One-time setup (5 minutes)
1111
+ git clone https://github.com/yusufkaraaslan/Skill_Seekers.git
1112
+ cd Skill_Seekers
1113
+ ./setup_mcp.sh
1114
+
1115
+ # 2. For HTTP agents (Cursor/Windsurf/IntelliJ), start server
1116
+ ./scripts/start_mcp_server.sh
1117
+
1118
+ # 3. Restart your AI agent
1119
+
1120
+ # 4. Use natural language in ANY agent:
1121
+ "List all available configs"
1122
+ "Generate React skill from https://react.dev/"
1123
+ "Estimate pages for Godot config"
1124
+ "Package and upload skill at output/react/"
1125
+
1126
+ # 5. Result: Skills created without touching CLI!
1127
+ ```
1128
+
1129
+ **Full Guide:** See [docs/MCP_SETUP.md](docs/MCP_SETUP.md) for detailed multi-agent setup instructions.
1130
+
1131
+ ---
1132
+
1133
+ ## 📁 Simple Structure
1134
+
1135
+ ```
1136
+ doc-to-skill/
1137
+ ├── cli/
1138
+ │ ├── doc_scraper.py # Main scraping tool
1139
+ │ ├── package_skill.py # Package to .zip
1140
+ │ ├── upload_skill.py # Auto-upload (API)
1141
+ │ └── enhance_skill.py # AI enhancement
1142
+ ├── mcp/ # MCP server for 5 AI agents
1143
+ │ └── server.py # 17 MCP tools (v2.4.0)
1144
+ ├── configs/ # Preset configurations
1145
+ │ ├── godot.json # Godot Engine
1146
+ │ ├── react.json # React
1147
+ │ ├── vue.json # Vue.js
1148
+ │ ├── django.json # Django
1149
+ │ └── fastapi.json # FastAPI
1150
+ └── output/ # All output (auto-created)
1151
+ ├── godot_data/ # Scraped data
1152
+ ├── godot/ # Built skill
1153
+ └── godot.zip # Packaged skill
1154
+ ```
1155
+
1156
+ ## ✨ Features
1157
+
1158
+ ### 1. Fast Page Estimation (NEW!)
1159
+
1160
+ ```bash
1161
+ skill-seekers estimate configs/react.json
1162
+
1163
+ # Output:
1164
+ 📊 ESTIMATION RESULTS
1165
+ ✅ Pages Discovered: 180
1166
+ 📈 Estimated Total: 230
1167
+ ⏱️ Time Elapsed: 1.2 minutes
1168
+ 💡 Recommended max_pages: 280
1169
+ ```
1170
+
1171
+ **Benefits:**
1172
+
1173
+ - Know page count BEFORE scraping (saves time)
1174
+ - Validates URL patterns work correctly
1175
+ - Estimates total scraping time
1176
+ - Recommends optimal `max_pages` setting
1177
+ - Fast (1-2 minutes vs 20-40 minutes full scrape)
1178
+
1179
+ ### 2. Auto-Detect Existing Data
1180
+
1181
+ ```bash
1182
+ skill-seekers scrape --config configs/godot.json
1183
+
1184
+ # If data exists:
1185
+ ✓ Found existing data: 245 pages
1186
+ Use existing data? (y/n): y
1187
+ ⏭️ Skipping scrape, using existing data
1188
+ ```
1189
+
1190
+ ### 3. Knowledge Generation
1191
+
1192
+ **Automatic pattern extraction:**
1193
+
1194
+ - Extracts common code patterns from docs
1195
+ - Detects programming language
1196
+ - Creates quick reference with real examples
1197
+ - Smarter categorization with scoring
1198
+
1199
+ **Enhanced SKILL.md:**
1200
+
1201
+ - Real code examples from documentation
1202
+ - Language-annotated code blocks
1203
+ - Common patterns section
1204
+ - Quick reference from actual usage examples
1205
+
1206
+ ### 4. Smart Categorization
1207
+
1208
+ Automatically infers categories from:
1209
+
1210
+ - URL structure
1211
+ - Page titles
1212
+ - Content keywords
1213
+ - With scoring for better accuracy
1214
+
1215
+ ### 5. Code Language Detection
1216
+
1217
+ ```python
1218
+ # Automatically detects:
1219
+ - Python (def, import, from)
1220
+ - JavaScript (const, let, =>)
1221
+ - GDScript (func, var, extends)
1222
+ - C++ (#include, int main)
1223
+ - And more...
1224
+ ```
1225
+
1226
+ ### 5. Skip Scraping
1227
+
1228
+ ```bash
1229
+ # Scrape once
1230
+ skill-seekers scrape --config configs/react.json
1231
+
1232
+ # Later, just rebuild (instant)
1233
+ skill-seekers scrape --config configs/react.json --skip-scrape
1234
+ ```
1235
+
1236
+ ### 6. Async Mode for Faster Scraping (2-3x Speed!)
1237
+
1238
+ ```bash
1239
+ # Enable async mode with 8 workers (recommended for large docs)
1240
+ skill-seekers scrape --config configs/react.json --async --workers 8
1241
+
1242
+ # Small docs (~100-500 pages)
1243
+ skill-seekers scrape --config configs/mydocs.json --async --workers 4
1244
+
1245
+ # Large docs (2000+ pages) with no rate limiting
1246
+ skill-seekers scrape --config configs/largedocs.json --async --workers 8 --no-rate-limit
1247
+ ```
1248
+
1249
+ **Performance Comparison:**
1250
+
1251
+ - **Sync mode (threads):** ~18 pages/sec, 120 MB memory
1252
+ - **Async mode:** ~55 pages/sec, 40 MB memory
1253
+ - **Result:** 3x faster, 66% less memory!
1254
+
1255
+ **When to use:**
1256
+
1257
+ - ✅ Large documentation (500+ pages)
1258
+ - ✅ Network latency is high
1259
+ - ✅ Memory is constrained
1260
+ - ❌ Small docs (< 100 pages) - overhead not worth it
1261
+
1262
+ **See full guide:** [ASYNC_SUPPORT.md](ASYNC_SUPPORT.md)
1263
+
1264
+ ### 7. AI-Powered SKILL.md Enhancement
1265
+
1266
+ ```bash
1267
+ # Option 1: During scraping (API-based, requires API key)
1268
+ pip3 install anthropic
1269
+ export ANTHROPIC_API_KEY=sk-ant-...
1270
+ skill-seekers scrape --config configs/react.json --enhance
1271
+
1272
+ # Option 2: During scraping (LOCAL, no API key - uses Claude Code Max)
1273
+ skill-seekers scrape --config configs/react.json --enhance-local
1274
+
1275
+ # Option 3: After scraping (API-based, standalone)
1276
+ skill-seekers enhance output/react/
1277
+
1278
+ # Option 4: After scraping (LOCAL, no API key, standalone)
1279
+ skill-seekers enhance output/react/
1280
+ ```
1281
+
1282
+ **What it does:**
1283
+
1284
+ - Reads your reference documentation
1285
+ - Uses Claude to generate an excellent SKILL.md
1286
+ - Extracts best code examples (5-10 practical examples)
1287
+ - Creates comprehensive quick reference
1288
+ - Adds domain-specific key concepts
1289
+ - Provides navigation guidance for different skill levels
1290
+ - Automatically backs up original
1291
+ - **Quality:** Transforms 75-line templates into 500+ line comprehensive guides
1292
+
1293
+ **LOCAL Enhancement (Recommended):**
1294
+
1295
+ - Uses your Claude Code Max plan (no API costs)
1296
+ - Opens new terminal with Claude Code
1297
+ - Analyzes reference files automatically
1298
+ - Takes 30-60 seconds
1299
+ - Quality: 9/10 (comparable to API version)
1300
+
1301
+ ### 7. Large Documentation Support (10K-40K+ Pages)
1302
+
1303
+ **For massive documentation sites like Godot (40K pages), AWS, or Microsoft Docs:**
1304
+
1305
+ ```bash
1306
+ # 1. Estimate first (discover page count)
1307
+ skill-seekers estimate configs/godot.json
1308
+
1309
+ # 2. Auto-split into focused sub-skills
1310
+ python3 -m skill_seekers.cli.split_config configs/godot.json --strategy router
1311
+
1312
+ # Creates:
1313
+ # - godot-scripting.json (5K pages)
1314
+ # - godot-2d.json (8K pages)
1315
+ # - godot-3d.json (10K pages)
1316
+ # - godot-physics.json (6K pages)
1317
+ # - godot-shaders.json (11K pages)
1318
+
1319
+ # 3. Scrape all in parallel (4-8 hours instead of 20-40!)
1320
+ for config in configs/godot-*.json; do
1321
+ skill-seekers scrape --config $config &
1322
+ done
1323
+ wait
1324
+
1325
+ # 4. Generate intelligent router/hub skill
1326
+ python3 -m skill_seekers.cli.generate_router configs/godot-*.json
1327
+
1328
+ # 5. Package all skills
1329
+ python3 -m skill_seekers.cli.package_multi output/godot*/
1330
+
1331
+ # 6. Upload all .zip files to Claude
1332
+ # Users just ask questions naturally!
1333
+ # Router automatically directs to the right sub-skill!
1334
+ ```
1335
+
1336
+ **Split Strategies:**
1337
+
1338
+ - **auto** - Intelligently detects best strategy based on page count
1339
+ - **category** - Split by documentation categories (scripting, 2d, 3d, etc.)
1340
+ - **router** - Create hub skill + specialized sub-skills (RECOMMENDED)
1341
+ - **size** - Split every N pages (for docs without clear categories)
1342
+
1343
+ **Benefits:**
1344
+
1345
+ - ✅ Faster scraping (parallel execution)
1346
+ - ✅ More focused skills (better Claude performance)
1347
+ - ✅ Easier maintenance (update one topic at a time)
1348
+ - ✅ Natural user experience (router handles routing)
1349
+ - ✅ Avoids context window limits
1350
+
1351
+ **Configuration:**
1352
+
1353
+ ```json
1354
+ {
1355
+ "name": "godot",
1356
+ "max_pages": 40000,
1357
+ "split_strategy": "router",
1358
+ "split_config": {
1359
+ "target_pages_per_skill": 5000,
1360
+ "create_router": true,
1361
+ "split_by_categories": ["scripting", "2d", "3d", "physics"]
1362
+ }
1363
+ }
1364
+ ```
1365
+
1366
+ **Full Guide:** [Large Documentation Guide](docs/LARGE_DOCUMENTATION.md)
1367
+
1368
+ ### 8. Checkpoint/Resume for Long Scrapes
1369
+
1370
+ **Never lose progress on long-running scrapes:**
1371
+
1372
+ ```bash
1373
+ # Enable in config
1374
+ {
1375
+ "checkpoint": {
1376
+ "enabled": true,
1377
+ "interval": 1000 // Save every 1000 pages
1378
+ }
1379
+ }
1380
+
1381
+ # If scrape is interrupted (Ctrl+C or crash)
1382
+ skill-seekers scrape --config configs/godot.json --resume
1383
+
1384
+ # Resume from last checkpoint
1385
+ ✅ Resuming from checkpoint (12,450 pages scraped)
1386
+ ⏭️ Skipping 12,450 already-scraped pages
1387
+ 🔄 Continuing from where we left off...
1388
+
1389
+ # Start fresh (clear checkpoint)
1390
+ skill-seekers scrape --config configs/godot.json --fresh
1391
+ ```
1392
+
1393
+ **Benefits:**
1394
+
1395
+ - ✅ Auto-saves every 1000 pages (configurable)
1396
+ - ✅ Saves on interruption (Ctrl+C)
1397
+ - ✅ Resume with `--resume` flag
1398
+ - ✅ Never lose hours of scraping progress
1399
+
1400
+ ## 🎯 Complete Workflows
1401
+
1402
+ ### First Time (With Scraping + Enhancement)
1403
+
1404
+ ```bash
1405
+ # 1. Scrape + Build + AI Enhancement (LOCAL, no API key)
1406
+ skill-seekers scrape --config configs/godot.json --enhance-local
1407
+
1408
+ # 2. Wait for new terminal to close (enhancement completes)
1409
+ # Check the enhanced SKILL.md:
1410
+ cat output/godot/SKILL.md
1411
+
1412
+ # 3. Package
1413
+ skill-seekers package output/godot/
1414
+
1415
+ # 4. Done! You have godot.zip with excellent SKILL.md
1416
+ ```
1417
+
1418
+ **Time:** 20-40 minutes (scraping) + 60 seconds (enhancement) = ~21-41 minutes
1419
+
1420
+ ### Using Existing Data (Fast!)
1421
+
1422
+ ```bash
1423
+ # 1. Use cached data + Local Enhancement
1424
+ skill-seekers scrape --config configs/godot.json --skip-scrape
1425
+ skill-seekers enhance output/godot/
1426
+
1427
+ # 2. Package
1428
+ skill-seekers package output/godot/
1429
+
1430
+ # 3. Done!
1431
+ ```
1432
+
1433
+ **Time:** 1-3 minutes (build) + 60 seconds (enhancement) = ~2-4 minutes total
1434
+
1435
+ ### Without Enhancement (Basic)
1436
+
1437
+ ```bash
1438
+ # 1. Scrape + Build (no enhancement)
1439
+ skill-seekers scrape --config configs/godot.json
1440
+
1441
+ # 2. Package
1442
+ skill-seekers package output/godot/
1443
+
1444
+ # 3. Done! (SKILL.md will be basic template)
1445
+ ```
1446
+
1447
+ **Time:** 20-40 minutes
1448
+ **Note:** SKILL.md will be generic - enhancement strongly recommended!
1449
+
1450
+ ## 📋 Available Presets
1451
+
1452
+ | Config | Framework | Description |
1453
+ | ------------------- | ----------------- | -------------------------- |
1454
+ | `godot.json` | Godot Engine | Game development |
1455
+ | `react.json` | React | UI framework |
1456
+ | `vue.json` | Vue.js | Progressive framework |
1457
+ | `django.json` | Django | Python web framework |
1458
+ | `fastapi.json` | FastAPI | Modern Python API |
1459
+ | `ansible-core.json` | Ansible Core 2.19 | Automation & configuration |
1460
+
1461
+ ### Using Presets
1462
+
1463
+ ```bash
1464
+ # Godot
1465
+ skill-seekers scrape --config configs/godot.json
1466
+
1467
+ # React
1468
+ skill-seekers scrape --config configs/react.json
1469
+
1470
+ # Vue
1471
+ skill-seekers scrape --config configs/vue.json
1472
+
1473
+ # Django
1474
+ skill-seekers scrape --config configs/django.json
1475
+
1476
+ # FastAPI
1477
+ skill-seekers scrape --config configs/fastapi.json
1478
+
1479
+ # Ansible
1480
+ skill-seekers scrape --config configs/ansible-core.json
1481
+ ```
1482
+
1483
+ ## 🎨 Creating Your Own Config
1484
+
1485
+ ### Option 1: Interactive
1486
+
1487
+ ```bash
1488
+ skill-seekers scrape --interactive
1489
+ # Follow prompts, it will create the config for you
1490
+ ```
1491
+
1492
+ ### Option 2: Copy and Edit
1493
+
1494
+ ```bash
1495
+ # Copy a preset
1496
+ cp configs/react.json configs/myframework.json
1497
+
1498
+ # Edit it
1499
+ nano configs/myframework.json
1500
+
1501
+ # Use it
1502
+ skill-seekers scrape --config configs/myframework.json
1503
+ ```
1504
+
1505
+ ### Config Structure
1506
+
1507
+ ```json
1508
+ {
1509
+ "name": "myframework",
1510
+ "description": "When to use this skill",
1511
+ "base_url": "https://docs.myframework.com/",
1512
+ "selectors": {
1513
+ "main_content": "article",
1514
+ "title": "h1",
1515
+ "code_blocks": "pre code"
1516
+ },
1517
+ "url_patterns": {
1518
+ "include": ["/docs", "/guide"],
1519
+ "exclude": ["/blog", "/about"]
1520
+ },
1521
+ "categories": {
1522
+ "getting_started": ["intro", "quickstart"],
1523
+ "api": ["api", "reference"]
1524
+ },
1525
+ "rate_limit": 0.5,
1526
+ "max_pages": 500
1527
+ }
1528
+ ```
1529
+
1530
+ ## 📊 What Gets Created
1531
+
1532
+ ```
1533
+ output/
1534
+ ├── godot_data/ # Scraped raw data
1535
+ │ ├── pages/ # JSON files (one per page)
1536
+ │ └── summary.json # Overview
1537
+
1538
+ └── godot/ # The skill
1539
+ ├── SKILL.md # Enhanced with real examples
1540
+ ├── references/ # Categorized docs
1541
+ │ ├── index.md
1542
+ │ ├── getting_started.md
1543
+ │ ├── scripting.md
1544
+ │ └── ...
1545
+ ├── scripts/ # Empty (add your own)
1546
+ └── assets/ # Empty (add your own)
1547
+ ```
1548
+
1549
+ ## 🎯 Command Line Options
1550
+
1551
+ ```bash
1552
+ # Interactive mode
1553
+ skill-seekers scrape --interactive
1554
+
1555
+ # Use config file
1556
+ skill-seekers scrape --config configs/godot.json
1557
+
1558
+ # Quick mode
1559
+ skill-seekers scrape --name react --url https://react.dev/
1560
+
1561
+ # Skip scraping (use existing data)
1562
+ skill-seekers scrape --config configs/godot.json --skip-scrape
1563
+
1564
+ # With description
1565
+ skill-seekers scrape \
1566
+ --name react \
1567
+ --url https://react.dev/ \
1568
+ --description "React framework for building UIs"
1569
+ ```
1570
+
1571
+ ## 💡 Tips
1572
+
1573
+ ### 1. Test Small First
1574
+
1575
+ Edit `max_pages` in config to test:
1576
+
1577
+ ```json
1578
+ {
1579
+ "max_pages": 20 // Test with just 20 pages
1580
+ }
1581
+ ```
1582
+
1583
+ ### 2. Reuse Scraped Data
1584
+
1585
+ ```bash
1586
+ # Scrape once
1587
+ skill-seekers scrape --config configs/react.json
1588
+
1589
+ # Rebuild multiple times (instant)
1590
+ skill-seekers scrape --config configs/react.json --skip-scrape
1591
+ skill-seekers scrape --config configs/react.json --skip-scrape
1592
+ ```
1593
+
1594
+ ### 3. Finding Selectors
1595
+
1596
+ ```python
1597
+ # Test in Python
1598
+ from bs4 import BeautifulSoup
1599
+ import requests
1600
+
1601
+ url = "https://docs.example.com/page"
1602
+ soup = BeautifulSoup(requests.get(url).content, 'html.parser')
1603
+
1604
+ # Try different selectors
1605
+ print(soup.select_one('article'))
1606
+ print(soup.select_one('main'))
1607
+ print(soup.select_one('div[role="main"]'))
1608
+ ```
1609
+
1610
+ ### 4. Check Output Quality
1611
+
1612
+ ```bash
1613
+ # After building, check:
1614
+ cat output/godot/SKILL.md # Should have real examples
1615
+ cat output/godot/references/index.md # Categories
1616
+ ```
1617
+
1618
+ ## 🐛 Troubleshooting
1619
+
1620
+ ### No Content Extracted?
1621
+
1622
+ - Check your `main_content` selector
1623
+ - Try: `article`, `main`, `div[role="main"]`
1624
+
1625
+ ### Data Exists But Won't Use It?
1626
+
1627
+ ```bash
1628
+ # Force re-scrape
1629
+ rm -rf output/myframework_data/
1630
+ skill-seekers scrape --config configs/myframework.json
1631
+ ```
1632
+
1633
+ ### Categories Not Good?
1634
+
1635
+ Edit the config `categories` section with better keywords.
1636
+
1637
+ ### Want to Update Docs?
1638
+
1639
+ ```bash
1640
+ # Delete old data
1641
+ rm -rf output/godot_data/
1642
+
1643
+ # Re-scrape
1644
+ skill-seekers scrape --config configs/godot.json
1645
+ ```
1646
+
1647
+ ## 📈 Performance
1648
+
1649
+ | Task | Time | Notes |
1650
+ | ---------------- | --------- | ----------------------------- |
1651
+ | Scraping (sync) | 15-45 min | First time only, thread-based |
1652
+ | Scraping (async) | 5-15 min | 2-3x faster with --async flag |
1653
+ | Building | 1-3 min | Fast! |
1654
+ | Re-building | <1 min | With --skip-scrape |
1655
+ | Packaging | 5-10 sec | Final zip |
1656
+
1657
+ ## ✅ Summary
1658
+
1659
+ **One tool does everything:**
1660
+
1661
+ 1. ✅ Scrapes documentation
1662
+ 2. ✅ Auto-detects existing data
1663
+ 3. ✅ Generates better knowledge
1664
+ 4. ✅ Creates enhanced skills
1665
+ 5. ✅ Works with presets or custom configs
1666
+ 6. ✅ Supports skip-scraping for fast iteration
1667
+
1668
+ **Simple structure:**
1669
+
1670
+ - `doc_scraper.py` - The tool
1671
+ - `configs/` - Presets
1672
+ - `output/` - Everything else
1673
+
1674
+ **Better output:**
1675
+
1676
+ - Real code examples with language detection
1677
+ - Common patterns extracted from docs
1678
+ - Smart categorization
1679
+ - Enhanced SKILL.md with actual examples
1680
+
1681
+ ## 📚 Documentation
1682
+
1683
+ ### Getting Started
1684
+
1685
+ - **[BULLETPROOF_QUICKSTART.md](BULLETPROOF_QUICKSTART.md)** - 🎯 **START HERE** if you're new!
1686
+ - **[QUICKSTART.md](QUICKSTART.md)** - Quick start for experienced users
1687
+ - **[TROUBLESHOOTING.md](TROUBLESHOOTING.md)** - Common issues and solutions
1688
+
1689
+ ### Guides
1690
+
1691
+ - **[docs/LARGE_DOCUMENTATION.md](docs/LARGE_DOCUMENTATION.md)** - Handle 10K-40K+ page docs
1692
+ - **[ASYNC_SUPPORT.md](ASYNC_SUPPORT.md)** - Async mode guide (2-3x faster scraping)
1693
+ - **[docs/ENHANCEMENT.md](docs/ENHANCEMENT.md)** - AI enhancement guide
1694
+ - **[docs/TERMINAL_SELECTION.md](docs/TERMINAL_SELECTION.md)** - Configure terminal app for local enhancement
1695
+ - **[docs/UPLOAD_GUIDE.md](docs/UPLOAD_GUIDE.md)** - How to upload skills to Claude
1696
+ - **[docs/MCP_SETUP.md](docs/MCP_SETUP.md)** - MCP integration setup
1697
+
1698
+ ### Technical
1699
+
1700
+ - **[docs/CLAUDE.md](docs/CLAUDE.md)** - Technical architecture
1701
+ - **[STRUCTURE.md](STRUCTURE.md)** - Repository structure
1702
+
1703
+ ## 🎮 Ready?
1704
+
1705
+ ```bash
1706
+ # Try Godot
1707
+ skill-seekers scrape --config configs/godot.json
1708
+
1709
+ # Try React
1710
+ skill-seekers scrape --config configs/react.json
1711
+
1712
+ # Or go interactive
1713
+ skill-seekers scrape --interactive
1714
+ ```
1715
+
1716
+ ## 📝 License
1717
+
1718
+ MIT License - see [LICENSE](LICENSE) file for details
1719
+
1720
+ ---
1721
+
1722
+ Happy skill building! 🚀