javi-forge 1.2.0 → 1.3.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (228) hide show
  1. package/ci-local/ci-local.sh +20 -8
  2. package/package.json +1 -1
  3. package/ai-config/.skillignore +0 -15
  4. package/ai-config/AUTO_INVOKE.md +0 -300
  5. package/ai-config/agents/_TEMPLATE.md +0 -93
  6. package/ai-config/agents/business/api-designer.md +0 -1657
  7. package/ai-config/agents/business/business-analyst.md +0 -1331
  8. package/ai-config/agents/business/product-strategist.md +0 -206
  9. package/ai-config/agents/business/project-manager.md +0 -178
  10. package/ai-config/agents/business/requirements-analyst.md +0 -1277
  11. package/ai-config/agents/business/technical-writer.md +0 -1679
  12. package/ai-config/agents/creative/ux-designer.md +0 -205
  13. package/ai-config/agents/data-ai/ai-engineer.md +0 -487
  14. package/ai-config/agents/data-ai/analytics-engineer.md +0 -953
  15. package/ai-config/agents/data-ai/data-engineer.md +0 -173
  16. package/ai-config/agents/data-ai/data-scientist.md +0 -672
  17. package/ai-config/agents/data-ai/mlops-engineer.md +0 -814
  18. package/ai-config/agents/data-ai/prompt-engineer.md +0 -772
  19. package/ai-config/agents/development/angular-expert.md +0 -620
  20. package/ai-config/agents/development/backend-architect.md +0 -795
  21. package/ai-config/agents/development/database-specialist.md +0 -212
  22. package/ai-config/agents/development/frontend-specialist.md +0 -686
  23. package/ai-config/agents/development/fullstack-engineer.md +0 -668
  24. package/ai-config/agents/development/golang-pro.md +0 -338
  25. package/ai-config/agents/development/java-enterprise.md +0 -400
  26. package/ai-config/agents/development/javascript-pro.md +0 -422
  27. package/ai-config/agents/development/nextjs-pro.md +0 -474
  28. package/ai-config/agents/development/python-pro.md +0 -570
  29. package/ai-config/agents/development/react-pro.md +0 -487
  30. package/ai-config/agents/development/rust-pro.md +0 -246
  31. package/ai-config/agents/development/spring-boot-4-expert.md +0 -326
  32. package/ai-config/agents/development/typescript-pro.md +0 -336
  33. package/ai-config/agents/development/vue-specialist.md +0 -605
  34. package/ai-config/agents/infrastructure/cloud-architect.md +0 -472
  35. package/ai-config/agents/infrastructure/deployment-manager.md +0 -358
  36. package/ai-config/agents/infrastructure/devops-engineer.md +0 -455
  37. package/ai-config/agents/infrastructure/incident-responder.md +0 -519
  38. package/ai-config/agents/infrastructure/kubernetes-expert.md +0 -705
  39. package/ai-config/agents/infrastructure/monitoring-specialist.md +0 -674
  40. package/ai-config/agents/infrastructure/performance-engineer.md +0 -658
  41. package/ai-config/agents/orchestrator.md +0 -241
  42. package/ai-config/agents/quality/accessibility-auditor.md +0 -1204
  43. package/ai-config/agents/quality/code-reviewer-compact.md +0 -123
  44. package/ai-config/agents/quality/code-reviewer.md +0 -363
  45. package/ai-config/agents/quality/dependency-manager.md +0 -743
  46. package/ai-config/agents/quality/e2e-test-specialist.md +0 -1005
  47. package/ai-config/agents/quality/performance-tester.md +0 -1086
  48. package/ai-config/agents/quality/security-auditor.md +0 -133
  49. package/ai-config/agents/quality/test-engineer.md +0 -453
  50. package/ai-config/agents/specialists/api-designer.md +0 -87
  51. package/ai-config/agents/specialists/backend-architect.md +0 -73
  52. package/ai-config/agents/specialists/code-reviewer.md +0 -77
  53. package/ai-config/agents/specialists/db-optimizer.md +0 -75
  54. package/ai-config/agents/specialists/devops-engineer.md +0 -83
  55. package/ai-config/agents/specialists/documentation-writer.md +0 -78
  56. package/ai-config/agents/specialists/frontend-developer.md +0 -75
  57. package/ai-config/agents/specialists/performance-analyst.md +0 -82
  58. package/ai-config/agents/specialists/refactor-specialist.md +0 -74
  59. package/ai-config/agents/specialists/security-auditor.md +0 -74
  60. package/ai-config/agents/specialists/test-engineer.md +0 -81
  61. package/ai-config/agents/specialists/ux-consultant.md +0 -76
  62. package/ai-config/agents/specialized/agent-generator.md +0 -1190
  63. package/ai-config/agents/specialized/blockchain-developer.md +0 -149
  64. package/ai-config/agents/specialized/code-migrator.md +0 -892
  65. package/ai-config/agents/specialized/context-manager.md +0 -978
  66. package/ai-config/agents/specialized/documentation-writer.md +0 -1078
  67. package/ai-config/agents/specialized/ecommerce-expert.md +0 -1756
  68. package/ai-config/agents/specialized/embedded-engineer.md +0 -1714
  69. package/ai-config/agents/specialized/error-detective.md +0 -1034
  70. package/ai-config/agents/specialized/fintech-specialist.md +0 -1659
  71. package/ai-config/agents/specialized/freelance-project-planner-v2.md +0 -1988
  72. package/ai-config/agents/specialized/freelance-project-planner-v3.md +0 -2136
  73. package/ai-config/agents/specialized/freelance-project-planner-v4.md +0 -4503
  74. package/ai-config/agents/specialized/freelance-project-planner.md +0 -722
  75. package/ai-config/agents/specialized/game-developer.md +0 -1963
  76. package/ai-config/agents/specialized/healthcare-dev.md +0 -1620
  77. package/ai-config/agents/specialized/mobile-developer.md +0 -188
  78. package/ai-config/agents/specialized/parallel-plan-executor.md +0 -506
  79. package/ai-config/agents/specialized/plan-executor.md +0 -485
  80. package/ai-config/agents/specialized/solo-dev-planner-modular/00-INDEX.md +0 -485
  81. package/ai-config/agents/specialized/solo-dev-planner-modular/01-CORE.md +0 -3493
  82. package/ai-config/agents/specialized/solo-dev-planner-modular/02-SELF-CORRECTION.md +0 -778
  83. package/ai-config/agents/specialized/solo-dev-planner-modular/03-PROGRESSIVE-SETUP.md +0 -918
  84. package/ai-config/agents/specialized/solo-dev-planner-modular/04-DEPLOYMENT.md +0 -1537
  85. package/ai-config/agents/specialized/solo-dev-planner-modular/05-TESTING.md +0 -2633
  86. package/ai-config/agents/specialized/solo-dev-planner-modular/06-OPERATIONS.md +0 -5610
  87. package/ai-config/agents/specialized/solo-dev-planner-modular/INSTALL.md +0 -335
  88. package/ai-config/agents/specialized/solo-dev-planner-modular/QUICK-REFERENCE.txt +0 -215
  89. package/ai-config/agents/specialized/solo-dev-planner-modular/README.md +0 -260
  90. package/ai-config/agents/specialized/solo-dev-planner-modular/START-HERE.md +0 -379
  91. package/ai-config/agents/specialized/solo-dev-planner-modular/WORKFLOW-DIAGRAM.md +0 -355
  92. package/ai-config/agents/specialized/solo-dev-planner-modular/solo-dev-planner.md +0 -279
  93. package/ai-config/agents/specialized/template-writer.md +0 -347
  94. package/ai-config/agents/specialized/test-runner.md +0 -99
  95. package/ai-config/agents/specialized/vibekanban-smart-worker.md +0 -244
  96. package/ai-config/agents/specialized/wave-executor.md +0 -138
  97. package/ai-config/agents/specialized/workflow-optimizer.md +0 -1114
  98. package/ai-config/commands/git/changelog.md +0 -32
  99. package/ai-config/commands/git/ci-local.md +0 -70
  100. package/ai-config/commands/git/commit.md +0 -35
  101. package/ai-config/commands/git/fix-issue.md +0 -23
  102. package/ai-config/commands/git/pr-create.md +0 -42
  103. package/ai-config/commands/git/pr-review.md +0 -50
  104. package/ai-config/commands/git/worktree.md +0 -39
  105. package/ai-config/commands/refactoring/cleanup.md +0 -24
  106. package/ai-config/commands/refactoring/dead-code.md +0 -40
  107. package/ai-config/commands/refactoring/extract.md +0 -31
  108. package/ai-config/commands/testing/e2e.md +0 -30
  109. package/ai-config/commands/testing/tdd.md +0 -36
  110. package/ai-config/commands/testing/test-coverage.md +0 -30
  111. package/ai-config/commands/testing/test-fix.md +0 -24
  112. package/ai-config/commands/workflow/generate-agents-md.md +0 -85
  113. package/ai-config/commands/workflow/planning.md +0 -47
  114. package/ai-config/commands/workflows/compound.md +0 -89
  115. package/ai-config/commands/workflows/diagnose.md +0 -70
  116. package/ai-config/commands/workflows/discover.md +0 -86
  117. package/ai-config/commands/workflows/plan.md +0 -77
  118. package/ai-config/commands/workflows/review.md +0 -78
  119. package/ai-config/commands/workflows/work.md +0 -75
  120. package/ai-config/config.yaml +0 -18
  121. package/ai-config/hooks/_TEMPLATE.md +0 -96
  122. package/ai-config/hooks/block-dangerous-commands.md +0 -75
  123. package/ai-config/hooks/commit-guard.md +0 -90
  124. package/ai-config/hooks/context-loader.md +0 -73
  125. package/ai-config/hooks/improve-prompt.md +0 -91
  126. package/ai-config/hooks/learning-log.md +0 -72
  127. package/ai-config/hooks/model-router.md +0 -86
  128. package/ai-config/hooks/secret-scanner.md +0 -64
  129. package/ai-config/hooks/skill-validator.md +0 -102
  130. package/ai-config/hooks/task-artifact.md +0 -114
  131. package/ai-config/hooks/validate-workflow.md +0 -100
  132. package/ai-config/prompts/base.md +0 -71
  133. package/ai-config/prompts/modes/debug.md +0 -34
  134. package/ai-config/prompts/modes/deploy.md +0 -40
  135. package/ai-config/prompts/modes/research.md +0 -32
  136. package/ai-config/prompts/modes/review.md +0 -33
  137. package/ai-config/prompts/review-policy.md +0 -79
  138. package/ai-config/skills/_TEMPLATE.md +0 -157
  139. package/ai-config/skills/backend/api-gateway/SKILL.md +0 -254
  140. package/ai-config/skills/backend/bff-concepts/SKILL.md +0 -239
  141. package/ai-config/skills/backend/bff-spring/SKILL.md +0 -364
  142. package/ai-config/skills/backend/chi-router/SKILL.md +0 -396
  143. package/ai-config/skills/backend/error-handling/SKILL.md +0 -255
  144. package/ai-config/skills/backend/exceptions-spring/SKILL.md +0 -323
  145. package/ai-config/skills/backend/fastapi/SKILL.md +0 -302
  146. package/ai-config/skills/backend/gateway-spring/SKILL.md +0 -390
  147. package/ai-config/skills/backend/go-backend/SKILL.md +0 -457
  148. package/ai-config/skills/backend/gradle-multimodule/SKILL.md +0 -274
  149. package/ai-config/skills/backend/graphql-concepts/SKILL.md +0 -352
  150. package/ai-config/skills/backend/graphql-spring/SKILL.md +0 -398
  151. package/ai-config/skills/backend/grpc-concepts/SKILL.md +0 -283
  152. package/ai-config/skills/backend/grpc-spring/SKILL.md +0 -445
  153. package/ai-config/skills/backend/jwt-auth/SKILL.md +0 -412
  154. package/ai-config/skills/backend/notifications-concepts/SKILL.md +0 -259
  155. package/ai-config/skills/backend/recommendations-concepts/SKILL.md +0 -261
  156. package/ai-config/skills/backend/search-concepts/SKILL.md +0 -263
  157. package/ai-config/skills/backend/search-spring/SKILL.md +0 -375
  158. package/ai-config/skills/backend/spring-boot-4/SKILL.md +0 -172
  159. package/ai-config/skills/backend/websockets/SKILL.md +0 -532
  160. package/ai-config/skills/data-ai/ai-ml/SKILL.md +0 -423
  161. package/ai-config/skills/data-ai/analytics-concepts/SKILL.md +0 -195
  162. package/ai-config/skills/data-ai/analytics-spring/SKILL.md +0 -340
  163. package/ai-config/skills/data-ai/duckdb-analytics/SKILL.md +0 -440
  164. package/ai-config/skills/data-ai/langchain/SKILL.md +0 -238
  165. package/ai-config/skills/data-ai/mlflow/SKILL.md +0 -302
  166. package/ai-config/skills/data-ai/onnx-inference/SKILL.md +0 -290
  167. package/ai-config/skills/data-ai/powerbi/SKILL.md +0 -352
  168. package/ai-config/skills/data-ai/pytorch/SKILL.md +0 -274
  169. package/ai-config/skills/data-ai/scikit-learn/SKILL.md +0 -321
  170. package/ai-config/skills/data-ai/vector-db/SKILL.md +0 -301
  171. package/ai-config/skills/database/graph-databases/SKILL.md +0 -218
  172. package/ai-config/skills/database/graph-spring/SKILL.md +0 -361
  173. package/ai-config/skills/database/pgx-postgres/SKILL.md +0 -512
  174. package/ai-config/skills/database/redis-cache/SKILL.md +0 -343
  175. package/ai-config/skills/database/sqlite-embedded/SKILL.md +0 -388
  176. package/ai-config/skills/database/timescaledb/SKILL.md +0 -320
  177. package/ai-config/skills/docs/api-documentation/SKILL.md +0 -293
  178. package/ai-config/skills/docs/docs-spring/SKILL.md +0 -377
  179. package/ai-config/skills/docs/mustache-templates/SKILL.md +0 -190
  180. package/ai-config/skills/docs/technical-docs/SKILL.md +0 -447
  181. package/ai-config/skills/frontend/astro-ssr/SKILL.md +0 -441
  182. package/ai-config/skills/frontend/frontend-design/SKILL.md +0 -54
  183. package/ai-config/skills/frontend/frontend-web/SKILL.md +0 -368
  184. package/ai-config/skills/frontend/mantine-ui/SKILL.md +0 -396
  185. package/ai-config/skills/frontend/tanstack-query/SKILL.md +0 -439
  186. package/ai-config/skills/frontend/zod-validation/SKILL.md +0 -417
  187. package/ai-config/skills/frontend/zustand-state/SKILL.md +0 -350
  188. package/ai-config/skills/infrastructure/chaos-engineering/SKILL.md +0 -244
  189. package/ai-config/skills/infrastructure/chaos-spring/SKILL.md +0 -378
  190. package/ai-config/skills/infrastructure/devops-infra/SKILL.md +0 -435
  191. package/ai-config/skills/infrastructure/docker-containers/SKILL.md +0 -420
  192. package/ai-config/skills/infrastructure/kubernetes/SKILL.md +0 -456
  193. package/ai-config/skills/infrastructure/opentelemetry/SKILL.md +0 -546
  194. package/ai-config/skills/infrastructure/traefik-proxy/SKILL.md +0 -474
  195. package/ai-config/skills/infrastructure/woodpecker-ci/SKILL.md +0 -315
  196. package/ai-config/skills/mobile/ionic-capacitor/SKILL.md +0 -504
  197. package/ai-config/skills/mobile/mobile-ionic/SKILL.md +0 -448
  198. package/ai-config/skills/prompt-improver/SKILL.md +0 -125
  199. package/ai-config/skills/quality/ghagga-review/SKILL.md +0 -216
  200. package/ai-config/skills/references/hooks-patterns/SKILL.md +0 -238
  201. package/ai-config/skills/references/mcp-servers/SKILL.md +0 -275
  202. package/ai-config/skills/references/plugins-reference/SKILL.md +0 -110
  203. package/ai-config/skills/references/skills-reference/SKILL.md +0 -420
  204. package/ai-config/skills/references/subagent-templates/SKILL.md +0 -193
  205. package/ai-config/skills/systems-iot/modbus-protocol/SKILL.md +0 -410
  206. package/ai-config/skills/systems-iot/mqtt-rumqttc/SKILL.md +0 -408
  207. package/ai-config/skills/systems-iot/rust-systems/SKILL.md +0 -386
  208. package/ai-config/skills/systems-iot/tokio-async/SKILL.md +0 -324
  209. package/ai-config/skills/testing/playwright-e2e/SKILL.md +0 -289
  210. package/ai-config/skills/testing/testcontainers/SKILL.md +0 -299
  211. package/ai-config/skills/testing/vitest-testing/SKILL.md +0 -381
  212. package/ai-config/skills/workflow/ci-local-guide/SKILL.md +0 -118
  213. package/ai-config/skills/workflow/claude-automation-recommender/SKILL.md +0 -299
  214. package/ai-config/skills/workflow/claude-md-improver/SKILL.md +0 -158
  215. package/ai-config/skills/workflow/finishing-a-development-branch/SKILL.md +0 -117
  216. package/ai-config/skills/workflow/git-github/SKILL.md +0 -334
  217. package/ai-config/skills/workflow/git-github/references/examples.md +0 -160
  218. package/ai-config/skills/workflow/git-workflow/SKILL.md +0 -214
  219. package/ai-config/skills/workflow/ide-plugins/SKILL.md +0 -277
  220. package/ai-config/skills/workflow/ide-plugins-intellij/SKILL.md +0 -401
  221. package/ai-config/skills/workflow/obsidian-brain-workflow/SKILL.md +0 -199
  222. package/ai-config/skills/workflow/using-git-worktrees/SKILL.md +0 -100
  223. package/ai-config/skills/workflow/verification-before-completion/SKILL.md +0 -73
  224. package/ai-config/skills/workflow/wave-workflow/SKILL.md +0 -178
  225. package/schemas/agent.schema.json +0 -34
  226. package/schemas/ai-config.schema.json +0 -28
  227. package/schemas/plugin.schema.json +0 -62
  228. package/schemas/skill.schema.json +0 -44
@@ -1,261 +0,0 @@
1
- ---
2
- name: recommendations-concepts
3
- description: >
4
- Recommendation engine concepts. Collaborative filtering, content-based, hybrid systems.
5
- Trigger: recommendations, collaborative filtering, content-based, ML, personalization
6
- tools:
7
- - Read
8
- - Write
9
- - Edit
10
- - Grep
11
- metadata:
12
- author: apigen-team
13
- version: "1.0"
14
- tags: [recommendations, ml, personalization, algorithms]
15
- scope: ["**/recommendation/**"]
16
- ---
17
-
18
- # Recommendation System Concepts
19
-
20
- ## Types of Recommendation Systems
21
-
22
- ### Collaborative Filtering
23
- ```
24
- "Users who liked X also liked Y"
25
-
26
- User-based CF:
27
- 1. Find users similar to target user
28
- 2. Recommend items those users liked
29
- 3. Weight by similarity score
30
-
31
- Item-based CF:
32
- 1. Find items similar to items user liked
33
- 2. Recommend similar items
34
- 3. More scalable than user-based
35
-
36
- Matrix Factorization:
37
- - Decompose user-item matrix
38
- - Learn latent factors
39
- - SVD, ALS algorithms
40
- ```
41
-
42
- ### Content-Based Filtering
43
- ```
44
- "Items similar to what you've liked"
45
-
46
- Process:
47
- 1. Extract item features (TF-IDF, embeddings)
48
- 2. Build user profile from liked items
49
- 3. Match new items to user profile
50
-
51
- Features:
52
- - Text (descriptions, tags)
53
- - Categories
54
- - Attributes (color, size, brand)
55
- - Embeddings (deep learning)
56
- ```
57
-
58
- ### Hybrid Systems
59
- ```
60
- Combine multiple approaches:
61
-
62
- Weighted:
63
- score = α * CF_score + β * CB_score
64
-
65
- Switching:
66
- IF cold_start THEN content_based
67
- ELSE collaborative
68
-
69
- Feature Combination:
70
- Use CF scores as features in ML model
71
-
72
- Cascade:
73
- 1. Filter with content-based
74
- 2. Rank with collaborative
75
- ```
76
-
77
- ## Cold Start Problem
78
-
79
- ```
80
- New User Cold Start:
81
- - No interaction history
82
- - Solutions:
83
- • Ask preferences on signup
84
- • Use demographic data
85
- • Popular items fallback
86
- • Content-based until enough data
87
-
88
- New Item Cold Start:
89
- - No user interactions
90
- - Solutions:
91
- • Content-based similarity
92
- • Boost in exploration
93
- • Editorial placement
94
- ```
95
-
96
- ## Recommendation Scenarios
97
-
98
- ### E-commerce
99
- ```
100
- Homepage:
101
- - Trending products
102
- - Personalized picks
103
- - Recently viewed
104
-
105
- Product Page:
106
- - "Frequently bought together"
107
- - "Customers also viewed"
108
- - "Complete the look"
109
-
110
- Cart:
111
- - Cross-sell recommendations
112
- - Bundle suggestions
113
- ```
114
-
115
- ### Content Platforms
116
- ```
117
- Feed:
118
- - Personalized content stream
119
- - Explore/discover section
120
-
121
- After consumption:
122
- - "Up next"
123
- - "More like this"
124
- - Related content
125
-
126
- Search:
127
- - Personalized search results
128
- - "You might also search for"
129
- ```
130
-
131
- ### Social Networks
132
- ```
133
- People:
134
- - "People you may know"
135
- - "Follow suggestions"
136
-
137
- Content:
138
- - Personalized feed ranking
139
- - Trending in your network
140
-
141
- Groups:
142
- - "Groups you might like"
143
- - Activity suggestions
144
- ```
145
-
146
- ## Evaluation Metrics
147
-
148
- ### Offline Metrics
149
- ```
150
- Accuracy:
151
- - Precision@K: % of recommended items that are relevant
152
- - Recall@K: % of relevant items that are recommended
153
- - NDCG: Normalized discounted cumulative gain
154
-
155
- Error:
156
- - RMSE: Root mean square error (ratings)
157
- - MAE: Mean absolute error
158
-
159
- Ranking:
160
- - MRR: Mean reciprocal rank
161
- - AUC: Area under ROC curve
162
- ```
163
-
164
- ### Online Metrics
165
- ```
166
- Engagement:
167
- - Click-through rate (CTR)
168
- - Conversion rate
169
- - Time spent
170
-
171
- Business:
172
- - Revenue per user
173
- - Items per order
174
- - Return rate
175
-
176
- Long-term:
177
- - User retention
178
- - Diversity of consumption
179
- - Filter bubble effects
180
- ```
181
-
182
- ## A/B Testing Recommendations
183
-
184
- ```
185
- Test design:
186
- - Control: Existing algorithm
187
- - Treatment: New algorithm
188
- - Split: Random user assignment
189
-
190
- Metrics to track:
191
- - Primary: CTR or conversion
192
- - Secondary: Revenue, engagement
193
- - Guardrails: Page load time
194
-
195
- Statistical considerations:
196
- - Sample size for power
197
- - Duration for effects
198
- - Novelty effects
199
- ```
200
-
201
- ## Architecture Patterns
202
-
203
- ### Real-time vs Batch
204
- ```
205
- Batch (pre-computed):
206
- - Generate recommendations offline
207
- - Store in cache/database
208
- - Fast serving
209
- - Updated periodically
210
-
211
- Real-time:
212
- - Compute on request
213
- - Uses latest interactions
214
- - More resource intensive
215
- - Better personalization
216
-
217
- Hybrid:
218
- - Batch for base recommendations
219
- - Real-time for re-ranking
220
- - Periodic refresh
221
- ```
222
-
223
- ### Feature Store
224
- ```
225
- Store and serve ML features:
226
- - User features (preferences, history)
227
- - Item features (attributes, embeddings)
228
- - Context features (time, location)
229
-
230
- Benefits:
231
- - Consistency between training/serving
232
- - Feature reuse across models
233
- - Point-in-time correctness
234
- ```
235
-
236
- ## Data Requirements
237
-
238
- ```
239
- Implicit feedback:
240
- - Views, clicks, purchases
241
- - Time spent
242
- - Saves/bookmarks
243
-
244
- Explicit feedback:
245
- - Ratings
246
- - Reviews
247
- - Likes/dislikes
248
-
249
- Context:
250
- - Timestamp
251
- - Device
252
- - Location
253
- - Session data
254
- ```
255
-
256
- ## Related Skills
257
-
258
- - `recommendations-spring`: Spring Boot recommendation implementation
259
- - `analytics-concepts`: Analytics for tracking interactions
260
-
261
-
@@ -1,263 +0,0 @@
1
- ---
2
- name: search-concepts
3
- description: >
4
- Search engine concepts. Full-text search, indexing, relevance, facets, autocomplete.
5
- Trigger: search, Elasticsearch, Meilisearch, Algolia, full-text, indexing
6
- tools:
7
- - Read
8
- - Write
9
- - Edit
10
- - Grep
11
- metadata:
12
- author: apigen-team
13
- version: "1.0"
14
- tags: [search, elasticsearch, full-text, indexing]
15
- scope: ["**/search/**"]
16
- ---
17
-
18
- # Search Engine Concepts
19
-
20
- ## Core Concepts
21
-
22
- ### Indexing
23
- ```
24
- Document: Unit of data (product, article, user)
25
- Field: Attribute within document (title, description)
26
- Index: Collection of documents with same structure
27
- Mapping: Schema defining field types and analyzers
28
-
29
- Process:
30
- 1. Extract text content
31
- 2. Tokenize (split into terms)
32
- 3. Normalize (lowercase, stemming)
33
- 4. Build inverted index
34
- 5. Store for retrieval
35
- ```
36
-
37
- ### Inverted Index
38
- ```
39
- Forward index:
40
- Doc1 → ["quick", "brown", "fox"]
41
- Doc2 → ["lazy", "brown", "dog"]
42
-
43
- Inverted index:
44
- "brown" → [Doc1, Doc2]
45
- "quick" → [Doc1]
46
- "fox" → [Doc1]
47
- "lazy" → [Doc2]
48
- "dog" → [Doc2]
49
-
50
- Enables O(1) term lookup
51
- ```
52
-
53
- ### Text Analysis
54
- ```
55
- Analyzer pipeline:
56
- Input → Character Filter → Tokenizer → Token Filter → Tokens
57
-
58
- Character filters:
59
- - Strip HTML tags
60
- - Replace patterns
61
-
62
- Tokenizers:
63
- - Standard (split on whitespace/punctuation)
64
- - Whitespace (split on whitespace only)
65
- - N-gram (sliding window)
66
-
67
- Token filters:
68
- - Lowercase
69
- - Stemming (running → run)
70
- - Synonyms (couch → sofa)
71
- - Stop words removal
72
- ```
73
-
74
- ## Search Types
75
-
76
- ### Full-Text Search
77
- ```
78
- Query: "quick brown fox"
79
-
80
- Match:
81
- - Documents containing any term
82
- - Scored by relevance
83
-
84
- Phrase match:
85
- - Documents with exact phrase
86
- - Term order matters
87
-
88
- Multi-match:
89
- - Search across multiple fields
90
- - Weighted by field importance
91
- ```
92
-
93
- ### Fuzzy Search
94
- ```
95
- Query: "qiuck" (typo)
96
- Match: "quick" (edit distance 1)
97
-
98
- Edit distance:
99
- - Insert: quic → quick
100
- - Delete: quiick → quick
101
- - Replace: quack → quick
102
- - Transpose: qiuck → quick
103
-
104
- Useful for typo tolerance
105
- ```
106
-
107
- ### Semantic Search
108
- ```
109
- Traditional: Keyword matching
110
- Semantic: Meaning understanding
111
-
112
- Uses embeddings (vectors):
113
- 1. Convert query to embedding
114
- 2. Find similar document embeddings
115
- 3. Return nearest neighbors
116
-
117
- Requires ML model (BERT, etc.)
118
- ```
119
-
120
- ## Relevance Scoring
121
-
122
- ### TF-IDF
123
- ```
124
- TF (Term Frequency):
125
- How often term appears in document
126
- tf(t,d) = count(t in d) / count(all terms in d)
127
-
128
- IDF (Inverse Document Frequency):
129
- How rare term is across all documents
130
- idf(t) = log(N / df(t))
131
- N = total documents
132
- df(t) = documents containing term
133
-
134
- Score = TF × IDF
135
- ```
136
-
137
- ### BM25
138
- ```
139
- Improved TF-IDF:
140
- - Saturation: diminishing returns for high TF
141
- - Document length normalization
142
-
143
- BM25(t,d) = IDF(t) × (TF(t,d) × (k1 + 1)) /
144
- (TF(t,d) + k1 × (1 - b + b × |d|/avgdl))
145
-
146
- k1, b: tuning parameters
147
- |d|: document length
148
- avgdl: average document length
149
- ```
150
-
151
- ### Boosting
152
- ```
153
- Field boosting:
154
- title^3 description^1
155
- (title matches worth 3x)
156
-
157
- Query boosting:
158
- "laptop"^2 OR "computer"
159
- (laptop matches worth 2x)
160
-
161
- Function scoring:
162
- base_score * popularity_boost * recency_boost
163
- ```
164
-
165
- ## Search Features
166
-
167
- ### Faceted Search
168
- ```
169
- Facets = aggregations for filtering
170
-
171
- Example (e-commerce):
172
- Category: Electronics (150), Clothing (89)
173
- Price: $0-50 (45), $50-100 (78), $100+ (116)
174
- Brand: Apple (34), Samsung (28), Sony (21)
175
- Rating: 4+ stars (89), 3+ stars (156)
176
-
177
- User clicks "Electronics" → refines results
178
- ```
179
-
180
- ### Autocomplete
181
- ```
182
- Types:
183
- - Prefix matching: "app" → "apple", "application"
184
- - Fuzzy prefix: "apl" → "apple"
185
- - Query suggestions: based on popular searches
186
- - Completion: "new york" → "new york city"
187
-
188
- Implementation:
189
- - Edge n-grams at index time
190
- - Completion suggester
191
- - Separate autocomplete index
192
- ```
193
-
194
- ### Highlighting
195
- ```
196
- Query: "quick brown fox"
197
-
198
- Result with highlighting:
199
- "The <em>quick</em> <em>brown</em> <em>fox</em> jumps..."
200
-
201
- Options:
202
- - Fragment size
203
- - Number of fragments
204
- - Pre/post tags
205
- ```
206
-
207
- ### Pagination
208
- ```
209
- Offset-based:
210
- from=0, size=10 (page 1)
211
- from=10, size=10 (page 2)
212
- Problem: deep pagination expensive
213
-
214
- Search-after (cursor):
215
- Use sort values from last result
216
- Better for deep pagination
217
-
218
- Scroll:
219
- For large exports
220
- Not for real-time search
221
- ```
222
-
223
- ## Search Provider Comparison
224
-
225
- ```
226
- | Feature | Elasticsearch | Meilisearch | Algolia | Typesense |
227
- |---------|--------------|-------------|---------|-----------|
228
- | Hosting | Self/Cloud | Self/Cloud | SaaS | Self/Cloud |
229
- | Speed | Fast | Very fast | Very fast | Very fast |
230
- | Typo tolerance | Config | Built-in | Built-in | Built-in |
231
- | Facets | Yes | Yes | Yes | Yes |
232
- | Vectors | Yes (8.x) | Yes | No | Yes |
233
- | Pricing | Open source | Open source | Per search | Open source |
234
- ```
235
-
236
- ## Best Practices
237
-
238
- ```
239
- Indexing:
240
- ✅ Define explicit mappings
241
- ✅ Use appropriate analyzers per field
242
- ✅ Denormalize for search performance
243
- ✅ Index only searchable fields
244
-
245
- Querying:
246
- ✅ Use filters for exact matches (cached)
247
- ✅ Limit returned fields
248
- ✅ Implement pagination properly
249
- ✅ Track search analytics
250
-
251
- Operations:
252
- ✅ Monitor index size
253
- ✅ Plan for reindexing
254
- ✅ Set up aliases for zero-downtime
255
- ✅ Test relevance with query sets
256
- ```
257
-
258
- ## Related Skills
259
-
260
- - `search-spring`: Spring Boot search implementation
261
- - `apigen-architecture`: Overall system architecture
262
-
263
-