@mytechtoday/augment-extensions 0.7.0 → 1.2.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (483) hide show
  1. package/AGENTS.md +265 -232
  2. package/README.md +956 -771
  3. package/augment-extensions/coding-standards/bash/README.md +196 -196
  4. package/augment-extensions/coding-standards/bash/module.json +163 -163
  5. package/augment-extensions/coding-standards/bash/rules/naming-conventions.md +336 -336
  6. package/augment-extensions/coding-standards/bash/rules/universal-standards.md +289 -289
  7. package/augment-extensions/coding-standards/css/README.md +40 -40
  8. package/augment-extensions/coding-standards/css/examples/css-examples.css +550 -550
  9. package/augment-extensions/coding-standards/css/module.json +44 -44
  10. package/augment-extensions/coding-standards/css/rules/css-modern-features.md +448 -448
  11. package/augment-extensions/coding-standards/css/rules/css-standards.md +492 -492
  12. package/augment-extensions/coding-standards/html/README.md +40 -40
  13. package/augment-extensions/coding-standards/html/examples/html-examples.html +267 -267
  14. package/augment-extensions/coding-standards/html/examples/responsive-layout.html +505 -505
  15. package/augment-extensions/coding-standards/html/module.json +44 -44
  16. package/augment-extensions/coding-standards/html/rules/html-standards.md +349 -349
  17. package/augment-extensions/coding-standards/html-css-js/README.md +194 -194
  18. package/augment-extensions/coding-standards/html-css-js/examples/async-examples.js +487 -487
  19. package/augment-extensions/coding-standards/html-css-js/examples/css-examples.css +550 -550
  20. package/augment-extensions/coding-standards/html-css-js/examples/dom-examples.js +667 -667
  21. package/augment-extensions/coding-standards/html-css-js/examples/html-examples.html +267 -267
  22. package/augment-extensions/coding-standards/html-css-js/examples/javascript-examples.js +612 -612
  23. package/augment-extensions/coding-standards/html-css-js/examples/responsive-layout.html +505 -505
  24. package/augment-extensions/coding-standards/html-css-js/module.json +48 -48
  25. package/augment-extensions/coding-standards/html-css-js/rules/async-patterns.md +515 -515
  26. package/augment-extensions/coding-standards/html-css-js/rules/css-modern-features.md +448 -448
  27. package/augment-extensions/coding-standards/html-css-js/rules/css-standards.md +492 -492
  28. package/augment-extensions/coding-standards/html-css-js/rules/dom-manipulation.md +439 -439
  29. package/augment-extensions/coding-standards/html-css-js/rules/html-standards.md +349 -349
  30. package/augment-extensions/coding-standards/html-css-js/rules/javascript-standards.md +486 -486
  31. package/augment-extensions/coding-standards/html-css-js/rules/performance.md +463 -463
  32. package/augment-extensions/coding-standards/html-css-js/rules/tooling.md +543 -543
  33. package/augment-extensions/coding-standards/js/README.md +46 -46
  34. package/augment-extensions/coding-standards/js/examples/async-examples.js +487 -487
  35. package/augment-extensions/coding-standards/js/examples/dom-examples.js +667 -667
  36. package/augment-extensions/coding-standards/js/examples/javascript-examples.js +612 -612
  37. package/augment-extensions/coding-standards/js/module.json +49 -49
  38. package/augment-extensions/coding-standards/js/rules/async-patterns.md +515 -515
  39. package/augment-extensions/coding-standards/js/rules/dom-manipulation.md +439 -439
  40. package/augment-extensions/coding-standards/js/rules/javascript-standards.md +486 -486
  41. package/augment-extensions/coding-standards/js/rules/performance.md +463 -463
  42. package/augment-extensions/coding-standards/js/rules/tooling.md +543 -543
  43. package/augment-extensions/coding-standards/php/README.md +248 -248
  44. package/augment-extensions/coding-standards/php/examples/api-endpoint-example.php +204 -204
  45. package/augment-extensions/coding-standards/php/examples/cli-command-example.php +206 -206
  46. package/augment-extensions/coding-standards/php/examples/legacy-refactoring-example.php +234 -234
  47. package/augment-extensions/coding-standards/php/examples/web-application-example.php +211 -211
  48. package/augment-extensions/coding-standards/php/examples/woocommerce-extension-example.php +215 -215
  49. package/augment-extensions/coding-standards/php/examples/wordpress-plugin-example.php +189 -189
  50. package/augment-extensions/coding-standards/php/module.json +166 -166
  51. package/augment-extensions/coding-standards/php/rules/api-development.md +480 -480
  52. package/augment-extensions/coding-standards/php/rules/category-configuration.md +332 -332
  53. package/augment-extensions/coding-standards/php/rules/cli-tools.md +472 -472
  54. package/augment-extensions/coding-standards/php/rules/cms-integration.md +561 -561
  55. package/augment-extensions/coding-standards/php/rules/code-quality.md +402 -402
  56. package/augment-extensions/coding-standards/php/rules/documentation.md +425 -425
  57. package/augment-extensions/coding-standards/php/rules/ecommerce.md +627 -627
  58. package/augment-extensions/coding-standards/php/rules/error-handling.md +336 -336
  59. package/augment-extensions/coding-standards/php/rules/legacy-migration.md +677 -677
  60. package/augment-extensions/coding-standards/php/rules/naming-conventions.md +279 -279
  61. package/augment-extensions/coding-standards/php/rules/performance.md +392 -392
  62. package/augment-extensions/coding-standards/php/rules/psr-standards.md +186 -186
  63. package/augment-extensions/coding-standards/php/rules/security.md +358 -358
  64. package/augment-extensions/coding-standards/php/rules/testing.md +403 -403
  65. package/augment-extensions/coding-standards/php/rules/type-declarations.md +331 -331
  66. package/augment-extensions/coding-standards/php/rules/web-applications.md +426 -426
  67. package/augment-extensions/coding-standards/powershell/README.md +154 -154
  68. package/augment-extensions/coding-standards/powershell/examples/admin-example.ps1 +272 -272
  69. package/augment-extensions/coding-standards/powershell/examples/automation-example.ps1 +173 -173
  70. package/augment-extensions/coding-standards/powershell/examples/cloud-example.ps1 +243 -243
  71. package/augment-extensions/coding-standards/powershell/examples/cross-platform-example.ps1 +297 -297
  72. package/augment-extensions/coding-standards/powershell/examples/dsc-example.ps1 +224 -224
  73. package/augment-extensions/coding-standards/powershell/examples/legacy-migration-example.ps1 +340 -340
  74. package/augment-extensions/coding-standards/powershell/examples/module-example.psm1 +255 -255
  75. package/augment-extensions/coding-standards/powershell/module.json +165 -165
  76. package/augment-extensions/coding-standards/powershell/rules/administrative-tools.md +439 -439
  77. package/augment-extensions/coding-standards/powershell/rules/automation-scripts.md +240 -240
  78. package/augment-extensions/coding-standards/powershell/rules/cloud-orchestration.md +384 -384
  79. package/augment-extensions/coding-standards/powershell/rules/configuration-schema.md +383 -383
  80. package/augment-extensions/coding-standards/powershell/rules/cross-platform-scripts.md +482 -482
  81. package/augment-extensions/coding-standards/powershell/rules/dsc-configurations.md +296 -296
  82. package/augment-extensions/coding-standards/powershell/rules/error-handling.md +314 -314
  83. package/augment-extensions/coding-standards/powershell/rules/legacy-migrations.md +466 -466
  84. package/augment-extensions/coding-standards/powershell/rules/modules-functions.md +244 -244
  85. package/augment-extensions/coding-standards/powershell/rules/naming-conventions.md +266 -266
  86. package/augment-extensions/coding-standards/powershell/rules/performance-optimization.md +209 -209
  87. package/augment-extensions/coding-standards/powershell/rules/security-practices.md +314 -314
  88. package/augment-extensions/coding-standards/powershell/rules/testing-guidelines.md +268 -268
  89. package/augment-extensions/coding-standards/powershell/rules/universal-standards.md +197 -197
  90. package/augment-extensions/coding-standards/python/README.md +48 -48
  91. package/augment-extensions/coding-standards/python/examples/best-practices.py +373 -373
  92. package/augment-extensions/coding-standards/python/module.json +30 -30
  93. package/augment-extensions/coding-standards/python/rules/async-patterns.md +884 -884
  94. package/augment-extensions/coding-standards/python/rules/best-practices.md +232 -232
  95. package/augment-extensions/coding-standards/python/rules/code-organization.md +220 -220
  96. package/augment-extensions/coding-standards/python/rules/documentation.md +831 -831
  97. package/augment-extensions/coding-standards/python/rules/error-handling.md +1008 -1008
  98. package/augment-extensions/coding-standards/python/rules/naming-conventions.md +172 -172
  99. package/augment-extensions/coding-standards/python/rules/testing.md +409 -409
  100. package/augment-extensions/coding-standards/python/rules/tooling.md +446 -446
  101. package/augment-extensions/coding-standards/python/rules/type-hints.md +253 -253
  102. package/augment-extensions/coding-standards/react/README.md +45 -45
  103. package/augment-extensions/coding-standards/react/module.json +27 -27
  104. package/augment-extensions/coding-standards/react/rules/component-patterns.md +214 -214
  105. package/augment-extensions/coding-standards/react/rules/hooks-best-practices.md +235 -235
  106. package/augment-extensions/coding-standards/react/rules/performance.md +300 -300
  107. package/augment-extensions/coding-standards/react/rules/state-management.md +265 -265
  108. package/augment-extensions/coding-standards/react/rules/typescript-react.md +271 -271
  109. package/augment-extensions/coding-standards/typescript/README.md +45 -45
  110. package/augment-extensions/coding-standards/typescript/module.json +27 -27
  111. package/augment-extensions/coding-standards/typescript/rules/naming-conventions.md +225 -225
  112. package/augment-extensions/collections/html-css-js/README.md +82 -82
  113. package/augment-extensions/collections/html-css-js/collection.json +41 -41
  114. package/augment-extensions/domain-rules/api-design/README.md +41 -41
  115. package/augment-extensions/domain-rules/api-design/module.json +27 -27
  116. package/augment-extensions/domain-rules/api-design/rules/authentication.md +263 -263
  117. package/augment-extensions/domain-rules/api-design/rules/documentation.md +395 -395
  118. package/augment-extensions/domain-rules/api-design/rules/error-handling.md +290 -290
  119. package/augment-extensions/domain-rules/api-design/rules/graphql-api.md +313 -313
  120. package/augment-extensions/domain-rules/api-design/rules/rest-api.md +214 -214
  121. package/augment-extensions/domain-rules/api-design/rules/versioning.md +268 -268
  122. package/augment-extensions/domain-rules/database/README.md +161 -161
  123. package/augment-extensions/domain-rules/database/examples/flat-database-example.md +793 -793
  124. package/augment-extensions/domain-rules/database/examples/hybrid-database-example.md +1132 -1132
  125. package/augment-extensions/domain-rules/database/examples/nosql-document-example.md +868 -868
  126. package/augment-extensions/domain-rules/database/examples/nosql-graph-example.md +805 -805
  127. package/augment-extensions/domain-rules/database/examples/relational-schema-example.md +621 -621
  128. package/augment-extensions/domain-rules/database/examples/vector-database-example.md +965 -965
  129. package/augment-extensions/domain-rules/database/module.json +28 -28
  130. package/augment-extensions/domain-rules/database/rules/flat-databases.md +624 -624
  131. package/augment-extensions/domain-rules/database/rules/nosql-databases.md +588 -588
  132. package/augment-extensions/domain-rules/database/rules/nosql-document-stores.md +856 -856
  133. package/augment-extensions/domain-rules/database/rules/nosql-graph-databases.md +778 -778
  134. package/augment-extensions/domain-rules/database/rules/nosql-key-value-stores.md +963 -963
  135. package/augment-extensions/domain-rules/database/rules/performance-optimization.md +1076 -1076
  136. package/augment-extensions/domain-rules/database/rules/relational-databases.md +697 -697
  137. package/augment-extensions/domain-rules/database/rules/relational-indexing.md +671 -671
  138. package/augment-extensions/domain-rules/database/rules/relational-query-optimization.md +607 -607
  139. package/augment-extensions/domain-rules/database/rules/relational-schema-design.md +907 -907
  140. package/augment-extensions/domain-rules/database/rules/relational-transactions.md +783 -783
  141. package/augment-extensions/domain-rules/database/rules/security-standards.md +980 -980
  142. package/augment-extensions/domain-rules/database/rules/universal-best-practices.md +485 -485
  143. package/augment-extensions/domain-rules/database/rules/vector-databases.md +521 -521
  144. package/augment-extensions/domain-rules/database/rules/vector-embeddings.md +858 -858
  145. package/augment-extensions/domain-rules/database/rules/vector-indexing.md +934 -934
  146. package/augment-extensions/domain-rules/design/color/themes/catppuccin-latte/README.md +23 -23
  147. package/augment-extensions/domain-rules/design/color/themes/catppuccin-latte/module.json +26 -26
  148. package/augment-extensions/domain-rules/design/color/themes/catppuccin-mocha/README.md +23 -23
  149. package/augment-extensions/domain-rules/design/color/themes/catppuccin-mocha/module.json +26 -26
  150. package/augment-extensions/domain-rules/design/color/themes/dracula/README.md +23 -23
  151. package/augment-extensions/domain-rules/design/color/themes/dracula/module.json +26 -26
  152. package/augment-extensions/domain-rules/design/color/themes/gruvbox-dark/README.md +23 -23
  153. package/augment-extensions/domain-rules/design/color/themes/gruvbox-dark/module.json +26 -26
  154. package/augment-extensions/domain-rules/design/color/themes/gruvbox-light/README.md +23 -23
  155. package/augment-extensions/domain-rules/design/color/themes/gruvbox-light/module.json +26 -26
  156. package/augment-extensions/domain-rules/design/color/themes/high-contrast/README.md +27 -27
  157. package/augment-extensions/domain-rules/design/color/themes/high-contrast/module.json +26 -26
  158. package/augment-extensions/domain-rules/design/color/themes/monokai/README.md +23 -23
  159. package/augment-extensions/domain-rules/design/color/themes/monokai/module.json +26 -26
  160. package/augment-extensions/domain-rules/design/color/themes/nord/README.md +23 -23
  161. package/augment-extensions/domain-rules/design/color/themes/nord/module.json +26 -26
  162. package/augment-extensions/domain-rules/design/color/themes/one-dark/README.md +23 -23
  163. package/augment-extensions/domain-rules/design/color/themes/one-dark/module.json +26 -26
  164. package/augment-extensions/domain-rules/design/color/themes/one-light/README.md +23 -23
  165. package/augment-extensions/domain-rules/design/color/themes/one-light/module.json +26 -26
  166. package/augment-extensions/domain-rules/design/color/themes/solarized-dark/README.md +23 -23
  167. package/augment-extensions/domain-rules/design/color/themes/solarized-dark/module.json +26 -26
  168. package/augment-extensions/domain-rules/design/color/themes/solarized-light/README.md +23 -23
  169. package/augment-extensions/domain-rules/design/color/themes/solarized-light/module.json +26 -26
  170. package/augment-extensions/domain-rules/design/color/themes/tokyo-night/README.md +23 -23
  171. package/augment-extensions/domain-rules/design/color/themes/tokyo-night/module.json +26 -26
  172. package/augment-extensions/domain-rules/mcp/README.md +150 -150
  173. package/augment-extensions/domain-rules/mcp/examples/compressed-example.md +522 -522
  174. package/augment-extensions/domain-rules/mcp/examples/graph-augmented-example.md +520 -520
  175. package/augment-extensions/domain-rules/mcp/examples/hybrid-example.md +570 -570
  176. package/augment-extensions/domain-rules/mcp/examples/state-based-example.md +427 -427
  177. package/augment-extensions/domain-rules/mcp/examples/token-based-example.md +435 -435
  178. package/augment-extensions/domain-rules/mcp/examples/vector-based-example.md +502 -502
  179. package/augment-extensions/domain-rules/mcp/module.json +49 -49
  180. package/augment-extensions/domain-rules/mcp/rules/compressed-mcp.md +595 -595
  181. package/augment-extensions/domain-rules/mcp/rules/configuration.md +345 -345
  182. package/augment-extensions/domain-rules/mcp/rules/graph-augmented-mcp.md +687 -687
  183. package/augment-extensions/domain-rules/mcp/rules/hybrid-mcp.md +636 -636
  184. package/augment-extensions/domain-rules/mcp/rules/state-based-mcp.md +484 -484
  185. package/augment-extensions/domain-rules/mcp/rules/testing-validation.md +360 -360
  186. package/augment-extensions/domain-rules/mcp/rules/token-based-mcp.md +393 -393
  187. package/augment-extensions/domain-rules/mcp/rules/universal-rules.md +194 -194
  188. package/augment-extensions/domain-rules/mcp/rules/vector-based-mcp.md +625 -625
  189. package/augment-extensions/domain-rules/security/README.md +41 -41
  190. package/augment-extensions/domain-rules/security/module.json +28 -28
  191. package/augment-extensions/domain-rules/security/rules/authentication-security.md +361 -361
  192. package/augment-extensions/domain-rules/security/rules/encryption.md +208 -208
  193. package/augment-extensions/domain-rules/security/rules/input-validation.md +294 -294
  194. package/augment-extensions/domain-rules/security/rules/owasp-top-10.md +339 -339
  195. package/augment-extensions/domain-rules/security/rules/secure-coding.md +293 -293
  196. package/augment-extensions/domain-rules/security/rules/web-security.md +268 -268
  197. package/augment-extensions/domain-rules/seo-sales-marketing/ANNOUNCEMENT.md +143 -0
  198. package/augment-extensions/domain-rules/{marketing-standards/seo-sales-marketing → seo-sales-marketing}/README.md +140 -136
  199. package/augment-extensions/domain-rules/{marketing-standards/seo-sales-marketing → seo-sales-marketing}/SCHEMA-VALIDATION-REPORT.md +216 -216
  200. package/augment-extensions/domain-rules/seo-sales-marketing/TEST-VALIDATION.md +129 -0
  201. package/augment-extensions/domain-rules/seo-sales-marketing/USAGE-GUIDES.md +254 -0
  202. package/augment-extensions/domain-rules/{marketing-standards/seo-sales-marketing → seo-sales-marketing}/examples/brand-kit-example.yaml +292 -292
  203. package/augment-extensions/domain-rules/{marketing-standards/seo-sales-marketing → seo-sales-marketing}/examples/campaign-brief-example.yaml +389 -389
  204. package/augment-extensions/domain-rules/{marketing-standards/seo-sales-marketing → seo-sales-marketing}/examples/content-calendar-example.yaml +643 -643
  205. package/augment-extensions/domain-rules/{marketing-standards/seo-sales-marketing → seo-sales-marketing}/examples/email-newsletter-example.md +376 -376
  206. package/augment-extensions/domain-rules/{marketing-standards/seo-sales-marketing → seo-sales-marketing}/examples/landing-page-example.md +934 -934
  207. package/augment-extensions/domain-rules/{marketing-standards/seo-sales-marketing → seo-sales-marketing}/examples/ppc-ad-copy-example.md +301 -301
  208. package/augment-extensions/domain-rules/{marketing-standards/seo-sales-marketing → seo-sales-marketing}/examples/seo-blog-post-example.md +347 -347
  209. package/augment-extensions/domain-rules/{marketing-standards/seo-sales-marketing → seo-sales-marketing}/examples/social-media-campaign-example.md +606 -606
  210. package/augment-extensions/domain-rules/{marketing-standards/seo-sales-marketing → seo-sales-marketing}/module.json +50 -50
  211. package/augment-extensions/domain-rules/{marketing-standards/seo-sales-marketing → seo-sales-marketing}/rules/affiliate-influencer-marketing.md +593 -593
  212. package/augment-extensions/domain-rules/{marketing-standards/seo-sales-marketing → seo-sales-marketing}/rules/asset-management.md +418 -418
  213. package/augment-extensions/domain-rules/{marketing-standards/seo-sales-marketing → seo-sales-marketing}/rules/brand-consistency.md +210 -210
  214. package/augment-extensions/domain-rules/{marketing-standards/seo-sales-marketing → seo-sales-marketing}/rules/content-marketing.md +337 -337
  215. package/augment-extensions/domain-rules/{marketing-standards/seo-sales-marketing → seo-sales-marketing}/rules/conversion-optimization.md +455 -455
  216. package/augment-extensions/domain-rules/{marketing-standards/seo-sales-marketing → seo-sales-marketing}/rules/direct-sales.md +499 -499
  217. package/augment-extensions/domain-rules/{marketing-standards/seo-sales-marketing → seo-sales-marketing}/rules/email-marketing.md +439 -439
  218. package/augment-extensions/domain-rules/{marketing-standards/seo-sales-marketing → seo-sales-marketing}/rules/legal-compliance.md +227 -227
  219. package/augment-extensions/domain-rules/{marketing-standards/seo-sales-marketing → seo-sales-marketing}/rules/ppc-advertising.md +569 -569
  220. package/augment-extensions/domain-rules/{marketing-standards/seo-sales-marketing → seo-sales-marketing}/rules/seo-optimization.md +470 -470
  221. package/augment-extensions/domain-rules/{marketing-standards/seo-sales-marketing → seo-sales-marketing}/rules/social-media-marketing.md +414 -414
  222. package/augment-extensions/domain-rules/{marketing-standards/seo-sales-marketing → seo-sales-marketing}/rules/universal-marketing.md +177 -177
  223. package/augment-extensions/domain-rules/{marketing-standards/seo-sales-marketing → seo-sales-marketing}/schemas/asset-inventory.schema.json +247 -247
  224. package/augment-extensions/domain-rules/{marketing-standards/seo-sales-marketing → seo-sales-marketing}/schemas/brand-kit.schema.json +326 -326
  225. package/augment-extensions/domain-rules/{marketing-standards/seo-sales-marketing → seo-sales-marketing}/schemas/campaign-brief.schema.json +342 -342
  226. package/augment-extensions/domain-rules/{marketing-standards/seo-sales-marketing → seo-sales-marketing}/schemas/color-palette.schema.json +223 -223
  227. package/augment-extensions/domain-rules/{marketing-standards/seo-sales-marketing → seo-sales-marketing}/schemas/content-template.schema.json +383 -383
  228. package/augment-extensions/domain-rules/wordpress/README.md +163 -163
  229. package/augment-extensions/domain-rules/wordpress/module.json +32 -32
  230. package/augment-extensions/domain-rules/wordpress/rules/coding-standards.md +617 -617
  231. package/augment-extensions/domain-rules/wordpress/rules/directory-structure.md +270 -270
  232. package/augment-extensions/domain-rules/wordpress/rules/file-patterns.md +423 -423
  233. package/augment-extensions/domain-rules/wordpress/rules/gutenberg-blocks.md +493 -493
  234. package/augment-extensions/domain-rules/wordpress/rules/performance.md +568 -568
  235. package/augment-extensions/domain-rules/wordpress/rules/plugin-development.md +510 -510
  236. package/augment-extensions/domain-rules/wordpress/rules/project-detection.md +251 -251
  237. package/augment-extensions/domain-rules/wordpress/rules/rest-api.md +501 -501
  238. package/augment-extensions/domain-rules/wordpress/rules/security.md +564 -564
  239. package/augment-extensions/domain-rules/wordpress/rules/theme-development.md +388 -388
  240. package/augment-extensions/domain-rules/wordpress/rules/woocommerce.md +441 -441
  241. package/augment-extensions/domain-rules/wordpress-plugin/README.md +139 -139
  242. package/augment-extensions/domain-rules/wordpress-plugin/examples/ajax-plugin.md +1599 -1599
  243. package/augment-extensions/domain-rules/wordpress-plugin/examples/custom-post-type-plugin.md +1727 -1727
  244. package/augment-extensions/domain-rules/wordpress-plugin/examples/gutenberg-block-plugin.md +428 -428
  245. package/augment-extensions/domain-rules/wordpress-plugin/examples/gutenberg-block.md +422 -422
  246. package/augment-extensions/domain-rules/wordpress-plugin/examples/mvc-plugin.md +1623 -1623
  247. package/augment-extensions/domain-rules/wordpress-plugin/examples/object-oriented-plugin.md +1343 -1343
  248. package/augment-extensions/domain-rules/wordpress-plugin/examples/rest-endpoint.md +734 -734
  249. package/augment-extensions/domain-rules/wordpress-plugin/examples/settings-page-plugin.md +1350 -1350
  250. package/augment-extensions/domain-rules/wordpress-plugin/examples/simple-procedural-plugin.md +503 -503
  251. package/augment-extensions/domain-rules/wordpress-plugin/examples/singleton-plugin.md +971 -971
  252. package/augment-extensions/domain-rules/wordpress-plugin/module.json +53 -53
  253. package/augment-extensions/domain-rules/wordpress-plugin/rules/activation-hooks.md +770 -770
  254. package/augment-extensions/domain-rules/wordpress-plugin/rules/admin-interface.md +874 -874
  255. package/augment-extensions/domain-rules/wordpress-plugin/rules/ajax-handlers.md +629 -629
  256. package/augment-extensions/domain-rules/wordpress-plugin/rules/asset-management.md +559 -559
  257. package/augment-extensions/domain-rules/wordpress-plugin/rules/context-providers.md +709 -709
  258. package/augment-extensions/domain-rules/wordpress-plugin/rules/cron-jobs.md +736 -736
  259. package/augment-extensions/domain-rules/wordpress-plugin/rules/database-management.md +1057 -1057
  260. package/augment-extensions/domain-rules/wordpress-plugin/rules/documentation-standards.md +463 -463
  261. package/augment-extensions/domain-rules/wordpress-plugin/rules/frontend-functionality.md +478 -478
  262. package/augment-extensions/domain-rules/wordpress-plugin/rules/gutenberg-blocks.md +818 -818
  263. package/augment-extensions/domain-rules/wordpress-plugin/rules/internationalization.md +416 -416
  264. package/augment-extensions/domain-rules/wordpress-plugin/rules/migration.md +667 -667
  265. package/augment-extensions/domain-rules/wordpress-plugin/rules/performance-optimization.md +878 -878
  266. package/augment-extensions/domain-rules/wordpress-plugin/rules/plugin-architecture.md +693 -693
  267. package/augment-extensions/domain-rules/wordpress-plugin/rules/plugin-structure.md +352 -352
  268. package/augment-extensions/domain-rules/wordpress-plugin/rules/rest-api.md +818 -818
  269. package/augment-extensions/domain-rules/wordpress-plugin/rules/scaffolding-workflow.md +624 -624
  270. package/augment-extensions/domain-rules/wordpress-plugin/rules/security-best-practices.md +866 -866
  271. package/augment-extensions/domain-rules/wordpress-plugin/rules/testing-patterns.md +1165 -1165
  272. package/augment-extensions/domain-rules/wordpress-plugin/rules/testing.md +414 -414
  273. package/augment-extensions/domain-rules/wordpress-plugin/rules/vscode-integration.md +751 -751
  274. package/augment-extensions/domain-rules/wordpress-plugin/rules/woocommerce-integration.md +949 -949
  275. package/augment-extensions/domain-rules/wordpress-plugin/rules/wordpress-org-submission.md +458 -458
  276. package/augment-extensions/examples/design-patterns/README.md +37 -37
  277. package/augment-extensions/examples/design-patterns/examples/behavioral-patterns.md +370 -370
  278. package/augment-extensions/examples/design-patterns/examples/creational-patterns.md +250 -250
  279. package/augment-extensions/examples/design-patterns/examples/structural-patterns.md +264 -264
  280. package/augment-extensions/examples/design-patterns/module.json +27 -27
  281. package/augment-extensions/examples/gutenberg-block-plugin/README.md +101 -101
  282. package/augment-extensions/examples/gutenberg-block-plugin/examples/testimonial-block.md +428 -428
  283. package/augment-extensions/examples/gutenberg-block-plugin/module.json +40 -40
  284. package/augment-extensions/examples/rest-api-plugin/README.md +98 -98
  285. package/augment-extensions/examples/rest-api-plugin/examples/task-manager-api.md +1299 -1299
  286. package/augment-extensions/examples/rest-api-plugin/module.json +40 -40
  287. package/augment-extensions/examples/woocommerce-extension/README.md +98 -98
  288. package/augment-extensions/examples/woocommerce-extension/examples/product-customizer.md +763 -763
  289. package/augment-extensions/examples/woocommerce-extension/module.json +40 -40
  290. package/augment-extensions/workflows/beads/README.md +135 -135
  291. package/augment-extensions/workflows/beads/examples/complete-workflow-example.md +278 -278
  292. package/augment-extensions/workflows/beads/module.json +55 -55
  293. package/augment-extensions/workflows/beads/rules/best-practices.md +398 -398
  294. package/augment-extensions/workflows/beads/rules/file-format.md +327 -327
  295. package/augment-extensions/workflows/beads/rules/manual-setup.md +315 -315
  296. package/augment-extensions/workflows/beads/rules/workflow.md +326 -326
  297. package/augment-extensions/workflows/beads-integration/IMPLEMENTATION-STATUS.md +145 -145
  298. package/augment-extensions/workflows/beads-integration/README.md +143 -143
  299. package/augment-extensions/workflows/beads-integration/config/defaults.json +32 -32
  300. package/augment-extensions/workflows/beads-integration/config/schema.json +140 -140
  301. package/augment-extensions/workflows/beads-integration/examples/basic-task-generation.md +293 -293
  302. package/augment-extensions/workflows/beads-integration/module.json +75 -75
  303. package/augment-extensions/workflows/beads-integration/rules/core-rules.md +219 -219
  304. package/augment-extensions/workflows/beads-integration/rules/effectiveness-standards.md +256 -256
  305. package/augment-extensions/workflows/beads-integration/rules/task-generation.md +607 -607
  306. package/augment-extensions/workflows/database/README.md +195 -195
  307. package/augment-extensions/workflows/database/ai-prompt-testing.md +295 -295
  308. package/augment-extensions/workflows/database/examples/migration-example.md +498 -498
  309. package/augment-extensions/workflows/database/examples/optimization-example.md +496 -496
  310. package/augment-extensions/workflows/database/examples/schema-design-example.md +444 -444
  311. package/augment-extensions/workflows/database/module.json +42 -42
  312. package/augment-extensions/workflows/database/rules/data-migration.md +249 -249
  313. package/augment-extensions/workflows/database/rules/documentation-standards.md +339 -339
  314. package/augment-extensions/workflows/database/rules/migration-workflow.md +352 -352
  315. package/augment-extensions/workflows/database/rules/optimization-workflow.md +435 -435
  316. package/augment-extensions/workflows/database/rules/schema-design-workflow.md +535 -535
  317. package/augment-extensions/workflows/database/rules/testing-patterns.md +305 -305
  318. package/augment-extensions/workflows/database/rules/workflow.md +458 -458
  319. package/augment-extensions/workflows/wordpress-plugin/README.md +232 -232
  320. package/augment-extensions/workflows/wordpress-plugin/ai-prompts.md +839 -839
  321. package/augment-extensions/workflows/wordpress-plugin/bead-decomposition-patterns.md +854 -854
  322. package/augment-extensions/workflows/wordpress-plugin/examples/complete-plugin-example.md +540 -540
  323. package/augment-extensions/workflows/wordpress-plugin/examples/custom-post-type-example.md +1083 -1083
  324. package/augment-extensions/workflows/wordpress-plugin/examples/feature-addition-workflow.md +669 -669
  325. package/augment-extensions/workflows/wordpress-plugin/examples/plugin-creation-workflow.md +597 -597
  326. package/augment-extensions/workflows/wordpress-plugin/examples/secure-form-handler-example.md +925 -925
  327. package/augment-extensions/workflows/wordpress-plugin/examples/security-audit-workflow.md +752 -752
  328. package/augment-extensions/workflows/wordpress-plugin/examples/wordpress-org-submission-workflow.md +773 -773
  329. package/augment-extensions/workflows/wordpress-plugin/module.json +49 -49
  330. package/augment-extensions/workflows/wordpress-plugin/rules/best-practices.md +942 -942
  331. package/augment-extensions/workflows/wordpress-plugin/rules/development-workflow.md +702 -702
  332. package/augment-extensions/workflows/wordpress-plugin/rules/submission-workflow.md +728 -728
  333. package/augment-extensions/workflows/wordpress-plugin/rules/testing-workflow.md +775 -775
  334. package/augment-extensions/writing-standards/screenplay/README.md +339 -300
  335. package/augment-extensions/writing-standards/screenplay/_templates/README.md +121 -121
  336. package/augment-extensions/writing-standards/screenplay/_templates/genre-template.md +153 -153
  337. package/augment-extensions/writing-standards/screenplay/_templates/style-template.md +243 -243
  338. package/augment-extensions/writing-standards/screenplay/_templates/theme-template.md +213 -213
  339. package/augment-extensions/writing-standards/screenplay/examples/aaa-hollywood-scene.fountain +164 -164
  340. package/augment-extensions/writing-standards/screenplay/examples/beat-sheet-example.yaml +95 -95
  341. package/augment-extensions/writing-standards/screenplay/examples/character-profile-example.yaml +116 -116
  342. package/augment-extensions/writing-standards/screenplay/examples/commercial-30sec.fountain +151 -151
  343. package/augment-extensions/writing-standards/screenplay/examples/independent-monologue.fountain +67 -67
  344. package/augment-extensions/writing-standards/screenplay/examples/news-segment.fountain +142 -142
  345. package/augment-extensions/writing-standards/screenplay/examples/plot-outline-example.yaml +184 -184
  346. package/augment-extensions/writing-standards/screenplay/examples/tv-episode-teaser.fountain +204 -204
  347. package/augment-extensions/writing-standards/screenplay/genres/README.md +181 -181
  348. package/augment-extensions/writing-standards/screenplay/genres/examples/.gitkeep +2 -2
  349. package/augment-extensions/writing-standards/screenplay/genres/module.json +70 -70
  350. package/augment-extensions/writing-standards/screenplay/genres/rules/.gitkeep +2 -2
  351. package/augment-extensions/writing-standards/screenplay/genres/rules/action.md +399 -399
  352. package/augment-extensions/writing-standards/screenplay/genres/rules/adventure.md +407 -407
  353. package/augment-extensions/writing-standards/screenplay/genres/rules/animation.md +293 -293
  354. package/augment-extensions/writing-standards/screenplay/genres/rules/biographical.md +293 -293
  355. package/augment-extensions/writing-standards/screenplay/genres/rules/comedy.md +401 -401
  356. package/augment-extensions/writing-standards/screenplay/genres/rules/documentary.md +293 -293
  357. package/augment-extensions/writing-standards/screenplay/genres/rules/drama.md +409 -409
  358. package/augment-extensions/writing-standards/screenplay/genres/rules/fantasy.md +293 -293
  359. package/augment-extensions/writing-standards/screenplay/genres/rules/historical.md +293 -293
  360. package/augment-extensions/writing-standards/screenplay/genres/rules/horror.md +268 -268
  361. package/augment-extensions/writing-standards/screenplay/genres/rules/musical.md +294 -294
  362. package/augment-extensions/writing-standards/screenplay/genres/rules/mystery.md +293 -293
  363. package/augment-extensions/writing-standards/screenplay/genres/rules/noir.md +294 -294
  364. package/augment-extensions/writing-standards/screenplay/genres/rules/romance.md +293 -293
  365. package/augment-extensions/writing-standards/screenplay/genres/rules/sci-fi.md +289 -289
  366. package/augment-extensions/writing-standards/screenplay/genres/rules/superhero.md +293 -293
  367. package/augment-extensions/writing-standards/screenplay/genres/rules/thriller.md +294 -294
  368. package/augment-extensions/writing-standards/screenplay/genres/rules/western.md +293 -293
  369. package/augment-extensions/writing-standards/screenplay/module.json +124 -124
  370. package/augment-extensions/writing-standards/screenplay/rules/aaa-hollywood-films.md +339 -339
  371. package/augment-extensions/writing-standards/screenplay/rules/ai-integration-testing.md +329 -329
  372. package/augment-extensions/writing-standards/screenplay/rules/character-development.md +169 -169
  373. package/augment-extensions/writing-standards/screenplay/rules/commercials.md +437 -437
  374. package/augment-extensions/writing-standards/screenplay/rules/dialogue-writing.md +263 -263
  375. package/augment-extensions/writing-standards/screenplay/rules/diversity-inclusion.md +261 -261
  376. package/augment-extensions/writing-standards/screenplay/rules/examples-guide.md +315 -315
  377. package/augment-extensions/writing-standards/screenplay/rules/file-organization.md +213 -0
  378. package/augment-extensions/writing-standards/screenplay/rules/formatting-validation.md +413 -413
  379. package/augment-extensions/writing-standards/screenplay/rules/fountain-format.md +372 -372
  380. package/augment-extensions/writing-standards/screenplay/rules/independent-films.md +374 -374
  381. package/augment-extensions/writing-standards/screenplay/rules/live-tv-productions.md +443 -443
  382. package/augment-extensions/writing-standards/screenplay/rules/narrative-structures.md +207 -207
  383. package/augment-extensions/writing-standards/screenplay/rules/news-broadcasts.md +444 -444
  384. package/augment-extensions/writing-standards/screenplay/rules/pacing-timing.md +331 -331
  385. package/augment-extensions/writing-standards/screenplay/rules/quality-review-checklist.md +334 -334
  386. package/augment-extensions/writing-standards/screenplay/rules/quick-reference.md +299 -299
  387. package/augment-extensions/writing-standards/screenplay/rules/screen-continuity.md +263 -263
  388. package/augment-extensions/writing-standards/screenplay/rules/streaming-content.md +412 -412
  389. package/augment-extensions/writing-standards/screenplay/rules/trope-management.md +370 -370
  390. package/augment-extensions/writing-standards/screenplay/rules/tv-series.md +374 -374
  391. package/augment-extensions/writing-standards/screenplay/rules/universal-formatting.md +339 -339
  392. package/augment-extensions/writing-standards/screenplay/rules/vscode-integration.md +277 -277
  393. package/augment-extensions/writing-standards/screenplay/rules/web-content.md +393 -393
  394. package/augment-extensions/writing-standards/screenplay/schemas/beat-sheet.json +332 -332
  395. package/augment-extensions/writing-standards/screenplay/schemas/character-profile.json +247 -247
  396. package/augment-extensions/writing-standards/screenplay/schemas/feature-selection.json +200 -200
  397. package/augment-extensions/writing-standards/screenplay/schemas/plot-outline.json +233 -233
  398. package/augment-extensions/writing-standards/screenplay/schemas/screenplay-config.json +245 -245
  399. package/augment-extensions/writing-standards/screenplay/schemas/trope-inventory.json +221 -221
  400. package/augment-extensions/writing-standards/screenplay/styles/README.md +159 -159
  401. package/augment-extensions/writing-standards/screenplay/styles/examples/.gitkeep +2 -2
  402. package/augment-extensions/writing-standards/screenplay/styles/examples/style-applications.md +1449 -1449
  403. package/augment-extensions/writing-standards/screenplay/styles/module.json +64 -64
  404. package/augment-extensions/writing-standards/screenplay/styles/rules/.gitkeep +2 -2
  405. package/augment-extensions/writing-standards/screenplay/styles/rules/dialogue-centric.md +520 -520
  406. package/augment-extensions/writing-standards/screenplay/styles/rules/ensemble.md +499 -499
  407. package/augment-extensions/writing-standards/screenplay/styles/rules/epic.md +497 -497
  408. package/augment-extensions/writing-standards/screenplay/styles/rules/experimental.md +492 -492
  409. package/augment-extensions/writing-standards/screenplay/styles/rules/flashback.md +509 -509
  410. package/augment-extensions/writing-standards/screenplay/styles/rules/linear.md +490 -490
  411. package/augment-extensions/writing-standards/screenplay/styles/rules/minimalist.md +499 -499
  412. package/augment-extensions/writing-standards/screenplay/styles/rules/non-linear.md +501 -501
  413. package/augment-extensions/writing-standards/screenplay/styles/rules/poetic.md +499 -499
  414. package/augment-extensions/writing-standards/screenplay/styles/rules/realistic.md +498 -498
  415. package/augment-extensions/writing-standards/screenplay/styles/rules/satirical.md +499 -499
  416. package/augment-extensions/writing-standards/screenplay/styles/rules/surreal.md +508 -508
  417. package/augment-extensions/writing-standards/screenplay/styles/rules/voice-over.md +500 -500
  418. package/augment-extensions/writing-standards/screenplay/themes/README.md +158 -158
  419. package/augment-extensions/writing-standards/screenplay/themes/examples/.gitkeep +2 -2
  420. package/augment-extensions/writing-standards/screenplay/themes/examples/common-mistakes-and-fixes.md +643 -643
  421. package/augment-extensions/writing-standards/screenplay/themes/examples/complete-scene-example.md +311 -311
  422. package/augment-extensions/writing-standards/screenplay/themes/examples/individual-theme-examples.md +562 -562
  423. package/augment-extensions/writing-standards/screenplay/themes/examples/multi-theme-weaving.md +538 -538
  424. package/augment-extensions/writing-standards/screenplay/themes/examples/theme-application-guide.md +432 -432
  425. package/augment-extensions/writing-standards/screenplay/themes/examples/theme-integration-across-acts.md +637 -637
  426. package/augment-extensions/writing-standards/screenplay/themes/module.json +66 -66
  427. package/augment-extensions/writing-standards/screenplay/themes/rules/.gitkeep +2 -2
  428. package/augment-extensions/writing-standards/screenplay/themes/rules/ambition.md +458 -458
  429. package/augment-extensions/writing-standards/screenplay/themes/rules/betrayal.md +490 -490
  430. package/augment-extensions/writing-standards/screenplay/themes/rules/environment.md +458 -458
  431. package/augment-extensions/writing-standards/screenplay/themes/rules/fate.md +459 -459
  432. package/augment-extensions/writing-standards/screenplay/themes/rules/friendship.md +491 -491
  433. package/augment-extensions/writing-standards/screenplay/themes/rules/growth.md +491 -491
  434. package/augment-extensions/writing-standards/screenplay/themes/rules/identity.md +490 -490
  435. package/augment-extensions/writing-standards/screenplay/themes/rules/isolation.md +464 -464
  436. package/augment-extensions/writing-standards/screenplay/themes/rules/justice.md +461 -461
  437. package/augment-extensions/writing-standards/screenplay/themes/rules/love.md +489 -489
  438. package/augment-extensions/writing-standards/screenplay/themes/rules/power.md +494 -494
  439. package/augment-extensions/writing-standards/screenplay/themes/rules/redemption.md +483 -483
  440. package/augment-extensions/writing-standards/screenplay/themes/rules/revenge.md +489 -489
  441. package/augment-extensions/writing-standards/screenplay/themes/rules/survival.md +496 -496
  442. package/augment-extensions/writing-standards/screenplay/themes/rules/technology.md +463 -463
  443. package/augment-extensions/writing-standards/screenplay/utils/__tests__/file-organization.test.ts +169 -0
  444. package/augment-extensions/writing-standards/screenplay/utils/file-organization.ts +165 -0
  445. package/cli/MODULES.md +302 -302
  446. package/cli/dist/cli.js +109 -22
  447. package/cli/dist/cli.js.map +1 -1
  448. package/cli/dist/commands/gui.d.ts.map +1 -1
  449. package/cli/dist/commands/gui.js +54 -6
  450. package/cli/dist/commands/gui.js.map +1 -1
  451. package/cli/dist/commands/init.d.ts.map +1 -1
  452. package/cli/dist/commands/init.js +76 -23
  453. package/cli/dist/commands/init.js.map +1 -1
  454. package/cli/dist/commands/self-remove.d.ts.map +1 -1
  455. package/cli/dist/commands/self-remove.js +48 -74
  456. package/cli/dist/commands/self-remove.js.map +1 -1
  457. package/cli/dist/commands/show.d.ts +11 -0
  458. package/cli/dist/commands/show.d.ts.map +1 -1
  459. package/cli/dist/commands/show.js +120 -0
  460. package/cli/dist/commands/show.js.map +1 -1
  461. package/cli/dist/commands/showCompleted.d.ts +21 -0
  462. package/cli/dist/commands/showCompleted.d.ts.map +1 -0
  463. package/cli/dist/commands/showCompleted.js +225 -0
  464. package/cli/dist/commands/showCompleted.js.map +1 -0
  465. package/cli/dist/commands/skill.js +88 -88
  466. package/cli/dist/commands/update.d.ts +2 -0
  467. package/cli/dist/commands/update.d.ts.map +1 -1
  468. package/cli/dist/commands/update.js +67 -1
  469. package/cli/dist/commands/update.js.map +1 -1
  470. package/cli/dist/utils/beadsCompletedChecker.d.ts +72 -0
  471. package/cli/dist/utils/beadsCompletedChecker.d.ts.map +1 -0
  472. package/cli/dist/utils/beadsCompletedChecker.js +198 -0
  473. package/cli/dist/utils/beadsCompletedChecker.js.map +1 -0
  474. package/cli/dist/utils/catalog-sync.js +13 -13
  475. package/cli/dist/utils/extractCommandHelp.d.ts +51 -0
  476. package/cli/dist/utils/extractCommandHelp.d.ts.map +1 -0
  477. package/cli/dist/utils/extractCommandHelp.js +250 -0
  478. package/cli/dist/utils/extractCommandHelp.js.map +1 -0
  479. package/cli/dist/utils/install-rules.js +55 -55
  480. package/cli/dist/utils/mcp-integration.js +44 -44
  481. package/cli/dist/utils/rule-install-hooks.js +8 -8
  482. package/modules.md +667 -630
  483. package/package.json +85 -85
@@ -1,502 +1,502 @@
1
- # Vector-Based MCP Example: Knowledge Base Q&A
2
-
3
- ## Use Case
4
-
5
- A knowledge base Q&A system that retrieves relevant documentation using semantic search (RAG - Retrieval Augmented Generation).
6
-
7
- **Challenges**:
8
- - Large knowledge base (10,000+ documents)
9
- - Semantic similarity search
10
- - Relevance ranking
11
- - Context window management
12
-
13
- ---
14
-
15
- ## Configuration
16
-
17
- ```json
18
- {
19
- "mcp": {
20
- "type": "vector",
21
- "embedding": {
22
- "model": "voyage-3-large",
23
- "dimensions": 1024,
24
- "batchSize": 100
25
- },
26
- "vectorStore": {
27
- "backend": "pinecone",
28
- "index": "knowledge-base",
29
- "metric": "cosine"
30
- },
31
- "retrieval": {
32
- "topK": 10,
33
- "minScore": 0.7,
34
- "rerankingEnabled": true,
35
- "rerankingModel": "cohere-rerank-v3"
36
- },
37
- "chunking": {
38
- "strategy": "semantic",
39
- "chunkSize": 512,
40
- "overlap": 50
41
- }
42
- }
43
- }
44
- ```
45
-
46
- ---
47
-
48
- ## Architecture
49
-
50
- ```
51
- ┌─────────────────────────────────────────────────────────────┐
52
- │ Knowledge Base (10,000 documents) │
53
- └─────────────────────────────────────────────────────────────┘
54
-
55
-
56
- ┌─────────────────────────────────────────────────────────────┐
57
- │ Document Processing Pipeline │
58
- │ ┌──────────────────────────────────────────────────────┐ │
59
- │ │ 1. Chunking (semantic, 512 tokens, 50 overlap) │ │
60
- │ └──────────────────────────────────────────────────────┘ │
61
- │ ┌──────────────────────────────────────────────────────┐ │
62
- │ │ 2. Embedding (voyage-3-large, 1024 dims) │ │
63
- │ └──────────────────────────────────────────────────────┘ │
64
- │ ┌──────────────────────────────────────────────────────┐ │
65
- │ │ 3. Indexing (Pinecone, cosine similarity) │ │
66
- │ └──────────────────────────────────────────────────────┘ │
67
- └─────────────────────────────────────────────────────────────┘
68
-
69
-
70
- ┌─────────────────────────────────────────────────────────────┐
71
- │ Query Processing │
72
- │ User Query → Embed → Vector Search → Rerank → Context │
73
- └─────────────────────────────────────────────────────────────┘
74
-
75
-
76
- ┌─────────────────────────────────────────────────────────────┐
77
- │ LLM Generation │
78
- │ Context + Query → GPT-4o → Answer │
79
- └─────────────────────────────────────────────────────────────┘
80
- ```
81
-
82
- ---
83
-
84
- ## Implementation
85
-
86
- ### Step 1: Document Processor
87
-
88
- ```python
89
- import voyageai
90
- from pinecone import Pinecone
91
- from typing import List, Dict, Any
92
- import hashlib
93
-
94
- class DocumentProcessor:
95
- def __init__(self, voyage_api_key: str, pinecone_api_key: str, index_name: str):
96
- self.voyage_client = voyageai.Client(api_key=voyage_api_key)
97
- self.pc = Pinecone(api_key=pinecone_api_key)
98
- self.index = self.pc.Index(index_name)
99
- self.chunk_size = 512
100
- self.overlap = 50
101
-
102
- def chunk_document(self, text: str, metadata: Dict[str, Any]) -> List[Dict[str, Any]]:
103
- """Chunk document using semantic chunking"""
104
- # Simple sentence-based chunking (use semantic chunking in production)
105
- sentences = text.split('. ')
106
- chunks = []
107
- current_chunk = []
108
- current_length = 0
109
-
110
- for sentence in sentences:
111
- sentence_length = len(sentence.split())
112
-
113
- if current_length + sentence_length > self.chunk_size:
114
- # Save current chunk
115
- chunk_text = '. '.join(current_chunk) + '.'
116
- chunk_id = hashlib.md5(chunk_text.encode()).hexdigest()
117
-
118
- chunks.append({
119
- "id": chunk_id,
120
- "text": chunk_text,
121
- "metadata": {
122
- **metadata,
123
- "chunk_index": len(chunks),
124
- "chunk_length": current_length
125
- }
126
- })
127
-
128
- # Start new chunk with overlap
129
- overlap_sentences = current_chunk[-self.overlap:] if len(current_chunk) > self.overlap else current_chunk
130
- current_chunk = overlap_sentences + [sentence]
131
- current_length = sum(len(s.split()) for s in current_chunk)
132
- else:
133
- current_chunk.append(sentence)
134
- current_length += sentence_length
135
-
136
- # Add final chunk
137
- if current_chunk:
138
- chunk_text = '. '.join(current_chunk) + '.'
139
- chunk_id = hashlib.md5(chunk_text.encode()).hexdigest()
140
- chunks.append({
141
- "id": chunk_id,
142
- "text": chunk_text,
143
- "metadata": {
144
- **metadata,
145
- "chunk_index": len(chunks),
146
- "chunk_length": current_length
147
- }
148
- })
149
-
150
- return chunks
151
-
152
- def embed_chunks(self, chunks: List[Dict[str, Any]]) -> List[Dict[str, Any]]:
153
- """Generate embeddings for chunks"""
154
- texts = [chunk["text"] for chunk in chunks]
155
-
156
- # Batch embedding
157
- embeddings = self.voyage_client.embed(
158
- texts=texts,
159
- model="voyage-3-large",
160
- input_type="document"
161
- ).embeddings
162
-
163
- for chunk, embedding in zip(chunks, embeddings):
164
- chunk["embedding"] = embedding
165
-
166
- return chunks
167
-
168
- def index_chunks(self, chunks: List[Dict[str, Any]]) -> None:
169
- """Index chunks in Pinecone"""
170
- vectors = [
171
- {
172
- "id": chunk["id"],
173
- "values": chunk["embedding"],
174
- "metadata": {
175
- "text": chunk["text"],
176
- **chunk["metadata"]
177
- }
178
- }
179
- for chunk in chunks
180
- ]
181
-
182
- # Upsert in batches
183
- batch_size = 100
184
- for i in range(0, len(vectors), batch_size):
185
- batch = vectors[i:i+batch_size]
186
- self.index.upsert(vectors=batch)
187
-
188
- def process_document(self, text: str, metadata: Dict[str, Any]) -> None:
189
- """Full pipeline: chunk → embed → index"""
190
- chunks = self.chunk_document(text, metadata)
191
- chunks_with_embeddings = self.embed_chunks(chunks)
192
- self.index_chunks(chunks_with_embeddings)
193
- ```
194
-
195
- ### Step 2: Retrieval System
196
-
197
- ```python
198
- import cohere
199
- import openai
200
- from typing import List, Dict, Any
201
-
202
- class KnowledgeBaseQA:
203
- def __init__(
204
- self,
205
- voyage_api_key: str,
206
- pinecone_api_key: str,
207
- cohere_api_key: str,
208
- openai_api_key: str,
209
- index_name: str
210
- ):
211
- self.voyage_client = voyageai.Client(api_key=voyage_api_key)
212
- self.pc = Pinecone(api_key=pinecone_api_key)
213
- self.index = self.pc.Index(index_name)
214
- self.cohere_client = cohere.Client(api_key=cohere_api_key)
215
- self.openai_client = openai.OpenAI(api_key=openai_api_key)
216
- self.top_k = 10
217
- self.min_score = 0.7
218
-
219
- def retrieve(self, query: str) -> List[Dict[str, Any]]:
220
- """Retrieve relevant chunks using vector search"""
221
- # Embed query
222
- query_embedding = self.voyage_client.embed(
223
- texts=[query],
224
- model="voyage-3-large",
225
- input_type="query"
226
- ).embeddings[0]
227
-
228
- # Vector search
229
- results = self.index.query(
230
- vector=query_embedding,
231
- top_k=self.top_k,
232
- include_metadata=True
233
- )
234
-
235
- # Filter by score
236
- filtered_results = [
237
- {
238
- "id": match.id,
239
- "score": match.score,
240
- "text": match.metadata["text"],
241
- "metadata": {k: v for k, v in match.metadata.items() if k != "text"}
242
- }
243
- for match in results.matches
244
- if match.score >= self.min_score
245
- ]
246
-
247
- return filtered_results
248
-
249
- def rerank(self, query: str, documents: List[Dict[str, Any]]) -> List[Dict[str, Any]]:
250
- """Rerank documents using Cohere Rerank"""
251
- if not documents:
252
- return []
253
-
254
- # Prepare documents for reranking
255
- texts = [doc["text"] for doc in documents]
256
-
257
- # Rerank
258
- rerank_results = self.cohere_client.rerank(
259
- model="rerank-english-v3.0",
260
- query=query,
261
- documents=texts,
262
- top_n=5
263
- )
264
-
265
- # Map back to original documents
266
- reranked_docs = []
267
- for result in rerank_results.results:
268
- doc = documents[result.index]
269
- doc["rerank_score"] = result.relevance_score
270
- reranked_docs.append(doc)
271
-
272
- return reranked_docs
273
-
274
- def answer_question(self, query: str) -> Dict[str, Any]:
275
- """Answer question using RAG"""
276
- # Retrieve relevant documents
277
- retrieved_docs = self.retrieve(query)
278
-
279
- if not retrieved_docs:
280
- return {
281
- "answer": "I couldn't find relevant information to answer your question.",
282
- "sources": []
283
- }
284
-
285
- # Rerank documents
286
- reranked_docs = self.rerank(query, retrieved_docs)
287
-
288
- # Build context from top documents
289
- context_parts = []
290
- for i, doc in enumerate(reranked_docs[:5], 1):
291
- source = doc["metadata"].get("source", "Unknown")
292
- context_parts.append(f"[{i}] {doc['text']}\nSource: {source}")
293
-
294
- context = "\n\n".join(context_parts)
295
-
296
- # Generate answer
297
- messages = [
298
- {
299
- "role": "system",
300
- "content": """You are a helpful assistant that answers questions based on the provided context.
301
-
302
- Rules:
303
- 1. Only use information from the provided context
304
- 2. If the context doesn't contain the answer, say so
305
- 3. Cite sources using [1], [2], etc.
306
- 4. Be concise and accurate"""
307
- },
308
- {
309
- "role": "user",
310
- "content": f"""Context:
311
- {context}
312
-
313
- Question: {query}
314
-
315
- Answer:"""
316
- }
317
- ]
318
-
319
- response = self.openai_client.chat.completions.create(
320
- model="gpt-4o",
321
- messages=messages,
322
- temperature=0.3
323
- )
324
-
325
- answer = response.choices[0].message.content
326
-
327
- return {
328
- "answer": answer,
329
- "sources": [
330
- {
331
- "text": doc["text"],
332
- "score": doc["score"],
333
- "rerank_score": doc["rerank_score"],
334
- "metadata": doc["metadata"]
335
- }
336
- for doc in reranked_docs
337
- ]
338
- }
339
- ```
340
-
341
- ---
342
-
343
- ## Usage Example
344
-
345
- ```python
346
- # Initialize processor
347
- processor = DocumentProcessor(
348
- voyage_api_key="your-voyage-key",
349
- pinecone_api_key="your-pinecone-key",
350
- index_name="knowledge-base"
351
- )
352
-
353
- # Process documents
354
- documents = [
355
- {
356
- "text": "Python is a high-level programming language...",
357
- "metadata": {"source": "python-docs.pdf", "category": "programming"}
358
- },
359
- {
360
- "text": "Machine learning is a subset of artificial intelligence...",
361
- "metadata": {"source": "ml-guide.pdf", "category": "ai"}
362
- }
363
- ]
364
-
365
- for doc in documents:
366
- processor.process_document(doc["text"], doc["metadata"])
367
-
368
- # Initialize Q&A system
369
- qa_system = KnowledgeBaseQA(
370
- voyage_api_key="your-voyage-key",
371
- pinecone_api_key="your-pinecone-key",
372
- cohere_api_key="your-cohere-key",
373
- openai_api_key="your-openai-key",
374
- index_name="knowledge-base"
375
- )
376
-
377
- # Ask questions
378
- result = qa_system.answer_question("What is Python?")
379
- print(f"Answer: {result['answer']}")
380
- print(f"\nSources:")
381
- for i, source in enumerate(result['sources'], 1):
382
- print(f"[{i}] Score: {source['score']:.3f}, Rerank: {source['rerank_score']:.3f}")
383
- print(f" {source['metadata']['source']}")
384
- ```
385
-
386
- ---
387
-
388
- ## Key Features
389
-
390
- ### 1. Semantic Chunking
391
- - Sentence-based chunking with overlap
392
- - Preserves semantic coherence
393
- - Configurable chunk size
394
-
395
- ### 2. Vector Embeddings
396
- - Voyage AI embeddings (1024 dimensions)
397
- - Batch processing for efficiency
398
- - Separate embeddings for documents and queries
399
-
400
- ### 3. Vector Search
401
- - Pinecone vector database
402
- - Cosine similarity metric
403
- - Top-K retrieval with score filtering
404
-
405
- ### 4. Reranking
406
- - Cohere Rerank for improved relevance
407
- - Cross-encoder architecture
408
- - Top-N selection after reranking
409
-
410
- ### 5. RAG Generation
411
- - Context-aware answer generation
412
- - Source citation
413
- - Hallucination prevention
414
-
415
- ---
416
-
417
- ## Testing
418
-
419
- ```python
420
- import pytest
421
- from unittest.mock import Mock, patch
422
-
423
- def test_chunking():
424
- processor = DocumentProcessor("key", "key", "index")
425
-
426
- text = "Sentence one. Sentence two. Sentence three."
427
- metadata = {"source": "test.txt"}
428
-
429
- chunks = processor.chunk_document(text, metadata)
430
-
431
- assert len(chunks) > 0
432
- assert all("text" in chunk for chunk in chunks)
433
- assert all("metadata" in chunk for chunk in chunks)
434
-
435
- def test_retrieval():
436
- qa_system = KnowledgeBaseQA("key", "key", "key", "key", "index")
437
-
438
- with patch.object(qa_system.index, 'query') as mock_query:
439
- mock_query.return_value = Mock(matches=[
440
- Mock(id="1", score=0.9, metadata={"text": "Test document"})
441
- ])
442
-
443
- results = qa_system.retrieve("test query")
444
-
445
- assert len(results) > 0
446
- assert results[0]["score"] >= qa_system.min_score
447
-
448
- def test_answer_generation():
449
- qa_system = KnowledgeBaseQA("key", "key", "key", "key", "index")
450
-
451
- with patch.object(qa_system, 'retrieve') as mock_retrieve:
452
- mock_retrieve.return_value = [
453
- {"id": "1", "score": 0.9, "text": "Python is a programming language", "metadata": {}}
454
- ]
455
-
456
- with patch.object(qa_system, 'rerank') as mock_rerank:
457
- mock_rerank.return_value = [
458
- {"id": "1", "score": 0.9, "rerank_score": 0.95, "text": "Python is a programming language", "metadata": {}}
459
- ]
460
-
461
- result = qa_system.answer_question("What is Python?")
462
-
463
- assert "answer" in result
464
- assert "sources" in result
465
- ```
466
-
467
- ---
468
-
469
- ## Best Practices
470
-
471
- 1. **Chunking Strategy**: Use semantic chunking for better coherence
472
- 2. **Embedding Model**: Choose model based on domain (general vs. specialized)
473
- 3. **Vector Store**: Select based on scale (Pinecone, Weaviate, Qdrant)
474
- 4. **Reranking**: Always rerank for improved relevance
475
- 5. **Metadata**: Include rich metadata for filtering and attribution
476
- 6. **Monitoring**: Track retrieval quality metrics (MRR, NDCG)
477
- 7. **Caching**: Cache embeddings for frequently queried documents
478
- 8. **Hybrid Search**: Combine vector search with keyword search
479
-
480
- ---
481
-
482
- ## Performance Optimization
483
-
484
- - **Batch Embedding**: Process documents in batches (100-1000)
485
- - **Async Processing**: Use async I/O for embedding and indexing
486
- - **Index Optimization**: Use appropriate index type (HNSW, IVF)
487
- - **Caching**: Cache query embeddings for repeated queries
488
- - **Compression**: Use quantization for reduced storage
489
- - **Sharding**: Distribute index across multiple shards
490
-
491
- ---
492
-
493
- ## Security
494
-
495
- - **Access Control**: Implement document-level access control
496
- - **Data Encryption**: Encrypt vectors at rest
497
- - **Query Sanitization**: Sanitize user queries
498
- - **Rate Limiting**: Limit queries per user
499
- - **Audit Logging**: Log all retrieval operations
500
- - **PII Filtering**: Remove PII before indexing
501
- ```
502
-
1
+ # Vector-Based MCP Example: Knowledge Base Q&A
2
+
3
+ ## Use Case
4
+
5
+ A knowledge base Q&A system that retrieves relevant documentation using semantic search (RAG - Retrieval Augmented Generation).
6
+
7
+ **Challenges**:
8
+ - Large knowledge base (10,000+ documents)
9
+ - Semantic similarity search
10
+ - Relevance ranking
11
+ - Context window management
12
+
13
+ ---
14
+
15
+ ## Configuration
16
+
17
+ ```json
18
+ {
19
+ "mcp": {
20
+ "type": "vector",
21
+ "embedding": {
22
+ "model": "voyage-3-large",
23
+ "dimensions": 1024,
24
+ "batchSize": 100
25
+ },
26
+ "vectorStore": {
27
+ "backend": "pinecone",
28
+ "index": "knowledge-base",
29
+ "metric": "cosine"
30
+ },
31
+ "retrieval": {
32
+ "topK": 10,
33
+ "minScore": 0.7,
34
+ "rerankingEnabled": true,
35
+ "rerankingModel": "cohere-rerank-v3"
36
+ },
37
+ "chunking": {
38
+ "strategy": "semantic",
39
+ "chunkSize": 512,
40
+ "overlap": 50
41
+ }
42
+ }
43
+ }
44
+ ```
45
+
46
+ ---
47
+
48
+ ## Architecture
49
+
50
+ ```
51
+ ┌─────────────────────────────────────────────────────────────┐
52
+ │ Knowledge Base (10,000 documents) │
53
+ └─────────────────────────────────────────────────────────────┘
54
+
55
+
56
+ ┌─────────────────────────────────────────────────────────────┐
57
+ │ Document Processing Pipeline │
58
+ │ ┌──────────────────────────────────────────────────────┐ │
59
+ │ │ 1. Chunking (semantic, 512 tokens, 50 overlap) │ │
60
+ │ └──────────────────────────────────────────────────────┘ │
61
+ │ ┌──────────────────────────────────────────────────────┐ │
62
+ │ │ 2. Embedding (voyage-3-large, 1024 dims) │ │
63
+ │ └──────────────────────────────────────────────────────┘ │
64
+ │ ┌──────────────────────────────────────────────────────┐ │
65
+ │ │ 3. Indexing (Pinecone, cosine similarity) │ │
66
+ │ └──────────────────────────────────────────────────────┘ │
67
+ └─────────────────────────────────────────────────────────────┘
68
+
69
+
70
+ ┌─────────────────────────────────────────────────────────────┐
71
+ │ Query Processing │
72
+ │ User Query → Embed → Vector Search → Rerank → Context │
73
+ └─────────────────────────────────────────────────────────────┘
74
+
75
+
76
+ ┌─────────────────────────────────────────────────────────────┐
77
+ │ LLM Generation │
78
+ │ Context + Query → GPT-4o → Answer │
79
+ └─────────────────────────────────────────────────────────────┘
80
+ ```
81
+
82
+ ---
83
+
84
+ ## Implementation
85
+
86
+ ### Step 1: Document Processor
87
+
88
+ ```python
89
+ import voyageai
90
+ from pinecone import Pinecone
91
+ from typing import List, Dict, Any
92
+ import hashlib
93
+
94
+ class DocumentProcessor:
95
+ def __init__(self, voyage_api_key: str, pinecone_api_key: str, index_name: str):
96
+ self.voyage_client = voyageai.Client(api_key=voyage_api_key)
97
+ self.pc = Pinecone(api_key=pinecone_api_key)
98
+ self.index = self.pc.Index(index_name)
99
+ self.chunk_size = 512
100
+ self.overlap = 50
101
+
102
+ def chunk_document(self, text: str, metadata: Dict[str, Any]) -> List[Dict[str, Any]]:
103
+ """Chunk document using semantic chunking"""
104
+ # Simple sentence-based chunking (use semantic chunking in production)
105
+ sentences = text.split('. ')
106
+ chunks = []
107
+ current_chunk = []
108
+ current_length = 0
109
+
110
+ for sentence in sentences:
111
+ sentence_length = len(sentence.split())
112
+
113
+ if current_length + sentence_length > self.chunk_size:
114
+ # Save current chunk
115
+ chunk_text = '. '.join(current_chunk) + '.'
116
+ chunk_id = hashlib.md5(chunk_text.encode()).hexdigest()
117
+
118
+ chunks.append({
119
+ "id": chunk_id,
120
+ "text": chunk_text,
121
+ "metadata": {
122
+ **metadata,
123
+ "chunk_index": len(chunks),
124
+ "chunk_length": current_length
125
+ }
126
+ })
127
+
128
+ # Start new chunk with overlap
129
+ overlap_sentences = current_chunk[-self.overlap:] if len(current_chunk) > self.overlap else current_chunk
130
+ current_chunk = overlap_sentences + [sentence]
131
+ current_length = sum(len(s.split()) for s in current_chunk)
132
+ else:
133
+ current_chunk.append(sentence)
134
+ current_length += sentence_length
135
+
136
+ # Add final chunk
137
+ if current_chunk:
138
+ chunk_text = '. '.join(current_chunk) + '.'
139
+ chunk_id = hashlib.md5(chunk_text.encode()).hexdigest()
140
+ chunks.append({
141
+ "id": chunk_id,
142
+ "text": chunk_text,
143
+ "metadata": {
144
+ **metadata,
145
+ "chunk_index": len(chunks),
146
+ "chunk_length": current_length
147
+ }
148
+ })
149
+
150
+ return chunks
151
+
152
+ def embed_chunks(self, chunks: List[Dict[str, Any]]) -> List[Dict[str, Any]]:
153
+ """Generate embeddings for chunks"""
154
+ texts = [chunk["text"] for chunk in chunks]
155
+
156
+ # Batch embedding
157
+ embeddings = self.voyage_client.embed(
158
+ texts=texts,
159
+ model="voyage-3-large",
160
+ input_type="document"
161
+ ).embeddings
162
+
163
+ for chunk, embedding in zip(chunks, embeddings):
164
+ chunk["embedding"] = embedding
165
+
166
+ return chunks
167
+
168
+ def index_chunks(self, chunks: List[Dict[str, Any]]) -> None:
169
+ """Index chunks in Pinecone"""
170
+ vectors = [
171
+ {
172
+ "id": chunk["id"],
173
+ "values": chunk["embedding"],
174
+ "metadata": {
175
+ "text": chunk["text"],
176
+ **chunk["metadata"]
177
+ }
178
+ }
179
+ for chunk in chunks
180
+ ]
181
+
182
+ # Upsert in batches
183
+ batch_size = 100
184
+ for i in range(0, len(vectors), batch_size):
185
+ batch = vectors[i:i+batch_size]
186
+ self.index.upsert(vectors=batch)
187
+
188
+ def process_document(self, text: str, metadata: Dict[str, Any]) -> None:
189
+ """Full pipeline: chunk → embed → index"""
190
+ chunks = self.chunk_document(text, metadata)
191
+ chunks_with_embeddings = self.embed_chunks(chunks)
192
+ self.index_chunks(chunks_with_embeddings)
193
+ ```
194
+
195
+ ### Step 2: Retrieval System
196
+
197
+ ```python
198
+ import cohere
199
+ import openai
200
+ from typing import List, Dict, Any
201
+
202
+ class KnowledgeBaseQA:
203
+ def __init__(
204
+ self,
205
+ voyage_api_key: str,
206
+ pinecone_api_key: str,
207
+ cohere_api_key: str,
208
+ openai_api_key: str,
209
+ index_name: str
210
+ ):
211
+ self.voyage_client = voyageai.Client(api_key=voyage_api_key)
212
+ self.pc = Pinecone(api_key=pinecone_api_key)
213
+ self.index = self.pc.Index(index_name)
214
+ self.cohere_client = cohere.Client(api_key=cohere_api_key)
215
+ self.openai_client = openai.OpenAI(api_key=openai_api_key)
216
+ self.top_k = 10
217
+ self.min_score = 0.7
218
+
219
+ def retrieve(self, query: str) -> List[Dict[str, Any]]:
220
+ """Retrieve relevant chunks using vector search"""
221
+ # Embed query
222
+ query_embedding = self.voyage_client.embed(
223
+ texts=[query],
224
+ model="voyage-3-large",
225
+ input_type="query"
226
+ ).embeddings[0]
227
+
228
+ # Vector search
229
+ results = self.index.query(
230
+ vector=query_embedding,
231
+ top_k=self.top_k,
232
+ include_metadata=True
233
+ )
234
+
235
+ # Filter by score
236
+ filtered_results = [
237
+ {
238
+ "id": match.id,
239
+ "score": match.score,
240
+ "text": match.metadata["text"],
241
+ "metadata": {k: v for k, v in match.metadata.items() if k != "text"}
242
+ }
243
+ for match in results.matches
244
+ if match.score >= self.min_score
245
+ ]
246
+
247
+ return filtered_results
248
+
249
+ def rerank(self, query: str, documents: List[Dict[str, Any]]) -> List[Dict[str, Any]]:
250
+ """Rerank documents using Cohere Rerank"""
251
+ if not documents:
252
+ return []
253
+
254
+ # Prepare documents for reranking
255
+ texts = [doc["text"] for doc in documents]
256
+
257
+ # Rerank
258
+ rerank_results = self.cohere_client.rerank(
259
+ model="rerank-english-v3.0",
260
+ query=query,
261
+ documents=texts,
262
+ top_n=5
263
+ )
264
+
265
+ # Map back to original documents
266
+ reranked_docs = []
267
+ for result in rerank_results.results:
268
+ doc = documents[result.index]
269
+ doc["rerank_score"] = result.relevance_score
270
+ reranked_docs.append(doc)
271
+
272
+ return reranked_docs
273
+
274
+ def answer_question(self, query: str) -> Dict[str, Any]:
275
+ """Answer question using RAG"""
276
+ # Retrieve relevant documents
277
+ retrieved_docs = self.retrieve(query)
278
+
279
+ if not retrieved_docs:
280
+ return {
281
+ "answer": "I couldn't find relevant information to answer your question.",
282
+ "sources": []
283
+ }
284
+
285
+ # Rerank documents
286
+ reranked_docs = self.rerank(query, retrieved_docs)
287
+
288
+ # Build context from top documents
289
+ context_parts = []
290
+ for i, doc in enumerate(reranked_docs[:5], 1):
291
+ source = doc["metadata"].get("source", "Unknown")
292
+ context_parts.append(f"[{i}] {doc['text']}\nSource: {source}")
293
+
294
+ context = "\n\n".join(context_parts)
295
+
296
+ # Generate answer
297
+ messages = [
298
+ {
299
+ "role": "system",
300
+ "content": """You are a helpful assistant that answers questions based on the provided context.
301
+
302
+ Rules:
303
+ 1. Only use information from the provided context
304
+ 2. If the context doesn't contain the answer, say so
305
+ 3. Cite sources using [1], [2], etc.
306
+ 4. Be concise and accurate"""
307
+ },
308
+ {
309
+ "role": "user",
310
+ "content": f"""Context:
311
+ {context}
312
+
313
+ Question: {query}
314
+
315
+ Answer:"""
316
+ }
317
+ ]
318
+
319
+ response = self.openai_client.chat.completions.create(
320
+ model="gpt-4o",
321
+ messages=messages,
322
+ temperature=0.3
323
+ )
324
+
325
+ answer = response.choices[0].message.content
326
+
327
+ return {
328
+ "answer": answer,
329
+ "sources": [
330
+ {
331
+ "text": doc["text"],
332
+ "score": doc["score"],
333
+ "rerank_score": doc["rerank_score"],
334
+ "metadata": doc["metadata"]
335
+ }
336
+ for doc in reranked_docs
337
+ ]
338
+ }
339
+ ```
340
+
341
+ ---
342
+
343
+ ## Usage Example
344
+
345
+ ```python
346
+ # Initialize processor
347
+ processor = DocumentProcessor(
348
+ voyage_api_key="your-voyage-key",
349
+ pinecone_api_key="your-pinecone-key",
350
+ index_name="knowledge-base"
351
+ )
352
+
353
+ # Process documents
354
+ documents = [
355
+ {
356
+ "text": "Python is a high-level programming language...",
357
+ "metadata": {"source": "python-docs.pdf", "category": "programming"}
358
+ },
359
+ {
360
+ "text": "Machine learning is a subset of artificial intelligence...",
361
+ "metadata": {"source": "ml-guide.pdf", "category": "ai"}
362
+ }
363
+ ]
364
+
365
+ for doc in documents:
366
+ processor.process_document(doc["text"], doc["metadata"])
367
+
368
+ # Initialize Q&A system
369
+ qa_system = KnowledgeBaseQA(
370
+ voyage_api_key="your-voyage-key",
371
+ pinecone_api_key="your-pinecone-key",
372
+ cohere_api_key="your-cohere-key",
373
+ openai_api_key="your-openai-key",
374
+ index_name="knowledge-base"
375
+ )
376
+
377
+ # Ask questions
378
+ result = qa_system.answer_question("What is Python?")
379
+ print(f"Answer: {result['answer']}")
380
+ print(f"\nSources:")
381
+ for i, source in enumerate(result['sources'], 1):
382
+ print(f"[{i}] Score: {source['score']:.3f}, Rerank: {source['rerank_score']:.3f}")
383
+ print(f" {source['metadata']['source']}")
384
+ ```
385
+
386
+ ---
387
+
388
+ ## Key Features
389
+
390
+ ### 1. Semantic Chunking
391
+ - Sentence-based chunking with overlap
392
+ - Preserves semantic coherence
393
+ - Configurable chunk size
394
+
395
+ ### 2. Vector Embeddings
396
+ - Voyage AI embeddings (1024 dimensions)
397
+ - Batch processing for efficiency
398
+ - Separate embeddings for documents and queries
399
+
400
+ ### 3. Vector Search
401
+ - Pinecone vector database
402
+ - Cosine similarity metric
403
+ - Top-K retrieval with score filtering
404
+
405
+ ### 4. Reranking
406
+ - Cohere Rerank for improved relevance
407
+ - Cross-encoder architecture
408
+ - Top-N selection after reranking
409
+
410
+ ### 5. RAG Generation
411
+ - Context-aware answer generation
412
+ - Source citation
413
+ - Hallucination prevention
414
+
415
+ ---
416
+
417
+ ## Testing
418
+
419
+ ```python
420
+ import pytest
421
+ from unittest.mock import Mock, patch
422
+
423
+ def test_chunking():
424
+ processor = DocumentProcessor("key", "key", "index")
425
+
426
+ text = "Sentence one. Sentence two. Sentence three."
427
+ metadata = {"source": "test.txt"}
428
+
429
+ chunks = processor.chunk_document(text, metadata)
430
+
431
+ assert len(chunks) > 0
432
+ assert all("text" in chunk for chunk in chunks)
433
+ assert all("metadata" in chunk for chunk in chunks)
434
+
435
+ def test_retrieval():
436
+ qa_system = KnowledgeBaseQA("key", "key", "key", "key", "index")
437
+
438
+ with patch.object(qa_system.index, 'query') as mock_query:
439
+ mock_query.return_value = Mock(matches=[
440
+ Mock(id="1", score=0.9, metadata={"text": "Test document"})
441
+ ])
442
+
443
+ results = qa_system.retrieve("test query")
444
+
445
+ assert len(results) > 0
446
+ assert results[0]["score"] >= qa_system.min_score
447
+
448
+ def test_answer_generation():
449
+ qa_system = KnowledgeBaseQA("key", "key", "key", "key", "index")
450
+
451
+ with patch.object(qa_system, 'retrieve') as mock_retrieve:
452
+ mock_retrieve.return_value = [
453
+ {"id": "1", "score": 0.9, "text": "Python is a programming language", "metadata": {}}
454
+ ]
455
+
456
+ with patch.object(qa_system, 'rerank') as mock_rerank:
457
+ mock_rerank.return_value = [
458
+ {"id": "1", "score": 0.9, "rerank_score": 0.95, "text": "Python is a programming language", "metadata": {}}
459
+ ]
460
+
461
+ result = qa_system.answer_question("What is Python?")
462
+
463
+ assert "answer" in result
464
+ assert "sources" in result
465
+ ```
466
+
467
+ ---
468
+
469
+ ## Best Practices
470
+
471
+ 1. **Chunking Strategy**: Use semantic chunking for better coherence
472
+ 2. **Embedding Model**: Choose model based on domain (general vs. specialized)
473
+ 3. **Vector Store**: Select based on scale (Pinecone, Weaviate, Qdrant)
474
+ 4. **Reranking**: Always rerank for improved relevance
475
+ 5. **Metadata**: Include rich metadata for filtering and attribution
476
+ 6. **Monitoring**: Track retrieval quality metrics (MRR, NDCG)
477
+ 7. **Caching**: Cache embeddings for frequently queried documents
478
+ 8. **Hybrid Search**: Combine vector search with keyword search
479
+
480
+ ---
481
+
482
+ ## Performance Optimization
483
+
484
+ - **Batch Embedding**: Process documents in batches (100-1000)
485
+ - **Async Processing**: Use async I/O for embedding and indexing
486
+ - **Index Optimization**: Use appropriate index type (HNSW, IVF)
487
+ - **Caching**: Cache query embeddings for repeated queries
488
+ - **Compression**: Use quantization for reduced storage
489
+ - **Sharding**: Distribute index across multiple shards
490
+
491
+ ---
492
+
493
+ ## Security
494
+
495
+ - **Access Control**: Implement document-level access control
496
+ - **Data Encryption**: Encrypt vectors at rest
497
+ - **Query Sanitization**: Sanitize user queries
498
+ - **Rate Limiting**: Limit queries per user
499
+ - **Audit Logging**: Log all retrieval operations
500
+ - **PII Filtering**: Remove PII before indexing
501
+ ```
502
+