bmad-plus 0.7.4 → 0.8.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (294) hide show
  1. package/CHANGELOG.md +450 -407
  2. package/LICENSE +21 -0
  3. package/README.md +555 -446
  4. package/osint-agent-package/README.md +88 -88
  5. package/osint-agent-package/SETUP_KEYS.md +108 -108
  6. package/osint-agent-package/agents/osint-investigator.md +80 -80
  7. package/osint-agent-package/install.ps1 +87 -87
  8. package/osint-agent-package/install.sh +76 -76
  9. package/osint-agent-package/skills/bmad-osint-investigate/SKILL.md +147 -147
  10. package/osint-agent-package/skills/bmad-osint-investigate/osint/SKILL.md +452 -452
  11. package/osint-agent-package/skills/bmad-osint-investigate/osint/assets/dossier-template.md +116 -116
  12. package/osint-agent-package/skills/bmad-osint-investigate/osint/references/content-extraction.md +100 -100
  13. package/osint-agent-package/skills/bmad-osint-investigate/osint/references/enrichment-databases-fr.md +148 -148
  14. package/osint-agent-package/skills/bmad-osint-investigate/osint/references/platforms.md +130 -130
  15. package/osint-agent-package/skills/bmad-osint-investigate/osint/references/psychoprofile.md +69 -69
  16. package/osint-agent-package/skills/bmad-osint-investigate/osint/references/tools.md +281 -281
  17. package/osint-agent-package/skills/bmad-osint-investigate/osint/scripts/_http.py +101 -101
  18. package/osint-agent-package/skills/bmad-osint-investigate/osint/scripts/apify.py +266 -260
  19. package/osint-agent-package/skills/bmad-osint-investigate/osint/scripts/brightdata.py +101 -101
  20. package/osint-agent-package/skills/bmad-osint-investigate/osint/scripts/diagnose.py +141 -141
  21. package/osint-agent-package/skills/bmad-osint-investigate/osint/scripts/exa.py +79 -79
  22. package/osint-agent-package/skills/bmad-osint-investigate/osint/scripts/jina.py +71 -71
  23. package/osint-agent-package/skills/bmad-osint-investigate/osint/scripts/mcp-client.py +136 -136
  24. package/osint-agent-package/skills/bmad-osint-investigate/osint/scripts/parallel.py +85 -85
  25. package/osint-agent-package/skills/bmad-osint-investigate/osint/scripts/perplexity.py +102 -102
  26. package/osint-agent-package/skills/bmad-osint-investigate/osint/scripts/tavily.py +72 -72
  27. package/osint-agent-package/skills/bmad-osint-investigate/osint/scripts/volley.py +208 -208
  28. package/osint-agent-package/skills/bmad-osint-investigator/SKILL.md +15 -15
  29. package/package.json +62 -57
  30. package/readme-international/README.de.md +576 -426
  31. package/readme-international/README.es.md +578 -518
  32. package/readme-international/README.fr.md +576 -516
  33. package/src/bmad-plus/agents/agent-architect-dev/SKILL.md +96 -96
  34. package/src/bmad-plus/agents/agent-architect-dev/bmad-skill-manifest.yaml +13 -13
  35. package/src/bmad-plus/agents/agent-maker/SKILL.md +201 -201
  36. package/src/bmad-plus/agents/agent-maker/bmad-skill-manifest.yaml +13 -13
  37. package/src/bmad-plus/agents/agent-orchestrator/SKILL.md +137 -137
  38. package/src/bmad-plus/agents/agent-orchestrator/bmad-skill-manifest.yaml +13 -13
  39. package/src/bmad-plus/agents/agent-quality/SKILL.md +83 -83
  40. package/src/bmad-plus/agents/agent-quality/bmad-skill-manifest.yaml +13 -13
  41. package/src/bmad-plus/agents/agent-shadow/SKILL.md +71 -71
  42. package/src/bmad-plus/agents/agent-shadow/bmad-skill-manifest.yaml +13 -13
  43. package/src/bmad-plus/agents/agent-strategist/SKILL.md +80 -80
  44. package/src/bmad-plus/agents/agent-strategist/bmad-skill-manifest.yaml +13 -13
  45. package/src/bmad-plus/agents/pack-animated/animated-website-agent.md +325 -325
  46. package/src/bmad-plus/agents/pack-animated/templates/animated-website-workflow.md +55 -55
  47. package/src/bmad-plus/agents/pack-backup/backup-agent.md +71 -71
  48. package/src/bmad-plus/agents/pack-backup/templates/backup-workflow.md +51 -51
  49. package/src/bmad-plus/agents/pack-seo/SKILL.md +171 -171
  50. package/src/bmad-plus/agents/pack-seo/checklist.md +140 -140
  51. package/src/bmad-plus/agents/pack-seo/pagespeed-playbook.md +320 -320
  52. package/src/bmad-plus/agents/pack-seo/ref/audit-schema.json +187 -187
  53. package/src/bmad-plus/agents/pack-seo/ref/cwv-thresholds.md +87 -87
  54. package/src/bmad-plus/agents/pack-seo/ref/eeat-criteria.md +123 -123
  55. package/src/bmad-plus/agents/pack-seo/ref/geo-signals.md +167 -167
  56. package/src/bmad-plus/agents/pack-seo/ref/hreflang-rules.md +153 -153
  57. package/src/bmad-plus/agents/pack-seo/ref/quality-gates.md +133 -133
  58. package/src/bmad-plus/agents/pack-seo/ref/schema-catalog.md +91 -91
  59. package/src/bmad-plus/agents/pack-seo/ref/schema-templates.json +356 -356
  60. package/src/bmad-plus/agents/pack-seo/seo-chief.md +294 -294
  61. package/src/bmad-plus/agents/pack-seo/seo-judge.md +241 -241
  62. package/src/bmad-plus/agents/pack-seo/seo-scout.md +171 -171
  63. package/src/bmad-plus/agents/pack-seo/templates/seo-audit-workflow.md +241 -241
  64. package/src/bmad-plus/data/role-triggers.yaml +209 -209
  65. package/src/bmad-plus/module-help.csv +10 -10
  66. package/src/bmad-plus/module.yaml +283 -280
  67. package/src/bmad-plus/packs/pack-animated/animated-website-agent.md +325 -0
  68. package/src/bmad-plus/packs/pack-animated/templates/animated-website-workflow.md +55 -0
  69. package/src/bmad-plus/packs/pack-backup/backup-agent.md +71 -0
  70. package/src/bmad-plus/packs/pack-backup/templates/backup-workflow.md +51 -0
  71. package/src/bmad-plus/packs/pack-dev-studio/README.md +162 -162
  72. package/src/bmad-plus/packs/pack-dev-studio/categories/analysis/analyst-agent.md +73 -73
  73. package/src/bmad-plus/packs/pack-dev-studio/categories/analysis/document-project.md +61 -61
  74. package/src/bmad-plus/packs/pack-dev-studio/categories/analysis/domain-research.md +95 -95
  75. package/src/bmad-plus/packs/pack-dev-studio/categories/analysis/market-research.md +95 -95
  76. package/src/bmad-plus/packs/pack-dev-studio/categories/analysis/prfaq.md +134 -134
  77. package/src/bmad-plus/packs/pack-dev-studio/categories/analysis/product-brief.md +80 -80
  78. package/src/bmad-plus/packs/pack-dev-studio/categories/analysis/tech-writer-agent.md +73 -73
  79. package/src/bmad-plus/packs/pack-dev-studio/categories/analysis/technical-research.md +95 -95
  80. package/src/bmad-plus/packs/pack-dev-studio/categories/architecture/architect-agent.md +73 -73
  81. package/src/bmad-plus/packs/pack-dev-studio/categories/architecture/create-architecture.md +73 -73
  82. package/src/bmad-plus/packs/pack-dev-studio/categories/architecture/create-epics-stories.md +92 -92
  83. package/src/bmad-plus/packs/pack-dev-studio/categories/architecture/generate-project-context.md +80 -80
  84. package/src/bmad-plus/packs/pack-dev-studio/categories/architecture/implementation-readiness.md +90 -90
  85. package/src/bmad-plus/packs/pack-dev-studio/categories/architecture/steps/step-01-init.md +153 -153
  86. package/src/bmad-plus/packs/pack-dev-studio/categories/architecture/steps/step-01b-continue.md +173 -173
  87. package/src/bmad-plus/packs/pack-dev-studio/categories/architecture/steps/step-02-context.md +224 -224
  88. package/src/bmad-plus/packs/pack-dev-studio/categories/architecture/steps/step-03-starter.md +329 -329
  89. package/src/bmad-plus/packs/pack-dev-studio/categories/architecture/steps/step-04-decisions.md +318 -318
  90. package/src/bmad-plus/packs/pack-dev-studio/categories/architecture/steps/step-05-patterns.md +359 -359
  91. package/src/bmad-plus/packs/pack-dev-studio/categories/architecture/steps/step-06-structure.md +379 -379
  92. package/src/bmad-plus/packs/pack-dev-studio/categories/architecture/steps/step-07-validation.md +361 -361
  93. package/src/bmad-plus/packs/pack-dev-studio/categories/architecture/steps/step-08-complete.md +81 -81
  94. package/src/bmad-plus/packs/pack-dev-studio/categories/implementation/checkpoint-preview.md +67 -67
  95. package/src/bmad-plus/packs/pack-dev-studio/categories/implementation/code-review-steps/step-01-gather-context.md +85 -85
  96. package/src/bmad-plus/packs/pack-dev-studio/categories/implementation/code-review-steps/step-02-review.md +35 -35
  97. package/src/bmad-plus/packs/pack-dev-studio/categories/implementation/code-review-steps/step-03-triage.md +49 -49
  98. package/src/bmad-plus/packs/pack-dev-studio/categories/implementation/code-review-steps/step-04-present.md +131 -131
  99. package/src/bmad-plus/packs/pack-dev-studio/categories/implementation/code-review.md +89 -89
  100. package/src/bmad-plus/packs/pack-dev-studio/categories/implementation/correct-course.md +300 -300
  101. package/src/bmad-plus/packs/pack-dev-studio/categories/implementation/create-story.md +428 -428
  102. package/src/bmad-plus/packs/pack-dev-studio/categories/implementation/dev-agent.md +73 -73
  103. package/src/bmad-plus/packs/pack-dev-studio/categories/implementation/dev-story-checklist.md +80 -80
  104. package/src/bmad-plus/packs/pack-dev-studio/categories/implementation/dev-story.md +484 -484
  105. package/src/bmad-plus/packs/pack-dev-studio/categories/implementation/investigate.md +193 -193
  106. package/src/bmad-plus/packs/pack-dev-studio/categories/implementation/qa-e2e-tests.md +175 -175
  107. package/src/bmad-plus/packs/pack-dev-studio/categories/implementation/quick-dev.md +110 -110
  108. package/src/bmad-plus/packs/pack-dev-studio/categories/implementation/retrospective.md +1511 -1511
  109. package/src/bmad-plus/packs/pack-dev-studio/categories/implementation/sprint-planning.md +298 -298
  110. package/src/bmad-plus/packs/pack-dev-studio/categories/implementation/sprint-status.md +296 -296
  111. package/src/bmad-plus/packs/pack-dev-studio/categories/planning/create-prd.md +29 -29
  112. package/src/bmad-plus/packs/pack-dev-studio/categories/planning/create-ux-design.md +74 -74
  113. package/src/bmad-plus/packs/pack-dev-studio/categories/planning/edit-prd.md +29 -29
  114. package/src/bmad-plus/packs/pack-dev-studio/categories/planning/pm-agent.md +73 -73
  115. package/src/bmad-plus/packs/pack-dev-studio/categories/planning/prd.md +89 -89
  116. package/src/bmad-plus/packs/pack-dev-studio/categories/planning/ux-designer-agent.md +73 -73
  117. package/src/bmad-plus/packs/pack-dev-studio/categories/planning/validate-prd.md +29 -29
  118. package/src/bmad-plus/packs/pack-dev-studio/categories/utilities/advanced-elicitation.md +141 -141
  119. package/src/bmad-plus/packs/pack-dev-studio/categories/utilities/adversarial-review.md +37 -37
  120. package/src/bmad-plus/packs/pack-dev-studio/categories/utilities/bmad-help.md +75 -75
  121. package/src/bmad-plus/packs/pack-dev-studio/categories/utilities/brainstorming.md +6 -6
  122. package/src/bmad-plus/packs/pack-dev-studio/categories/utilities/customize.md +110 -110
  123. package/src/bmad-plus/packs/pack-dev-studio/categories/utilities/distillator.md +176 -176
  124. package/src/bmad-plus/packs/pack-dev-studio/categories/utilities/edge-case-hunter.md +67 -67
  125. package/src/bmad-plus/packs/pack-dev-studio/categories/utilities/editorial-review-prose.md +86 -86
  126. package/src/bmad-plus/packs/pack-dev-studio/categories/utilities/editorial-review-structure.md +179 -179
  127. package/src/bmad-plus/packs/pack-dev-studio/categories/utilities/index-docs.md +66 -66
  128. package/src/bmad-plus/packs/pack-dev-studio/categories/utilities/party-mode.md +127 -127
  129. package/src/bmad-plus/packs/pack-dev-studio/categories/utilities/shard-doc.md +105 -105
  130. package/src/bmad-plus/packs/pack-dev-studio/dev-studio-orchestrator.md +120 -120
  131. package/src/bmad-plus/packs/pack-dev-studio/shared/architecture-decision-template.md +12 -12
  132. package/src/bmad-plus/packs/pack-dev-studio/shared/bwml-spec.md +328 -328
  133. package/src/bmad-plus/packs/pack-dev-studio/shared/module-help.csv +32 -32
  134. package/src/bmad-plus/packs/pack-dev-studio/upstream-sync.yaml +81 -81
  135. package/src/bmad-plus/packs/pack-memory/README.md +106 -106
  136. package/src/bmad-plus/packs/pack-memory/memory-orchestrator.md +79 -79
  137. package/src/bmad-plus/packs/pack-memory/shared/karpathy-guardrails.md +86 -86
  138. package/src/bmad-plus/packs/pack-memory/shared/memory-protocol.md +143 -143
  139. package/src/bmad-plus/packs/pack-memory/templates/context.md +39 -39
  140. package/src/bmad-plus/packs/pack-memory/templates/decisions.md +25 -25
  141. package/src/bmad-plus/packs/pack-memory/templates/identity.yaml +39 -39
  142. package/src/bmad-plus/packs/pack-memory/templates/lessons.md +31 -31
  143. package/src/bmad-plus/packs/pack-memory/templates/patterns.md +24 -24
  144. package/src/bmad-plus/packs/pack-memory/templates/session-handoff.md +25 -25
  145. package/src/bmad-plus/packs/pack-memory/zecher-agent.md +157 -157
  146. package/src/bmad-plus/packs/pack-seo/SKILL.md +171 -0
  147. package/src/bmad-plus/packs/pack-seo/checklist.md +140 -0
  148. package/src/bmad-plus/packs/pack-seo/pagespeed-playbook.md +320 -0
  149. package/src/bmad-plus/packs/pack-seo/ref/audit-schema.json +187 -0
  150. package/src/bmad-plus/packs/pack-seo/ref/cwv-thresholds.md +87 -0
  151. package/src/bmad-plus/packs/pack-seo/ref/eeat-criteria.md +123 -0
  152. package/src/bmad-plus/packs/pack-seo/ref/geo-signals.md +167 -0
  153. package/src/bmad-plus/packs/pack-seo/ref/hreflang-rules.md +153 -0
  154. package/src/bmad-plus/packs/pack-seo/ref/quality-gates.md +133 -0
  155. package/src/bmad-plus/packs/pack-seo/ref/schema-catalog.md +91 -0
  156. package/src/bmad-plus/packs/pack-seo/ref/schema-templates.json +356 -0
  157. package/src/bmad-plus/packs/pack-seo/seo-chief.md +294 -0
  158. package/src/bmad-plus/packs/pack-seo/seo-judge.md +241 -0
  159. package/src/bmad-plus/packs/pack-seo/seo-scout.md +171 -0
  160. package/src/bmad-plus/packs/pack-seo/templates/seo-audit-workflow.md +241 -0
  161. package/src/bmad-plus/packs/pack-shield/README.md +110 -110
  162. package/src/bmad-plus/packs/pack-shield/categories/accessibility-esg/csrd-agent.md +262 -262
  163. package/src/bmad-plus/packs/pack-shield/categories/accessibility-esg/section508-agent.md +179 -179
  164. package/src/bmad-plus/packs/pack-shield/categories/accessibility-esg/wcag-agent.md +201 -201
  165. package/src/bmad-plus/packs/pack-shield/categories/ai-governance/eu-ai-act-agent.md +97 -97
  166. package/src/bmad-plus/packs/pack-shield/categories/ai-governance/iso42001-agent.md +251 -251
  167. package/src/bmad-plus/packs/pack-shield/categories/ai-governance/nist-ai-rmf-agent.md +133 -133
  168. package/src/bmad-plus/packs/pack-shield/categories/cybersecurity/cis-controls-agent.md +221 -221
  169. package/src/bmad-plus/packs/pack-shield/categories/cybersecurity/ism-agent.md +150 -150
  170. package/src/bmad-plus/packs/pack-shield/categories/cybersecurity/iso27001-agent.md +167 -167
  171. package/src/bmad-plus/packs/pack-shield/categories/cybersecurity/nis2-agent.md +83 -83
  172. package/src/bmad-plus/packs/pack-shield/categories/cybersecurity/nist-800-53-agent.md +250 -250
  173. package/src/bmad-plus/packs/pack-shield/categories/cybersecurity/nist-csf-agent.md +218 -218
  174. package/src/bmad-plus/packs/pack-shield/categories/data-privacy/ccpa-agent.md +94 -94
  175. package/src/bmad-plus/packs/pack-shield/categories/data-privacy/dpdpa-agent.md +136 -136
  176. package/src/bmad-plus/packs/pack-shield/categories/data-privacy/gdpr-agent.md +296 -296
  177. package/src/bmad-plus/packs/pack-shield/categories/data-privacy/iso27701-agent.md +134 -134
  178. package/src/bmad-plus/packs/pack-shield/categories/data-privacy/lgpd-agent.md +129 -129
  179. package/src/bmad-plus/packs/pack-shield/categories/defense-export/cmmc-agent.md +127 -127
  180. package/src/bmad-plus/packs/pack-shield/categories/defense-export/ear-agent.md +272 -272
  181. package/src/bmad-plus/packs/pack-shield/categories/defense-export/itar-agent.md +202 -202
  182. package/src/bmad-plus/packs/pack-shield/categories/defense-export/tsa-agent.md +367 -367
  183. package/src/bmad-plus/packs/pack-shield/categories/industry-compliance/dora-agent.md +510 -510
  184. package/src/bmad-plus/packs/pack-shield/categories/industry-compliance/fedramp-agent.md +247 -247
  185. package/src/bmad-plus/packs/pack-shield/categories/industry-compliance/hipaa-agent.md +173 -173
  186. package/src/bmad-plus/packs/pack-shield/categories/industry-compliance/pci-dss-agent.md +239 -239
  187. package/src/bmad-plus/packs/pack-shield/categories/industry-compliance/soc2-agent.md +266 -266
  188. package/src/bmad-plus/packs/pack-shield/categories/industry-compliance/swift-csp-agent.md +164 -164
  189. package/src/bmad-plus/packs/pack-shield/categories/workflows/ai-act-classifier.md +131 -131
  190. package/src/bmad-plus/packs/pack-shield/categories/workflows/ai-act-fria.md +155 -155
  191. package/src/bmad-plus/packs/pack-shield/categories/workflows/ai-act-incidents.md +187 -187
  192. package/src/bmad-plus/packs/pack-shield/categories/workflows/ai-act-roles.md +113 -113
  193. package/src/bmad-plus/packs/pack-shield/categories/workflows/breach-sentinel.md +197 -197
  194. package/src/bmad-plus/packs/pack-shield/categories/workflows/cookie-policy-gen.md +180 -180
  195. package/src/bmad-plus/packs/pack-shield/categories/workflows/dpia-sentinel.md +235 -235
  196. package/src/bmad-plus/packs/pack-shield/categories/workflows/legitimate-interest.md +159 -159
  197. package/src/bmad-plus/packs/pack-shield/categories/workflows/privacy-advisor.md +133 -133
  198. package/src/bmad-plus/packs/pack-shield/categories/workflows/privacy-notice-gen.md +160 -160
  199. package/src/bmad-plus/packs/pack-shield/categories/workflows/privacy-policy-gen.md +135 -135
  200. package/src/bmad-plus/packs/pack-shield/references/ccpa/ccpa-gdpr-comparison.md +117 -117
  201. package/src/bmad-plus/packs/pack-shield/references/ccpa/consumer-rights-workflows.md +177 -177
  202. package/src/bmad-plus/packs/pack-shield/references/cis-controls/framework-mappings.md +162 -162
  203. package/src/bmad-plus/packs/pack-shield/references/cis-controls/implementation-guidance.md +235 -235
  204. package/src/bmad-plus/packs/pack-shield/references/cis-controls/safeguards-detail.md +252 -252
  205. package/src/bmad-plus/packs/pack-shield/references/cmmc/cmmc-assessment.md +170 -170
  206. package/src/bmad-plus/packs/pack-shield/references/cmmc/cmmc-levels.md +113 -113
  207. package/src/bmad-plus/packs/pack-shield/references/cmmc/cmmc-practices.md +211 -211
  208. package/src/bmad-plus/packs/pack-shield/references/csrd/compliance-program.md +281 -281
  209. package/src/bmad-plus/packs/pack-shield/references/csrd/double-materiality.md +253 -253
  210. package/src/bmad-plus/packs/pack-shield/references/csrd/esrs-standards.md +401 -401
  211. package/src/bmad-plus/packs/pack-shield/references/dora/article-reference.md +441 -441
  212. package/src/bmad-plus/packs/pack-shield/references/dora/incident-classification.md +297 -297
  213. package/src/bmad-plus/packs/pack-shield/references/dora/rts-its-guide.md +306 -306
  214. package/src/bmad-plus/packs/pack-shield/references/dora/third-party-risk.md +349 -349
  215. package/src/bmad-plus/packs/pack-shield/references/dpdpa/gdpr-comparison.md +173 -173
  216. package/src/bmad-plus/packs/pack-shield/references/dpdpa/rights-and-obligations.md +426 -426
  217. package/src/bmad-plus/packs/pack-shield/references/dpdpa/rules-2025.md +599 -599
  218. package/src/bmad-plus/packs/pack-shield/references/dpdpa/sections-reference.md +319 -319
  219. package/src/bmad-plus/packs/pack-shield/references/ear/ccl-eccn-guide.md +250 -250
  220. package/src/bmad-plus/packs/pack-shield/references/ear/compliance-program.md +280 -280
  221. package/src/bmad-plus/packs/pack-shield/references/ear/license-exceptions.md +207 -207
  222. package/src/bmad-plus/packs/pack-shield/references/eu-ai-act/gpai-governance.md +267 -267
  223. package/src/bmad-plus/packs/pack-shield/references/eu-ai-act/obligations-high-risk.md +287 -287
  224. package/src/bmad-plus/packs/pack-shield/references/eu-ai-act/risk-classification.md +182 -182
  225. package/src/bmad-plus/packs/pack-shield/references/fedramp/appendices-guide.md +209 -209
  226. package/src/bmad-plus/packs/pack-shield/references/fedramp/control-families.md +281 -281
  227. package/src/bmad-plus/packs/pack-shield/references/fedramp/poam-guide.md +93 -93
  228. package/src/bmad-plus/packs/pack-shield/references/fedramp/readiness-checklist.md +134 -134
  229. package/src/bmad-plus/packs/pack-shield/references/fedramp/sap-sar-guide.md +86 -86
  230. package/src/bmad-plus/packs/pack-shield/references/fedramp/ssp-guide.md +129 -129
  231. package/src/bmad-plus/packs/pack-shield/references/gdpr-compliance/documents.md +192 -192
  232. package/src/bmad-plus/packs/pack-shield/references/gdpr-compliance/dpa-template.md +121 -121
  233. package/src/bmad-plus/packs/pack-shield/references/gdpr-compliance/privacy-notice.md +87 -87
  234. package/src/bmad-plus/packs/pack-shield/references/hipaa-compliance/breach-notification.md +293 -293
  235. package/src/bmad-plus/packs/pack-shield/references/hipaa-compliance/privacy-rule.md +276 -276
  236. package/src/bmad-plus/packs/pack-shield/references/hipaa-compliance/security-rule.md +299 -299
  237. package/src/bmad-plus/packs/pack-shield/references/hipaa-compliance/templates.md +568 -568
  238. package/src/bmad-plus/packs/pack-shield/references/ism/control-applicability.md +181 -181
  239. package/src/bmad-plus/packs/pack-shield/references/ism/guidelines-overview.md +183 -183
  240. package/src/bmad-plus/packs/pack-shield/references/iso27001/annex-a-2013.md +203 -203
  241. package/src/bmad-plus/packs/pack-shield/references/iso27001/annex-a-2022.md +132 -132
  242. package/src/bmad-plus/packs/pack-shield/references/iso27001/control-mapping.md +153 -153
  243. package/src/bmad-plus/packs/pack-shield/references/iso27701/annex-a-controls.md +195 -195
  244. package/src/bmad-plus/packs/pack-shield/references/iso27701/regulatory-mapping.md +229 -229
  245. package/src/bmad-plus/packs/pack-shield/references/iso27701/transition-guide.md +219 -219
  246. package/src/bmad-plus/packs/pack-shield/references/iso42001/iso42001-ai-risk-assessment.md +258 -258
  247. package/src/bmad-plus/packs/pack-shield/references/iso42001/iso42001-clauses-requirements.md +279 -279
  248. package/src/bmad-plus/packs/pack-shield/references/iso42001/iso42001-controls-annex-a.md +155 -155
  249. package/src/bmad-plus/packs/pack-shield/references/itar/compliance-program.md +174 -174
  250. package/src/bmad-plus/packs/pack-shield/references/itar/licensing-guide.md +146 -146
  251. package/src/bmad-plus/packs/pack-shield/references/itar/usml-categories.md +93 -93
  252. package/src/bmad-plus/packs/pack-shield/references/lgpd/anpd-enforcement.md +147 -147
  253. package/src/bmad-plus/packs/pack-shield/references/lgpd/compliance-program.md +272 -272
  254. package/src/bmad-plus/packs/pack-shield/references/lgpd/lgpd-articles.md +271 -271
  255. package/src/bmad-plus/packs/pack-shield/references/nis2/article-21-measures.md +153 -153
  256. package/src/bmad-plus/packs/pack-shield/references/nis2/iso27001-nis2-mapping.md +68 -68
  257. package/src/bmad-plus/packs/pack-shield/references/nist-800-53/assessment-rmf.md +349 -349
  258. package/src/bmad-plus/packs/pack-shield/references/nist-800-53/baselines-tailoring.md +277 -277
  259. package/src/bmad-plus/packs/pack-shield/references/nist-800-53/control-families.md +450 -450
  260. package/src/bmad-plus/packs/pack-shield/references/nist-ai-rmf/rmf-core.md +361 -361
  261. package/src/bmad-plus/packs/pack-shield/references/nist-ai-rmf/rmf-profiles.md +192 -192
  262. package/src/bmad-plus/packs/pack-shield/references/nist-csf/csf-10-to-20-mapping.md +143 -143
  263. package/src/bmad-plus/packs/pack-shield/references/nist-csf/csf-20-functions-categories.md +278 -278
  264. package/src/bmad-plus/packs/pack-shield/references/nist-csf/csf-implementation-tiers.md +135 -135
  265. package/src/bmad-plus/packs/pack-shield/references/pci-compliance/pci-dss-requirements.md +366 -366
  266. package/src/bmad-plus/packs/pack-shield/references/pci-compliance/pci-dss-saq-guide.md +217 -217
  267. package/src/bmad-plus/packs/pack-shield/references/pci-compliance/pci-dss-v4-changes.md +190 -190
  268. package/src/bmad-plus/packs/pack-shield/references/section-508/wcag-mapping.md +160 -160
  269. package/src/bmad-plus/packs/pack-shield/references/soc2/controls.md +241 -241
  270. package/src/bmad-plus/packs/pack-shield/references/soc2/evidence.md +236 -236
  271. package/src/bmad-plus/packs/pack-shield/references/soc2/policies.md +254 -254
  272. package/src/bmad-plus/packs/pack-shield/references/soc2/vendor.md +276 -276
  273. package/src/bmad-plus/packs/pack-shield/references/swift-csp/swift-assessment.md +202 -202
  274. package/src/bmad-plus/packs/pack-shield/references/swift-csp/swift-controls.md +545 -545
  275. package/src/bmad-plus/packs/pack-shield/references/tsa-compliance/tsa-crmp-requirements.md +359 -359
  276. package/src/bmad-plus/packs/pack-shield/references/tsa-compliance/tsa-directives-overview.md +187 -187
  277. package/src/bmad-plus/packs/pack-shield/references/tsa-compliance/tsa-incident-reporting.md +187 -187
  278. package/src/bmad-plus/packs/pack-shield/references/wcag/criteria-detail.md +510 -510
  279. package/src/bmad-plus/packs/pack-shield/shared/audit-report-template.md +103 -103
  280. package/src/bmad-plus/packs/pack-shield/shared/cross-framework-mapper.md +103 -103
  281. package/src/bmad-plus/packs/pack-shield/shared/gap-analysis-template.md +83 -83
  282. package/src/bmad-plus/packs/pack-shield/shield-orchestrator.md +229 -229
  283. package/src/bmad-plus/packs/pack-shield/upstream-sync.yaml +68 -68
  284. package/src/bmad-plus/skills/bmad-plus-autopilot/SKILL.md +99 -99
  285. package/src/bmad-plus/skills/bmad-plus-parallel/SKILL.md +93 -93
  286. package/src/bmad-plus/skills/bmad-plus-sync/SKILL.md +69 -69
  287. package/tools/cli/commands/autoconfig.js +498 -489
  288. package/tools/cli/commands/doctor.js +222 -175
  289. package/tools/cli/commands/install.js +739 -739
  290. package/tools/cli/commands/memory.js +194 -194
  291. package/tools/cli/commands/scan.js +360 -350
  292. package/tools/cli/commands/uninstall.js +96 -96
  293. package/tools/cli/commands/update.js +174 -174
  294. package/tools/cli/i18n.js +763 -763
@@ -1,452 +1,452 @@
1
- ---
2
- name: osint
3
- description: >
4
- Conduct deep OSINT research on individuals. Build full digital footprint, psychoprofile
5
- (MBTI/Big Five), career history, social graph with confidence scores. Recursive
6
- self-evaluation until completeness threshold is met. Includes internal intelligence
7
- (Telegram history, email, vault contacts) before going external.
8
- Use when: "osint", "досье", "research person", "find everything about", "пробей",
9
- "разведка", "due diligence", "background check", "digital footprint",
10
- "найди всё про", "собери информацию", "кто это", "профиль человека".
11
- NOT for: company/product research without a named person, competitive analysis,
12
- market research, content generation, or general web scraping tasks.
13
- ---
14
-
15
- # OSINT Skill v3.2
16
-
17
- Systematic intelligence gathering on individuals. From a name or handle to a scored
18
- dossier with psychoprofile, career map, and entry points.
19
-
20
- ## Phase Router
21
-
22
- Determine entry point from context:
23
-
24
- - New name/handle/URL, "пробей", "find out about" → Phase 0 (full cycle)
25
- - "Add LinkedIn/Instagram data" to existing dossier → Phase 2 (extraction)
26
- - "Build psychoprofile" from existing data → Phase 4
27
- - "Rate completeness" of existing dossier → Phase 5
28
- - "Reformat" or "present" findings → Phase 6
29
-
30
- Default (full research request): Phase 0 → 1 → 1.5 → 2 → 3 → 4 → 5 → 6.
31
-
32
- ## Environment
33
-
34
- All API keys via environment variables. Never hardcode tokens.
35
-
36
- - `PERPLEXITY_API_KEY` — Perplexity Sonar (fast answers + deep research)
37
- - `EXA_API_KEY` — Exa AI (semantic search, company/people research, deep research)
38
- - `TAVILY_API_KEY` — Tavily (agent-optimized search + extract, $0.005/req basic)
39
- - `APIFY_API_TOKEN` — Apify scraping (LinkedIn, Instagram, Facebook)
40
- - `JINA_API_KEY` — Jina reader/search/deepsearch
41
- - `PARALLEL_API_KEY` — Parallel AI search
42
- - `BRIGHTDATA_MCP_URL` — Bright Data MCP endpoint (full URL with token)
43
- - `MCPORTER_CONFIG` — mcporter config path
44
-
45
- ## Scripts
46
-
47
- Run from skill dir: `bash scripts/<name>.sh`.
48
- Each validates env vars, exits with descriptive error + URL to get the key.
49
-
50
- **Search & Research:**
51
- - `diagnose.sh` — run FIRST. Capability map of all tools.
52
- - `perplexity.sh` — `search <query>` | `sonar <query>` (AI answer) | `deep <query>` (deep research)
53
- - `tavily.sh` — `search <query>` (basic $0.005) | `deep <query>` (advanced) | `extract <url>`
54
- - `exa.sh` — `search <query>` | `company <name>` | `people <name>` | `crawl <url>` | `deep <prompt>`
55
- - `first-volley.sh "Name" "context"` — parallel search, all engines at once.
56
- - `merge-volley.sh <outdir>` — deduplicate and merge first-volley results.
57
-
58
- **Scraping:**
59
- - `apify.sh` — `linkedin <url>` | `instagram <handle>` | `run` | `results` | `store-search`
60
- - `run-actor.sh` — **universal Apify runner (55+ actors).** Embedded from [apify/agent-skills](https://github.com/apify/agent-skills).
61
- Quick answer: `bash scripts/run-actor.sh "actor/id" '{"input":"json"}'`
62
- Export: `bash scripts/run-actor.sh "actor/id" '{"input":"json"}' --output /tmp/out.csv`
63
- - `jina.sh` — `read <url>` | `search <query>` | `deepsearch <query>`
64
- - `parallel.sh` — `search <query>` | `extract <url>`
65
- - `brightdata.sh` — `scrape <url>` | `scrape-batch` | `search` | `search-geo <cc>` | `search-yandex`
66
-
67
- ## Research Escalation Flow
68
-
69
- **Принцип: от дешёвого к дорогому, от быстрого к глубокому.**
70
-
71
- ### Level 1: Quick Answers (секунды, ~$0.00)
72
- Начни ВСЕГДА с этого. Получи быстрый контекст прежде чем копать.
73
- Запускай ВСЕ параллельно:
74
- ```bash
75
- # Perplexity Sonar — AI ответ с цитатами
76
- bash skills/osint/scripts/perplexity.sh sonar "Who is <Name>, <context>"
77
- # Brave Search — классический поиск
78
- web_search "<Name> <company> <role>"
79
- # Tavily — agent-optimized search с AI answer
80
- bash skills/osint/scripts/tavily.sh search "<Name> <context>"
81
- # Exa — семантический поиск + company/people research
82
- bash skills/osint/scripts/exa.sh search "<Name> <context>"
83
- bash skills/osint/scripts/exa.sh people "<Name>"
84
- ```
85
- → Получаешь: быстрые факты, ссылки, контекст.
86
- → Решение: достаточно? → Phase 6. Нужно больше? → Level 2.
87
-
88
- ### Level 2: Source Verification (секунды-минуты, ~$0.01)
89
- Проверяй источники из Level 1 через fetch:
90
- ```bash
91
- # Читай найденные URL
92
- web_fetch "<url_from_perplexity>"
93
- bash skills/osint/scripts/jina.sh read "<url>"
94
- bash skills/osint/scripts/parallel.sh extract "<url>"
95
- ```
96
- → Получаешь: подтверждённые факты, cross-reference.
97
- → Совпадает? → дополняй досье. Нужно глубже? → Level 3.
98
-
99
- ### Level 3: Social Media Deep Dive (~$0.01-0.10)
100
- Подключай scraping для соцсетей:
101
- ```bash
102
- # LinkedIn
103
- bash skills/osint/scripts/apify.sh linkedin "<url>"
104
- # Instagram
105
- bash skills/osint/scripts/apify.sh instagram "<handle>"
106
- # Facebook, заблокированные сайты
107
- bash skills/osint/scripts/brightdata.sh scrape "<url>"
108
- ```
109
- → Получаешь: структурированные профили, фото, связи.
110
-
111
- ### Level 4: Deep Research (~$0.05-0.50)
112
- Если нужно копать ещё глубже — формируй развёрнутый промпт и отправляй в deep research.
113
- Запускай ВСЕ параллельно (30-60 сек каждый):
114
- ```bash
115
- # Perplexity Deep Research
116
- bash skills/osint/scripts/perplexity.sh deep "<detailed research prompt about Name>"
117
- # Exa Deep Research
118
- bash skills/osint/scripts/exa.sh deep "<detailed prompt>"
119
- # Parallel AI Deep Search
120
- bash skills/osint/scripts/parallel.sh search "<detailed query>"
121
- # Jina DeepSearch
122
- bash skills/osint/scripts/jina.sh deepsearch "<query>"
123
- ```
124
-
125
- **Правило:** Level 4 промпт должен быть РАЗВЁРНУТЫМ — включай всё что уже знаешь
126
- из Level 1-3, чтобы deep research не повторял базовые факты, а копал дальше.
127
-
128
- ## Swarm Mode (DEFAULT)
129
-
130
- OSINT research runs as a **swarm of parallel sub-agents on Sonnet**.
131
- The main agent is the coordinator — it does NOT scrape itself.
132
-
133
- ### How it works:
134
- 1. Main agent runs Phase 0 (tooling check) and Phase 1 (seed collection) to get initial context
135
- 2. Main agent spawns 3-5 sub-agents via `sessions_spawn` with `model: sonnet`, `mode: run`
136
- 3. Each sub-agent gets a focused task + all known data from Phase 1
137
- 4. Sub-agents return results → main agent merges into dossier
138
-
139
- ### Task split pattern:
140
- - **Agent 1: YouTube/Content** — extract transcripts via Apify (NOT yt-dlp, NOT BrightData — YouTube blocks them). 3-5 videos, speech style, topics. Use `streamers/youtube-channel-scraper` for channel data
141
- - **Agent 2: Facebook deep** — BrightData scrape: profile, posts, about, photos, friends (use m.facebook.com for more data). For public Pages: `apify/facebook-pages-scraper` + `apify/facebook-page-contact-information`
142
- - **Agent 3: Social platforms** — Instagram (Apify + tagged/comments scrapers), DOU, company websites, LinkedIn (BrightData). Contact enrichment: `vdrmota/contact-info-scraper` on found websites
143
- - **Agent 4: TikTok + Regional** — TikTok profile/videos (`clockworks/tiktok-profile-scraper`), local registries, press, university records, Yandex search, Google Maps (`compass/crawler-google-places` if business owner)
144
- - **Agent 5: Deep research** — Perplexity deep, Exa deep, Parallel deep (if needed)
145
-
146
- ### Rules:
147
- - Always pass ALL known data to each sub-agent (names, URLs, emails, phones, context)
148
- - Each sub-agent saves results to `/tmp/osint-<subject>-<task>.md`
149
- - Main agent waits for all results, then runs Phase 3-6 (cross-reference, psychoprofile, dossier)
150
- - Budget: each sub-agent ≤$0.15, total swarm ≤$0.50
151
- - YouTube transcripts: use **Apify** actors, NOT BrightData or yt-dlp (both blocked by YouTube)
152
-
153
- ### Why swarm:
154
- - 5 agents × 5 min = 10 min total (vs 30+ min sequential)
155
- - Sonnet is 5x cheaper than Opus
156
- - Parallel scraping avoids rate limit stacking on single IP
157
-
158
- ---
159
-
160
- ## Phase 0: Tooling Self-Check
161
-
162
- 1. Execute `bash skills/osint/scripts/diagnose.sh`.
163
- 2. Log available vs missing tools.
164
- 3. Check internal tools: `tg.py` (Telegram history), `himalaya` (email), vault contacts.
165
- 4. If Bright Data unavailable → Facebook and LinkedIn deep scrape limited. Inform user.
166
- 5. If Apify unavailable → Instagram and LinkedIn structured data limited.
167
- 6. Proceed with available toolset.
168
-
169
- ## Phase 1: Seed Collection
170
-
171
- **Start with Level 1 (quick answers) ALWAYS before heavy scraping.**
172
-
173
- 1. Parse user input. Extract identifiers: names, handles, URLs, companies, locations.
174
- 2. **Perplexity fast pass:**
175
- ```bash
176
- bash skills/osint/scripts/perplexity.sh search "Who is <Name>, <context>"
177
- ```
178
- 3. **Brave + Parallel in parallel:**
179
- ```bash
180
- web_search "<Name> <company>"
181
- bash skills/osint/scripts/first-volley.sh "Full Name" "context"
182
- ```
183
- 4. **Review Perplexity citations** — fetch and verify top sources:
184
- ```bash
185
- web_fetch "<citation_url_1>"
186
- web_fetch "<citation_url_2>"
187
- ```
188
- 5. Parse & merge: `bash skills/osint/scripts/merge-volley.sh /tmp/osint-<timestamp>`.
189
- 6. Collect all identifiers into seed list. Deduplicate.
190
- 7. Flag name collisions (common names → verify with company/location cross-reference).
191
- 8. **Decision point:** enough context? → skip to Phase 4. Need social media? → Phase 2. Need deep dive? → Level 4 (deep research).
192
-
193
- **Rate limiting:** wait 1s between Brave queries, 2s between Jina calls.
194
- Do NOT hammer APIs in tight loops — stagger parallel launches.
195
-
196
- ## Phase 1.5: Internal Intelligence
197
-
198
- **Before going external, check what we already know.** This phase mines local sources
199
- that may contain gold — prior conversations, emails, vault contacts.
200
-
201
- ### Telegram History
202
- If `tg.py` is available (check Phase 0):
203
- ```bash
204
- # Search by name/handle in Telegram
205
- python3 skills/telegram/scripts/tg.py search "Name" 20
206
- # If we have their username/id — read conversation history
207
- python3 skills/telegram/scripts/tg.py history <username_or_id> 50
208
- ```
209
-
210
- **What to extract from Telegram history:**
211
- - Communication style (formal/informal, language, emoji patterns)
212
- - Topics discussed — what they care about, what they ask for
213
- - Response patterns — reply speed, active hours → timezone
214
- - Shared links/files — projects they work on
215
- - How they address the user — relationship dynamics
216
- - Mentioned colleagues, partners, competitors → social graph seeds
217
- - Pricing discussions, deal terms (if business contact)
218
-
219
- ⚠️ **Telegram history is Grade A intelligence** — unfiltered, real-time, authentic.
220
- Weight it higher than curated LinkedIn/Instagram profiles.
221
- ⚠️ **Privacy:** internal intelligence stays in the dossier. Never quote DMs in public outputs.
222
-
223
- ### Email History
224
- If `himalaya` is available:
225
- ```bash
226
- # Search emails by name or domain
227
- ~/.local/bin/himalaya search "from:name@domain.com OR to:name@domain.com" -f INBOX
228
- # Or by name
229
- ~/.local/bin/himalaya search "Name Surname" -f INBOX
230
- ~/.local/bin/himalaya search "Name Surname" -f Sent
231
- ```
232
-
233
- **What to extract from email:**
234
- - Formal communication style vs Telegram style (contrast = insight)
235
- - Business proposals, invoices → financial relationship
236
- - CC'd people → organizational map
237
- - Signature block → title, phone, company, social links (often richer than LinkedIn)
238
-
239
- ### Vault / CRM Check
240
- ```bash
241
- # Check if we already have a card
242
- grep -rl "Name" vault/crm/ vault/contacts/ 2>/dev/null
243
- # Check MOC indexes (adjust paths to your vault structure)
244
- grep -i "name" vault/MOC/*.md 2>/dev/null
245
- ```
246
-
247
- **If vault card exists:** read it, note last_accessed, existing tags, prior interactions.
248
- Don't duplicate — enrich the existing card after research completes.
249
-
250
- ### Node Camera/Location (if paired device available)
251
- If meeting in person and node is available, `nodes camera_snap` can capture context.
252
- Only with explicit user permission.
253
-
254
- ### Internal Intelligence Summary
255
- After Phase 1.5, you should know:
256
- - Do we have prior relationship? (cold/warm/hot contact)
257
- - What language do they prefer?
258
- - What's their communication style?
259
- - Any existing business context?
260
- - Social graph seeds from conversations
261
-
262
- This context shapes Phase 2 priorities — if we already know their career from emails,
263
- focus external research on psychoprofile and social media instead.
264
-
265
- ## Phase 2: Platform Extraction
266
-
267
- Read `references/platforms.md` ONLY when needing URL patterns or extraction signals.
268
-
269
- Tool priority (primary → fallback). **If primary fails, switch immediately. Never retry same tool.**
270
-
271
- - LinkedIn: `apify.sh linkedin` → `brightdata.sh scrape` → `jina.sh read`
272
- - Instagram: `apify.sh instagram` → `brightdata.sh scrape`
273
- - Instagram deep: `run-actor.sh "apify/instagram-tagged-scraper"` (who tags them), `apify/instagram-comment-scraper` (sentiment)
274
- - Facebook personal: `brightdata.sh scrape` → none (only Bright Data works)
275
- - Facebook pages/groups: `run-actor.sh "apify/facebook-pages-scraper"` → `brightdata.sh scrape`
276
- - TikTok: `run-actor.sh "clockworks/tiktok-profile-scraper"` → `clockworks/tiktok-scraper` (comprehensive)
277
- - TikTok discovery: `run-actor.sh "clockworks/tiktok-user-search-scraper"` (find by keywords)
278
- - YouTube: `run-actor.sh "streamers/youtube-channel-scraper"` → `jina.sh read` → `brightdata.sh scrape`
279
- - Telegram channels: `web_fetch t.me/s/{channel}` → `jina.sh read`
280
- - Twitter/X: `python3 scripts/twitter.py tweet <url>` → `jina.sh read`
281
- - Google Maps (businesses): `run-actor.sh "compass/crawler-google-places"`
282
- - Contact enrichment: `run-actor.sh "vdrmota/contact-info-scraper"` (extract emails/phones from any URL)
283
- - Any site: `jina.sh read` → `brightdata.sh scrape`
284
-
285
- **run-actor.sh** = universal Apify runner (embedded, 55+ actors). See `references/tools.md` for full actor catalog.
286
-
287
- Read `references/tools.md` ONLY when troubleshooting a failed tool.
288
-
289
- ### ⚠️ Content Platform Rule (CRITICAL)
290
-
291
- When you find YouTube, podcast, blog, or conference talks — read `references/content-extraction.md` **immediately** and extract 3-5 pieces of content on the spot.
292
-
293
- Do NOT just note the URL. Extract transcripts/text NOW.
294
- A 20-minute YouTube video reveals more about a person than their entire LinkedIn.
295
- Content platforms are the #1 source for psychoprofile — skipping them = shallow dossier.
296
-
297
- ### OpSec-Aware Targets
298
-
299
- If initial searches return unusually little for someone who should have a footprint:
300
-
301
- 1. **Wayback Machine:** `web_fetch "https://web.archive.org/web/2024*/target-url"` — deleted profiles, old bios
302
- 2. **Google Cache:** `web_search "cache:domain.com/path"` — recently removed pages
303
- 3. **Yandex Cache:** `brightdata.sh search-yandex "Name"` — Yandex indexes CIS deeper and caches longer
304
- 4. **Username variations:** try transliteration (Иванов → ivanov, ivanoff), birth year suffixes, company abbreviations
305
- 5. **Reverse image search:** if photo found, check for other profiles using same avatar
306
- 6. **Conference archives:** speaker bios often survive after profiles are deleted
307
-
308
- ## Phase 3: Cross-Reference & Confidence Scoring
309
-
310
- ### Step 1: Fact Table
311
- List every claim as a row: fact | source 1 | source 2 | grade.
312
-
313
- ### Step 2: Cross-check key facts
314
- For each critical fact (employer, role, location, education):
315
- - Compare LinkedIn title vs Telegram signature vs email signature vs company website
316
- - If 2+ match → Grade A
317
- - If only 1 source → Grade B
318
- - If inferred (timezone from messages, geotag) → Grade C
319
- - If single unverified mention → Grade D
320
-
321
- ### Step 3: Resolve contradictions
322
- If LinkedIn says "CEO" but company site says "Co-founder" — flag explicitly. Include both with sources. Do NOT silently pick one.
323
-
324
- ### Step 4: Name collision check
325
- If common name — verify at least 2 facts (company + city, or photo + company) link to same person. If unsure, split into separate entities.
326
-
327
- ### Confidence grades:
328
-
329
- - **A (confirmed)**: 2+ independent sources, or official/verified profile, or direct Telegram/email conversation
330
- - **B (probable)**: 1 credible source (LinkedIn, official media, company site)
331
- - **C (inferred)**: indirect evidence (photo geotag, timezone from message patterns, connections)
332
- - **D (unverified)**: single mention, could be wrong
333
-
334
- Internal intelligence (Phase 1.5) counts as an independent source.
335
-
336
- ## Phase 4: Psychoprofile
337
-
338
- Read `references/psychoprofile.md` ONLY at this phase.
339
-
340
- 1. Collect text samples: posts, bios, interviews, channel content, **Telegram messages** (highest signal).
341
- 2. Assess MBTI per dimension with cited behavioral evidence and confidence (high/medium/low).
342
- 3. Quantify writing style: sentence length, emoji density, self-reference rate.
343
- 4. **Compare formal (LinkedIn/email) vs informal (Telegram/Instagram) voice** — the delta reveals the real person.
344
- 5. Deduce values from actions, not self-reported claims.
345
- 6. Zodiac ONLY if DOB confirmed (Grade A or B).
346
-
347
- ## Phase 5: Completeness Evaluation (Recursive)
348
-
349
- ### Axis 1: Data Coverage (pass/fail per dimension)
350
-
351
- 9 mandatory checks. If any fail, flag as critical gap:
352
-
353
- 1. Subject correctly identified? (not a namesake)
354
- 2. Current role/company confirmed?
355
- 3. At least 2 social platforms found?
356
- 4. At least 1 contact method (email/phone/messenger)?
357
- 5. Career history has 2+ verifiable positions?
358
- 6. Location (current) established?
359
- 7. At least 1 photo found?
360
- 8. No unresolved contradictions between sources?
361
- 9. Internal intelligence checked? (Telegram/email/vault — even if empty)
362
-
363
- ### Axis 2: Depth Score (8 weighted criteria)
364
-
365
- | Dimension | Weight | What to score (1-10) |
366
- |-----------|--------|---------------------|
367
- | Identity | 0.15 | Full name, DOB, location, education, photo |
368
- | Career | 0.20 | Completeness of work history, current role clarity |
369
- | Digital footprint | 0.15 | Number of platforms found, account activity level |
370
- | Psychoprofile | 0.15 | MBTI confidence, writing style quantified, values deduced |
371
- | Internal intel | 0.10 | Telegram/email history depth, vault data |
372
- | Personal life | 0.05 | Family, hobbies, lifestyle, pets |
373
- | Cross-reference | 0.10 | How many facts are A-grade, contradiction count |
374
- | Actionability | 0.10 | Entry points identified, approach strategy clear |
375
-
376
- Weighted sum (1-10) = **Depth Score**.
377
-
378
- ### Axis 3: Source Diversity
379
-
380
- Count unique source types used (max 12):
381
- LinkedIn, Instagram, Facebook, Telegram DM, Telegram channel, VK, Twitter/X,
382
- company website, press/media articles, conference profiles, government/business registries,
383
- email correspondence.
384
-
385
- - 8+ source types = Excellent
386
- - 5-7 = Good
387
- - 2-4 = Shallow
388
- - 1 = Insufficient
389
-
390
- ### Gap Analysis
391
-
392
- | Depth Score | Coverage | Diagnosis | Action |
393
- |------------|----------|-----------|--------|
394
- | 8+ | All pass | Strong dossier | Proceed to Phase 6 |
395
- | 8+ | Some fail | Deep but blind spots | Target failed checks, 1 more cycle |
396
- | <7 | All pass | Wide but shallow | Deepen via interviews/articles/deepsearch |
397
- | <7 | Some fail | Restart needed | Different search angle, new tool combination |
398
-
399
- ### Stopping Criteria
400
-
401
- **(a)** Depth Score ≥ 8.0 AND all coverage checks pass → exit to Phase 6
402
- **(b)** 3 cycles completed → deliver best available with honest assessment
403
- **(c)** Two cycles with delta < 0.5 → plateau reached, deliver with note
404
-
405
- ### Calibration Benchmarks
406
-
407
- - **9-10**: full career timeline, 5+ platforms, confirmed DOB, psychoprofile with high confidence, family/hobbies known, multiple entry points, Telegram history analyzed. Equivalent to a professional PI report.
408
- - **7-8**: career outline, 3+ platforms, most facts B-grade or above, psychoprofile with medium confidence. Solid due diligence.
409
- - **5-6**: basic bio, 1-2 platforms, some gaps. Quick background check level.
410
- - **<5**: minimal data found. Name + current role at best. Flag as insufficient.
411
-
412
- ## Phase 6: Dossier Output
413
-
414
- Read `assets/dossier-template.md` before rendering. Follow the template structure exactly.
415
- No markdown tables in output (Telegram cannot render). Bullet lists only.
416
- Report Depth Score, source count, source types, and total API spend.
417
-
418
- If internal intelligence was used, add a separate **"из переписки"** section
419
- (marked as internal/confidential, not for sharing outside).
420
-
421
- ## Budget
422
-
423
- - ≤$0.50 per target: spend without asking.
424
- - >$0.50: ask user before proceeding.
425
- - Track cumulative spend per research session.
426
-
427
- ## Troubleshooting
428
-
429
- - **All tools return empty**: target has minimal digital presence. Try Bright Data Yandex search (better for CIS region), search by company + role instead of name.
430
- - **Wrong person keeps appearing**: add company name, city, or role to all queries. Use quotes around full name.
431
- - **LinkedIn blocked**: use `brightdata.sh scrape` as primary instead of Apify.
432
- - **Apify actor dead/changed**: check `apify.sh store-search "linkedin scraper"` for alternatives. Actors on Apify are volatile — always have a Bright Data fallback.
433
- - **Depth Score stuck at 6-7**: likely missing press/media articles or internal intel. Search industry publications (AdIndex, Sostav, Forbes, Kommersant for Russian market). Try `jina.sh deepsearch`. Check Telegram history.
434
- - **No social media found**: person may use pseudonyms. Search by email, phone, or company employee page. Search Apify store: `bash scripts/apify.sh store-search "people search"`. If `mcpc` installed: `APIFY_TOKEN=$APIFY_API_TOKEN mcpc --json mcp.apify.com --header "Authorization: Bearer $APIFY_TOKEN" tools-call search-actors keywords:="people search" limit:=10`. Check Telegram contacts by phone.
435
- - **TikTok scraper fails**: try `clockworks/free-tiktok-scraper` (free tier) as fallback. TikTok usernames often differ from other platforms — search by real name via `clockworks/tiktok-user-search-scraper`.
436
- - **Need emails from website**: use `vdrmota/contact-info-scraper` — it crawls the site and extracts all contact info.
437
- - **Rate limited (429)**: back off 5s, then 15s. Switch to fallback tool. Never retry immediately.
438
-
439
- ## Anti-Patterns
440
-
441
- 1. Never start with a single tool. Launch all available in parallel.
442
- 2. Never retry a failed tool more than once. Switch to fallback.
443
- 3. Never guess DOB, family, or zodiac.
444
- 4. Never attribute data without cross-referencing against namesakes.
445
- 5. Never include unsourced facts.
446
- 6. Never reveal OSINT methods in public messages.
447
- 7. Never exceed 3 recursive cycles. Diminishing returns.
448
- 8. Never rate Depth Score 9+ without justification.
449
- 9. Never skip psychoprofile. Without it, dossier = Wikipedia article.
450
- 10. Never skip Phase 1.5 (internal intel). Telegram history is often the richest source.
451
- 11. Never quote DMs verbatim in shareable outputs. Summarize and cite.
452
- 12. Never hammer APIs without rate limiting. Stagger requests.
1
+ ---
2
+ name: osint
3
+ description: >
4
+ Conduct deep OSINT research on individuals. Build full digital footprint, psychoprofile
5
+ (MBTI/Big Five), career history, social graph with confidence scores. Recursive
6
+ self-evaluation until completeness threshold is met. Includes internal intelligence
7
+ (Telegram history, email, vault contacts) before going external.
8
+ Use when: "osint", "досье", "research person", "find everything about", "пробей",
9
+ "разведка", "due diligence", "background check", "digital footprint",
10
+ "найди всё про", "собери информацию", "кто это", "профиль человека".
11
+ NOT for: company/product research without a named person, competitive analysis,
12
+ market research, content generation, or general web scraping tasks.
13
+ ---
14
+
15
+ # OSINT Skill v3.2
16
+
17
+ Systematic intelligence gathering on individuals. From a name or handle to a scored
18
+ dossier with psychoprofile, career map, and entry points.
19
+
20
+ ## Phase Router
21
+
22
+ Determine entry point from context:
23
+
24
+ - New name/handle/URL, "пробей", "find out about" → Phase 0 (full cycle)
25
+ - "Add LinkedIn/Instagram data" to existing dossier → Phase 2 (extraction)
26
+ - "Build psychoprofile" from existing data → Phase 4
27
+ - "Rate completeness" of existing dossier → Phase 5
28
+ - "Reformat" or "present" findings → Phase 6
29
+
30
+ Default (full research request): Phase 0 → 1 → 1.5 → 2 → 3 → 4 → 5 → 6.
31
+
32
+ ## Environment
33
+
34
+ All API keys via environment variables. Never hardcode tokens.
35
+
36
+ - `PERPLEXITY_API_KEY` — Perplexity Sonar (fast answers + deep research)
37
+ - `EXA_API_KEY` — Exa AI (semantic search, company/people research, deep research)
38
+ - `TAVILY_API_KEY` — Tavily (agent-optimized search + extract, $0.005/req basic)
39
+ - `APIFY_API_TOKEN` — Apify scraping (LinkedIn, Instagram, Facebook)
40
+ - `JINA_API_KEY` — Jina reader/search/deepsearch
41
+ - `PARALLEL_API_KEY` — Parallel AI search
42
+ - `BRIGHTDATA_MCP_URL` — Bright Data MCP endpoint (full URL with token)
43
+ - `MCPORTER_CONFIG` — mcporter config path
44
+
45
+ ## Scripts
46
+
47
+ Run from skill dir: `bash scripts/<name>.sh`.
48
+ Each validates env vars, exits with descriptive error + URL to get the key.
49
+
50
+ **Search & Research:**
51
+ - `diagnose.sh` — run FIRST. Capability map of all tools.
52
+ - `perplexity.sh` — `search <query>` | `sonar <query>` (AI answer) | `deep <query>` (deep research)
53
+ - `tavily.sh` — `search <query>` (basic $0.005) | `deep <query>` (advanced) | `extract <url>`
54
+ - `exa.sh` — `search <query>` | `company <name>` | `people <name>` | `crawl <url>` | `deep <prompt>`
55
+ - `first-volley.sh "Name" "context"` — parallel search, all engines at once.
56
+ - `merge-volley.sh <outdir>` — deduplicate and merge first-volley results.
57
+
58
+ **Scraping:**
59
+ - `apify.sh` — `linkedin <url>` | `instagram <handle>` | `run` | `results` | `store-search`
60
+ - `run-actor.sh` — **universal Apify runner (55+ actors).** Embedded from [apify/agent-skills](https://github.com/apify/agent-skills).
61
+ Quick answer: `bash scripts/run-actor.sh "actor/id" '{"input":"json"}'`
62
+ Export: `bash scripts/run-actor.sh "actor/id" '{"input":"json"}' --output /tmp/out.csv`
63
+ - `jina.sh` — `read <url>` | `search <query>` | `deepsearch <query>`
64
+ - `parallel.sh` — `search <query>` | `extract <url>`
65
+ - `brightdata.sh` — `scrape <url>` | `scrape-batch` | `search` | `search-geo <cc>` | `search-yandex`
66
+
67
+ ## Research Escalation Flow
68
+
69
+ **Принцип: от дешёвого к дорогому, от быстрого к глубокому.**
70
+
71
+ ### Level 1: Quick Answers (секунды, ~$0.00)
72
+ Начни ВСЕГДА с этого. Получи быстрый контекст прежде чем копать.
73
+ Запускай ВСЕ параллельно:
74
+ ```bash
75
+ # Perplexity Sonar — AI ответ с цитатами
76
+ bash skills/osint/scripts/perplexity.sh sonar "Who is <Name>, <context>"
77
+ # Brave Search — классический поиск
78
+ web_search "<Name> <company> <role>"
79
+ # Tavily — agent-optimized search с AI answer
80
+ bash skills/osint/scripts/tavily.sh search "<Name> <context>"
81
+ # Exa — семантический поиск + company/people research
82
+ bash skills/osint/scripts/exa.sh search "<Name> <context>"
83
+ bash skills/osint/scripts/exa.sh people "<Name>"
84
+ ```
85
+ → Получаешь: быстрые факты, ссылки, контекст.
86
+ → Решение: достаточно? → Phase 6. Нужно больше? → Level 2.
87
+
88
+ ### Level 2: Source Verification (секунды-минуты, ~$0.01)
89
+ Проверяй источники из Level 1 через fetch:
90
+ ```bash
91
+ # Читай найденные URL
92
+ web_fetch "<url_from_perplexity>"
93
+ bash skills/osint/scripts/jina.sh read "<url>"
94
+ bash skills/osint/scripts/parallel.sh extract "<url>"
95
+ ```
96
+ → Получаешь: подтверждённые факты, cross-reference.
97
+ → Совпадает? → дополняй досье. Нужно глубже? → Level 3.
98
+
99
+ ### Level 3: Social Media Deep Dive (~$0.01-0.10)
100
+ Подключай scraping для соцсетей:
101
+ ```bash
102
+ # LinkedIn
103
+ bash skills/osint/scripts/apify.sh linkedin "<url>"
104
+ # Instagram
105
+ bash skills/osint/scripts/apify.sh instagram "<handle>"
106
+ # Facebook, заблокированные сайты
107
+ bash skills/osint/scripts/brightdata.sh scrape "<url>"
108
+ ```
109
+ → Получаешь: структурированные профили, фото, связи.
110
+
111
+ ### Level 4: Deep Research (~$0.05-0.50)
112
+ Если нужно копать ещё глубже — формируй развёрнутый промпт и отправляй в deep research.
113
+ Запускай ВСЕ параллельно (30-60 сек каждый):
114
+ ```bash
115
+ # Perplexity Deep Research
116
+ bash skills/osint/scripts/perplexity.sh deep "<detailed research prompt about Name>"
117
+ # Exa Deep Research
118
+ bash skills/osint/scripts/exa.sh deep "<detailed prompt>"
119
+ # Parallel AI Deep Search
120
+ bash skills/osint/scripts/parallel.sh search "<detailed query>"
121
+ # Jina DeepSearch
122
+ bash skills/osint/scripts/jina.sh deepsearch "<query>"
123
+ ```
124
+
125
+ **Правило:** Level 4 промпт должен быть РАЗВЁРНУТЫМ — включай всё что уже знаешь
126
+ из Level 1-3, чтобы deep research не повторял базовые факты, а копал дальше.
127
+
128
+ ## Swarm Mode (DEFAULT)
129
+
130
+ OSINT research runs as a **swarm of parallel sub-agents on Sonnet**.
131
+ The main agent is the coordinator — it does NOT scrape itself.
132
+
133
+ ### How it works:
134
+ 1. Main agent runs Phase 0 (tooling check) and Phase 1 (seed collection) to get initial context
135
+ 2. Main agent spawns 3-5 sub-agents via `sessions_spawn` with `model: sonnet`, `mode: run`
136
+ 3. Each sub-agent gets a focused task + all known data from Phase 1
137
+ 4. Sub-agents return results → main agent merges into dossier
138
+
139
+ ### Task split pattern:
140
+ - **Agent 1: YouTube/Content** — extract transcripts via Apify (NOT yt-dlp, NOT BrightData — YouTube blocks them). 3-5 videos, speech style, topics. Use `streamers/youtube-channel-scraper` for channel data
141
+ - **Agent 2: Facebook deep** — BrightData scrape: profile, posts, about, photos, friends (use m.facebook.com for more data). For public Pages: `apify/facebook-pages-scraper` + `apify/facebook-page-contact-information`
142
+ - **Agent 3: Social platforms** — Instagram (Apify + tagged/comments scrapers), DOU, company websites, LinkedIn (BrightData). Contact enrichment: `vdrmota/contact-info-scraper` on found websites
143
+ - **Agent 4: TikTok + Regional** — TikTok profile/videos (`clockworks/tiktok-profile-scraper`), local registries, press, university records, Yandex search, Google Maps (`compass/crawler-google-places` if business owner)
144
+ - **Agent 5: Deep research** — Perplexity deep, Exa deep, Parallel deep (if needed)
145
+
146
+ ### Rules:
147
+ - Always pass ALL known data to each sub-agent (names, URLs, emails, phones, context)
148
+ - Each sub-agent saves results to `/tmp/osint-<subject>-<task>.md`
149
+ - Main agent waits for all results, then runs Phase 3-6 (cross-reference, psychoprofile, dossier)
150
+ - Budget: each sub-agent ≤$0.15, total swarm ≤$0.50
151
+ - YouTube transcripts: use **Apify** actors, NOT BrightData or yt-dlp (both blocked by YouTube)
152
+
153
+ ### Why swarm:
154
+ - 5 agents × 5 min = 10 min total (vs 30+ min sequential)
155
+ - Sonnet is 5x cheaper than Opus
156
+ - Parallel scraping avoids rate limit stacking on single IP
157
+
158
+ ---
159
+
160
+ ## Phase 0: Tooling Self-Check
161
+
162
+ 1. Execute `bash skills/osint/scripts/diagnose.sh`.
163
+ 2. Log available vs missing tools.
164
+ 3. Check internal tools: `tg.py` (Telegram history), `himalaya` (email), vault contacts.
165
+ 4. If Bright Data unavailable → Facebook and LinkedIn deep scrape limited. Inform user.
166
+ 5. If Apify unavailable → Instagram and LinkedIn structured data limited.
167
+ 6. Proceed with available toolset.
168
+
169
+ ## Phase 1: Seed Collection
170
+
171
+ **Start with Level 1 (quick answers) ALWAYS before heavy scraping.**
172
+
173
+ 1. Parse user input. Extract identifiers: names, handles, URLs, companies, locations.
174
+ 2. **Perplexity fast pass:**
175
+ ```bash
176
+ bash skills/osint/scripts/perplexity.sh search "Who is <Name>, <context>"
177
+ ```
178
+ 3. **Brave + Parallel in parallel:**
179
+ ```bash
180
+ web_search "<Name> <company>"
181
+ bash skills/osint/scripts/first-volley.sh "Full Name" "context"
182
+ ```
183
+ 4. **Review Perplexity citations** — fetch and verify top sources:
184
+ ```bash
185
+ web_fetch "<citation_url_1>"
186
+ web_fetch "<citation_url_2>"
187
+ ```
188
+ 5. Parse & merge: `bash skills/osint/scripts/merge-volley.sh /tmp/osint-<timestamp>`.
189
+ 6. Collect all identifiers into seed list. Deduplicate.
190
+ 7. Flag name collisions (common names → verify with company/location cross-reference).
191
+ 8. **Decision point:** enough context? → skip to Phase 4. Need social media? → Phase 2. Need deep dive? → Level 4 (deep research).
192
+
193
+ **Rate limiting:** wait 1s between Brave queries, 2s between Jina calls.
194
+ Do NOT hammer APIs in tight loops — stagger parallel launches.
195
+
196
+ ## Phase 1.5: Internal Intelligence
197
+
198
+ **Before going external, check what we already know.** This phase mines local sources
199
+ that may contain gold — prior conversations, emails, vault contacts.
200
+
201
+ ### Telegram History
202
+ If `tg.py` is available (check Phase 0):
203
+ ```bash
204
+ # Search by name/handle in Telegram
205
+ python3 skills/telegram/scripts/tg.py search "Name" 20
206
+ # If we have their username/id — read conversation history
207
+ python3 skills/telegram/scripts/tg.py history <username_or_id> 50
208
+ ```
209
+
210
+ **What to extract from Telegram history:**
211
+ - Communication style (formal/informal, language, emoji patterns)
212
+ - Topics discussed — what they care about, what they ask for
213
+ - Response patterns — reply speed, active hours → timezone
214
+ - Shared links/files — projects they work on
215
+ - How they address the user — relationship dynamics
216
+ - Mentioned colleagues, partners, competitors → social graph seeds
217
+ - Pricing discussions, deal terms (if business contact)
218
+
219
+ ⚠️ **Telegram history is Grade A intelligence** — unfiltered, real-time, authentic.
220
+ Weight it higher than curated LinkedIn/Instagram profiles.
221
+ ⚠️ **Privacy:** internal intelligence stays in the dossier. Never quote DMs in public outputs.
222
+
223
+ ### Email History
224
+ If `himalaya` is available:
225
+ ```bash
226
+ # Search emails by name or domain
227
+ ~/.local/bin/himalaya search "from:name@domain.com OR to:name@domain.com" -f INBOX
228
+ # Or by name
229
+ ~/.local/bin/himalaya search "Name Surname" -f INBOX
230
+ ~/.local/bin/himalaya search "Name Surname" -f Sent
231
+ ```
232
+
233
+ **What to extract from email:**
234
+ - Formal communication style vs Telegram style (contrast = insight)
235
+ - Business proposals, invoices → financial relationship
236
+ - CC'd people → organizational map
237
+ - Signature block → title, phone, company, social links (often richer than LinkedIn)
238
+
239
+ ### Vault / CRM Check
240
+ ```bash
241
+ # Check if we already have a card
242
+ grep -rl "Name" vault/crm/ vault/contacts/ 2>/dev/null
243
+ # Check MOC indexes (adjust paths to your vault structure)
244
+ grep -i "name" vault/MOC/*.md 2>/dev/null
245
+ ```
246
+
247
+ **If vault card exists:** read it, note last_accessed, existing tags, prior interactions.
248
+ Don't duplicate — enrich the existing card after research completes.
249
+
250
+ ### Node Camera/Location (if paired device available)
251
+ If meeting in person and node is available, `nodes camera_snap` can capture context.
252
+ Only with explicit user permission.
253
+
254
+ ### Internal Intelligence Summary
255
+ After Phase 1.5, you should know:
256
+ - Do we have prior relationship? (cold/warm/hot contact)
257
+ - What language do they prefer?
258
+ - What's their communication style?
259
+ - Any existing business context?
260
+ - Social graph seeds from conversations
261
+
262
+ This context shapes Phase 2 priorities — if we already know their career from emails,
263
+ focus external research on psychoprofile and social media instead.
264
+
265
+ ## Phase 2: Platform Extraction
266
+
267
+ Read `references/platforms.md` ONLY when needing URL patterns or extraction signals.
268
+
269
+ Tool priority (primary → fallback). **If primary fails, switch immediately. Never retry same tool.**
270
+
271
+ - LinkedIn: `apify.sh linkedin` → `brightdata.sh scrape` → `jina.sh read`
272
+ - Instagram: `apify.sh instagram` → `brightdata.sh scrape`
273
+ - Instagram deep: `run-actor.sh "apify/instagram-tagged-scraper"` (who tags them), `apify/instagram-comment-scraper` (sentiment)
274
+ - Facebook personal: `brightdata.sh scrape` → none (only Bright Data works)
275
+ - Facebook pages/groups: `run-actor.sh "apify/facebook-pages-scraper"` → `brightdata.sh scrape`
276
+ - TikTok: `run-actor.sh "clockworks/tiktok-profile-scraper"` → `clockworks/tiktok-scraper` (comprehensive)
277
+ - TikTok discovery: `run-actor.sh "clockworks/tiktok-user-search-scraper"` (find by keywords)
278
+ - YouTube: `run-actor.sh "streamers/youtube-channel-scraper"` → `jina.sh read` → `brightdata.sh scrape`
279
+ - Telegram channels: `web_fetch t.me/s/{channel}` → `jina.sh read`
280
+ - Twitter/X: `python3 scripts/twitter.py tweet <url>` → `jina.sh read`
281
+ - Google Maps (businesses): `run-actor.sh "compass/crawler-google-places"`
282
+ - Contact enrichment: `run-actor.sh "vdrmota/contact-info-scraper"` (extract emails/phones from any URL)
283
+ - Any site: `jina.sh read` → `brightdata.sh scrape`
284
+
285
+ **run-actor.sh** = universal Apify runner (embedded, 55+ actors). See `references/tools.md` for full actor catalog.
286
+
287
+ Read `references/tools.md` ONLY when troubleshooting a failed tool.
288
+
289
+ ### ⚠️ Content Platform Rule (CRITICAL)
290
+
291
+ When you find YouTube, podcast, blog, or conference talks — read `references/content-extraction.md` **immediately** and extract 3-5 pieces of content on the spot.
292
+
293
+ Do NOT just note the URL. Extract transcripts/text NOW.
294
+ A 20-minute YouTube video reveals more about a person than their entire LinkedIn.
295
+ Content platforms are the #1 source for psychoprofile — skipping them = shallow dossier.
296
+
297
+ ### OpSec-Aware Targets
298
+
299
+ If initial searches return unusually little for someone who should have a footprint:
300
+
301
+ 1. **Wayback Machine:** `web_fetch "https://web.archive.org/web/2024*/target-url"` — deleted profiles, old bios
302
+ 2. **Google Cache:** `web_search "cache:domain.com/path"` — recently removed pages
303
+ 3. **Yandex Cache:** `brightdata.sh search-yandex "Name"` — Yandex indexes CIS deeper and caches longer
304
+ 4. **Username variations:** try transliteration (Иванов → ivanov, ivanoff), birth year suffixes, company abbreviations
305
+ 5. **Reverse image search:** if photo found, check for other profiles using same avatar
306
+ 6. **Conference archives:** speaker bios often survive after profiles are deleted
307
+
308
+ ## Phase 3: Cross-Reference & Confidence Scoring
309
+
310
+ ### Step 1: Fact Table
311
+ List every claim as a row: fact | source 1 | source 2 | grade.
312
+
313
+ ### Step 2: Cross-check key facts
314
+ For each critical fact (employer, role, location, education):
315
+ - Compare LinkedIn title vs Telegram signature vs email signature vs company website
316
+ - If 2+ match → Grade A
317
+ - If only 1 source → Grade B
318
+ - If inferred (timezone from messages, geotag) → Grade C
319
+ - If single unverified mention → Grade D
320
+
321
+ ### Step 3: Resolve contradictions
322
+ If LinkedIn says "CEO" but company site says "Co-founder" — flag explicitly. Include both with sources. Do NOT silently pick one.
323
+
324
+ ### Step 4: Name collision check
325
+ If common name — verify at least 2 facts (company + city, or photo + company) link to same person. If unsure, split into separate entities.
326
+
327
+ ### Confidence grades:
328
+
329
+ - **A (confirmed)**: 2+ independent sources, or official/verified profile, or direct Telegram/email conversation
330
+ - **B (probable)**: 1 credible source (LinkedIn, official media, company site)
331
+ - **C (inferred)**: indirect evidence (photo geotag, timezone from message patterns, connections)
332
+ - **D (unverified)**: single mention, could be wrong
333
+
334
+ Internal intelligence (Phase 1.5) counts as an independent source.
335
+
336
+ ## Phase 4: Psychoprofile
337
+
338
+ Read `references/psychoprofile.md` ONLY at this phase.
339
+
340
+ 1. Collect text samples: posts, bios, interviews, channel content, **Telegram messages** (highest signal).
341
+ 2. Assess MBTI per dimension with cited behavioral evidence and confidence (high/medium/low).
342
+ 3. Quantify writing style: sentence length, emoji density, self-reference rate.
343
+ 4. **Compare formal (LinkedIn/email) vs informal (Telegram/Instagram) voice** — the delta reveals the real person.
344
+ 5. Deduce values from actions, not self-reported claims.
345
+ 6. Zodiac ONLY if DOB confirmed (Grade A or B).
346
+
347
+ ## Phase 5: Completeness Evaluation (Recursive)
348
+
349
+ ### Axis 1: Data Coverage (pass/fail per dimension)
350
+
351
+ 9 mandatory checks. If any fail, flag as critical gap:
352
+
353
+ 1. Subject correctly identified? (not a namesake)
354
+ 2. Current role/company confirmed?
355
+ 3. At least 2 social platforms found?
356
+ 4. At least 1 contact method (email/phone/messenger)?
357
+ 5. Career history has 2+ verifiable positions?
358
+ 6. Location (current) established?
359
+ 7. At least 1 photo found?
360
+ 8. No unresolved contradictions between sources?
361
+ 9. Internal intelligence checked? (Telegram/email/vault — even if empty)
362
+
363
+ ### Axis 2: Depth Score (8 weighted criteria)
364
+
365
+ | Dimension | Weight | What to score (1-10) |
366
+ |-----------|--------|---------------------|
367
+ | Identity | 0.15 | Full name, DOB, location, education, photo |
368
+ | Career | 0.20 | Completeness of work history, current role clarity |
369
+ | Digital footprint | 0.15 | Number of platforms found, account activity level |
370
+ | Psychoprofile | 0.15 | MBTI confidence, writing style quantified, values deduced |
371
+ | Internal intel | 0.10 | Telegram/email history depth, vault data |
372
+ | Personal life | 0.05 | Family, hobbies, lifestyle, pets |
373
+ | Cross-reference | 0.10 | How many facts are A-grade, contradiction count |
374
+ | Actionability | 0.10 | Entry points identified, approach strategy clear |
375
+
376
+ Weighted sum (1-10) = **Depth Score**.
377
+
378
+ ### Axis 3: Source Diversity
379
+
380
+ Count unique source types used (max 12):
381
+ LinkedIn, Instagram, Facebook, Telegram DM, Telegram channel, VK, Twitter/X,
382
+ company website, press/media articles, conference profiles, government/business registries,
383
+ email correspondence.
384
+
385
+ - 8+ source types = Excellent
386
+ - 5-7 = Good
387
+ - 2-4 = Shallow
388
+ - 1 = Insufficient
389
+
390
+ ### Gap Analysis
391
+
392
+ | Depth Score | Coverage | Diagnosis | Action |
393
+ |------------|----------|-----------|--------|
394
+ | 8+ | All pass | Strong dossier | Proceed to Phase 6 |
395
+ | 8+ | Some fail | Deep but blind spots | Target failed checks, 1 more cycle |
396
+ | <7 | All pass | Wide but shallow | Deepen via interviews/articles/deepsearch |
397
+ | <7 | Some fail | Restart needed | Different search angle, new tool combination |
398
+
399
+ ### Stopping Criteria
400
+
401
+ **(a)** Depth Score ≥ 8.0 AND all coverage checks pass → exit to Phase 6
402
+ **(b)** 3 cycles completed → deliver best available with honest assessment
403
+ **(c)** Two cycles with delta < 0.5 → plateau reached, deliver with note
404
+
405
+ ### Calibration Benchmarks
406
+
407
+ - **9-10**: full career timeline, 5+ platforms, confirmed DOB, psychoprofile with high confidence, family/hobbies known, multiple entry points, Telegram history analyzed. Equivalent to a professional PI report.
408
+ - **7-8**: career outline, 3+ platforms, most facts B-grade or above, psychoprofile with medium confidence. Solid due diligence.
409
+ - **5-6**: basic bio, 1-2 platforms, some gaps. Quick background check level.
410
+ - **<5**: minimal data found. Name + current role at best. Flag as insufficient.
411
+
412
+ ## Phase 6: Dossier Output
413
+
414
+ Read `assets/dossier-template.md` before rendering. Follow the template structure exactly.
415
+ No markdown tables in output (Telegram cannot render). Bullet lists only.
416
+ Report Depth Score, source count, source types, and total API spend.
417
+
418
+ If internal intelligence was used, add a separate **"из переписки"** section
419
+ (marked as internal/confidential, not for sharing outside).
420
+
421
+ ## Budget
422
+
423
+ - ≤$0.50 per target: spend without asking.
424
+ - >$0.50: ask user before proceeding.
425
+ - Track cumulative spend per research session.
426
+
427
+ ## Troubleshooting
428
+
429
+ - **All tools return empty**: target has minimal digital presence. Try Bright Data Yandex search (better for CIS region), search by company + role instead of name.
430
+ - **Wrong person keeps appearing**: add company name, city, or role to all queries. Use quotes around full name.
431
+ - **LinkedIn blocked**: use `brightdata.sh scrape` as primary instead of Apify.
432
+ - **Apify actor dead/changed**: check `apify.sh store-search "linkedin scraper"` for alternatives. Actors on Apify are volatile — always have a Bright Data fallback.
433
+ - **Depth Score stuck at 6-7**: likely missing press/media articles or internal intel. Search industry publications (AdIndex, Sostav, Forbes, Kommersant for Russian market). Try `jina.sh deepsearch`. Check Telegram history.
434
+ - **No social media found**: person may use pseudonyms. Search by email, phone, or company employee page. Search Apify store: `bash scripts/apify.sh store-search "people search"`. If `mcpc` installed: `APIFY_TOKEN=$APIFY_API_TOKEN mcpc --json mcp.apify.com --header "Authorization: Bearer $APIFY_TOKEN" tools-call search-actors keywords:="people search" limit:=10`. Check Telegram contacts by phone.
435
+ - **TikTok scraper fails**: try `clockworks/free-tiktok-scraper` (free tier) as fallback. TikTok usernames often differ from other platforms — search by real name via `clockworks/tiktok-user-search-scraper`.
436
+ - **Need emails from website**: use `vdrmota/contact-info-scraper` — it crawls the site and extracts all contact info.
437
+ - **Rate limited (429)**: back off 5s, then 15s. Switch to fallback tool. Never retry immediately.
438
+
439
+ ## Anti-Patterns
440
+
441
+ 1. Never start with a single tool. Launch all available in parallel.
442
+ 2. Never retry a failed tool more than once. Switch to fallback.
443
+ 3. Never guess DOB, family, or zodiac.
444
+ 4. Never attribute data without cross-referencing against namesakes.
445
+ 5. Never include unsourced facts.
446
+ 6. Never reveal OSINT methods in public messages.
447
+ 7. Never exceed 3 recursive cycles. Diminishing returns.
448
+ 8. Never rate Depth Score 9+ without justification.
449
+ 9. Never skip psychoprofile. Without it, dossier = Wikipedia article.
450
+ 10. Never skip Phase 1.5 (internal intel). Telegram history is often the richest source.
451
+ 11. Never quote DMs verbatim in shareable outputs. Summarize and cite.
452
+ 12. Never hammer APIs without rate limiting. Stagger requests.