@clawos-dev/clawd 0.2.50-beta.77.3a9364e → 0.2.51-beta.78.2024c11
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/dist/persona-defaults/persona-clawd-helper/CLAUDE.md +1 -1
- package/dist/persona-defaults/persona-knowledge-base/CLAUDE.md +19 -0
- package/dist/persona-defaults/persona-researcher/CLAUDE.md +20 -1
- package/package.json +1 -1
- package/dist/persona-defaults/persona-knowledge-base/.claude/skills/karpathy-llm-wiki/SKILL.md +0 -187
- package/dist/persona-defaults/persona-knowledge-base/.claude/skills/karpathy-llm-wiki/references/archive-template.md +0 -21
- package/dist/persona-defaults/persona-knowledge-base/.claude/skills/karpathy-llm-wiki/references/article-template.md +0 -20
- package/dist/persona-defaults/persona-knowledge-base/.claude/skills/karpathy-llm-wiki/references/index-template.md +0 -18
- package/dist/persona-defaults/persona-knowledge-base/.claude/skills/karpathy-llm-wiki/references/raw-template.md +0 -7
- package/dist/persona-defaults/persona-researcher/.claude/skills/deep-research/README.md +0 -119
- package/dist/persona-defaults/persona-researcher/.claude/skills/deep-research/SKILL.md +0 -108
- package/dist/persona-defaults/persona-researcher/.claude/skills/deep-research/reference/continuation.md +0 -167
- package/dist/persona-defaults/persona-researcher/.claude/skills/deep-research/reference/html-generation.md +0 -103
- package/dist/persona-defaults/persona-researcher/.claude/skills/deep-research/reference/methodology.md +0 -421
- package/dist/persona-defaults/persona-researcher/.claude/skills/deep-research/reference/quality-gates.md +0 -192
- package/dist/persona-defaults/persona-researcher/.claude/skills/deep-research/reference/report-assembly.md +0 -130
- package/dist/persona-defaults/persona-researcher/.claude/skills/deep-research/reference/weasyprint_guidelines.md +0 -324
- package/dist/persona-defaults/persona-researcher/.claude/skills/deep-research/requirements.txt +0 -14
- package/dist/persona-defaults/persona-researcher/.claude/skills/deep-research/schemas/claim.schema.json +0 -49
- package/dist/persona-defaults/persona-researcher/.claude/skills/deep-research/schemas/evidence.schema.json +0 -43
- package/dist/persona-defaults/persona-researcher/.claude/skills/deep-research/schemas/run_manifest.schema.json +0 -97
- package/dist/persona-defaults/persona-researcher/.claude/skills/deep-research/schemas/source.schema.json +0 -49
- package/dist/persona-defaults/persona-researcher/.claude/skills/deep-research/scripts/citation_manager.py +0 -300
- package/dist/persona-defaults/persona-researcher/.claude/skills/deep-research/scripts/evidence_store.py +0 -205
- package/dist/persona-defaults/persona-researcher/.claude/skills/deep-research/scripts/extract_claims.py +0 -358
- package/dist/persona-defaults/persona-researcher/.claude/skills/deep-research/scripts/md_to_html.py +0 -330
- package/dist/persona-defaults/persona-researcher/.claude/skills/deep-research/scripts/research_engine.py +0 -584
- package/dist/persona-defaults/persona-researcher/.claude/skills/deep-research/scripts/source_evaluator.py +0 -292
- package/dist/persona-defaults/persona-researcher/.claude/skills/deep-research/scripts/validate_report.py +0 -354
- package/dist/persona-defaults/persona-researcher/.claude/skills/deep-research/scripts/verify_citations.py +0 -426
- package/dist/persona-defaults/persona-researcher/.claude/skills/deep-research/scripts/verify_claim_support.py +0 -344
- package/dist/persona-defaults/persona-researcher/.claude/skills/deep-research/scripts/verify_html.py +0 -220
- package/dist/persona-defaults/persona-researcher/.claude/skills/deep-research/templates/mckinsey_report_template.html +0 -443
- package/dist/persona-defaults/persona-researcher/.claude/skills/deep-research/templates/report_template.md +0 -414
- package/dist/persona-defaults/persona-researcher/.claude/skills/deep-research/tests/fixtures/invalid_report.md +0 -27
- package/dist/persona-defaults/persona-researcher/.claude/skills/deep-research/tests/fixtures/valid_report.md +0 -114
- package/dist/persona-defaults/persona-researcher/.claude/skills/deep-research/tests/test_citation_manager.py +0 -195
- package/dist/persona-defaults/persona-researcher/.claude/skills/deep-research/tests/test_evidence_store.py +0 -166
- package/dist/persona-defaults/persona-researcher/.claude/skills/deep-research/tests/test_extract_claims.py +0 -213
- package/dist/persona-defaults/persona-researcher/.claude/skills/deep-research/tests/test_verify_claim_support.py +0 -230
- package/dist/persona-defaults/persona-researcher/skills-lock.json +0 -11
|
@@ -1,167 +0,0 @@
|
|
|
1
|
-
# Auto-Continuation Protocol
|
|
2
|
-
|
|
3
|
-
## When to Use
|
|
4
|
-
|
|
5
|
-
Trigger auto-continuation when report exceeds 18,000 words in single run.
|
|
6
|
-
|
|
7
|
-
---
|
|
8
|
-
|
|
9
|
-
## Strategy Overview
|
|
10
|
-
|
|
11
|
-
1. Generate sections 1-10 (stay under 18K words)
|
|
12
|
-
2. Save continuation state file with context preservation
|
|
13
|
-
3. Spawn continuation agent via Task tool
|
|
14
|
-
4. Continuation agent: Reads state -> Generates next batch -> Spawns next if needed
|
|
15
|
-
5. Chain continues recursively until complete
|
|
16
|
-
|
|
17
|
-
---
|
|
18
|
-
|
|
19
|
-
## Continuation State File
|
|
20
|
-
|
|
21
|
-
**Location:** `~/.claude/research_output/continuation_state_[report_id].json`
|
|
22
|
-
|
|
23
|
-
```json
|
|
24
|
-
{
|
|
25
|
-
"version": "3.0.0",
|
|
26
|
-
"report_id": "[unique_id]",
|
|
27
|
-
"file_path": "[absolute_path_to_report.md]",
|
|
28
|
-
"mode": "[quick|standard|deep|ultradeep]",
|
|
29
|
-
|
|
30
|
-
"progress": {
|
|
31
|
-
"sections_completed": ["list of section IDs"],
|
|
32
|
-
"total_planned_sections": 15,
|
|
33
|
-
"word_count_so_far": 12000,
|
|
34
|
-
"continuation_count": 1
|
|
35
|
-
},
|
|
36
|
-
|
|
37
|
-
"artifacts": {
|
|
38
|
-
"sources_path": "[folder]/sources.jsonl",
|
|
39
|
-
"evidence_path": "[folder]/evidence.jsonl",
|
|
40
|
-
"claims_path": "[folder]/claims.jsonl",
|
|
41
|
-
"run_manifest_path": "[folder]/run_manifest.json"
|
|
42
|
-
},
|
|
43
|
-
|
|
44
|
-
"research_context": {
|
|
45
|
-
"research_question": "[original question]",
|
|
46
|
-
"key_themes": ["theme1", "theme2"],
|
|
47
|
-
"main_findings_summary": [
|
|
48
|
-
"Finding 1: [100-word summary]",
|
|
49
|
-
"Finding 2: [100-word summary]"
|
|
50
|
-
],
|
|
51
|
-
"narrative_arc": "middle"
|
|
52
|
-
},
|
|
53
|
-
|
|
54
|
-
"quality_metrics": {
|
|
55
|
-
"avg_words_per_finding": 1500,
|
|
56
|
-
"citation_density": 5.2,
|
|
57
|
-
"prose_vs_bullets_ratio": "85% prose",
|
|
58
|
-
"writing_style": "technical-precise-data-driven"
|
|
59
|
-
},
|
|
60
|
-
|
|
61
|
-
"next_sections": [
|
|
62
|
-
{"id": 11, "type": "finding", "title": "Finding X", "target_words": 1500},
|
|
63
|
-
{"id": 12, "type": "synthesis", "title": "Synthesis", "target_words": 1000}
|
|
64
|
-
]
|
|
65
|
-
}
|
|
66
|
-
```
|
|
67
|
-
|
|
68
|
-
---
|
|
69
|
-
|
|
70
|
-
## Spawning Continuation Agent
|
|
71
|
-
|
|
72
|
-
Use Task tool:
|
|
73
|
-
|
|
74
|
-
```
|
|
75
|
-
Task(
|
|
76
|
-
subagent_type="general-purpose",
|
|
77
|
-
description="Continue deep-research report generation",
|
|
78
|
-
prompt="""
|
|
79
|
-
CONTINUATION TASK: Continue existing deep-research report.
|
|
80
|
-
|
|
81
|
-
CRITICAL INSTRUCTIONS:
|
|
82
|
-
1. Read continuation state: ~/.claude/research_output/continuation_state_[report_id].json
|
|
83
|
-
2. Read existing report: [file_path from state]
|
|
84
|
-
3. Read LAST 3 completed sections for flow/style
|
|
85
|
-
4. Load research context: themes, narrative arc, writing style
|
|
86
|
-
5. Load source registry from state.artifacts.sources_path — use stable source_ids, assign display numbers via citation_manager.py
|
|
87
|
-
6. Maintain quality metrics (avg words, citation density, prose ratio)
|
|
88
|
-
|
|
89
|
-
YOUR TASK:
|
|
90
|
-
Generate next batch (stay under 18,000 words):
|
|
91
|
-
[List next_sections from state]
|
|
92
|
-
|
|
93
|
-
Use Write/Edit to append to: [file_path]
|
|
94
|
-
|
|
95
|
-
QUALITY GATES:
|
|
96
|
-
- Words per section: Within +/-20% of avg_words_per_finding
|
|
97
|
-
- Citation density: Match +/-0.5 per 1K words
|
|
98
|
-
- Prose ratio: Maintain >=80%
|
|
99
|
-
- Theme alignment: Section ties to key_themes
|
|
100
|
-
|
|
101
|
-
After generating:
|
|
102
|
-
- If more sections remain: Update state, spawn next agent
|
|
103
|
-
- If final sections: Generate bibliography, verify report, cleanup state
|
|
104
|
-
"""
|
|
105
|
-
)
|
|
106
|
-
```
|
|
107
|
-
|
|
108
|
-
---
|
|
109
|
-
|
|
110
|
-
## Continuation Agent Quality Protocol
|
|
111
|
-
|
|
112
|
-
### Context Loading (CRITICAL)
|
|
113
|
-
|
|
114
|
-
1. Read continuation_state.json -> Load ALL context
|
|
115
|
-
2. Read existing report file -> Review last 3 sections
|
|
116
|
-
3. Extract patterns:
|
|
117
|
-
- Sentence structure complexity
|
|
118
|
-
- Technical terminology used
|
|
119
|
-
- Citation placement patterns
|
|
120
|
-
- Paragraph transition style
|
|
121
|
-
|
|
122
|
-
### Pre-Generation Checklist
|
|
123
|
-
|
|
124
|
-
- [ ] Loaded research context (themes, question, narrative arc)
|
|
125
|
-
- [ ] Reviewed previous sections for flow
|
|
126
|
-
- [ ] Loaded source registry from artifacts (stable source_ids, not citation numbers)
|
|
127
|
-
- [ ] Loaded quality targets (words, density, style)
|
|
128
|
-
- [ ] Understand narrative position (beginning/middle/end)
|
|
129
|
-
|
|
130
|
-
### Per-Section Generation
|
|
131
|
-
|
|
132
|
-
1. Generate section content
|
|
133
|
-
2. Quality checks:
|
|
134
|
-
- Word count within +/-20%
|
|
135
|
-
- Citation density matches
|
|
136
|
-
- Prose ratio >=80%
|
|
137
|
-
- Theme connection verified
|
|
138
|
-
- Style consistent
|
|
139
|
-
3. If ANY fails: Regenerate
|
|
140
|
-
4. If passes: Write to file, update state
|
|
141
|
-
|
|
142
|
-
### Handoff Decision
|
|
143
|
-
|
|
144
|
-
Calculate: Current words + remaining sections x avg_words_per_section
|
|
145
|
-
- If total < 18K: Generate all + finish
|
|
146
|
-
- If total > 18K: Generate partial, update state, spawn next agent
|
|
147
|
-
|
|
148
|
-
### Final Agent Responsibilities
|
|
149
|
-
|
|
150
|
-
- Generate final content sections
|
|
151
|
-
- Generate COMPLETE bibliography from state.citations.bibliography_entries
|
|
152
|
-
- Read entire assembled report
|
|
153
|
-
- Run validation: `python scripts/validate_report.py --report [path]`
|
|
154
|
-
- Delete continuation_state.json (cleanup)
|
|
155
|
-
- Report complete to user
|
|
156
|
-
|
|
157
|
-
---
|
|
158
|
-
|
|
159
|
-
## User Communication
|
|
160
|
-
|
|
161
|
-
After spawning continuation:
|
|
162
|
-
```
|
|
163
|
-
Report Generation: Part 1 Complete (N sections, X words)
|
|
164
|
-
Auto-continuing via spawned agent...
|
|
165
|
-
Next batch: [section list]
|
|
166
|
-
Progress: [X%] complete
|
|
167
|
-
```
|
|
@@ -1,103 +0,0 @@
|
|
|
1
|
-
# HTML Generation: McKinsey Style Report
|
|
2
|
-
|
|
3
|
-
## Design Principles
|
|
4
|
-
|
|
5
|
-
- Sharp corners (NO border-radius)
|
|
6
|
-
- Muted corporate colors (navy #003d5c, gray #f8f9fa)
|
|
7
|
-
- Ultra-compact layout
|
|
8
|
-
- Info-first structure
|
|
9
|
-
- 14px base font, compact spacing
|
|
10
|
-
- No decorative gradients or colors
|
|
11
|
-
- NO EMOJIS in final HTML
|
|
12
|
-
|
|
13
|
-
---
|
|
14
|
-
|
|
15
|
-
## Generation Steps
|
|
16
|
-
|
|
17
|
-
### Step 1: Read McKinsey Template
|
|
18
|
-
Load template from: `./templates/mckinsey_report_template.html`
|
|
19
|
-
|
|
20
|
-
### Step 2: Extract Key Metrics
|
|
21
|
-
Extract 3-4 key quantitative findings for dashboard display at top.
|
|
22
|
-
|
|
23
|
-
### Step 3: Convert MD to HTML
|
|
24
|
-
|
|
25
|
-
Use Python script:
|
|
26
|
-
```bash
|
|
27
|
-
cd ~/.claude/skills/deep-research
|
|
28
|
-
python scripts/md_to_html.py [markdown_report_path]
|
|
29
|
-
```
|
|
30
|
-
|
|
31
|
-
**Script outputs two parts:**
|
|
32
|
-
- **Part A ({{CONTENT}}):** All sections except Bibliography
|
|
33
|
-
- **Part B ({{BIBLIOGRAPHY}}):** Bibliography section only
|
|
34
|
-
|
|
35
|
-
**Script handles all conversion:**
|
|
36
|
-
- Headers: `##` -> `<div class="section"><h2 class="section-title">`
|
|
37
|
-
- Headers: `###` -> `<h3 class="subsection-title">`
|
|
38
|
-
- Lists: Markdown bullets -> `<ul><li>` with nesting
|
|
39
|
-
- Tables: Markdown tables -> `<table>` with thead/tbody
|
|
40
|
-
- Paragraphs: Text wrapped in `<p>` tags
|
|
41
|
-
- Bold/italic: `**text**` -> `<strong>`, `*text*` -> `<em>`
|
|
42
|
-
- Citations: [N] preserved for tooltip conversion
|
|
43
|
-
|
|
44
|
-
### Step 4: Add Citation Tooltips (Optional)
|
|
45
|
-
|
|
46
|
-
Attribution Gradients - wrap each [N] citation:
|
|
47
|
-
```html
|
|
48
|
-
<span class="citation">[N]
|
|
49
|
-
<span class="citation-tooltip">
|
|
50
|
-
<div class="tooltip-title">[Source Title]</div>
|
|
51
|
-
<div class="tooltip-source">[Author/Publisher]</div>
|
|
52
|
-
<div class="tooltip-claim">
|
|
53
|
-
<div class="tooltip-claim-label">Supports Claim:</div>
|
|
54
|
-
[Extract sentence with this citation]
|
|
55
|
-
</div>
|
|
56
|
-
</span>
|
|
57
|
-
</span>
|
|
58
|
-
```
|
|
59
|
-
NOTE: This step is optional for speed. Basic [N] citations are sufficient.
|
|
60
|
-
|
|
61
|
-
### Step 5: Replace Template Placeholders
|
|
62
|
-
|
|
63
|
-
| Placeholder | Content |
|
|
64
|
-
|-------------|---------|
|
|
65
|
-
| {{TITLE}} | Report title (from first ## heading) |
|
|
66
|
-
| {{DATE}} | Generation date (YYYY-MM-DD) |
|
|
67
|
-
| {{SOURCE_COUNT}} | Number of unique sources |
|
|
68
|
-
| {{METRICS_DASHBOARD}} | Metrics HTML from step 2 |
|
|
69
|
-
| {{CONTENT}} | HTML from Part A |
|
|
70
|
-
| {{BIBLIOGRAPHY}} | HTML from Part B |
|
|
71
|
-
|
|
72
|
-
### Step 6: Verify HTML
|
|
73
|
-
|
|
74
|
-
```bash
|
|
75
|
-
python scripts/verify_html.py --html [html_path] --md [md_path]
|
|
76
|
-
```
|
|
77
|
-
- Pass: Proceed to open
|
|
78
|
-
- Fail: Fix errors and re-run
|
|
79
|
-
|
|
80
|
-
### Step 7: Open in Browser
|
|
81
|
-
```bash
|
|
82
|
-
open [html_path]
|
|
83
|
-
```
|
|
84
|
-
|
|
85
|
-
---
|
|
86
|
-
|
|
87
|
-
## PDF Generation
|
|
88
|
-
|
|
89
|
-
**Option A: WeasyPrint Direct (Preferred)**
|
|
90
|
-
|
|
91
|
-
1. Create print-optimized HTML following `./reference/weasyprint_guidelines.md`
|
|
92
|
-
2. Critical CSS:
|
|
93
|
-
- `page-break-inside: avoid` on tables, boxes
|
|
94
|
-
- `page-break-after: avoid` on headings
|
|
95
|
-
- `orphans: 3; widows: 3` on paragraphs
|
|
96
|
-
- Use `display: table` not Flexbox/Grid
|
|
97
|
-
- Font sizes in pt (10pt body, 8pt citations)
|
|
98
|
-
3. Generate: `weasyprint [html_path] [pdf_path]`
|
|
99
|
-
4. Open: `open [pdf_path]`
|
|
100
|
-
|
|
101
|
-
**Option B: generating-pdf Skill**
|
|
102
|
-
|
|
103
|
-
Use Task tool with general-purpose agent, invoke generating-pdf skill.
|
|
@@ -1,421 +0,0 @@
|
|
|
1
|
-
# Deep Research Methodology: 8-Phase Pipeline
|
|
2
|
-
|
|
3
|
-
## Overview
|
|
4
|
-
|
|
5
|
-
This document contains the detailed methodology for conducting deep research. The 8 phases represent a comprehensive approach to gathering, verifying, and synthesizing information from multiple sources.
|
|
6
|
-
|
|
7
|
-
---
|
|
8
|
-
|
|
9
|
-
## Phase 1: SCOPE - Research Framing
|
|
10
|
-
|
|
11
|
-
**Objective:** Define research boundaries and success criteria
|
|
12
|
-
|
|
13
|
-
**Activities:**
|
|
14
|
-
1. Decompose the question into core components
|
|
15
|
-
2. Identify stakeholder perspectives
|
|
16
|
-
3. Define scope boundaries (what's in/out)
|
|
17
|
-
4. Establish success criteria
|
|
18
|
-
5. List key assumptions to validate
|
|
19
|
-
|
|
20
|
-
**Ultrathink Application:** Use extended reasoning to explore multiple framings of the question before committing to scope.
|
|
21
|
-
|
|
22
|
-
**Output:** Structured scope document with research boundaries
|
|
23
|
-
|
|
24
|
-
---
|
|
25
|
-
|
|
26
|
-
## Phase 2: PLAN - Strategy Formulation
|
|
27
|
-
|
|
28
|
-
**Objective:** Create an intelligent research roadmap
|
|
29
|
-
|
|
30
|
-
**Activities:**
|
|
31
|
-
1. Identify primary and secondary sources
|
|
32
|
-
2. Map knowledge dependencies (what must be understood first)
|
|
33
|
-
3. Create search query strategy with variants
|
|
34
|
-
4. Plan triangulation approach
|
|
35
|
-
5. Estimate time/effort per phase
|
|
36
|
-
6. Define quality gates
|
|
37
|
-
|
|
38
|
-
**Graph-of-Thoughts:** Branch into multiple potential research paths, then converge on optimal strategy.
|
|
39
|
-
|
|
40
|
-
**Output:** Research plan with prioritized investigation paths
|
|
41
|
-
|
|
42
|
-
---
|
|
43
|
-
|
|
44
|
-
## Phase 3: RETRIEVE - Parallel Information Gathering
|
|
45
|
-
|
|
46
|
-
**Objective:** Systematically collect information from multiple sources using parallel execution for maximum speed
|
|
47
|
-
|
|
48
|
-
**CRITICAL: Execute ALL searches in parallel using a single message with multiple tool calls**
|
|
49
|
-
|
|
50
|
-
### Query Decomposition Strategy
|
|
51
|
-
|
|
52
|
-
Before launching searches, decompose the research question into 5-10 independent search angles:
|
|
53
|
-
|
|
54
|
-
1. **Core topic (semantic search)** - Meaning-based exploration of main concept
|
|
55
|
-
2. **Technical details (keyword search)** - Specific terms, APIs, implementations
|
|
56
|
-
3. **Recent developments (date-filtered)** - What's new in last 12-18 months (use current date from Step 0)
|
|
57
|
-
4. **Academic sources (domain-specific)** - Papers, research, formal analysis
|
|
58
|
-
5. **Alternative perspectives (comparison)** - Competing approaches, criticisms
|
|
59
|
-
6. **Statistical/data sources** - Quantitative evidence, metrics, benchmarks
|
|
60
|
-
7. **Industry analysis** - Commercial applications, market trends
|
|
61
|
-
8. **Critical analysis/limitations** - Known problems, failure modes, edge cases
|
|
62
|
-
|
|
63
|
-
### Parallel Execution Protocol
|
|
64
|
-
|
|
65
|
-
**Step 0: Get the current date**
|
|
66
|
-
|
|
67
|
-
Before ANY searches, retrieve today's date using Bash: `date +%Y-%m-%d`
|
|
68
|
-
Use the returned year for all date-filtered queries and recency checks. Do NOT assume a year from training data.
|
|
69
|
-
|
|
70
|
-
**Step 1: Launch ALL searches concurrently (single message)**
|
|
71
|
-
|
|
72
|
-
**CRITICAL: Use correct tool and parameters to avoid errors**
|
|
73
|
-
|
|
74
|
-
**Primary: search-cli (multi-provider, always use first)**
|
|
75
|
-
- Unified CLI aggregating Brave, Serper, Exa, Jina, and Firecrawl
|
|
76
|
-
- Auto-detects best provider per query type (academic, news, general, people)
|
|
77
|
-
- JSON output for structured processing: `search "query" --json`
|
|
78
|
-
- Modes: general, news, academic, scholar, patents, people, images, extract, scrape
|
|
79
|
-
- Example: `search "quantum computing 2025" -m academic --json -c 15`
|
|
80
|
-
- For page content extraction: `search "URL" -m extract --json`
|
|
81
|
-
- For scraping: `search "URL" -m scrape --json`
|
|
82
|
-
- Run via Bash tool: `search "query" --json -c 10`
|
|
83
|
-
|
|
84
|
-
**Fallback: WebSearch (if search-cli fails or is unavailable)**
|
|
85
|
-
- Built-in Claude web search, no setup required
|
|
86
|
-
- Parameters: `query` (required), optional `allowed_domains`, `blocked_domains`
|
|
87
|
-
- Use when: search-cli returns errors, rate-limited, or for domain-restricted queries
|
|
88
|
-
|
|
89
|
-
**Optional: Exa MCP (if configured, for semantic/neural search)**
|
|
90
|
-
- Tool name: `mcp__Exa__exa_search`
|
|
91
|
-
- Use for semantic exploration alongside search-cli keyword results
|
|
92
|
-
|
|
93
|
-
|
|
94
|
-
**NEVER mix parameter styles** - this causes "Invalid tool parameters" errors.
|
|
95
|
-
|
|
96
|
-
**Step 2: Spawn parallel deep-dive agents**
|
|
97
|
-
|
|
98
|
-
Use Task tool with general-purpose agents (3-5 agents) for:
|
|
99
|
-
- Academic paper analysis (PDFs, detailed extraction)
|
|
100
|
-
- Documentation deep dives (technical specs, API docs)
|
|
101
|
-
- Repository analysis (code examples, implementations)
|
|
102
|
-
- Specialized domain research (requires multi-step investigation)
|
|
103
|
-
|
|
104
|
-
**Sub-agent output format:** Require all sub-agents to return structured evidence, not free text:
|
|
105
|
-
```json
|
|
106
|
-
{"claim": "specific claim text", "evidence_quote": "exact quote from source", "source_url": "https://...", "source_title": "...", "confidence": 0.85}
|
|
107
|
-
```
|
|
108
|
-
This prevents synthesis fatigue when merging results from 3-5 agents.
|
|
109
|
-
|
|
110
|
-
**Evidence persistence (v3.0):** After each retrieval batch, persist evidence immediately:
|
|
111
|
-
```bash
|
|
112
|
-
# Register the source first (returns stable source_id)
|
|
113
|
-
python scripts/citation_manager.py register-source --json '{"raw_url": "...", "title": "..."}' --dir [folder]
|
|
114
|
-
|
|
115
|
-
# Then persist each evidence span from that source
|
|
116
|
-
python scripts/evidence_store.py add --json '{"source_id": "...", "quote": "exact text", "evidence_type": "direct_quote", "locator": "page 5"}' --dir [folder]
|
|
117
|
-
```
|
|
118
|
-
Evidence must not live only in model context — it must be persisted to `evidence.jsonl` before synthesis begins. This ensures continuation agents and claim-support verification can access the full evidence trail.
|
|
119
|
-
|
|
120
|
-
**Example parallel execution (using search-cli via Bash):**
|
|
121
|
-
```
|
|
122
|
-
[Single message with multiple Bash tool calls]
|
|
123
|
-
- Bash: search "quantum computing 2026 state of the art" --json -c 10
|
|
124
|
-
- Bash: search "quantum computing limitations challenges" --json -c 10
|
|
125
|
-
- Bash: search "quantum computing commercial applications 2026" -m news --json -c 10
|
|
126
|
-
- Bash: search "quantum computing vs classical comparison" --json -c 10
|
|
127
|
-
- Bash: search "quantum error correction research" -m academic --json -c 10
|
|
128
|
-
- Task(subagent_type="general-purpose", description="Analyze quantum computing papers", prompt="Deep dive into quantum computing academic papers from [CURRENT_YEAR], extract key findings and methodologies")
|
|
129
|
-
- Task(subagent_type="general-purpose", description="Industry analysis", prompt="Analyze quantum computing industry reports and market data, identify commercial applications")
|
|
130
|
-
- Task(subagent_type="general-purpose", description="Technical challenges", prompt="Extract technical limitations and challenges from quantum computing research")
|
|
131
|
-
```
|
|
132
|
-
|
|
133
|
-
**Example parallel execution (using Exa MCP - if available):**
|
|
134
|
-
```
|
|
135
|
-
[Single message with multiple tool calls]
|
|
136
|
-
- mcp__Exa__exa_search(query="quantum computing state of the art", type="neural", num_results=10, start_published_date="[use current year from Step 0]")
|
|
137
|
-
- mcp__Exa__exa_search(query="quantum computing limitations", type="keyword", num_results=10)
|
|
138
|
-
- mcp__Exa__exa_search(query="quantum computing commercial", type="auto", num_results=10, start_published_date="[use current year from Step 0]")
|
|
139
|
-
- mcp__Exa__exa_search(query="quantum error correction", type="neural", num_results=10, include_domains=["arxiv.org"])
|
|
140
|
-
- Task(subagent_type="general-purpose", description="Academic analysis", prompt="Analyze quantum computing academic papers")
|
|
141
|
-
```
|
|
142
|
-
|
|
143
|
-
**Step 3: Collect and organize results**
|
|
144
|
-
|
|
145
|
-
As results arrive:
|
|
146
|
-
1. Extract key passages with source metadata (title, URL, date, credibility)
|
|
147
|
-
2. Track information gaps that emerge
|
|
148
|
-
3. Follow promising tangents with additional targeted searches
|
|
149
|
-
4. Maintain source diversity (mix academic, industry, news, technical docs)
|
|
150
|
-
5. Monitor for quality threshold (see FFS pattern below)
|
|
151
|
-
|
|
152
|
-
### First Finish Search (FFS) Pattern
|
|
153
|
-
|
|
154
|
-
**Adaptive completion based on quality threshold:**
|
|
155
|
-
|
|
156
|
-
**Quality gate:** Proceed to Phase 4 when FIRST threshold reached:
|
|
157
|
-
- **Quick mode:** 10+ sources with avg credibility >60/100 OR 2 minutes elapsed
|
|
158
|
-
- **Standard mode:** 15+ sources with avg credibility >60/100 OR 5 minutes elapsed
|
|
159
|
-
- **Deep mode:** 25+ sources with avg credibility >70/100 OR 10 minutes elapsed
|
|
160
|
-
- **UltraDeep mode:** 30+ sources with avg credibility >75/100 OR 15 minutes elapsed
|
|
161
|
-
|
|
162
|
-
**Continue background searches:**
|
|
163
|
-
- If threshold reached early, continue remaining parallel searches in background
|
|
164
|
-
- Additional sources used in Phase 5 (SYNTHESIZE) for depth and diversity
|
|
165
|
-
- Allows fast progression without sacrificing thoroughness
|
|
166
|
-
|
|
167
|
-
### Quality Standards
|
|
168
|
-
|
|
169
|
-
**Source diversity requirements:**
|
|
170
|
-
- Minimum 3 source types (academic, industry, news, technical docs)
|
|
171
|
-
- Temporal diversity (mix of recent 12-18 months + foundational older sources)
|
|
172
|
-
- Perspective diversity (proponents + critics + neutral analysis)
|
|
173
|
-
- Geographic diversity (not just US sources)
|
|
174
|
-
|
|
175
|
-
**Credibility tracking:**
|
|
176
|
-
- Score each source 0-100 using source_evaluator.py
|
|
177
|
-
- Flag low-credibility sources (<40) for additional verification
|
|
178
|
-
- Prioritize high-credibility sources (>80) for core claims
|
|
179
|
-
|
|
180
|
-
**Techniques:**
|
|
181
|
-
- Use search-cli for all searches (primary tool, multi-provider)
|
|
182
|
-
- Fall back to WebSearch if search-cli fails or is rate-limited
|
|
183
|
-
- Use WebFetch for deep dives into specific sources (secondary)
|
|
184
|
-
- Use Exa search (via WebSearch with type="neural") for semantic exploration
|
|
185
|
-
- Use Grep/Read for local documentation
|
|
186
|
-
- Execute code for computational analysis (when needed)
|
|
187
|
-
- Use Task tool to spawn parallel retrieval agents (3-5 agents)
|
|
188
|
-
|
|
189
|
-
**Output:** Organized information repository with source tracking, credibility scores, and coverage map
|
|
190
|
-
|
|
191
|
-
---
|
|
192
|
-
|
|
193
|
-
## Phase 4: TRIANGULATE - Cross-Reference Verification
|
|
194
|
-
|
|
195
|
-
**Objective:** Validate information across multiple independent sources
|
|
196
|
-
|
|
197
|
-
**Activities:**
|
|
198
|
-
1. Identify claims requiring verification
|
|
199
|
-
2. Cross-reference facts across 3+ sources
|
|
200
|
-
3. Flag contradictions or uncertainties
|
|
201
|
-
4. Assess source credibility
|
|
202
|
-
5. Note consensus vs. debate areas
|
|
203
|
-
6. Document verification status per claim
|
|
204
|
-
|
|
205
|
-
**Quality Standards:**
|
|
206
|
-
- Core claims must have 3+ independent sources
|
|
207
|
-
- Flag any single-source information
|
|
208
|
-
- Note recency of information
|
|
209
|
-
- Identify potential biases
|
|
210
|
-
|
|
211
|
-
**Output:** Verified fact base with confidence levels
|
|
212
|
-
|
|
213
|
-
---
|
|
214
|
-
|
|
215
|
-
## Phase 4.5: OUTLINE REFINEMENT - Dynamic Evolution (WebWeaver 2025)
|
|
216
|
-
|
|
217
|
-
**Objective:** Adapt research direction based on evidence discovered
|
|
218
|
-
|
|
219
|
-
**Problem Solved:** Prevents "locked-in" research when evidence points to different conclusions or uncovers more important angles than initially planned.
|
|
220
|
-
|
|
221
|
-
**When to Execute:**
|
|
222
|
-
- **Standard/Deep/UltraDeep modes only** (Quick mode skips this)
|
|
223
|
-
- After Phase 4 (TRIANGULATE) completes
|
|
224
|
-
- Before Phase 5 (SYNTHESIZE)
|
|
225
|
-
|
|
226
|
-
**Activities:**
|
|
227
|
-
|
|
228
|
-
1. **Review Initial Scope vs. Actual Findings**
|
|
229
|
-
- Compare Phase 1 scope with Phase 3-4 discoveries
|
|
230
|
-
- Identify unexpected patterns or contradictions
|
|
231
|
-
- Note underexplored angles that emerged as critical
|
|
232
|
-
- Flag overexplored areas that proved less important
|
|
233
|
-
|
|
234
|
-
2. **Evaluate Outline Adaptation Need**
|
|
235
|
-
|
|
236
|
-
**Signals for adaptation (ANY triggers refinement):**
|
|
237
|
-
- Major findings contradict initial assumptions
|
|
238
|
-
- Evidence reveals more important angle than originally scoped
|
|
239
|
-
- Critical subtopic emerged that wasn't in original plan
|
|
240
|
-
- Original research question was too broad/narrow based on evidence
|
|
241
|
-
- Sources consistently discuss aspects not in initial outline
|
|
242
|
-
|
|
243
|
-
**Signals to keep current outline:**
|
|
244
|
-
- Evidence aligns with initial scope
|
|
245
|
-
- All key angles adequately covered
|
|
246
|
-
- No major gaps or surprises
|
|
247
|
-
|
|
248
|
-
3. **Refine Outline (if needed)**
|
|
249
|
-
|
|
250
|
-
**Update structure to reflect evidence:**
|
|
251
|
-
- Add sections for unexpected but important findings
|
|
252
|
-
- Demote/remove sections with insufficient evidence
|
|
253
|
-
- Reorder sections based on evidence strength and importance
|
|
254
|
-
- Adjust scope boundaries based on what's actually discoverable
|
|
255
|
-
|
|
256
|
-
**Example adaptation:**
|
|
257
|
-
```
|
|
258
|
-
Original outline:
|
|
259
|
-
1. Introduction
|
|
260
|
-
2. Technical Architecture
|
|
261
|
-
3. Performance Benchmarks
|
|
262
|
-
4. Conclusion
|
|
263
|
-
|
|
264
|
-
Refined after Phase 4 (evidence revealed security as critical):
|
|
265
|
-
1. Introduction
|
|
266
|
-
2. Technical Architecture
|
|
267
|
-
3. **Security Vulnerabilities (NEW - major finding)**
|
|
268
|
-
4. Performance Benchmarks (demoted - less critical than expected)
|
|
269
|
-
5. **Real-World Failure Modes (NEW - pattern emerged)**
|
|
270
|
-
6. Synthesis & Recommendations
|
|
271
|
-
```
|
|
272
|
-
|
|
273
|
-
4. **Targeted Gap Filling (if major gaps found)**
|
|
274
|
-
|
|
275
|
-
If outline refinement reveals critical knowledge gaps:
|
|
276
|
-
- Launch 2-3 targeted searches for newly identified angles
|
|
277
|
-
- Quick retrieval only (don't restart full Phase 3)
|
|
278
|
-
- Time-box to 2-5 minutes
|
|
279
|
-
- Update triangulation for new evidence only
|
|
280
|
-
|
|
281
|
-
5. **Document Adaptation Rationale**
|
|
282
|
-
|
|
283
|
-
Record in methodology appendix:
|
|
284
|
-
- What changed in outline
|
|
285
|
-
- Why it changed (evidence-driven reasons)
|
|
286
|
-
- What additional research was conducted (if any)
|
|
287
|
-
|
|
288
|
-
**Quality Standards:**
|
|
289
|
-
- Adaptation must be evidence-driven (cite specific sources that prompted change)
|
|
290
|
-
- No more than 50% outline restructuring (if more needed, scope was severely mis scoped)
|
|
291
|
-
- Retain original research question core (don't drift into different topic entirely)
|
|
292
|
-
- New sections must have supporting evidence already gathered
|
|
293
|
-
|
|
294
|
-
**Output:** Refined outline that accurately reflects evidence landscape, ready for synthesis
|
|
295
|
-
|
|
296
|
-
**Anti-Pattern Warning:**
|
|
297
|
-
- ❌ DON'T adapt outline based on speculation or "what would be interesting"
|
|
298
|
-
- ❌ DON'T add sections without supporting evidence already in hand
|
|
299
|
-
- ❌ DON'T completely abandon original research question
|
|
300
|
-
- ✅ DO adapt when evidence clearly indicates better structure
|
|
301
|
-
- ✅ DO document rationale for changes
|
|
302
|
-
- ✅ DO stay within original topic scope
|
|
303
|
-
|
|
304
|
-
---
|
|
305
|
-
|
|
306
|
-
## Phase 5: SYNTHESIZE - Deep Analysis
|
|
307
|
-
|
|
308
|
-
**Objective:** Connect insights and generate novel understanding
|
|
309
|
-
|
|
310
|
-
**Activities:**
|
|
311
|
-
1. Identify patterns across sources
|
|
312
|
-
2. Map relationships between concepts
|
|
313
|
-
3. Generate insights beyond source material
|
|
314
|
-
4. Create conceptual frameworks
|
|
315
|
-
5. Build argument structures
|
|
316
|
-
6. Develop evidence hierarchies
|
|
317
|
-
|
|
318
|
-
**Ultrathink Integration:** Use extended reasoning to explore non-obvious connections and second-order implications.
|
|
319
|
-
|
|
320
|
-
**Output:** Synthesized understanding with insight generation
|
|
321
|
-
|
|
322
|
-
---
|
|
323
|
-
|
|
324
|
-
## Phase 6: CRITIQUE - Quality Assurance
|
|
325
|
-
|
|
326
|
-
**Objective:** Rigorously evaluate research quality
|
|
327
|
-
|
|
328
|
-
**Activities:**
|
|
329
|
-
1. Review for logical consistency
|
|
330
|
-
2. Check citation completeness
|
|
331
|
-
3. Identify gaps or weaknesses
|
|
332
|
-
4. Assess balance and objectivity
|
|
333
|
-
5. Verify claims against sources
|
|
334
|
-
6. Test alternative interpretations
|
|
335
|
-
|
|
336
|
-
**Red Team Questions:**
|
|
337
|
-
- What's missing?
|
|
338
|
-
- What could be wrong?
|
|
339
|
-
- What alternative explanations exist?
|
|
340
|
-
- What biases might be present?
|
|
341
|
-
- What counterfactuals should be considered?
|
|
342
|
-
|
|
343
|
-
**Persona-Based Critique (Deep/UltraDeep only):**
|
|
344
|
-
Simulate 2-3 specific critic personas relevant to the topic:
|
|
345
|
-
- "Skeptical Practitioner" — Would someone doing this daily trust these findings?
|
|
346
|
-
- "Adversarial Reviewer" — What would a peer reviewer reject?
|
|
347
|
-
- "Implementation Engineer" — Can these recommendations actually be executed?
|
|
348
|
-
|
|
349
|
-
**Critical Gap Loop-Back:**
|
|
350
|
-
If critique identifies a critical knowledge gap (not just a writing issue), return to Phase 3 with targeted "delta-queries" before proceeding to Phase 7. Time-box to 3-5 minutes. This prevents publishing reports with known blind spots.
|
|
351
|
-
|
|
352
|
-
**Output:** Critique report with improvement recommendations
|
|
353
|
-
|
|
354
|
-
---
|
|
355
|
-
|
|
356
|
-
## Phase 7: REFINE - Iterative Improvement
|
|
357
|
-
|
|
358
|
-
**Objective:** Address gaps and strengthen weak areas
|
|
359
|
-
|
|
360
|
-
**Activities:**
|
|
361
|
-
1. Conduct additional research for gaps
|
|
362
|
-
2. Strengthen weak arguments
|
|
363
|
-
3. Add missing perspectives
|
|
364
|
-
4. Resolve contradictions
|
|
365
|
-
5. Enhance clarity
|
|
366
|
-
6. Verify revised content
|
|
367
|
-
|
|
368
|
-
**Output:** Strengthened research with addressed deficiencies
|
|
369
|
-
|
|
370
|
-
---
|
|
371
|
-
|
|
372
|
-
## Phase 8: PACKAGE - Report Generation
|
|
373
|
-
|
|
374
|
-
**Objective:** Deliver professional, actionable research
|
|
375
|
-
|
|
376
|
-
**Activities:**
|
|
377
|
-
1. Structure report with clear hierarchy
|
|
378
|
-
2. Write executive summary
|
|
379
|
-
3. Develop detailed sections
|
|
380
|
-
4. Create visualizations (tables, diagrams)
|
|
381
|
-
5. Compile full bibliography
|
|
382
|
-
6. Add methodology appendix
|
|
383
|
-
|
|
384
|
-
**Output:** Complete research report ready for use
|
|
385
|
-
|
|
386
|
-
---
|
|
387
|
-
|
|
388
|
-
## Advanced Features
|
|
389
|
-
|
|
390
|
-
### Graph-of-Thoughts Reasoning
|
|
391
|
-
|
|
392
|
-
Rather than linear thinking, branch into multiple reasoning paths:
|
|
393
|
-
- Explore alternative framings in parallel
|
|
394
|
-
- Pursue tangential leads that might be relevant
|
|
395
|
-
- Merge insights from different branches
|
|
396
|
-
- Backtrack and revise as new information emerges
|
|
397
|
-
|
|
398
|
-
### Parallel Agent Deployment
|
|
399
|
-
|
|
400
|
-
Use Task tool to spawn sub-agents for:
|
|
401
|
-
- Parallel source retrieval
|
|
402
|
-
- Independent verification paths
|
|
403
|
-
- Competing hypothesis evaluation
|
|
404
|
-
- Specialized domain analysis
|
|
405
|
-
|
|
406
|
-
### Adaptive Depth Control
|
|
407
|
-
|
|
408
|
-
Automatically adjust research depth based on:
|
|
409
|
-
- Information complexity
|
|
410
|
-
- Source availability
|
|
411
|
-
- Time constraints
|
|
412
|
-
- Confidence levels
|
|
413
|
-
|
|
414
|
-
### Citation Intelligence
|
|
415
|
-
|
|
416
|
-
Smart citation management:
|
|
417
|
-
- Track provenance of every claim
|
|
418
|
-
- Link to original sources
|
|
419
|
-
- Assess source credibility
|
|
420
|
-
- Handle conflicting sources
|
|
421
|
-
- Generate proper bibliographies
|