flonat-research 0.1.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/.claude/agents/domain-reviewer.md +336 -0
- package/.claude/agents/fixer.md +226 -0
- package/.claude/agents/paper-critic.md +370 -0
- package/.claude/agents/peer-reviewer.md +289 -0
- package/.claude/agents/proposal-reviewer.md +215 -0
- package/.claude/agents/referee2-reviewer.md +367 -0
- package/.claude/agents/references/journal-referee-profiles.md +354 -0
- package/.claude/agents/references/paper-critic/council-personas.md +77 -0
- package/.claude/agents/references/paper-critic/council-prompts.md +198 -0
- package/.claude/agents/references/peer-reviewer/report-template.md +199 -0
- package/.claude/agents/references/peer-reviewer/sa-prompts.md +260 -0
- package/.claude/agents/references/peer-reviewer/security-scan.md +188 -0
- package/.claude/agents/references/proposal-reviewer/report-template.md +144 -0
- package/.claude/agents/references/proposal-reviewer/sa-prompts.md +149 -0
- package/.claude/agents/references/referee-config.md +114 -0
- package/.claude/agents/references/referee2-reviewer/audit-checklists.md +287 -0
- package/.claude/agents/references/referee2-reviewer/report-template.md +334 -0
- package/.claude/rules/design-before-results.md +52 -0
- package/.claude/rules/ignore-agents-md.md +17 -0
- package/.claude/rules/ignore-gemini-md.md +17 -0
- package/.claude/rules/lean-claude-md.md +45 -0
- package/.claude/rules/learn-tags.md +99 -0
- package/.claude/rules/overleaf-separation.md +67 -0
- package/.claude/rules/plan-first.md +175 -0
- package/.claude/rules/read-docs-first.md +50 -0
- package/.claude/rules/scope-discipline.md +28 -0
- package/.claude/settings.json +125 -0
- package/.context/current-focus.md +33 -0
- package/.context/preferences/priorities.md +36 -0
- package/.context/preferences/task-naming.md +28 -0
- package/.context/profile.md +29 -0
- package/.context/projects/_index.md +41 -0
- package/.context/projects/papers/nudge-exp.md +22 -0
- package/.context/projects/papers/uncertainty.md +31 -0
- package/.context/resources/claude-scientific-writer-review.md +48 -0
- package/.context/resources/cunningham-multi-analyst-agents.md +104 -0
- package/.context/resources/cunningham-multilang-code-audit.md +62 -0
- package/.context/resources/google-ai-co-scientist-review.md +72 -0
- package/.context/resources/karpathy-llm-council-review.md +58 -0
- package/.context/resources/multi-coder-reliability-protocol.md +175 -0
- package/.context/resources/pedro-santanna-takeaways.md +96 -0
- package/.context/resources/venue-rankings/abs_ajg_2024.csv +1823 -0
- package/.context/resources/venue-rankings/abs_ajg_2024_econ.csv +356 -0
- package/.context/resources/venue-rankings/cabs_4_4star_theory.csv +40 -0
- package/.context/resources/venue-rankings/core_2026.csv +801 -0
- package/.context/resources/venue-rankings.md +147 -0
- package/.context/workflows/README.md +69 -0
- package/.context/workflows/daily-review.md +91 -0
- package/.context/workflows/meeting-actions.md +108 -0
- package/.context/workflows/replication-protocol.md +155 -0
- package/.context/workflows/weekly-review.md +113 -0
- package/.mcp-server-biblio/formatters.py +158 -0
- package/.mcp-server-biblio/pyproject.toml +11 -0
- package/.mcp-server-biblio/server.py +678 -0
- package/.mcp-server-biblio/sources/__init__.py +14 -0
- package/.mcp-server-biblio/sources/base.py +73 -0
- package/.mcp-server-biblio/sources/formatters.py +83 -0
- package/.mcp-server-biblio/sources/models.py +22 -0
- package/.mcp-server-biblio/sources/multi_source.py +243 -0
- package/.mcp-server-biblio/sources/openalex_source.py +183 -0
- package/.mcp-server-biblio/sources/scopus_source.py +309 -0
- package/.mcp-server-biblio/sources/wos_source.py +508 -0
- package/.mcp-server-biblio/uv.lock +896 -0
- package/.scripts/README.md +161 -0
- package/.scripts/ai_pattern_density.py +446 -0
- package/.scripts/conf +445 -0
- package/.scripts/config.py +122 -0
- package/.scripts/count_inventory.py +275 -0
- package/.scripts/daily_digest.py +288 -0
- package/.scripts/done +177 -0
- package/.scripts/extract_meeting_actions.py +223 -0
- package/.scripts/focus +176 -0
- package/.scripts/generate-codex-agents-md.py +217 -0
- package/.scripts/inbox +194 -0
- package/.scripts/notion_helpers.py +325 -0
- package/.scripts/openalex/query_helpers.py +306 -0
- package/.scripts/papers +227 -0
- package/.scripts/query +223 -0
- package/.scripts/session-history.py +201 -0
- package/.scripts/skill-health.py +516 -0
- package/.scripts/skill-log-miner.py +273 -0
- package/.scripts/sync-to-codex.sh +252 -0
- package/.scripts/task +213 -0
- package/.scripts/tasks +190 -0
- package/.scripts/week +206 -0
- package/CLAUDE.md +197 -0
- package/LICENSE +21 -0
- package/MEMORY.md +38 -0
- package/README.md +269 -0
- package/docs/agents.md +44 -0
- package/docs/bibliography-setup.md +55 -0
- package/docs/council-mode.md +36 -0
- package/docs/getting-started.md +245 -0
- package/docs/hooks.md +38 -0
- package/docs/mcp-servers.md +82 -0
- package/docs/notion-setup.md +109 -0
- package/docs/rules.md +33 -0
- package/docs/scripts.md +303 -0
- package/docs/setup-overview/setup-overview.pdf +0 -0
- package/docs/skills.md +70 -0
- package/docs/system.md +159 -0
- package/hooks/block-destructive-git.sh +66 -0
- package/hooks/context-monitor.py +114 -0
- package/hooks/postcompact-restore.py +157 -0
- package/hooks/precompact-autosave.py +181 -0
- package/hooks/promise-checker.sh +124 -0
- package/hooks/protect-source-files.sh +81 -0
- package/hooks/resume-context-loader.sh +53 -0
- package/hooks/startup-context-loader.sh +102 -0
- package/package.json +51 -0
- package/packages/cli-council/.github/workflows/claude-code-review.yml +44 -0
- package/packages/cli-council/.github/workflows/claude.yml +50 -0
- package/packages/cli-council/README.md +100 -0
- package/packages/cli-council/pyproject.toml +43 -0
- package/packages/cli-council/src/cli_council/__init__.py +19 -0
- package/packages/cli-council/src/cli_council/__main__.py +185 -0
- package/packages/cli-council/src/cli_council/backends/__init__.py +8 -0
- package/packages/cli-council/src/cli_council/backends/base.py +81 -0
- package/packages/cli-council/src/cli_council/backends/claude.py +25 -0
- package/packages/cli-council/src/cli_council/backends/codex.py +27 -0
- package/packages/cli-council/src/cli_council/backends/gemini.py +26 -0
- package/packages/cli-council/src/cli_council/checkpoint.py +212 -0
- package/packages/cli-council/src/cli_council/config.py +51 -0
- package/packages/cli-council/src/cli_council/council.py +391 -0
- package/packages/cli-council/src/cli_council/models.py +46 -0
- package/packages/llm-council/.github/workflows/claude-code-review.yml +44 -0
- package/packages/llm-council/.github/workflows/claude.yml +50 -0
- package/packages/llm-council/README.md +453 -0
- package/packages/llm-council/pyproject.toml +42 -0
- package/packages/llm-council/src/llm_council/__init__.py +23 -0
- package/packages/llm-council/src/llm_council/__main__.py +259 -0
- package/packages/llm-council/src/llm_council/checkpoint.py +193 -0
- package/packages/llm-council/src/llm_council/client.py +253 -0
- package/packages/llm-council/src/llm_council/config.py +232 -0
- package/packages/llm-council/src/llm_council/council.py +482 -0
- package/packages/llm-council/src/llm_council/models.py +46 -0
- package/packages/mcp-bibliography/MEMORY.md +31 -0
- package/packages/mcp-bibliography/_app.py +226 -0
- package/packages/mcp-bibliography/formatters.py +158 -0
- package/packages/mcp-bibliography/log/2026-03-13-2100.md +35 -0
- package/packages/mcp-bibliography/pyproject.toml +15 -0
- package/packages/mcp-bibliography/run.sh +20 -0
- package/packages/mcp-bibliography/scholarly_formatters.py +83 -0
- package/packages/mcp-bibliography/server.py +1857 -0
- package/packages/mcp-bibliography/tools/__init__.py +28 -0
- package/packages/mcp-bibliography/tools/_registry.py +19 -0
- package/packages/mcp-bibliography/tools/altmetric.py +107 -0
- package/packages/mcp-bibliography/tools/core.py +92 -0
- package/packages/mcp-bibliography/tools/dblp.py +52 -0
- package/packages/mcp-bibliography/tools/openalex.py +296 -0
- package/packages/mcp-bibliography/tools/opencitations.py +102 -0
- package/packages/mcp-bibliography/tools/openreview.py +179 -0
- package/packages/mcp-bibliography/tools/orcid.py +131 -0
- package/packages/mcp-bibliography/tools/scholarly.py +575 -0
- package/packages/mcp-bibliography/tools/unpaywall.py +63 -0
- package/packages/mcp-bibliography/tools/zenodo.py +123 -0
- package/packages/mcp-bibliography/uv.lock +711 -0
- package/scripts/setup.sh +143 -0
- package/skills/beamer-deck/SKILL.md +199 -0
- package/skills/beamer-deck/references/quality-rubric.md +54 -0
- package/skills/beamer-deck/references/review-prompts.md +106 -0
- package/skills/bib-validate/SKILL.md +261 -0
- package/skills/bib-validate/references/council-mode.md +34 -0
- package/skills/bib-validate/references/deep-verify.md +79 -0
- package/skills/bib-validate/references/fix-mode.md +36 -0
- package/skills/bib-validate/references/openalex-verification.md +45 -0
- package/skills/bib-validate/references/preprint-check.md +31 -0
- package/skills/bib-validate/references/ref-manager-crossref.md +41 -0
- package/skills/bib-validate/references/report-template.md +82 -0
- package/skills/code-archaeology/SKILL.md +141 -0
- package/skills/code-review/SKILL.md +265 -0
- package/skills/code-review/references/quality-rubric.md +67 -0
- package/skills/consolidate-memory/SKILL.md +208 -0
- package/skills/context-status/SKILL.md +126 -0
- package/skills/creation-guard/SKILL.md +230 -0
- package/skills/devils-advocate/SKILL.md +130 -0
- package/skills/devils-advocate/references/competing-hypotheses.md +83 -0
- package/skills/init-project/SKILL.md +115 -0
- package/skills/init-project-course/references/memory-and-settings.md +92 -0
- package/skills/init-project-course/references/organise-templates.md +94 -0
- package/skills/init-project-course/skill.md +147 -0
- package/skills/init-project-light/skill.md +139 -0
- package/skills/init-project-research/SKILL.md +368 -0
- package/skills/init-project-research/references/atlas-pipeline-sync.md +70 -0
- package/skills/init-project-research/references/atlas-schema.md +81 -0
- package/skills/init-project-research/references/confirmation-report.md +39 -0
- package/skills/init-project-research/references/domain-profile-template.md +104 -0
- package/skills/init-project-research/references/interview-round3.md +34 -0
- package/skills/init-project-research/references/literature-discovery.md +43 -0
- package/skills/init-project-research/references/scaffold-details.md +197 -0
- package/skills/init-project-research/templates/field-calibration.md +60 -0
- package/skills/init-project-research/templates/pipeline-manifest.md +63 -0
- package/skills/init-project-research/templates/run-all.sh +116 -0
- package/skills/init-project-research/templates/seed-files.md +337 -0
- package/skills/insights-deck/SKILL.md +151 -0
- package/skills/interview-me/SKILL.md +157 -0
- package/skills/latex/SKILL.md +141 -0
- package/skills/latex/references/latex-configs.md +183 -0
- package/skills/latex-autofix/SKILL.md +230 -0
- package/skills/latex-autofix/references/known-errors.md +183 -0
- package/skills/latex-autofix/references/quality-rubric.md +50 -0
- package/skills/latex-health-check/SKILL.md +161 -0
- package/skills/learn/SKILL.md +220 -0
- package/skills/learn/scripts/validate_skill.py +265 -0
- package/skills/lessons-learned/SKILL.md +201 -0
- package/skills/literature/SKILL.md +335 -0
- package/skills/literature/references/agent-templates.md +393 -0
- package/skills/literature/references/bibliometric-apis.md +44 -0
- package/skills/literature/references/cli-council-search.md +79 -0
- package/skills/literature/references/openalex-api-guide.md +371 -0
- package/skills/literature/references/openalex-common-queries.md +381 -0
- package/skills/literature/references/openalex-workflows.md +248 -0
- package/skills/literature/references/reference-manager-sync.md +36 -0
- package/skills/literature/references/scopus-api-guide.md +208 -0
- package/skills/literature/references/wos-api-guide.md +308 -0
- package/skills/multi-perspective/SKILL.md +311 -0
- package/skills/multi-perspective/references/computational-many-analysts.md +77 -0
- package/skills/pipeline-manifest/SKILL.md +226 -0
- package/skills/pre-submission-report/SKILL.md +153 -0
- package/skills/process-reviews/SKILL.md +244 -0
- package/skills/process-reviews/references/rr-routing.md +101 -0
- package/skills/project-deck/SKILL.md +87 -0
- package/skills/project-safety/SKILL.md +135 -0
- package/skills/proofread/SKILL.md +254 -0
- package/skills/proofread/references/quality-rubric.md +104 -0
- package/skills/python-env/SKILL.md +57 -0
- package/skills/quarto-deck/SKILL.md +226 -0
- package/skills/quarto-deck/references/markdown-format.md +143 -0
- package/skills/quarto-deck/references/quality-rubric.md +54 -0
- package/skills/save-context/SKILL.md +174 -0
- package/skills/session-log/SKILL.md +98 -0
- package/skills/shared/concept-validation-gate.md +161 -0
- package/skills/shared/council-protocol.md +265 -0
- package/skills/shared/distribution-diagnostics.md +164 -0
- package/skills/shared/engagement-stratified-sampling.md +218 -0
- package/skills/shared/escalation-protocol.md +74 -0
- package/skills/shared/external-audit-protocol.md +205 -0
- package/skills/shared/intercoder-reliability.md +256 -0
- package/skills/shared/mcp-degradation.md +81 -0
- package/skills/shared/method-probing-questions.md +163 -0
- package/skills/shared/multi-language-conventions.md +143 -0
- package/skills/shared/paid-api-safety.md +174 -0
- package/skills/shared/palettes.md +90 -0
- package/skills/shared/progressive-disclosure.md +92 -0
- package/skills/shared/project-documentation-content.md +443 -0
- package/skills/shared/project-documentation-format.md +281 -0
- package/skills/shared/project-documentation.md +100 -0
- package/skills/shared/publication-output.md +138 -0
- package/skills/shared/quality-scoring.md +70 -0
- package/skills/shared/reference-resolution.md +77 -0
- package/skills/shared/research-quality-rubric.md +165 -0
- package/skills/shared/rhetoric-principles.md +54 -0
- package/skills/shared/skill-design-patterns.md +272 -0
- package/skills/shared/skill-index.md +240 -0
- package/skills/shared/system-documentation.md +334 -0
- package/skills/shared/tikz-rules.md +402 -0
- package/skills/shared/validation-tiers.md +121 -0
- package/skills/shared/venue-guides/README.md +46 -0
- package/skills/shared/venue-guides/cell_press_style.md +483 -0
- package/skills/shared/venue-guides/conferences_formatting.md +564 -0
- package/skills/shared/venue-guides/cs_conference_style.md +463 -0
- package/skills/shared/venue-guides/examples/cell_summary_example.md +247 -0
- package/skills/shared/venue-guides/examples/medical_structured_abstract.md +313 -0
- package/skills/shared/venue-guides/examples/nature_abstract_examples.md +213 -0
- package/skills/shared/venue-guides/examples/neurips_introduction_example.md +245 -0
- package/skills/shared/venue-guides/journals_formatting.md +486 -0
- package/skills/shared/venue-guides/medical_journal_styles.md +535 -0
- package/skills/shared/venue-guides/ml_conference_style.md +556 -0
- package/skills/shared/venue-guides/nature_science_style.md +405 -0
- package/skills/shared/venue-guides/reviewer_expectations.md +417 -0
- package/skills/shared/venue-guides/venue_writing_styles.md +321 -0
- package/skills/split-pdf/SKILL.md +172 -0
- package/skills/split-pdf/methodology.md +48 -0
- package/skills/sync-notion/SKILL.md +93 -0
- package/skills/system-audit/SKILL.md +157 -0
- package/skills/system-audit/references/sub-agent-prompts.md +294 -0
- package/skills/task-management/SKILL.md +131 -0
- package/skills/update-focus/SKILL.md +204 -0
- package/skills/update-project-doc/SKILL.md +194 -0
- package/skills/validate-bib/SKILL.md +242 -0
- package/skills/validate-bib/references/council-mode.md +34 -0
- package/skills/validate-bib/references/deep-verify.md +71 -0
- package/skills/validate-bib/references/openalex-verification.md +45 -0
- package/skills/validate-bib/references/preprint-check.md +31 -0
- package/skills/validate-bib/references/report-template.md +62 -0
|
@@ -0,0 +1,165 @@
|
|
|
1
|
+
# Research Quality Rubric
|
|
2
|
+
|
|
3
|
+
> Dimension-weighted scoring rubric for evaluating research papers and proposals.
|
|
4
|
+
> Adapted from ScholarEval (Moussa et al., 2025, arXiv:2510.16234) with language
|
|
5
|
+
> adjusted for social science, management, economics, and political economy.
|
|
6
|
+
>
|
|
7
|
+
> **When to load:** Any agent performing a structured research quality assessment.
|
|
8
|
+
> **Complements:** `quality-scoring.md` (deduction-based) handles artifact polish;
|
|
9
|
+
> this rubric handles substantive research quality across 8 dimensions.
|
|
10
|
+
|
|
11
|
+
## Dimensions & Scales
|
|
12
|
+
|
|
13
|
+
Each dimension is scored 1-5. Only 3/5/5 anchors are provided to keep the file lean.
|
|
14
|
+
Score 2 = between Adequate and Weak; Score 1 = fundamental deficiencies.
|
|
15
|
+
|
|
16
|
+
### 1. Problem Formulation & Research Questions (15%)
|
|
17
|
+
|
|
18
|
+
| Score | Indicators |
|
|
19
|
+
|-------|-----------|
|
|
20
|
+
| **5 Excellent** | Novel contribution clearly situated in a gap; research questions are precise, falsifiable, and theoretically grounded; scope is well-bounded |
|
|
21
|
+
| **4 Good** | Clear contribution with minor gap-articulation issues; questions are specific but could be sharper; scope is reasonable |
|
|
22
|
+
| **3 Adequate** | Contribution exists but is vaguely stated; questions are broad or partially redundant; scope creep risk |
|
|
23
|
+
|
|
24
|
+
- [ ] Gap in literature explicitly identified
|
|
25
|
+
- [ ] Research questions are stated, specific, and answerable
|
|
26
|
+
- [ ] Scope is bounded (what is in/out)
|
|
27
|
+
- [ ] Contribution is distinguished from related work
|
|
28
|
+
- [ ] Practical or theoretical relevance is articulated
|
|
29
|
+
|
|
30
|
+
### 2. Literature Review (15%)
|
|
31
|
+
|
|
32
|
+
| Score | Indicators |
|
|
33
|
+
|-------|-----------|
|
|
34
|
+
| **5 Excellent** | Comprehensive coverage of key streams; synthesises rather than lists; positions paper in ongoing debates; includes seminal and recent work |
|
|
35
|
+
| **4 Good** | Covers main streams with minor omissions; mostly synthetic; positioning is clear |
|
|
36
|
+
| **3 Adequate** | Covers basics but misses a stream or relies on dated sources; more descriptive than analytical |
|
|
37
|
+
|
|
38
|
+
- [ ] Key theoretical streams identified and covered
|
|
39
|
+
- [ ] Seminal works cited (not just recent)
|
|
40
|
+
- [ ] Review is synthetic (themes/tensions) not a list
|
|
41
|
+
- [ ] Competing perspectives acknowledged
|
|
42
|
+
- [ ] Clear link from review to research questions
|
|
43
|
+
- [ ] No major stream conspicuously absent
|
|
44
|
+
|
|
45
|
+
### 3. Methodology & Research Design (20%)
|
|
46
|
+
|
|
47
|
+
| Score | Indicators |
|
|
48
|
+
|-------|-----------|
|
|
49
|
+
| **5 Excellent** | Design is well-justified for the research questions; identification strategy is explicit; assumptions stated and defended; threats to validity addressed |
|
|
50
|
+
| **4 Good** | Design fits questions; identification mostly clear; some assumptions implicit; validity threats partially addressed |
|
|
51
|
+
| **3 Adequate** | Design is standard but justification is thin; identification strategy unclear; limited validity discussion |
|
|
52
|
+
|
|
53
|
+
- [ ] Method choice justified for the research question
|
|
54
|
+
- [ ] Identification strategy / causal logic explicit (if causal claims made)
|
|
55
|
+
- [ ] Key assumptions stated
|
|
56
|
+
- [ ] Threats to internal and external validity discussed
|
|
57
|
+
- [ ] Robustness checks or sensitivity analyses planned
|
|
58
|
+
- [ ] Replicability: sufficient procedural detail
|
|
59
|
+
|
|
60
|
+
### 4. Data Collection & Sources (10%)
|
|
61
|
+
|
|
62
|
+
| Score | Indicators |
|
|
63
|
+
|-------|-----------|
|
|
64
|
+
| **5 Excellent** | Data sources are well-suited, clearly described, and limitations acknowledged; sampling strategy justified; measurement validity established |
|
|
65
|
+
| **4 Good** | Sources are appropriate with minor documentation gaps; sampling is reasonable; most measures validated |
|
|
66
|
+
| **3 Adequate** | Sources are acceptable but convenience-driven; limited discussion of measurement validity or representativeness |
|
|
67
|
+
|
|
68
|
+
- [ ] Data sources described (origin, timeframe, access)
|
|
69
|
+
- [ ] Sampling strategy explained and justified
|
|
70
|
+
- [ ] Key variables operationalised with clear measures
|
|
71
|
+
- [ ] Data limitations acknowledged
|
|
72
|
+
- [ ] Ethical considerations addressed (if human subjects)
|
|
73
|
+
|
|
74
|
+
### 5. Analysis & Interpretation (15%)
|
|
75
|
+
|
|
76
|
+
| Score | Indicators |
|
|
77
|
+
|-------|-----------|
|
|
78
|
+
| **5 Excellent** | Analysis follows logically from design; results interpreted carefully with appropriate caveats; alternative explanations considered; effect sizes contextualised |
|
|
79
|
+
| **4 Good** | Analysis is sound; interpretation is mostly careful; some alternative explanations explored |
|
|
80
|
+
| **3 Adequate** | Analysis is correct but mechanical; over-reliance on p-values; limited exploration of alternatives |
|
|
81
|
+
|
|
82
|
+
- [ ] Analysis matches the stated methodology
|
|
83
|
+
- [ ] Results interpreted with appropriate uncertainty
|
|
84
|
+
- [ ] Alternative explanations considered
|
|
85
|
+
- [ ] Effect sizes reported and contextualised (not just significance)
|
|
86
|
+
- [ ] Limitations of the analysis acknowledged
|
|
87
|
+
- [ ] Robustness checks reported
|
|
88
|
+
|
|
89
|
+
### 6. Results & Findings (10%)
|
|
90
|
+
|
|
91
|
+
| Score | Indicators |
|
|
92
|
+
|-------|-----------|
|
|
93
|
+
| **5 Excellent** | Findings directly address research questions; tables/figures are clear and self-contained; null or unexpected results reported honestly |
|
|
94
|
+
| **4 Good** | Findings address questions with minor gaps; presentation mostly clear; negative results acknowledged |
|
|
95
|
+
| **3 Adequate** | Findings partially address questions; tables overloaded or under-explained; selective reporting suspected |
|
|
96
|
+
|
|
97
|
+
- [ ] Each research question has a corresponding finding
|
|
98
|
+
- [ ] Tables and figures are well-labelled and interpretable
|
|
99
|
+
- [ ] Null or unexpected results reported
|
|
100
|
+
- [ ] Key numbers are precise (CIs, effect sizes, sample sizes)
|
|
101
|
+
- [ ] No cherry-picking of specifications
|
|
102
|
+
|
|
103
|
+
### 7. Scholarly Writing & Presentation (10%)
|
|
104
|
+
|
|
105
|
+
| Score | Indicators |
|
|
106
|
+
|-------|-----------|
|
|
107
|
+
| **5 Excellent** | Argument flows logically; prose is concise and precise; structure supports the narrative; transitions between sections are smooth |
|
|
108
|
+
| **4 Good** | Writing is clear with occasional redundancy; structure is sound; minor flow issues |
|
|
109
|
+
| **3 Adequate** | Writing is understandable but verbose or disorganised in places; structure follows conventions but feels mechanical |
|
|
110
|
+
|
|
111
|
+
- [ ] Abstract accurately summarises the paper
|
|
112
|
+
- [ ] Logical flow from introduction to conclusion
|
|
113
|
+
- [ ] No unnecessary jargon or undefined acronyms
|
|
114
|
+
- [ ] Consistent terminology throughout
|
|
115
|
+
- [ ] Within page/word limits (if applicable)
|
|
116
|
+
|
|
117
|
+
### 8. Citations & References (5%)
|
|
118
|
+
|
|
119
|
+
| Score | Indicators |
|
|
120
|
+
|-------|-----------|
|
|
121
|
+
| **5 Excellent** | All claims supported; no orphan citations or missing references; formatting consistent; bibliography is complete and accurate |
|
|
122
|
+
| **4 Good** | Minor formatting inconsistencies; one or two unsupported claims; bibliography mostly complete |
|
|
123
|
+
| **3 Adequate** | Several formatting issues; some claims lack support; bibliography has gaps or duplicates |
|
|
124
|
+
|
|
125
|
+
- [ ] All factual claims have citations
|
|
126
|
+
- [ ] No orphan `\cite{}` keys (all resolve)
|
|
127
|
+
- [ ] Reference list matches in-text citations (no extras, no missing)
|
|
128
|
+
- [ ] Formatting consistent with target venue style
|
|
129
|
+
- [ ] DOIs or stable URLs present where appropriate
|
|
130
|
+
|
|
131
|
+
## Weighted Scoring
|
|
132
|
+
|
|
133
|
+
```
|
|
134
|
+
Score = (0.15 x D1) + (0.15 x D2) + (0.20 x D3) + (0.10 x D4)
|
|
135
|
+
+ (0.15 x D5) + (0.10 x D6) + (0.10 x D7) + (0.05 x D8)
|
|
136
|
+
```
|
|
137
|
+
|
|
138
|
+
| Range | Verdict | Interpretation |
|
|
139
|
+
|-------|---------|---------------|
|
|
140
|
+
| 4.5 - 5.0 | Exceptional | Ready for top-tier venue submission |
|
|
141
|
+
| 4.0 - 4.4 | Strong | Minor revisions needed |
|
|
142
|
+
| 3.5 - 3.9 | Good | Major revisions required |
|
|
143
|
+
| 3.0 - 3.4 | Acceptable | Significant rework before submission |
|
|
144
|
+
| < 3.0 | Weak | Fundamental issues to address |
|
|
145
|
+
|
|
146
|
+
## Contextual Adjustments
|
|
147
|
+
|
|
148
|
+
**By stage:** Proposals — relax D4, D5, D6 (no results yet); weight D1, D2, D3 more heavily. Working drafts — expect D7, D8 to be lower; focus on D1-D6. Final manuscripts — full rubric, no relaxation.
|
|
149
|
+
|
|
150
|
+
**By venue tier:** Workshop/conference papers — accept 3.5+ overall. CABS 3 journals — target 4.0+. CABS 4/4* or FT 50 — target 4.5+; all dimensions should be 4+.
|
|
151
|
+
|
|
152
|
+
**By paper type:** Theoretical papers — D4 may be N/A (re-weight to D1-D3). Empirical papers — full rubric applies. Review/survey papers — D4-D6 may be N/A; weight D2 at 30%.
|
|
153
|
+
|
|
154
|
+
When a dimension is N/A, redistribute its weight proportionally across the remaining dimensions.
|
|
155
|
+
|
|
156
|
+
## Agent Integration
|
|
157
|
+
|
|
158
|
+
| Agent | Loads | Dimensions used |
|
|
159
|
+
|-------|-------|----------------|
|
|
160
|
+
| `referee2-reviewer` | Always | All 8 (full scorecard) |
|
|
161
|
+
| `domain-reviewer` | When scoring requested | D1, D3, D5 (overlap with its domain lenses) |
|
|
162
|
+
| `paper-critic` | When scoring requested | D7, D8 (overlap with writing/citation categories) |
|
|
163
|
+
| `/pre-submission-report` | Always | Aggregate weighted score in summary |
|
|
164
|
+
|
|
165
|
+
Agents should report per-dimension scores in a table, then the weighted aggregate, then the verdict. Use the checklist items as evidence anchors for each score.
|
|
@@ -0,0 +1,54 @@
|
|
|
1
|
+
# Rhetoric Principles for Presentations
|
|
2
|
+
|
|
3
|
+
> Shared reference for `/beamer-deck`, `/quarto-deck`, and `/quarto-course`. Drawn from Scott Cunningham's "The Rhetoric of Decks" essay.
|
|
4
|
+
> Full source: `resources/academics/scott-cunningham/MixtapeTools/presentations/rhetoric_of_decks.md`
|
|
5
|
+
|
|
6
|
+
## The Three Laws
|
|
7
|
+
|
|
8
|
+
| Law | Principle |
|
|
9
|
+
|-----|-----------|
|
|
10
|
+
| **Beauty Is Function** | Every element earns its presence. Decoration without function is noise. A slide with three words on a blank background can be the most beautiful. |
|
|
11
|
+
| **Cognitive Load Is the Enemy** | Audience has limited working memory. Too many points = zero points retained. One idea per slide — not a guideline, a law. |
|
|
12
|
+
| **The Slide Serves the Spoken Word** | Slides provide focal points, memory hooks, evidence, structural markers. If slides work without speaking, you've written a document, not a presentation. |
|
|
13
|
+
|
|
14
|
+
## MB/MC Equivalence
|
|
15
|
+
|
|
16
|
+
Optimal rhetoric equalises the marginal-benefit-to-marginal-cost ratio across all slides. For each slide ask: "If I added one more element, would benefit justify cognitive cost? If I removed one, would I lose more than I'd gain in clarity?" When every slide answers "I'm at the margin," you're optimised.
|
|
17
|
+
|
|
18
|
+
- **Overloaded slides** (MB/MC too low): text running to footer, competing ideas, charts with too many series.
|
|
19
|
+
- **Underloaded slides** (MB/MC too high): single word that could support a sentence, wasted attention.
|
|
20
|
+
- **Rhythm**: vary dense and light slides deliberately.
|
|
21
|
+
|
|
22
|
+
## Aristotelian Triad
|
|
23
|
+
|
|
24
|
+
| Mode | Role in decks | Danger if absent |
|
|
25
|
+
|------|---------------|------------------|
|
|
26
|
+
| **Ethos** (credibility) | Process diagrams, acknowledged limitations, Devil's Advocate slides | Audience won't trust you |
|
|
27
|
+
| **Pathos** (emotion) | Open with a recognised problem, connect data to human impact | Audience won't care |
|
|
28
|
+
| **Logos** (logic) | Data viz revealing patterns, comparison tables, problem→solution flow | Audience won't follow |
|
|
29
|
+
|
|
30
|
+
## Narrative Arc (Three Acts)
|
|
31
|
+
|
|
32
|
+
1. **Act I — Problem (Tension):** Establish status quo, introduce disruption, make audience feel the problem.
|
|
33
|
+
2. **Act II — Investigation (Development):** Show what you tried, present what you learned, build the logical case.
|
|
34
|
+
3. **Act III — Resolution (Release):** Deliver insight, show implications, call to action.
|
|
35
|
+
|
|
36
|
+
## Pyramid Principle
|
|
37
|
+
|
|
38
|
+
Lead with the conclusion, then support it. Humans want the punchline first, then the reasoning.
|
|
39
|
+
|
|
40
|
+
- **Opening slide**: Provocative question, surprising statistic, bold claim. Never "Today I'm going to talk about..."
|
|
41
|
+
- **Closing slide**: Single memorable takeaway or return to opening, resolved. Never "Questions?" or "Thank you."
|
|
42
|
+
|
|
43
|
+
## The Devil's Advocate
|
|
44
|
+
|
|
45
|
+
Before presenting your argument, present its strongest critique. Show anticipated objections, weaknesses you acknowledge, mitigations you've developed. Format: "A skeptic would say..." → your response.
|
|
46
|
+
|
|
47
|
+
## Context-Specific Guidance
|
|
48
|
+
|
|
49
|
+
| Audience | Ethos/Pathos/Logos | Key adjustments |
|
|
50
|
+
|----------|-------------------|-----------------|
|
|
51
|
+
| **Academic seminar** | 20% / 35% / 45% | Lead with the puzzle, not the literature. Identification strategy early. One coefficient at a time. Acknowledge limitations before Q&A. |
|
|
52
|
+
| **Conference talk** | 25% / 35% / 40% | Emphasise identification strategy and key results. Tight narrative — no tangents. |
|
|
53
|
+
| **Teaching lecture** | 20% / 30% / 50% | Brevity < clarity. Repetition is allowed. Show reasoning, not just conclusions. Pause points for questions. |
|
|
54
|
+
| **Working deck** | 30% / 10% / 60% | Document choices (why A over B?). Preserve uncertainty. Show the math. Prioritise rigor over polish. |
|
|
@@ -0,0 +1,272 @@
|
|
|
1
|
+
# Skill Design Patterns
|
|
2
|
+
|
|
3
|
+
> Reference for designing new skills. Read this before writing any SKILL.md.
|
|
4
|
+
> Loaded on demand by `/learn` and when manually creating skills.
|
|
5
|
+
|
|
6
|
+
## Structural Patterns
|
|
7
|
+
|
|
8
|
+
Choose based on what the skill does. Most skills fit one pattern; some combine two.
|
|
9
|
+
|
|
10
|
+
### Workflow-Based
|
|
11
|
+
|
|
12
|
+
Multi-step processes where order matters and user approval is needed between phases.
|
|
13
|
+
|
|
14
|
+
```
|
|
15
|
+
Phase 1: Gather → Phase 2: Process → REVIEW GATE → Phase 3: Output
|
|
16
|
+
```
|
|
17
|
+
|
|
18
|
+
**When to use:** Report generation, project bootstrapping, multi-pass analysis, content creation with review cycles.
|
|
19
|
+
|
|
20
|
+
**Key elements:**
|
|
21
|
+
- Numbered phases with explicit entry/exit criteria
|
|
22
|
+
- Review gates between phases (hard stops where the agent presents work and waits)
|
|
23
|
+
- Default assumptions table to reduce friction
|
|
24
|
+
|
|
25
|
+
|
|
26
|
+
### Task-Based
|
|
27
|
+
|
|
28
|
+
Focused input/output with clear processing rules. No phases — just do the thing well.
|
|
29
|
+
|
|
30
|
+
```
|
|
31
|
+
Input spec → Processing rules → Output spec → Quality checks
|
|
32
|
+
```
|
|
33
|
+
|
|
34
|
+
**When to use:** Code review, proofreading, validation, data transformation, file conversion.
|
|
35
|
+
|
|
36
|
+
**Key elements:**
|
|
37
|
+
- Explicit input format and constraints
|
|
38
|
+
- Processing rules as imperative instructions
|
|
39
|
+
- Output format specification
|
|
40
|
+
- Verification criteria
|
|
41
|
+
|
|
42
|
+
|
|
43
|
+
### Agent-Delegation
|
|
44
|
+
|
|
45
|
+
Multiple subagents, each responsible for a distinct concern. Coordinator manages handoffs.
|
|
46
|
+
|
|
47
|
+
```
|
|
48
|
+
Coordinator → Subagent A (generate) → Subagent B (critique) → Coordinator (synthesize)
|
|
49
|
+
```
|
|
50
|
+
|
|
51
|
+
**When to use:** Tasks where quality benefits from separation of concerns — writing with editing, analysis with critique, multi-perspective evaluation.
|
|
52
|
+
|
|
53
|
+
**Key elements:**
|
|
54
|
+
- `agents/` directory or Task tool delegation with focused system prompts
|
|
55
|
+
- Each subagent does one thing well
|
|
56
|
+
- Coordinator synthesizes outputs
|
|
57
|
+
|
|
58
|
+
|
|
59
|
+
### Reference-Based
|
|
60
|
+
|
|
61
|
+
Augments the agent with domain knowledge it lacks, loaded on demand.
|
|
62
|
+
|
|
63
|
+
```
|
|
64
|
+
SKILL.md (routing + procedures) → references/ (domain knowledge)
|
|
65
|
+
```
|
|
66
|
+
|
|
67
|
+
**When to use:** Specialized domains where training data is insufficient — niche APIs, internal conventions, proprietary formats, scoring rubrics.
|
|
68
|
+
|
|
69
|
+
**Key elements:**
|
|
70
|
+
- SKILL.md stays concise — routing logic and procedures only
|
|
71
|
+
- `references/` contains detailed specs, lookup tables, scoring rubrics
|
|
72
|
+
- Agent reads references on demand, not all at once
|
|
73
|
+
|
|
74
|
+
|
|
75
|
+
---
|
|
76
|
+
|
|
77
|
+
## Design Elements
|
|
78
|
+
|
|
79
|
+
### Review Gates
|
|
80
|
+
|
|
81
|
+
Prevent runaway execution by requiring user approval at checkpoints.
|
|
82
|
+
|
|
83
|
+
```markdown
|
|
84
|
+
### REVIEW GATE: [Gate Name]
|
|
85
|
+
|
|
86
|
+
Present the following to the user and STOP:
|
|
87
|
+
- [What to show]
|
|
88
|
+
- [What to show]
|
|
89
|
+
|
|
90
|
+
Do NOT proceed until the user explicitly approves.
|
|
91
|
+
```
|
|
92
|
+
|
|
93
|
+
Place gates after planning/before execution, and after first draft/before refinement.
|
|
94
|
+
|
|
95
|
+
### Anti-Pattern Lists
|
|
96
|
+
|
|
97
|
+
Tell the agent what NOT to do. Agents default to safe, generic patterns unless explicitly constrained. Negative constraints are as important as positive instructions.
|
|
98
|
+
|
|
99
|
+
```markdown
|
|
100
|
+
## Never Do These
|
|
101
|
+
|
|
102
|
+
- Never use "leverage" — use "use" instead
|
|
103
|
+
- Never produce bullet points when a table would be clearer
|
|
104
|
+
- Never assume verbose output — default to concise
|
|
105
|
+
```
|
|
106
|
+
|
|
107
|
+
### Default Assumptions Table
|
|
108
|
+
|
|
109
|
+
Reduce friction by pre-answering common questions. Only ask the user what can't be reasonably defaulted.
|
|
110
|
+
|
|
111
|
+
```markdown
|
|
112
|
+
## Defaults
|
|
113
|
+
|
|
114
|
+
| Setting | Default | Override |
|
|
115
|
+
|---------|---------|---------|
|
|
116
|
+
| Output format | Markdown | User specifies otherwise |
|
|
117
|
+
| Verbosity | Concise | User says "detailed" |
|
|
118
|
+
| Location | Project root | User provides path |
|
|
119
|
+
```
|
|
120
|
+
|
|
121
|
+
### Drift Prevention
|
|
122
|
+
|
|
123
|
+
Skills that reference external state (database schemas, file counts, theme lists, status options) go stale silently. Use this 3-part pattern when a skill has hardcoded values derived from mutable sources.
|
|
124
|
+
|
|
125
|
+
**Part 1: Runtime detection (Phase 0)**
|
|
126
|
+
|
|
127
|
+
Add a Phase 0 at the start of the skill that checks hardcoded values against reality before proceeding:
|
|
128
|
+
|
|
129
|
+
```markdown
|
|
130
|
+
## Phase 0: Drift Detection
|
|
131
|
+
|
|
132
|
+
Before starting, verify skill references:
|
|
133
|
+
1. Count actual {items} — compare against ~N in this skill
|
|
134
|
+
2. List actual {categories} — compare against the N listed here
|
|
135
|
+
3. Fetch {schema/config} — verify properties still match
|
|
136
|
+
|
|
137
|
+
If drift detected:
|
|
138
|
+
- Warn in output header: "⚠ Drift: {what} was {expected}, actually {found}"
|
|
139
|
+
- Use actual values for this run (self-heal)
|
|
140
|
+
- Log to references/drift-checks.md
|
|
141
|
+
```
|
|
142
|
+
|
|
143
|
+
**Part 2: Drift reference file**
|
|
144
|
+
|
|
145
|
+
Create `references/drift-checks.md` listing every drift-prone value:
|
|
146
|
+
|
|
147
|
+
```markdown
|
|
148
|
+
| Value | Location | Source of Truth | Trigger |
|
|
149
|
+
|-------|----------|-----------------|---------|
|
|
150
|
+
| Topic count (~67) | SKILL.md Phase 1 | ls topics/**/*.md | /scaffold-atlas-topics |
|
|
151
|
+
| Theme list (9) | SKILL.md, sa-prompts.md, build_report.py | ls Projects/ | New theme dir |
|
|
152
|
+
```
|
|
153
|
+
|
|
154
|
+
Include a Drift Log table where Phase 0 appends detected drift with timestamps.
|
|
155
|
+
|
|
156
|
+
**Part 3: Trigger notes in related skills**
|
|
157
|
+
|
|
158
|
+
Add cross-references in skills that commonly cause drift:
|
|
159
|
+
|
|
160
|
+
```markdown
|
|
161
|
+
| `/atlas-review` | **Drift trigger:** new topics change count — see drift-checks.md |
|
|
162
|
+
```
|
|
163
|
+
|
|
164
|
+
This ensures the person (or agent) running the triggering skill is aware that downstream skills may need updating.
|
|
165
|
+
|
|
166
|
+
**When to use:** Any skill with hardcoded counts, database IDs, schema property names, status option lists, file paths derived from external systems, or theme/category lists that grow over time.
|
|
167
|
+
|
|
168
|
+
**When to skip:** Skills that derive all values at runtime (no hardcoded references to external state).
|
|
169
|
+
|
|
170
|
+
**Example:** `/atlas-review` — topic count, theme list, vault schemas, stage mappings, rules count all drift. Phase 0 detects and self-heals; `references/drift-checks.md` tracks 8 drift-prone values.
|
|
171
|
+
|
|
172
|
+
### Progressive Disclosure
|
|
173
|
+
|
|
174
|
+
Control what goes where based on how often the agent needs it:
|
|
175
|
+
|
|
176
|
+
| Location | Loaded | Use for |
|
|
177
|
+
|----------|--------|---------|
|
|
178
|
+
| `name` + `description` | Always in context | Trigger matching — when to activate |
|
|
179
|
+
| SKILL.md body | When skill triggers | Core procedures, workflow, constraints |
|
|
180
|
+
| `references/` | On demand via Read | Detailed specs, large examples, lookup tables |
|
|
181
|
+
| `scripts/` | On demand via Bash | Deterministic operations (validation, formatting) |
|
|
182
|
+
|
|
183
|
+
**Rule of thumb:** If the agent needs it every time → SKILL.md body. If it needs it sometimes → `references/`. If it should execute it → `scripts/`.
|
|
184
|
+
|
|
185
|
+
---
|
|
186
|
+
|
|
187
|
+
## Writing Guidelines
|
|
188
|
+
|
|
189
|
+
### For the Description (Frontmatter)
|
|
190
|
+
|
|
191
|
+
The `description` determines when the skill activates. It's always in context. **Write in third person** — the description is injected into the system prompt, and inconsistent point-of-view causes discovery problems.
|
|
192
|
+
|
|
193
|
+
**Good:**
|
|
194
|
+
- `"Analyze datasets using statistical methods. Handles EDA, hypothesis testing, and causal inference. Use when asked to analyze CSV/Excel data or run A/B test analysis."`
|
|
195
|
+
- `"Academic proofreading for LaTeX papers. Grammar, notation consistency, citation format, tone. Report-only — never edits source files."`
|
|
196
|
+
|
|
197
|
+
**Bad:**
|
|
198
|
+
- `"A helpful skill"` — too vague, triggers on everything
|
|
199
|
+
- `"I can help you process files"` — wrong point-of-view (first person)
|
|
200
|
+
- `"Skill for doing things with files"` — will trigger on every file operation
|
|
201
|
+
|
|
202
|
+
**Tips:**
|
|
203
|
+
- Lead with the primary capability
|
|
204
|
+
- Include concrete task types as trigger phrases
|
|
205
|
+
- End with "Use when..." to define activation conditions
|
|
206
|
+
- State what it does NOT do if there's a common confusion
|
|
207
|
+
- Max 1024 characters; name max 64 characters (lowercase, hyphens, numbers only)
|
|
208
|
+
|
|
209
|
+
### For the Body (System Prompt)
|
|
210
|
+
|
|
211
|
+
- **Write for an AI agent, not a human.** Procedural knowledge the agent cannot infer.
|
|
212
|
+
- **Imperative form.** "Parse the input" not "You should parse the input."
|
|
213
|
+
- **Be specific about what NOT to do.** Anti-pattern lists are highly effective.
|
|
214
|
+
- **Include concrete examples.** Input/output pairs and good/bad snippets beat abstract rules.
|
|
215
|
+
- **Keep SKILL.md under 500 lines** (Anthropic official threshold). Move detailed specs and large examples to `references/`. Aim for under 300 lines when possible.
|
|
216
|
+
- **Every instruction must be actionable.** If the agent cannot act on a sentence, delete it.
|
|
217
|
+
- **Use tables for structured data.** Defaults, field specs, command references — tables are faster to parse than prose.
|
|
218
|
+
- **One section, one concern.** Don't mix workflow steps with quality criteria.
|
|
219
|
+
|
|
220
|
+
### For allowed_tools
|
|
221
|
+
|
|
222
|
+
Follow principle of least privilege:
|
|
223
|
+
|
|
224
|
+
| Skill type | Tools |
|
|
225
|
+
|-----------|-------|
|
|
226
|
+
| Report-only | `Read`, `Glob`, `Grep` |
|
|
227
|
+
| File-creating | + `Write`, `Edit` |
|
|
228
|
+
| Shell-needing | + specific `Bash(command*)` patterns |
|
|
229
|
+
| Interactive | + `AskUserQuestion` |
|
|
230
|
+
| Delegating | + `Task` |
|
|
231
|
+
|
|
232
|
+
---
|
|
233
|
+
|
|
234
|
+
## Quality Checklist
|
|
235
|
+
|
|
236
|
+
Before finalising any skill, verify:
|
|
237
|
+
|
|
238
|
+
| Check | Question |
|
|
239
|
+
|-------|----------|
|
|
240
|
+
| Trigger clarity | Would Claude know when to invoke this from the description alone? |
|
|
241
|
+
| Pattern fit | Does the structure match one of the 4 patterns above? |
|
|
242
|
+
| No duplication | Does this overlap with an existing skill? |
|
|
243
|
+
| Anti-patterns | Does it say what NOT to do, not just what to do? |
|
|
244
|
+
| Examples present | Are there concrete before/after or good/bad examples? |
|
|
245
|
+
| Minimum tools | Only the tools actually needed are in `allowed_tools`? |
|
|
246
|
+
| No secrets | No API keys, tokens, or passwords? |
|
|
247
|
+
| References extracted | Is detailed reference material in `references/`, not inline? |
|
|
248
|
+
| Description specificity | Would the description distinguish this from similar skills? |
|
|
249
|
+
| Tested | Has the procedure been tested, not just theorised? |
|
|
250
|
+
| Refs one-level deep | All reference files link directly from SKILL.md (no nested refs)? |
|
|
251
|
+
| Third-person description | Description uses "Processes..." not "I can..." or "You can..."? |
|
|
252
|
+
|
|
253
|
+
---
|
|
254
|
+
|
|
255
|
+
## Evaluation-Driven Development
|
|
256
|
+
|
|
257
|
+
Per [Anthropic's official best practices](https://platform.claude.com/docs/en/agents-and-tools/agent-skills/best-practices):
|
|
258
|
+
|
|
259
|
+
1. **Identify gaps:** Run Claude on representative tasks without the skill. Document specific failures.
|
|
260
|
+
2. **Create evaluations:** Build 3+ scenarios that test these gaps.
|
|
261
|
+
3. **Establish baseline:** Measure Claude's performance without the skill.
|
|
262
|
+
4. **Write minimal instructions:** Just enough to address the gaps and pass evaluations.
|
|
263
|
+
5. **Iterate:** Execute evaluations, compare against baseline, refine.
|
|
264
|
+
|
|
265
|
+
### Claude A / Claude B Pattern
|
|
266
|
+
|
|
267
|
+
- **Claude A** (the expert): helps design and refine the skill
|
|
268
|
+
- **Claude B** (the user): tests the skill in real tasks with fresh context
|
|
269
|
+
- Observe Claude B's behavior, bring insights back to Claude A
|
|
270
|
+
- Repeat: observe → refine → test
|
|
271
|
+
|
|
272
|
+
This is similar to the existing `/skill-creator` workflow but emphasises real-task testing over synthetic evaluation.
|