flonat-research 0.1.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (285) hide show
  1. package/.claude/agents/domain-reviewer.md +336 -0
  2. package/.claude/agents/fixer.md +226 -0
  3. package/.claude/agents/paper-critic.md +370 -0
  4. package/.claude/agents/peer-reviewer.md +289 -0
  5. package/.claude/agents/proposal-reviewer.md +215 -0
  6. package/.claude/agents/referee2-reviewer.md +367 -0
  7. package/.claude/agents/references/journal-referee-profiles.md +354 -0
  8. package/.claude/agents/references/paper-critic/council-personas.md +77 -0
  9. package/.claude/agents/references/paper-critic/council-prompts.md +198 -0
  10. package/.claude/agents/references/peer-reviewer/report-template.md +199 -0
  11. package/.claude/agents/references/peer-reviewer/sa-prompts.md +260 -0
  12. package/.claude/agents/references/peer-reviewer/security-scan.md +188 -0
  13. package/.claude/agents/references/proposal-reviewer/report-template.md +144 -0
  14. package/.claude/agents/references/proposal-reviewer/sa-prompts.md +149 -0
  15. package/.claude/agents/references/referee-config.md +114 -0
  16. package/.claude/agents/references/referee2-reviewer/audit-checklists.md +287 -0
  17. package/.claude/agents/references/referee2-reviewer/report-template.md +334 -0
  18. package/.claude/rules/design-before-results.md +52 -0
  19. package/.claude/rules/ignore-agents-md.md +17 -0
  20. package/.claude/rules/ignore-gemini-md.md +17 -0
  21. package/.claude/rules/lean-claude-md.md +45 -0
  22. package/.claude/rules/learn-tags.md +99 -0
  23. package/.claude/rules/overleaf-separation.md +67 -0
  24. package/.claude/rules/plan-first.md +175 -0
  25. package/.claude/rules/read-docs-first.md +50 -0
  26. package/.claude/rules/scope-discipline.md +28 -0
  27. package/.claude/settings.json +125 -0
  28. package/.context/current-focus.md +33 -0
  29. package/.context/preferences/priorities.md +36 -0
  30. package/.context/preferences/task-naming.md +28 -0
  31. package/.context/profile.md +29 -0
  32. package/.context/projects/_index.md +41 -0
  33. package/.context/projects/papers/nudge-exp.md +22 -0
  34. package/.context/projects/papers/uncertainty.md +31 -0
  35. package/.context/resources/claude-scientific-writer-review.md +48 -0
  36. package/.context/resources/cunningham-multi-analyst-agents.md +104 -0
  37. package/.context/resources/cunningham-multilang-code-audit.md +62 -0
  38. package/.context/resources/google-ai-co-scientist-review.md +72 -0
  39. package/.context/resources/karpathy-llm-council-review.md +58 -0
  40. package/.context/resources/multi-coder-reliability-protocol.md +175 -0
  41. package/.context/resources/pedro-santanna-takeaways.md +96 -0
  42. package/.context/resources/venue-rankings/abs_ajg_2024.csv +1823 -0
  43. package/.context/resources/venue-rankings/abs_ajg_2024_econ.csv +356 -0
  44. package/.context/resources/venue-rankings/cabs_4_4star_theory.csv +40 -0
  45. package/.context/resources/venue-rankings/core_2026.csv +801 -0
  46. package/.context/resources/venue-rankings.md +147 -0
  47. package/.context/workflows/README.md +69 -0
  48. package/.context/workflows/daily-review.md +91 -0
  49. package/.context/workflows/meeting-actions.md +108 -0
  50. package/.context/workflows/replication-protocol.md +155 -0
  51. package/.context/workflows/weekly-review.md +113 -0
  52. package/.mcp-server-biblio/formatters.py +158 -0
  53. package/.mcp-server-biblio/pyproject.toml +11 -0
  54. package/.mcp-server-biblio/server.py +678 -0
  55. package/.mcp-server-biblio/sources/__init__.py +14 -0
  56. package/.mcp-server-biblio/sources/base.py +73 -0
  57. package/.mcp-server-biblio/sources/formatters.py +83 -0
  58. package/.mcp-server-biblio/sources/models.py +22 -0
  59. package/.mcp-server-biblio/sources/multi_source.py +243 -0
  60. package/.mcp-server-biblio/sources/openalex_source.py +183 -0
  61. package/.mcp-server-biblio/sources/scopus_source.py +309 -0
  62. package/.mcp-server-biblio/sources/wos_source.py +508 -0
  63. package/.mcp-server-biblio/uv.lock +896 -0
  64. package/.scripts/README.md +161 -0
  65. package/.scripts/ai_pattern_density.py +446 -0
  66. package/.scripts/conf +445 -0
  67. package/.scripts/config.py +122 -0
  68. package/.scripts/count_inventory.py +275 -0
  69. package/.scripts/daily_digest.py +288 -0
  70. package/.scripts/done +177 -0
  71. package/.scripts/extract_meeting_actions.py +223 -0
  72. package/.scripts/focus +176 -0
  73. package/.scripts/generate-codex-agents-md.py +217 -0
  74. package/.scripts/inbox +194 -0
  75. package/.scripts/notion_helpers.py +325 -0
  76. package/.scripts/openalex/query_helpers.py +306 -0
  77. package/.scripts/papers +227 -0
  78. package/.scripts/query +223 -0
  79. package/.scripts/session-history.py +201 -0
  80. package/.scripts/skill-health.py +516 -0
  81. package/.scripts/skill-log-miner.py +273 -0
  82. package/.scripts/sync-to-codex.sh +252 -0
  83. package/.scripts/task +213 -0
  84. package/.scripts/tasks +190 -0
  85. package/.scripts/week +206 -0
  86. package/CLAUDE.md +197 -0
  87. package/LICENSE +21 -0
  88. package/MEMORY.md +38 -0
  89. package/README.md +269 -0
  90. package/docs/agents.md +44 -0
  91. package/docs/bibliography-setup.md +55 -0
  92. package/docs/council-mode.md +36 -0
  93. package/docs/getting-started.md +245 -0
  94. package/docs/hooks.md +38 -0
  95. package/docs/mcp-servers.md +82 -0
  96. package/docs/notion-setup.md +109 -0
  97. package/docs/rules.md +33 -0
  98. package/docs/scripts.md +303 -0
  99. package/docs/setup-overview/setup-overview.pdf +0 -0
  100. package/docs/skills.md +70 -0
  101. package/docs/system.md +159 -0
  102. package/hooks/block-destructive-git.sh +66 -0
  103. package/hooks/context-monitor.py +114 -0
  104. package/hooks/postcompact-restore.py +157 -0
  105. package/hooks/precompact-autosave.py +181 -0
  106. package/hooks/promise-checker.sh +124 -0
  107. package/hooks/protect-source-files.sh +81 -0
  108. package/hooks/resume-context-loader.sh +53 -0
  109. package/hooks/startup-context-loader.sh +102 -0
  110. package/package.json +51 -0
  111. package/packages/cli-council/.github/workflows/claude-code-review.yml +44 -0
  112. package/packages/cli-council/.github/workflows/claude.yml +50 -0
  113. package/packages/cli-council/README.md +100 -0
  114. package/packages/cli-council/pyproject.toml +43 -0
  115. package/packages/cli-council/src/cli_council/__init__.py +19 -0
  116. package/packages/cli-council/src/cli_council/__main__.py +185 -0
  117. package/packages/cli-council/src/cli_council/backends/__init__.py +8 -0
  118. package/packages/cli-council/src/cli_council/backends/base.py +81 -0
  119. package/packages/cli-council/src/cli_council/backends/claude.py +25 -0
  120. package/packages/cli-council/src/cli_council/backends/codex.py +27 -0
  121. package/packages/cli-council/src/cli_council/backends/gemini.py +26 -0
  122. package/packages/cli-council/src/cli_council/checkpoint.py +212 -0
  123. package/packages/cli-council/src/cli_council/config.py +51 -0
  124. package/packages/cli-council/src/cli_council/council.py +391 -0
  125. package/packages/cli-council/src/cli_council/models.py +46 -0
  126. package/packages/llm-council/.github/workflows/claude-code-review.yml +44 -0
  127. package/packages/llm-council/.github/workflows/claude.yml +50 -0
  128. package/packages/llm-council/README.md +453 -0
  129. package/packages/llm-council/pyproject.toml +42 -0
  130. package/packages/llm-council/src/llm_council/__init__.py +23 -0
  131. package/packages/llm-council/src/llm_council/__main__.py +259 -0
  132. package/packages/llm-council/src/llm_council/checkpoint.py +193 -0
  133. package/packages/llm-council/src/llm_council/client.py +253 -0
  134. package/packages/llm-council/src/llm_council/config.py +232 -0
  135. package/packages/llm-council/src/llm_council/council.py +482 -0
  136. package/packages/llm-council/src/llm_council/models.py +46 -0
  137. package/packages/mcp-bibliography/MEMORY.md +31 -0
  138. package/packages/mcp-bibliography/_app.py +226 -0
  139. package/packages/mcp-bibliography/formatters.py +158 -0
  140. package/packages/mcp-bibliography/log/2026-03-13-2100.md +35 -0
  141. package/packages/mcp-bibliography/pyproject.toml +15 -0
  142. package/packages/mcp-bibliography/run.sh +20 -0
  143. package/packages/mcp-bibliography/scholarly_formatters.py +83 -0
  144. package/packages/mcp-bibliography/server.py +1857 -0
  145. package/packages/mcp-bibliography/tools/__init__.py +28 -0
  146. package/packages/mcp-bibliography/tools/_registry.py +19 -0
  147. package/packages/mcp-bibliography/tools/altmetric.py +107 -0
  148. package/packages/mcp-bibliography/tools/core.py +92 -0
  149. package/packages/mcp-bibliography/tools/dblp.py +52 -0
  150. package/packages/mcp-bibliography/tools/openalex.py +296 -0
  151. package/packages/mcp-bibliography/tools/opencitations.py +102 -0
  152. package/packages/mcp-bibliography/tools/openreview.py +179 -0
  153. package/packages/mcp-bibliography/tools/orcid.py +131 -0
  154. package/packages/mcp-bibliography/tools/scholarly.py +575 -0
  155. package/packages/mcp-bibliography/tools/unpaywall.py +63 -0
  156. package/packages/mcp-bibliography/tools/zenodo.py +123 -0
  157. package/packages/mcp-bibliography/uv.lock +711 -0
  158. package/scripts/setup.sh +143 -0
  159. package/skills/beamer-deck/SKILL.md +199 -0
  160. package/skills/beamer-deck/references/quality-rubric.md +54 -0
  161. package/skills/beamer-deck/references/review-prompts.md +106 -0
  162. package/skills/bib-validate/SKILL.md +261 -0
  163. package/skills/bib-validate/references/council-mode.md +34 -0
  164. package/skills/bib-validate/references/deep-verify.md +79 -0
  165. package/skills/bib-validate/references/fix-mode.md +36 -0
  166. package/skills/bib-validate/references/openalex-verification.md +45 -0
  167. package/skills/bib-validate/references/preprint-check.md +31 -0
  168. package/skills/bib-validate/references/ref-manager-crossref.md +41 -0
  169. package/skills/bib-validate/references/report-template.md +82 -0
  170. package/skills/code-archaeology/SKILL.md +141 -0
  171. package/skills/code-review/SKILL.md +265 -0
  172. package/skills/code-review/references/quality-rubric.md +67 -0
  173. package/skills/consolidate-memory/SKILL.md +208 -0
  174. package/skills/context-status/SKILL.md +126 -0
  175. package/skills/creation-guard/SKILL.md +230 -0
  176. package/skills/devils-advocate/SKILL.md +130 -0
  177. package/skills/devils-advocate/references/competing-hypotheses.md +83 -0
  178. package/skills/init-project/SKILL.md +115 -0
  179. package/skills/init-project-course/references/memory-and-settings.md +92 -0
  180. package/skills/init-project-course/references/organise-templates.md +94 -0
  181. package/skills/init-project-course/skill.md +147 -0
  182. package/skills/init-project-light/skill.md +139 -0
  183. package/skills/init-project-research/SKILL.md +368 -0
  184. package/skills/init-project-research/references/atlas-pipeline-sync.md +70 -0
  185. package/skills/init-project-research/references/atlas-schema.md +81 -0
  186. package/skills/init-project-research/references/confirmation-report.md +39 -0
  187. package/skills/init-project-research/references/domain-profile-template.md +104 -0
  188. package/skills/init-project-research/references/interview-round3.md +34 -0
  189. package/skills/init-project-research/references/literature-discovery.md +43 -0
  190. package/skills/init-project-research/references/scaffold-details.md +197 -0
  191. package/skills/init-project-research/templates/field-calibration.md +60 -0
  192. package/skills/init-project-research/templates/pipeline-manifest.md +63 -0
  193. package/skills/init-project-research/templates/run-all.sh +116 -0
  194. package/skills/init-project-research/templates/seed-files.md +337 -0
  195. package/skills/insights-deck/SKILL.md +151 -0
  196. package/skills/interview-me/SKILL.md +157 -0
  197. package/skills/latex/SKILL.md +141 -0
  198. package/skills/latex/references/latex-configs.md +183 -0
  199. package/skills/latex-autofix/SKILL.md +230 -0
  200. package/skills/latex-autofix/references/known-errors.md +183 -0
  201. package/skills/latex-autofix/references/quality-rubric.md +50 -0
  202. package/skills/latex-health-check/SKILL.md +161 -0
  203. package/skills/learn/SKILL.md +220 -0
  204. package/skills/learn/scripts/validate_skill.py +265 -0
  205. package/skills/lessons-learned/SKILL.md +201 -0
  206. package/skills/literature/SKILL.md +335 -0
  207. package/skills/literature/references/agent-templates.md +393 -0
  208. package/skills/literature/references/bibliometric-apis.md +44 -0
  209. package/skills/literature/references/cli-council-search.md +79 -0
  210. package/skills/literature/references/openalex-api-guide.md +371 -0
  211. package/skills/literature/references/openalex-common-queries.md +381 -0
  212. package/skills/literature/references/openalex-workflows.md +248 -0
  213. package/skills/literature/references/reference-manager-sync.md +36 -0
  214. package/skills/literature/references/scopus-api-guide.md +208 -0
  215. package/skills/literature/references/wos-api-guide.md +308 -0
  216. package/skills/multi-perspective/SKILL.md +311 -0
  217. package/skills/multi-perspective/references/computational-many-analysts.md +77 -0
  218. package/skills/pipeline-manifest/SKILL.md +226 -0
  219. package/skills/pre-submission-report/SKILL.md +153 -0
  220. package/skills/process-reviews/SKILL.md +244 -0
  221. package/skills/process-reviews/references/rr-routing.md +101 -0
  222. package/skills/project-deck/SKILL.md +87 -0
  223. package/skills/project-safety/SKILL.md +135 -0
  224. package/skills/proofread/SKILL.md +254 -0
  225. package/skills/proofread/references/quality-rubric.md +104 -0
  226. package/skills/python-env/SKILL.md +57 -0
  227. package/skills/quarto-deck/SKILL.md +226 -0
  228. package/skills/quarto-deck/references/markdown-format.md +143 -0
  229. package/skills/quarto-deck/references/quality-rubric.md +54 -0
  230. package/skills/save-context/SKILL.md +174 -0
  231. package/skills/session-log/SKILL.md +98 -0
  232. package/skills/shared/concept-validation-gate.md +161 -0
  233. package/skills/shared/council-protocol.md +265 -0
  234. package/skills/shared/distribution-diagnostics.md +164 -0
  235. package/skills/shared/engagement-stratified-sampling.md +218 -0
  236. package/skills/shared/escalation-protocol.md +74 -0
  237. package/skills/shared/external-audit-protocol.md +205 -0
  238. package/skills/shared/intercoder-reliability.md +256 -0
  239. package/skills/shared/mcp-degradation.md +81 -0
  240. package/skills/shared/method-probing-questions.md +163 -0
  241. package/skills/shared/multi-language-conventions.md +143 -0
  242. package/skills/shared/paid-api-safety.md +174 -0
  243. package/skills/shared/palettes.md +90 -0
  244. package/skills/shared/progressive-disclosure.md +92 -0
  245. package/skills/shared/project-documentation-content.md +443 -0
  246. package/skills/shared/project-documentation-format.md +281 -0
  247. package/skills/shared/project-documentation.md +100 -0
  248. package/skills/shared/publication-output.md +138 -0
  249. package/skills/shared/quality-scoring.md +70 -0
  250. package/skills/shared/reference-resolution.md +77 -0
  251. package/skills/shared/research-quality-rubric.md +165 -0
  252. package/skills/shared/rhetoric-principles.md +54 -0
  253. package/skills/shared/skill-design-patterns.md +272 -0
  254. package/skills/shared/skill-index.md +240 -0
  255. package/skills/shared/system-documentation.md +334 -0
  256. package/skills/shared/tikz-rules.md +402 -0
  257. package/skills/shared/validation-tiers.md +121 -0
  258. package/skills/shared/venue-guides/README.md +46 -0
  259. package/skills/shared/venue-guides/cell_press_style.md +483 -0
  260. package/skills/shared/venue-guides/conferences_formatting.md +564 -0
  261. package/skills/shared/venue-guides/cs_conference_style.md +463 -0
  262. package/skills/shared/venue-guides/examples/cell_summary_example.md +247 -0
  263. package/skills/shared/venue-guides/examples/medical_structured_abstract.md +313 -0
  264. package/skills/shared/venue-guides/examples/nature_abstract_examples.md +213 -0
  265. package/skills/shared/venue-guides/examples/neurips_introduction_example.md +245 -0
  266. package/skills/shared/venue-guides/journals_formatting.md +486 -0
  267. package/skills/shared/venue-guides/medical_journal_styles.md +535 -0
  268. package/skills/shared/venue-guides/ml_conference_style.md +556 -0
  269. package/skills/shared/venue-guides/nature_science_style.md +405 -0
  270. package/skills/shared/venue-guides/reviewer_expectations.md +417 -0
  271. package/skills/shared/venue-guides/venue_writing_styles.md +321 -0
  272. package/skills/split-pdf/SKILL.md +172 -0
  273. package/skills/split-pdf/methodology.md +48 -0
  274. package/skills/sync-notion/SKILL.md +93 -0
  275. package/skills/system-audit/SKILL.md +157 -0
  276. package/skills/system-audit/references/sub-agent-prompts.md +294 -0
  277. package/skills/task-management/SKILL.md +131 -0
  278. package/skills/update-focus/SKILL.md +204 -0
  279. package/skills/update-project-doc/SKILL.md +194 -0
  280. package/skills/validate-bib/SKILL.md +242 -0
  281. package/skills/validate-bib/references/council-mode.md +34 -0
  282. package/skills/validate-bib/references/deep-verify.md +71 -0
  283. package/skills/validate-bib/references/openalex-verification.md +45 -0
  284. package/skills/validate-bib/references/preprint-check.md +31 -0
  285. package/skills/validate-bib/references/report-template.md +62 -0
@@ -0,0 +1,165 @@
1
+ # Research Quality Rubric
2
+
3
+ > Dimension-weighted scoring rubric for evaluating research papers and proposals.
4
+ > Adapted from ScholarEval (Moussa et al., 2025, arXiv:2510.16234) with language
5
+ > adjusted for social science, management, economics, and political economy.
6
+ >
7
+ > **When to load:** Any agent performing a structured research quality assessment.
8
+ > **Complements:** `quality-scoring.md` (deduction-based) handles artifact polish;
9
+ > this rubric handles substantive research quality across 8 dimensions.
10
+
11
+ ## Dimensions & Scales
12
+
13
+ Each dimension is scored 1-5. Only 3/5/5 anchors are provided to keep the file lean.
14
+ Score 2 = between Adequate and Weak; Score 1 = fundamental deficiencies.
15
+
16
+ ### 1. Problem Formulation & Research Questions (15%)
17
+
18
+ | Score | Indicators |
19
+ |-------|-----------|
20
+ | **5 Excellent** | Novel contribution clearly situated in a gap; research questions are precise, falsifiable, and theoretically grounded; scope is well-bounded |
21
+ | **4 Good** | Clear contribution with minor gap-articulation issues; questions are specific but could be sharper; scope is reasonable |
22
+ | **3 Adequate** | Contribution exists but is vaguely stated; questions are broad or partially redundant; scope creep risk |
23
+
24
+ - [ ] Gap in literature explicitly identified
25
+ - [ ] Research questions are stated, specific, and answerable
26
+ - [ ] Scope is bounded (what is in/out)
27
+ - [ ] Contribution is distinguished from related work
28
+ - [ ] Practical or theoretical relevance is articulated
29
+
30
+ ### 2. Literature Review (15%)
31
+
32
+ | Score | Indicators |
33
+ |-------|-----------|
34
+ | **5 Excellent** | Comprehensive coverage of key streams; synthesises rather than lists; positions paper in ongoing debates; includes seminal and recent work |
35
+ | **4 Good** | Covers main streams with minor omissions; mostly synthetic; positioning is clear |
36
+ | **3 Adequate** | Covers basics but misses a stream or relies on dated sources; more descriptive than analytical |
37
+
38
+ - [ ] Key theoretical streams identified and covered
39
+ - [ ] Seminal works cited (not just recent)
40
+ - [ ] Review is synthetic (themes/tensions) not a list
41
+ - [ ] Competing perspectives acknowledged
42
+ - [ ] Clear link from review to research questions
43
+ - [ ] No major stream conspicuously absent
44
+
45
+ ### 3. Methodology & Research Design (20%)
46
+
47
+ | Score | Indicators |
48
+ |-------|-----------|
49
+ | **5 Excellent** | Design is well-justified for the research questions; identification strategy is explicit; assumptions stated and defended; threats to validity addressed |
50
+ | **4 Good** | Design fits questions; identification mostly clear; some assumptions implicit; validity threats partially addressed |
51
+ | **3 Adequate** | Design is standard but justification is thin; identification strategy unclear; limited validity discussion |
52
+
53
+ - [ ] Method choice justified for the research question
54
+ - [ ] Identification strategy / causal logic explicit (if causal claims made)
55
+ - [ ] Key assumptions stated
56
+ - [ ] Threats to internal and external validity discussed
57
+ - [ ] Robustness checks or sensitivity analyses planned
58
+ - [ ] Replicability: sufficient procedural detail
59
+
60
+ ### 4. Data Collection & Sources (10%)
61
+
62
+ | Score | Indicators |
63
+ |-------|-----------|
64
+ | **5 Excellent** | Data sources are well-suited, clearly described, and limitations acknowledged; sampling strategy justified; measurement validity established |
65
+ | **4 Good** | Sources are appropriate with minor documentation gaps; sampling is reasonable; most measures validated |
66
+ | **3 Adequate** | Sources are acceptable but convenience-driven; limited discussion of measurement validity or representativeness |
67
+
68
+ - [ ] Data sources described (origin, timeframe, access)
69
+ - [ ] Sampling strategy explained and justified
70
+ - [ ] Key variables operationalised with clear measures
71
+ - [ ] Data limitations acknowledged
72
+ - [ ] Ethical considerations addressed (if human subjects)
73
+
74
+ ### 5. Analysis & Interpretation (15%)
75
+
76
+ | Score | Indicators |
77
+ |-------|-----------|
78
+ | **5 Excellent** | Analysis follows logically from design; results interpreted carefully with appropriate caveats; alternative explanations considered; effect sizes contextualised |
79
+ | **4 Good** | Analysis is sound; interpretation is mostly careful; some alternative explanations explored |
80
+ | **3 Adequate** | Analysis is correct but mechanical; over-reliance on p-values; limited exploration of alternatives |
81
+
82
+ - [ ] Analysis matches the stated methodology
83
+ - [ ] Results interpreted with appropriate uncertainty
84
+ - [ ] Alternative explanations considered
85
+ - [ ] Effect sizes reported and contextualised (not just significance)
86
+ - [ ] Limitations of the analysis acknowledged
87
+ - [ ] Robustness checks reported
88
+
89
+ ### 6. Results & Findings (10%)
90
+
91
+ | Score | Indicators |
92
+ |-------|-----------|
93
+ | **5 Excellent** | Findings directly address research questions; tables/figures are clear and self-contained; null or unexpected results reported honestly |
94
+ | **4 Good** | Findings address questions with minor gaps; presentation mostly clear; negative results acknowledged |
95
+ | **3 Adequate** | Findings partially address questions; tables overloaded or under-explained; selective reporting suspected |
96
+
97
+ - [ ] Each research question has a corresponding finding
98
+ - [ ] Tables and figures are well-labelled and interpretable
99
+ - [ ] Null or unexpected results reported
100
+ - [ ] Key numbers are precise (CIs, effect sizes, sample sizes)
101
+ - [ ] No cherry-picking of specifications
102
+
103
+ ### 7. Scholarly Writing & Presentation (10%)
104
+
105
+ | Score | Indicators |
106
+ |-------|-----------|
107
+ | **5 Excellent** | Argument flows logically; prose is concise and precise; structure supports the narrative; transitions between sections are smooth |
108
+ | **4 Good** | Writing is clear with occasional redundancy; structure is sound; minor flow issues |
109
+ | **3 Adequate** | Writing is understandable but verbose or disorganised in places; structure follows conventions but feels mechanical |
110
+
111
+ - [ ] Abstract accurately summarises the paper
112
+ - [ ] Logical flow from introduction to conclusion
113
+ - [ ] No unnecessary jargon or undefined acronyms
114
+ - [ ] Consistent terminology throughout
115
+ - [ ] Within page/word limits (if applicable)
116
+
117
+ ### 8. Citations & References (5%)
118
+
119
+ | Score | Indicators |
120
+ |-------|-----------|
121
+ | **5 Excellent** | All claims supported; no orphan citations or missing references; formatting consistent; bibliography is complete and accurate |
122
+ | **4 Good** | Minor formatting inconsistencies; one or two unsupported claims; bibliography mostly complete |
123
+ | **3 Adequate** | Several formatting issues; some claims lack support; bibliography has gaps or duplicates |
124
+
125
+ - [ ] All factual claims have citations
126
+ - [ ] No orphan `\cite{}` keys (all resolve)
127
+ - [ ] Reference list matches in-text citations (no extras, no missing)
128
+ - [ ] Formatting consistent with target venue style
129
+ - [ ] DOIs or stable URLs present where appropriate
130
+
131
+ ## Weighted Scoring
132
+
133
+ ```
134
+ Score = (0.15 x D1) + (0.15 x D2) + (0.20 x D3) + (0.10 x D4)
135
+ + (0.15 x D5) + (0.10 x D6) + (0.10 x D7) + (0.05 x D8)
136
+ ```
137
+
138
+ | Range | Verdict | Interpretation |
139
+ |-------|---------|---------------|
140
+ | 4.5 - 5.0 | Exceptional | Ready for top-tier venue submission |
141
+ | 4.0 - 4.4 | Strong | Minor revisions needed |
142
+ | 3.5 - 3.9 | Good | Major revisions required |
143
+ | 3.0 - 3.4 | Acceptable | Significant rework before submission |
144
+ | < 3.0 | Weak | Fundamental issues to address |
145
+
146
+ ## Contextual Adjustments
147
+
148
+ **By stage:** Proposals — relax D4, D5, D6 (no results yet); weight D1, D2, D3 more heavily. Working drafts — expect D7, D8 to be lower; focus on D1-D6. Final manuscripts — full rubric, no relaxation.
149
+
150
+ **By venue tier:** Workshop/conference papers — accept 3.5+ overall. CABS 3 journals — target 4.0+. CABS 4/4* or FT 50 — target 4.5+; all dimensions should be 4+.
151
+
152
+ **By paper type:** Theoretical papers — D4 may be N/A (re-weight to D1-D3). Empirical papers — full rubric applies. Review/survey papers — D4-D6 may be N/A; weight D2 at 30%.
153
+
154
+ When a dimension is N/A, redistribute its weight proportionally across the remaining dimensions.
155
+
156
+ ## Agent Integration
157
+
158
+ | Agent | Loads | Dimensions used |
159
+ |-------|-------|----------------|
160
+ | `referee2-reviewer` | Always | All 8 (full scorecard) |
161
+ | `domain-reviewer` | When scoring requested | D1, D3, D5 (overlap with its domain lenses) |
162
+ | `paper-critic` | When scoring requested | D7, D8 (overlap with writing/citation categories) |
163
+ | `/pre-submission-report` | Always | Aggregate weighted score in summary |
164
+
165
+ Agents should report per-dimension scores in a table, then the weighted aggregate, then the verdict. Use the checklist items as evidence anchors for each score.
@@ -0,0 +1,54 @@
1
+ # Rhetoric Principles for Presentations
2
+
3
+ > Shared reference for `/beamer-deck`, `/quarto-deck`, and `/quarto-course`. Drawn from Scott Cunningham's "The Rhetoric of Decks" essay.
4
+ > Full source: `resources/academics/scott-cunningham/MixtapeTools/presentations/rhetoric_of_decks.md`
5
+
6
+ ## The Three Laws
7
+
8
+ | Law | Principle |
9
+ |-----|-----------|
10
+ | **Beauty Is Function** | Every element earns its presence. Decoration without function is noise. A slide with three words on a blank background can be the most beautiful. |
11
+ | **Cognitive Load Is the Enemy** | Audience has limited working memory. Too many points = zero points retained. One idea per slide — not a guideline, a law. |
12
+ | **The Slide Serves the Spoken Word** | Slides provide focal points, memory hooks, evidence, structural markers. If slides work without speaking, you've written a document, not a presentation. |
13
+
14
+ ## MB/MC Equivalence
15
+
16
+ Optimal rhetoric equalises the marginal-benefit-to-marginal-cost ratio across all slides. For each slide ask: "If I added one more element, would benefit justify cognitive cost? If I removed one, would I lose more than I'd gain in clarity?" When every slide answers "I'm at the margin," you're optimised.
17
+
18
+ - **Overloaded slides** (MB/MC too low): text running to footer, competing ideas, charts with too many series.
19
+ - **Underloaded slides** (MB/MC too high): single word that could support a sentence, wasted attention.
20
+ - **Rhythm**: vary dense and light slides deliberately.
21
+
22
+ ## Aristotelian Triad
23
+
24
+ | Mode | Role in decks | Danger if absent |
25
+ |------|---------------|------------------|
26
+ | **Ethos** (credibility) | Process diagrams, acknowledged limitations, Devil's Advocate slides | Audience won't trust you |
27
+ | **Pathos** (emotion) | Open with a recognised problem, connect data to human impact | Audience won't care |
28
+ | **Logos** (logic) | Data viz revealing patterns, comparison tables, problem→solution flow | Audience won't follow |
29
+
30
+ ## Narrative Arc (Three Acts)
31
+
32
+ 1. **Act I — Problem (Tension):** Establish status quo, introduce disruption, make audience feel the problem.
33
+ 2. **Act II — Investigation (Development):** Show what you tried, present what you learned, build the logical case.
34
+ 3. **Act III — Resolution (Release):** Deliver insight, show implications, call to action.
35
+
36
+ ## Pyramid Principle
37
+
38
+ Lead with the conclusion, then support it. Humans want the punchline first, then the reasoning.
39
+
40
+ - **Opening slide**: Provocative question, surprising statistic, bold claim. Never "Today I'm going to talk about..."
41
+ - **Closing slide**: Single memorable takeaway or return to opening, resolved. Never "Questions?" or "Thank you."
42
+
43
+ ## The Devil's Advocate
44
+
45
+ Before presenting your argument, present its strongest critique. Show anticipated objections, weaknesses you acknowledge, mitigations you've developed. Format: "A skeptic would say..." → your response.
46
+
47
+ ## Context-Specific Guidance
48
+
49
+ | Audience | Ethos/Pathos/Logos | Key adjustments |
50
+ |----------|-------------------|-----------------|
51
+ | **Academic seminar** | 20% / 35% / 45% | Lead with the puzzle, not the literature. Identification strategy early. One coefficient at a time. Acknowledge limitations before Q&A. |
52
+ | **Conference talk** | 25% / 35% / 40% | Emphasise identification strategy and key results. Tight narrative — no tangents. |
53
+ | **Teaching lecture** | 20% / 30% / 50% | Brevity < clarity. Repetition is allowed. Show reasoning, not just conclusions. Pause points for questions. |
54
+ | **Working deck** | 30% / 10% / 60% | Document choices (why A over B?). Preserve uncertainty. Show the math. Prioritise rigor over polish. |
@@ -0,0 +1,272 @@
1
+ # Skill Design Patterns
2
+
3
+ > Reference for designing new skills. Read this before writing any SKILL.md.
4
+ > Loaded on demand by `/learn` and when manually creating skills.
5
+
6
+ ## Structural Patterns
7
+
8
+ Choose based on what the skill does. Most skills fit one pattern; some combine two.
9
+
10
+ ### Workflow-Based
11
+
12
+ Multi-step processes where order matters and user approval is needed between phases.
13
+
14
+ ```
15
+ Phase 1: Gather → Phase 2: Process → REVIEW GATE → Phase 3: Output
16
+ ```
17
+
18
+ **When to use:** Report generation, project bootstrapping, multi-pass analysis, content creation with review cycles.
19
+
20
+ **Key elements:**
21
+ - Numbered phases with explicit entry/exit criteria
22
+ - Review gates between phases (hard stops where the agent presents work and waits)
23
+ - Default assumptions table to reduce friction
24
+
25
+
26
+ ### Task-Based
27
+
28
+ Focused input/output with clear processing rules. No phases — just do the thing well.
29
+
30
+ ```
31
+ Input spec → Processing rules → Output spec → Quality checks
32
+ ```
33
+
34
+ **When to use:** Code review, proofreading, validation, data transformation, file conversion.
35
+
36
+ **Key elements:**
37
+ - Explicit input format and constraints
38
+ - Processing rules as imperative instructions
39
+ - Output format specification
40
+ - Verification criteria
41
+
42
+
43
+ ### Agent-Delegation
44
+
45
+ Multiple subagents, each responsible for a distinct concern. Coordinator manages handoffs.
46
+
47
+ ```
48
+ Coordinator → Subagent A (generate) → Subagent B (critique) → Coordinator (synthesize)
49
+ ```
50
+
51
+ **When to use:** Tasks where quality benefits from separation of concerns — writing with editing, analysis with critique, multi-perspective evaluation.
52
+
53
+ **Key elements:**
54
+ - `agents/` directory or Task tool delegation with focused system prompts
55
+ - Each subagent does one thing well
56
+ - Coordinator synthesizes outputs
57
+
58
+
59
+ ### Reference-Based
60
+
61
+ Augments the agent with domain knowledge it lacks, loaded on demand.
62
+
63
+ ```
64
+ SKILL.md (routing + procedures) → references/ (domain knowledge)
65
+ ```
66
+
67
+ **When to use:** Specialized domains where training data is insufficient — niche APIs, internal conventions, proprietary formats, scoring rubrics.
68
+
69
+ **Key elements:**
70
+ - SKILL.md stays concise — routing logic and procedures only
71
+ - `references/` contains detailed specs, lookup tables, scoring rubrics
72
+ - Agent reads references on demand, not all at once
73
+
74
+
75
+ ---
76
+
77
+ ## Design Elements
78
+
79
+ ### Review Gates
80
+
81
+ Prevent runaway execution by requiring user approval at checkpoints.
82
+
83
+ ```markdown
84
+ ### REVIEW GATE: [Gate Name]
85
+
86
+ Present the following to the user and STOP:
87
+ - [What to show]
88
+ - [What to show]
89
+
90
+ Do NOT proceed until the user explicitly approves.
91
+ ```
92
+
93
+ Place gates after planning/before execution, and after first draft/before refinement.
94
+
95
+ ### Anti-Pattern Lists
96
+
97
+ Tell the agent what NOT to do. Agents default to safe, generic patterns unless explicitly constrained. Negative constraints are as important as positive instructions.
98
+
99
+ ```markdown
100
+ ## Never Do These
101
+
102
+ - Never use "leverage" — use "use" instead
103
+ - Never produce bullet points when a table would be clearer
104
+ - Never assume verbose output — default to concise
105
+ ```
106
+
107
+ ### Default Assumptions Table
108
+
109
+ Reduce friction by pre-answering common questions. Only ask the user what can't be reasonably defaulted.
110
+
111
+ ```markdown
112
+ ## Defaults
113
+
114
+ | Setting | Default | Override |
115
+ |---------|---------|---------|
116
+ | Output format | Markdown | User specifies otherwise |
117
+ | Verbosity | Concise | User says "detailed" |
118
+ | Location | Project root | User provides path |
119
+ ```
120
+
121
+ ### Drift Prevention
122
+
123
+ Skills that reference external state (database schemas, file counts, theme lists, status options) go stale silently. Use this 3-part pattern when a skill has hardcoded values derived from mutable sources.
124
+
125
+ **Part 1: Runtime detection (Phase 0)**
126
+
127
+ Add a Phase 0 at the start of the skill that checks hardcoded values against reality before proceeding:
128
+
129
+ ```markdown
130
+ ## Phase 0: Drift Detection
131
+
132
+ Before starting, verify skill references:
133
+ 1. Count actual {items} — compare against ~N in this skill
134
+ 2. List actual {categories} — compare against the N listed here
135
+ 3. Fetch {schema/config} — verify properties still match
136
+
137
+ If drift detected:
138
+ - Warn in output header: "⚠ Drift: {what} was {expected}, actually {found}"
139
+ - Use actual values for this run (self-heal)
140
+ - Log to references/drift-checks.md
141
+ ```
142
+
143
+ **Part 2: Drift reference file**
144
+
145
+ Create `references/drift-checks.md` listing every drift-prone value:
146
+
147
+ ```markdown
148
+ | Value | Location | Source of Truth | Trigger |
149
+ |-------|----------|-----------------|---------|
150
+ | Topic count (~67) | SKILL.md Phase 1 | ls topics/**/*.md | /scaffold-atlas-topics |
151
+ | Theme list (9) | SKILL.md, sa-prompts.md, build_report.py | ls Projects/ | New theme dir |
152
+ ```
153
+
154
+ Include a Drift Log table where Phase 0 appends detected drift with timestamps.
155
+
156
+ **Part 3: Trigger notes in related skills**
157
+
158
+ Add cross-references in skills that commonly cause drift:
159
+
160
+ ```markdown
161
+ | `/atlas-review` | **Drift trigger:** new topics change count — see drift-checks.md |
162
+ ```
163
+
164
+ This ensures the person (or agent) running the triggering skill is aware that downstream skills may need updating.
165
+
166
+ **When to use:** Any skill with hardcoded counts, database IDs, schema property names, status option lists, file paths derived from external systems, or theme/category lists that grow over time.
167
+
168
+ **When to skip:** Skills that derive all values at runtime (no hardcoded references to external state).
169
+
170
+ **Example:** `/atlas-review` — topic count, theme list, vault schemas, stage mappings, rules count all drift. Phase 0 detects and self-heals; `references/drift-checks.md` tracks 8 drift-prone values.
171
+
172
+ ### Progressive Disclosure
173
+
174
+ Control what goes where based on how often the agent needs it:
175
+
176
+ | Location | Loaded | Use for |
177
+ |----------|--------|---------|
178
+ | `name` + `description` | Always in context | Trigger matching — when to activate |
179
+ | SKILL.md body | When skill triggers | Core procedures, workflow, constraints |
180
+ | `references/` | On demand via Read | Detailed specs, large examples, lookup tables |
181
+ | `scripts/` | On demand via Bash | Deterministic operations (validation, formatting) |
182
+
183
+ **Rule of thumb:** If the agent needs it every time → SKILL.md body. If it needs it sometimes → `references/`. If it should execute it → `scripts/`.
184
+
185
+ ---
186
+
187
+ ## Writing Guidelines
188
+
189
+ ### For the Description (Frontmatter)
190
+
191
+ The `description` determines when the skill activates. It's always in context. **Write in third person** — the description is injected into the system prompt, and inconsistent point-of-view causes discovery problems.
192
+
193
+ **Good:**
194
+ - `"Analyze datasets using statistical methods. Handles EDA, hypothesis testing, and causal inference. Use when asked to analyze CSV/Excel data or run A/B test analysis."`
195
+ - `"Academic proofreading for LaTeX papers. Grammar, notation consistency, citation format, tone. Report-only — never edits source files."`
196
+
197
+ **Bad:**
198
+ - `"A helpful skill"` — too vague, triggers on everything
199
+ - `"I can help you process files"` — wrong point-of-view (first person)
200
+ - `"Skill for doing things with files"` — will trigger on every file operation
201
+
202
+ **Tips:**
203
+ - Lead with the primary capability
204
+ - Include concrete task types as trigger phrases
205
+ - End with "Use when..." to define activation conditions
206
+ - State what it does NOT do if there's a common confusion
207
+ - Max 1024 characters; name max 64 characters (lowercase, hyphens, numbers only)
208
+
209
+ ### For the Body (System Prompt)
210
+
211
+ - **Write for an AI agent, not a human.** Procedural knowledge the agent cannot infer.
212
+ - **Imperative form.** "Parse the input" not "You should parse the input."
213
+ - **Be specific about what NOT to do.** Anti-pattern lists are highly effective.
214
+ - **Include concrete examples.** Input/output pairs and good/bad snippets beat abstract rules.
215
+ - **Keep SKILL.md under 500 lines** (Anthropic official threshold). Move detailed specs and large examples to `references/`. Aim for under 300 lines when possible.
216
+ - **Every instruction must be actionable.** If the agent cannot act on a sentence, delete it.
217
+ - **Use tables for structured data.** Defaults, field specs, command references — tables are faster to parse than prose.
218
+ - **One section, one concern.** Don't mix workflow steps with quality criteria.
219
+
220
+ ### For allowed_tools
221
+
222
+ Follow principle of least privilege:
223
+
224
+ | Skill type | Tools |
225
+ |-----------|-------|
226
+ | Report-only | `Read`, `Glob`, `Grep` |
227
+ | File-creating | + `Write`, `Edit` |
228
+ | Shell-needing | + specific `Bash(command*)` patterns |
229
+ | Interactive | + `AskUserQuestion` |
230
+ | Delegating | + `Task` |
231
+
232
+ ---
233
+
234
+ ## Quality Checklist
235
+
236
+ Before finalising any skill, verify:
237
+
238
+ | Check | Question |
239
+ |-------|----------|
240
+ | Trigger clarity | Would Claude know when to invoke this from the description alone? |
241
+ | Pattern fit | Does the structure match one of the 4 patterns above? |
242
+ | No duplication | Does this overlap with an existing skill? |
243
+ | Anti-patterns | Does it say what NOT to do, not just what to do? |
244
+ | Examples present | Are there concrete before/after or good/bad examples? |
245
+ | Minimum tools | Only the tools actually needed are in `allowed_tools`? |
246
+ | No secrets | No API keys, tokens, or passwords? |
247
+ | References extracted | Is detailed reference material in `references/`, not inline? |
248
+ | Description specificity | Would the description distinguish this from similar skills? |
249
+ | Tested | Has the procedure been tested, not just theorised? |
250
+ | Refs one-level deep | All reference files link directly from SKILL.md (no nested refs)? |
251
+ | Third-person description | Description uses "Processes..." not "I can..." or "You can..."? |
252
+
253
+ ---
254
+
255
+ ## Evaluation-Driven Development
256
+
257
+ Per [Anthropic's official best practices](https://platform.claude.com/docs/en/agents-and-tools/agent-skills/best-practices):
258
+
259
+ 1. **Identify gaps:** Run Claude on representative tasks without the skill. Document specific failures.
260
+ 2. **Create evaluations:** Build 3+ scenarios that test these gaps.
261
+ 3. **Establish baseline:** Measure Claude's performance without the skill.
262
+ 4. **Write minimal instructions:** Just enough to address the gaps and pass evaluations.
263
+ 5. **Iterate:** Execute evaluations, compare against baseline, refine.
264
+
265
+ ### Claude A / Claude B Pattern
266
+
267
+ - **Claude A** (the expert): helps design and refine the skill
268
+ - **Claude B** (the user): tests the skill in real tasks with fresh context
269
+ - Observe Claude B's behavior, bring insights back to Claude A
270
+ - Repeat: observe → refine → test
271
+
272
+ This is similar to the existing `/skill-creator` workflow but emphasises real-task testing over synthetic evaluation.