npm - superlab - Versions diffs - 0.1.28 → 0.1.30 - Mend

superlab 0.1.28 → 0.1.30

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (29) hide show

package/package-assets/claude/commands/lab.md CHANGED Viewed

@@ -16,7 +16,7 @@ Use the same repository artifacts and stage boundaries every time.
 ## Stage Aliases
 - `/lab idea ...` or `/lab-idea`
-  Research the idea, define the problem and failure case, classify the contribution and breakthrough level, compare against existing methods, end with three meaningful points, and keep an explicit approval gate before any implementation.
+  Research the idea through two brainstorm passes and two literature sweeps, define the problem and failure case, compare against closest prior work, then end with a source-backed recommendation and an explicit approval gate before any implementation.
 - `/lab data ...` or `/lab-data`
   Turn the approved idea into an approved dataset and benchmark package with dataset years, papers that used each dataset, source audit, download plan, classic-public versus recent-strong-public versus claim-specific benchmark roles, and explicit rationale for canonical baselines, strong historical baselines, recent strong public methods, and closest prior work.
@@ -58,9 +58,7 @@ Use the same repository artifacts and stage boundaries every time.
 - `iterate` requires a normalized summary from `scripts/eval_report.py`.
 - `run`, `iterate`, `auto`, and `report` should all follow `.lab/context/eval-protocol.md`, including its recorded sources for metrics and comparison implementations.
 - `write` requires an approved framing artifact from the `framing` stage.
-- `write` requires stable report artifacts and should only change one section per round while following the installed write-stage contract under `skills/lab/stages/write.md`.
-- `write` should use `.lab/writing/plan.md` as the write-time source of truth for planned tables, figures, citations, and asset coverage.
-- `write` should treat section-quality, claim-safety, and manuscript-delivery validators as the canonical acceptance gates for final-draft or export rounds.
+- `write` requires stable report artifacts and must follow the installed write-stage contract under `skills/lab/stages/write.md` instead of re-stating write-specific rules here.
 ## How to Ask for `/lab auto`

package/package-assets/codex/prompts/lab-idea.md CHANGED Viewed

@@ -6,4 +6,4 @@ argument-hint: idea or research problem
 Use the installed `lab` skill at `.codex/skills/lab/SKILL.md`.
 Execute the requested `/lab:idea` stage against the user's argument now. Do not only recommend another lab stage. If a blocking prerequisite is missing, say exactly what is missing and ask at most one clarifying question.
-This command runs the `/lab:idea` stage. It must produce a collaborator-readable proposal memo with a plain-language scenario, problem, why-it-matters explanation, explicit current-method landscape, closest-prior-work comparison, a literature scoping bundle that defaults to roughly 20 relevant sources unless the field is too narrow, a rough approach description, and a minimum viable experiment before the approval gate.
+This command runs the `/lab:idea` stage. Use `.codex/skills/lab/stages/idea.md` as the single source of truth for the two brainstorm passes, two literature sweeps, closest-prior comparison, source-backed proposal memo, evaluation sketch, tentative contributions, user guidance, minimum viable experiment, and approval gate. Start with brainstorm pass 1 over 2-4 candidate directions, run literature sweep 1 with real closest-prior references for each direction, narrow the field with brainstorm pass 2, then run literature sweep 2 to build the final source bundle before producing a collaborator-readable recommendation. The final idea memo must explain the real-world scenario, the problem solved, why current methods fall short, roughly how the idea would work, how it would be evaluated, what the tentative contributions are, and what the user should decide next. Keep `.lab/writing/idea-source-log.md` synchronized with the actual search queries, bucketed sources, and final source count used in both sweeps. The literature bundle should default to about 20 sources unless the field is genuinely narrow and that smaller bundle is explicitly justified.

package/package-assets/codex/prompts/lab.md CHANGED Viewed

@@ -10,7 +10,7 @@ argument-hint: workflow question or stage choice
 ## Subcommands
 - `/lab:idea`
-  Research the idea, define the problem and failure case, classify the contribution and breakthrough level, compare against existing methods, end with three meaningful points, and keep an explicit approval gate before any implementation.
+  Research the idea through two brainstorm passes and two literature sweeps, define the problem and failure case, compare against closest prior work, then end with a source-backed recommendation and an explicit approval gate before any implementation.
 - `/lab:data`
   Turn the approved idea into an approved dataset and benchmark package with dataset years, papers that used each dataset, source audit, download plan, classic-public versus recent-strong-public versus claim-specific benchmark roles, and explicit rationale for canonical baselines, strong historical baselines, recent strong public methods, and closest prior work.
@@ -52,9 +52,7 @@ argument-hint: workflow question or stage choice
 - `/lab:iterate` requires a normalized summary from `scripts/eval_report.py`.
 - `/lab:run`, `/lab:iterate`, `/lab:auto`, and `/lab:report` should all follow `.lab/context/eval-protocol.md`, including its recorded sources for metrics and comparison implementations.
 - `/lab:write` requires an approved framing artifact from `/lab:framing`.
-- `/lab:write` requires stable report artifacts and should only change one section per round while following the installed write-stage contract under `skills/lab/stages/write.md`.
-- `/lab:write` should use `.lab/writing/plan.md` as the write-time source of truth for planned tables, figures, citations, and asset coverage.
-- `/lab:write` should treat section-quality, claim-safety, and manuscript-delivery validators as the canonical acceptance gates for final-draft or export rounds.
+- `/lab:write` requires stable report artifacts and must follow the installed write-stage contract under `skills/lab/stages/write.md` instead of re-stating write-specific rules here.
 ## How to Ask for `/lab:auto`

package/package-assets/shared/lab/.managed/scripts/validate_idea_artifact.py CHANGED Viewed

@@ -11,19 +11,60 @@ REQUIRED_SECTIONS = {
     "One-Sentence Problem": [r"^##\s+One-Sentence Problem\s*$", r"^##\s+一句话问题(?:定义)?\s*$"],
     "Why It Matters": [r"^##\s+Why It Matters\s*$", r"^##\s+为什么重要\s*$"],
     "Existing Methods": [r"^##\s+Existing Methods\s*$", r"^##\s+现有方法(?:与失败模式)?\s*$"],
+    "Brainstorm Pass 1": [r"^##\s+Brainstorm Pass 1\s*$", r"^##\s+第一轮脑暴\s*$"],
+    "Literature Sweep 1": [r"^##\s+Literature Sweep 1\s*$", r"^##\s+第一轮文献(?:检索|收敛)?\s*$"],
     "Literature Scoping Bundle": [r"^##\s+Literature Scoping Bundle\s*$", r"^##\s+文献范围(?:包)?\s*$"],
     "Closest Prior Work Comparison": [r"^##\s+Closest Prior Work Comparison\s*$", r"^##\s+最接近前作对照\s*$"],
+    "Brainstorm Pass 2": [r"^##\s+Brainstorm Pass 2\s*$", r"^##\s+第二轮脑暴\s*$"],
+    "Literature Sweep 2": [r"^##\s+Literature Sweep 2\s*$", r"^##\s+第二轮文献(?:检索|收敛)?\s*$"],
     "Rough Approach": [r"^##\s+Rough Approach\s*$", r"^##\s+我们准备怎么做\s*$"],
+    "Problem Solved": [r"^##\s+Problem Solved\s*$", r"^##\s+解决了什么问题\s*$"],
+    "Evaluation Sketch": [r"^##\s+Evaluation Sketch\s*$", r"^##\s+评测草图\s*$"],
+    "Tentative Contributions": [r"^##\s+Tentative Contributions\s*$", r"^##\s+暂定贡献\s*$"],
     "Candidate Experiment": [r"^##\s+Candidate Experiment\s*$", r"^##\s+(?:最小实验|候选实验)\s*$"],
     "Falsifiable Hypothesis": [r"^##\s+Falsifiable Hypothesis\s*$", r"^##\s+可证伪假设\s*$"],
+    "Final Recommendation": [r"^##\s+Final Recommendation\s*$", r"^##\s+最终推荐\s*$"],
+    "User Guidance": [r"^##\s+User Guidance\s*$", r"^##\s+用户引导\s*$"],
+}
+SOURCE_LOG_SECTIONS = {
+    "Search Intent": [r"^##\s+Search Intent\s*$", r"^##\s+检索意图\s*$"],
+    "Sweep 1 Log": [r"^##\s+Sweep 1 Log\s*$", r"^##\s+第一轮检索记录\s*$"],
+    "Sweep 2 Log": [r"^##\s+Sweep 2 Log\s*$", r"^##\s+第二轮检索记录\s*$"],
+    "Source Integrity Notes": [r"^##\s+Source Integrity Notes\s*$", r"^##\s+来源完整性说明\s*$"],
+}
+REFERENCE_PATTERN = re.compile(
+    r"(https?://\S+|arxiv\.org/\S+|doi\.org/\S+|doi:\s*\S+|\[[^\]]+\]\([^)]+\))",
+    flags=re.IGNORECASE,
+)
+MANDATORY_SOURCE_BUCKETS = {
+    "Closest prior": (
+        ("Closest prior bucket", "最接近前作来源数", "最接近前作 bucket"),
+        ("Closest prior", "最接近前作"),
+    ),
+    "Recent strong papers": (
+        ("Recent strong papers", "近期强相关论文", "近期强论文"),
+        ("Recent strong papers", "近期强相关论文", "近期强论文"),
+    ),
+    "Benchmark or evaluation papers": (
+        ("Benchmark or evaluation papers", "基准或评测论文", "基准论文"),
+        ("Benchmark or evaluation papers", "基准或评测论文", "基准论文"),
+    ),
+    "Survey or taxonomy papers": (
+        ("Survey or taxonomy papers", "综述或 taxonomy 论文", "综述论文"),
+        ("Survey or taxonomy papers", "综述或 taxonomy 论文", "综述论文"),
+    ),
 }
 def parse_args():
     parser = argparse.ArgumentParser(
-        description="Validate that an idea artifact is source-backed, plain-language, and aligned with workflow language."
+        description="Validate that an idea artifact and idea source log are source-backed, plain-language, and aligned with workflow language."
     )
     parser.add_argument("--idea", required=True, help="Path to the idea artifact markdown file")
+    parser.add_argument("--source-log", required=True, help="Path to the idea source log markdown file")
     parser.add_argument("--workflow-config", required=True, help="Path to .lab/config/workflow.json")
     return parser.parse_args()
@@ -57,11 +98,36 @@ def missing_sections(text: str) -> list[str]:
     return missing
+def missing_source_log_sections(text: str) -> list[str]:
+    missing = []
+    for section_name, patterns in SOURCE_LOG_SECTIONS.items():
+        if not any(re.search(pattern, text, flags=re.MULTILINE) for pattern in patterns):
+            missing.append(section_name)
+    return missing
 def contains_any(text: str, needles: tuple[str, ...]) -> bool:
     lowered = text.lower()
     return any(needle.lower() in lowered for needle in needles)
+def count_references(text: str) -> int:
+    return len(REFERENCE_PATTERN.findall(text))
+def unique_references(text: str) -> set[str]:
+    return {match[0] if isinstance(match, tuple) else match for match in REFERENCE_PATTERN.findall(text)}
+def extract_numeric_field(body: str, labels: tuple[str, ...]) -> int | None:
+    for label in labels:
+        pattern = re.compile(rf"{re.escape(label)}\s*[:：]\s*(\d+)", flags=re.IGNORECASE)
+        match = pattern.search(body)
+        if match:
+            return int(match.group(1))
+    return None
 def has_field_value(body: str, labels: tuple[str, ...]) -> bool:
     for label in labels:
         pattern = re.compile(rf"^\s*(?:-|\d+\.)\s*{re.escape(label)}[:：][ \t]*([^\n]+?)\s*$", flags=re.MULTILINE)
@@ -72,6 +138,38 @@ def has_field_value(body: str, labels: tuple[str, ...]) -> bool:
     return False
+def has_non_placeholder_field_value(body: str, labels: tuple[str, ...]) -> bool:
+    return has_field_value(body, labels)
+def extract_bucket_body(body: str, labels: tuple[str, ...]) -> str:
+    lines = body.splitlines()
+    start_index = None
+    start_indent = 0
+    for index, line in enumerate(lines):
+        stripped = line.lstrip()
+        indent = len(line) - len(stripped)
+        for label in labels:
+            if re.match(rf"^-\s*{re.escape(label)}\s*:\s*$", stripped, flags=re.IGNORECASE):
+                start_index = index + 1
+                start_indent = indent
+                break
+        if start_index is not None:
+            break
+    if start_index is None:
+        return ""
+    captured: list[str] = []
+    for line in lines[start_index:]:
+        stripped = line.lstrip()
+        indent = len(line) - len(stripped)
+        if stripped.startswith("- ") and indent <= start_indent:
+            break
+        captured.append(line)
+    return "\n".join(captured).strip()
 def validate_content(text: str) -> list[str]:
     issues: list[str] = []
     scenario = extract_section_body(text, REQUIRED_SECTIONS["Scenario"])
@@ -86,22 +184,88 @@ def validate_content(text: str) -> list[str]:
     if not contains_any(existing_methods, ("mainstream", "current", "shared assumption", "主流", "共同假设", "不够", "不足")):
         issues.append("idea artifact is missing a concrete current-methods landscape")
+    brainstorm_1 = extract_section_body(text, REQUIRED_SECTIONS["Brainstorm Pass 1"])
+    if not contains_any(brainstorm_1, ("candidate direction", "候选方向", "worth checking", "值得检查")):
+        issues.append("idea artifact is missing a real brainstorm pass 1 shortlist")
+    sweep_1 = extract_section_body(text, REQUIRED_SECTIONS["Literature Sweep 1"])
+    if count_references(sweep_1) < 3:
+        issues.append("idea artifact is missing literature sweep 1 with real references")
     literature = extract_section_body(text, REQUIRED_SECTIONS["Literature Scoping Bundle"])
     if not contains_any(literature, ("target", "source count", "closest prior", "recent strong", "benchmark", "survey", "adjacent", "目标", "来源数", "最接近前作", "近期", "基准", "综述", "相邻领域")):
         issues.append("idea artifact is missing a literature scoping bundle")
+    if not re.search(r"(actual source count|当前已覆盖来源数|实际来源数)\s*[:：]\s*\d+", literature, flags=re.IGNORECASE):
+        issues.append("idea artifact is missing a concrete literature source count")
+    default_target = extract_numeric_field(literature, ("Default target source count", "默认目标来源数", "默认来源目标数"))
+    actual_source_count = extract_numeric_field(literature, ("Actual source count", "当前已覆盖来源数", "实际来源数"))
+    if default_target is None:
+        issues.append("idea artifact is missing a default target source count")
+    if actual_source_count is not None and default_target is not None and actual_source_count < default_target:
+        if not has_non_placeholder_field_value(
+            literature,
+            ("If the total is below the default target, why", "如果总数低于默认目标，为什么"),
+        ):
+            issues.append("idea artifact is below the default target without explaining why the smaller source bundle is acceptable")
+    for bucket_name, (count_labels, _) in MANDATORY_SOURCE_BUCKETS.items():
+        bucket_count = extract_numeric_field(literature, count_labels)
+        if bucket_count is None or bucket_count <= 0:
+            issues.append(f"idea artifact is missing mandatory literature coverage for {bucket_name.lower()}")
     closest_prior = extract_section_body(text, REQUIRED_SECTIONS["Closest Prior Work Comparison"])
     if not contains_any(closest_prior, ("citation", "difference", "limitation", "引用", "差异", "局限")):
         issues.append("idea artifact is missing a closest prior work comparison")
+    if count_references(closest_prior) < 1:
+        issues.append("idea artifact is missing real reference markers in the closest prior work comparison")
+    brainstorm_2 = extract_section_body(text, REQUIRED_SECTIONS["Brainstorm Pass 2"])
+    if not contains_any(brainstorm_2, ("surviving direction", "recommended narrowed direction", "surviving", "幸存方向", "推荐收敛方向", "淘汰")):
+        issues.append("idea artifact is missing a real brainstorm pass 2 narrowing step")
+    sweep_2 = extract_section_body(text, REQUIRED_SECTIONS["Literature Sweep 2"])
+    if count_references(sweep_2) < 5:
+        issues.append("idea artifact is missing literature sweep 2 with real references")
     rough_approach = extract_section_body(text, REQUIRED_SECTIONS["Rough Approach"])
     if not contains_any(rough_approach, ("plain-language", "how this would work", "粗略做法", "怎么做", "why this design", "为什么")):
         issues.append("idea artifact is missing a rough plain-language approach")
+    problem_solved = extract_section_body(text, REQUIRED_SECTIONS["Problem Solved"])
+    if not has_field_value(problem_solved, ("In plain language", "白话问题", "用大白话说")):
+        issues.append("idea artifact is missing a plain-language statement of what problem the idea actually solves")
+    if not has_field_value(problem_solved, ("What becomes possible if this works", "如果这条路成立", "如果这条路可行")):
+        issues.append("idea artifact is missing the payoff of solving the proposed problem")
+    evaluation_sketch = extract_section_body(text, REQUIRED_SECTIONS["Evaluation Sketch"])
+    if not has_field_value(evaluation_sketch, ("Evaluation subject", "评测对象")):
+        issues.append("idea artifact is missing an evaluation sketch with the evaluation subject")
+    if not has_field_value(evaluation_sketch, ("Proxy or simulator, if any", "代理或模拟器")):
+        issues.append("idea artifact is missing an evaluation sketch that states any proxy or simulator")
+    if not has_field_value(evaluation_sketch, ("Main outcome to observe", "主要观察结果")):
+        issues.append("idea artifact is missing the main outcome in the evaluation sketch")
+    if not has_field_value(evaluation_sketch, ("Main validity risk", "主要有效性风险")):
+        issues.append("idea artifact is missing the main validity risk in the evaluation sketch")
+    tentative_contributions = extract_section_body(text, REQUIRED_SECTIONS["Tentative Contributions"])
+    if sum(1 for label in ("Contribution 1", "Contribution 2", "Contribution 3", "贡献 1", "贡献 2", "贡献 3") if has_field_value(tentative_contributions, (label,))) < 2:
+        issues.append("idea artifact is missing tentative contributions stated at the idea level")
     experiment = extract_section_body(text, REQUIRED_SECTIONS["Candidate Experiment"])
     if not contains_any(experiment, ("minimum viable experiment", "minimum experiment", "dataset", "metric", "最小实验", "主指标", "次指标")):
         issues.append("idea artifact is missing a minimum experiment")
+    final_recommendation = extract_section_body(text, REQUIRED_SECTIONS["Final Recommendation"])
+    if not contains_any(final_recommendation, ("recommended direction", "paper-worthy", "推荐方向", "值得做论文")):
+        issues.append("idea artifact is missing a final recommendation after the second sweep")
+    user_guidance = extract_section_body(text, REQUIRED_SECTIONS["User Guidance"])
+    if not has_field_value(user_guidance, ("Immediate decision needed from the user", "现在最需要你确认的选择", "Immediate decision")):
+        issues.append("idea artifact is missing user guidance about the next decision")
+    if not has_field_value(user_guidance, ("Information that would sharpen the idea", "哪些信息会显著提高下一轮判断质量", "Information that would sharpen")):
+        issues.append("idea artifact is missing user guidance about what information would sharpen the idea")
+    if not has_field_value(user_guidance, ("Recommended next stage", "推荐下一步")):
+        issues.append("idea artifact is missing user guidance about the next lab stage")
     return issues
@@ -114,25 +278,100 @@ def validate_language(text: str, workflow_language: str) -> list[str]:
     return []
+def validate_source_log(text: str) -> list[str]:
+    issues: list[str] = []
+    search_intent = extract_section_body(text, SOURCE_LOG_SECTIONS["Search Intent"])
+    if not contains_any(search_intent, ("problem framing", "search constraints", "search window", "问题 framing", "检索约束", "检索窗口", "目标方向")):
+        issues.append("idea source log is missing search intent and search boundary details")
+    sweep_1 = extract_section_body(text, SOURCE_LOG_SECTIONS["Sweep 1 Log"])
+    if not contains_any(sweep_1, ("query strings", "sources found", "查询", "已找到来源")):
+        issues.append("idea source log is missing sweep 1 search queries or source listings")
+    if count_references(sweep_1) < 3:
+        issues.append("idea source log is missing enough real references for sweep 1")
+    sweep_2 = extract_section_body(text, SOURCE_LOG_SECTIONS["Sweep 2 Log"])
+    if not contains_any(sweep_2, ("final source bundle", "closest prior", "recent strong", "benchmark", "survey", "adjacent", "final source bundle", "最终来源包")):
+        issues.append("idea source log is missing a bucketed sweep 2 source bundle")
+    if count_references(sweep_2) < 5:
+        issues.append("idea source log is missing enough real references for sweep 2")
+    actual_count = extract_numeric_field(sweep_2, ("Actual source count", "实际来源数", "当前已覆盖来源数"))
+    if actual_count is None:
+        issues.append("idea source log is missing an actual source count")
+    elif len(unique_references(text)) < actual_count:
+        issues.append("idea source log actual source count exceeds the unique references recorded in the source log")
+    for bucket_name, (_, bucket_labels) in MANDATORY_SOURCE_BUCKETS.items():
+        bucket_body = extract_bucket_body(sweep_2, bucket_labels)
+        if count_references(bucket_body) < 1:
+            issues.append(f"idea source log is missing mandatory sweep 2 references for {bucket_name.lower()}")
+    integrity = extract_section_body(text, SOURCE_LOG_SECTIONS["Source Integrity Notes"])
+    if not contains_any(integrity, ("duplicates removed", "unused or weak sources", "caveat", "去重", "未直接依赖", "备注", "caveat")):
+        issues.append("idea source log is missing source integrity notes")
+    return issues
+def cross_validate_idea_and_source_log(idea_text: str, source_log_text: str) -> list[str]:
+    issues: list[str] = []
+    literature = extract_section_body(idea_text, REQUIRED_SECTIONS["Literature Scoping Bundle"])
+    source_sweep_2 = extract_section_body(source_log_text, SOURCE_LOG_SECTIONS["Sweep 2 Log"])
+    idea_count = extract_numeric_field(literature, ("Actual source count", "当前已覆盖来源数", "实际来源数"))
+    source_count = extract_numeric_field(source_sweep_2, ("Actual source count", "当前已覆盖来源数", "实际来源数"))
+    default_target = extract_numeric_field(literature, ("Default target source count", "默认目标来源数", "默认来源目标数"))
+    source_integrity = extract_section_body(source_log_text, SOURCE_LOG_SECTIONS["Source Integrity Notes"])
+    if idea_count is not None and source_count is not None and idea_count != source_count:
+        issues.append("idea source log actual source count does not match the literature scoping bundle in the idea artifact")
+    if count_references(source_sweep_2) < count_references(extract_section_body(idea_text, REQUIRED_SECTIONS["Literature Sweep 2"])):
+        issues.append("idea source log sweep 2 is missing references that appear in the idea artifact literature sweep 2")
+    if idea_count is not None and default_target is not None and idea_count < default_target:
+        if not (
+            has_non_placeholder_field_value(
+                source_sweep_2,
+                ("Why the bundle is below the default target", "Why the bundle stays below the default target", "为什么最终来源包低于默认目标"),
+            )
+            or has_non_placeholder_field_value(
+                source_integrity,
+                ("Why the bundle is below the default target", "Why the bundle stays below the default target", "为什么最终来源包低于默认目标"),
+            )
+        ):
+            issues.append("idea source log is below the default target without recording why the smaller source bundle is acceptable")
+    return issues
 def main():
     args = parse_args()
     idea_path = Path(args.idea)
+    source_log_path = Path(args.source_log)
     config_path = Path(args.workflow_config)
     if not idea_path.exists():
         print(f"idea artifact does not exist: {idea_path}", file=sys.stderr)
         return 1
+    if not source_log_path.exists():
+        print(f"idea source log does not exist: {source_log_path}", file=sys.stderr)
+        return 1
     if not config_path.exists():
         print(f"workflow config does not exist: {config_path}", file=sys.stderr)
         return 1
     text = read_text(idea_path)
+    source_log_text = read_text(source_log_path)
     workflow_language = load_workflow_language(config_path)
     issues = []
     missing = missing_sections(text)
     if missing:
         issues.append(f"idea artifact is missing required sections: {', '.join(missing)}")
+    missing_source_sections = missing_source_log_sections(source_log_text)
+    if missing_source_sections:
+        issues.append(f"idea source log is missing required sections: {', '.join(missing_source_sections)}")
     issues.extend(validate_content(text))
     issues.extend(validate_language(text, workflow_language))
+    issues.extend(validate_source_log(source_log_text))
+    issues.extend(cross_validate_idea_and_source_log(text, source_log_text))
     if issues:
         for issue in issues:

package/package-assets/shared/lab/.managed/templates/idea-source-log.md ADDED Viewed

@@ -0,0 +1,37 @@
+# Idea Source Log
+## Search Intent
+- Problem framing:
+- Search constraints:
+- Search window:
+## Sweep 1 Log
+- Direction 1 query strings:
+- Sources found:
+- Direction 2 query strings:
+- Sources found:
+- Direction 3 query strings:
+- Sources found:
+- Direction 4 query strings:
+- Sources found:
+- Early elimination notes:
+## Sweep 2 Log
+- Surviving directions:
+- Final source bundle:
+  - Closest prior:
+  - Recent strong papers:
+  - Benchmark or evaluation papers:
+  - Survey or taxonomy papers:
+  - Adjacent-field papers:
+- Actual source count:
+## Source Integrity Notes
+- Duplicates removed:
+- Unused or weak sources not relied on:
+- Why the bundle stays below the default target:
+- Caveats:

package/package-assets/shared/lab/.managed/templates/idea.md CHANGED Viewed

@@ -54,10 +54,32 @@ Suggested levels:
 ## Existing Methods
 - Mainstream line 1:
+- Citation:
+- What it solves:
+- Why it still falls short here:
 - Mainstream line 2:
+- Citation:
+- What it solves:
+- Why it still falls short here:
 - Shared assumption:
 - Why that assumption breaks here:
+## Brainstorm Pass 1
+- Candidate direction 1:
+- Candidate direction 2:
+- Candidate direction 3:
+- Candidate direction 4:
+- Why these directions are worth checking:
+## Literature Sweep 1
+- Direction 1 seed references:
+- Direction 2 seed references:
+- Direction 3 seed references:
+- Direction 4 seed references:
+- Early conclusion from the first sweep:
 ## Literature Scoping Bundle
 - Default target source count:
@@ -67,7 +89,7 @@ Suggested levels:
 - Benchmark or evaluation papers:
 - Survey or taxonomy papers:
 - Adjacent-field papers:
-- If the total is below the default target, why:
+- If the total is below the default target, why is the smaller bundle still acceptable:
 ## Closest Prior Work Comparison
@@ -84,6 +106,21 @@ Suggested levels:
   - Limitation for the current problem:
   - Difference from our direction:
+## Brainstorm Pass 2
+- Surviving direction 1:
+- Surviving direction 2:
+- Rejected directions and why:
+- Recommended narrowed direction:
+## Literature Sweep 2
+- Recent strong papers:
+- Benchmark or evaluation papers:
+- Survey or taxonomy papers:
+- Adjacent-field papers:
+- Final literature takeaway:
 ## Why Ours Is Different
 - Existing methods rely on:
@@ -96,6 +133,24 @@ Suggested levels:
 - Plain-language description of how this would work:
 - Why this design might resolve the failure case:
+## Problem Solved
+- In plain language:
+- What becomes possible if this works:
+## Evaluation Sketch
+- Evaluation subject:
+- Proxy or simulator, if any:
+- Main outcome to observe:
+- Main validity risk:
+## Tentative Contributions
+- Contribution 1:
+- Contribution 2:
+- Contribution 3:
 ## Three Meaningful Points
 1. Significance:
@@ -140,6 +195,17 @@ Suggested levels:
 - What must be validated before implementation:
 - Kill criteria:
+## Final Recommendation
+- Recommended direction after two sweeps:
+- Why this is still paper-worthy:
+## User Guidance
+- Immediate decision needed from the user:
+- Information that would sharpen the idea:
+- Recommended next stage:
 ## Approval Gate
 - User-approved direction:

package/package-assets/shared/lab/context/auto-mode.md CHANGED Viewed

@@ -51,7 +51,7 @@ If `eval-protocol.md` declares structured rung entries, auto mode follows those
 - Run stage contract: write persistent outputs under `results_root`.
 - Iterate stage contract: update persistent outputs under `results_root`.
-- Review stage contract: update canonical review context such as `.lab/context/decisions.md`, `state.md`, `workflow-state.md`, `open-questions.md`, or `evidence-index.md`.
+- Review stage contract: update canonical review context such as `.lab/context/decisions.md`, `workflow-state.md`, `open-questions.md`, or `evidence-index.md`, then refresh derived views.
 - Report stage contract: write `<deliverables_root>/report.md`, `<deliverables_root>/main-tables.md`, and `<deliverables_root>/artifact-status.md`.
 - Write stage contract: write LaTeX output under `<deliverables_root>/paper/`.
@@ -68,4 +68,4 @@ If `eval-protocol.md` declares structured rung entries, auto mode follows those
 - Stop conditions:
 - Escalation conditions:
-- Canonical promotion writeback: update `.lab/context/data-decisions.md`, `.lab/context/decisions.md`, `.lab/context/state.md`, and `.lab/context/workflow-state.md`.
+- Canonical promotion writeback: update `.lab/context/data-decisions.md`, `.lab/context/decisions.md`, and `.lab/context/workflow-state.md`, then refresh derived views such as `state.md`.

package/package-assets/shared/lab/context/session-brief.md CHANGED Viewed

@@ -1,27 +1,16 @@
 # Session Brief
-## Active Stage
+## Immediate Focus
 - Stage:
 - Current objective:
 - Immediate next action:
-## Mission
-One sentence describing the active research mission.
-## Best Current Path
+## Mission Snapshot
+- Mission:
 - Approved direction:
 - Strongest supported claim:
-- Auto mode:
-- Auto objective:
-- Auto decision:
-- Collaborator report mode:
-- Canonical context readiness:
-- Method name:
-- Primary metrics:
-- Secondary metrics:
 ## Main Risk

package/package-assets/shared/lab/context/state.md CHANGED Viewed

@@ -1,5 +1,7 @@
 # Research State
+> This file is a derived durable snapshot. Update canonical context files such as `mission.md`, `decisions.md`, `data-decisions.md`, `evidence-index.md`, `eval-protocol.md`, and `open-questions.md`, then refresh derived context instead of editing this file directly.
 ## Approved Direction
 - One-sentence problem:

package/package-assets/shared/lab/system/core.md CHANGED Viewed

@@ -31,7 +31,7 @@ For auto-mode orchestration or long-running experiment campaigns, also read:
 - Figures and plots belong under the configured `figures_root`, not inside `.lab/changes/`.
 - Deliverables belong under the configured `deliverables_root`, not inside `.lab/context/`.
 - Change-local `data/` directories may hold lightweight manifests or batch specs, but not the canonical dataset copy.
-- `.lab/context/state.md` holds durable research state; `.lab/context/workflow-state.md` holds live workflow state.
+- `.lab/context/state.md` is a derived durable research snapshot; `.lab/context/workflow-state.md` holds live workflow state.
 - `.lab/context/summary.md` is the durable project summary; `.lab/context/session-brief.md` is the next-session startup brief.
 - `.lab/context/auto-mode.md` defines the bounded autonomous envelope; `.lab/context/auto-status.md` records live state for resume and handoff.
 - If the user provides a LaTeX template directory, validate it and attach it through `paper_template_root` before drafting.
@@ -55,6 +55,7 @@ Do not force `/lab:*` onto unrelated engineering tasks.
 ## State Discipline
-- Treat `.lab/context/*` as durable project state.
-- Do not silently overwrite context files.
+- Treat canonical context files such as `mission.md`, `decisions.md`, `data-decisions.md`, `evidence-index.md`, `eval-protocol.md`, and `open-questions.md` as durable project state.
+- Treat `state.md`, `summary.md`, `session-brief.md`, and `next-action.md` as derived views.
+- Do not silently overwrite canonical context files.
 - Keep sourced evidence separate from generated hypotheses.