npm - @wentorai/research-plugins - Versions diffs - 1.2.3 → 1.3.0 - Mend

@wentorai/research-plugins 1.2.3 → 1.3.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (142) hide show

package/skills/research/methodology/research-pipeline-units-guide/SKILL.md DELETED Viewed

@@ -1,169 +0,0 @@
----
-name: research-pipeline-units-guide
-description: "Evidence-first semantic research pipeline methodology"
-metadata:
-  openclaw:
-    emoji: "🔬"
-    category: "research"
-    subcategory: "methodology"
-    keywords: ["research pipeline", "evidence-first", "semantic units", "methodology", "research workflow", "structured inquiry"]
-    source: "https://github.com/WILLOSCAR/research-units-pipeline-skills"
----
-# Research Pipeline Units Guide
-## Overview
-Research Pipeline Units is a methodology for structuring research as composable, evidence-first semantic units. Instead of monolithic literature reviews, it decomposes research into atomic claims, evidence units, and reasoning chains that can be independently verified, combined, and reused. Each unit has clear provenance, confidence levels, and connections to other units. Suited for systematic reviews and evidence synthesis.
-## Core Concepts
-```
-Research Pipeline
-├── Claim Unit
-│   ├── Statement (falsifiable assertion)
-│   ├── Evidence (supporting sources)
-│   ├── Confidence (high/medium/low)
-│   └── Counter-evidence (opposing sources)
-├── Evidence Unit
-│   ├── Source (paper, dataset, experiment)
-│   ├── Extraction (quote, data point, figure)
-│   ├── Quality (study design, bias risk)
-│   └── Relevance (to claim)
-├── Reasoning Chain
-│   ├── Premises (claim units)
-│   ├── Logic (deductive, inductive, abductive)
-│   └── Conclusion (derived claim)
-└── Knowledge Map
-    ├── Clusters (related claims)
-    ├── Contradictions (conflicting evidence)
-    └── Gaps (missing evidence)
-```
-## Building Claim Units
-```python
-from research_units import ClaimUnit, EvidenceUnit
-# Create an evidence-backed claim
-claim = ClaimUnit(
-    statement="Retrieval-augmented generation reduces "
-              "hallucination in LLMs by 40-60%",
-    confidence="medium",
-    evidence=[
-        EvidenceUnit(
-            source="Lewis et al., 2020",
-            doi="10.48550/arXiv.2005.11401",
-            extraction="RAG reduces factual errors by 43% "
-                       "on Natural Questions",
-            quality="high",  # Peer-reviewed, strong methodology
-        ),
-        EvidenceUnit(
-            source="Shuster et al., 2021",
-            doi="10.48550/arXiv.2104.07567",
-            extraction="Knowledge-grounded dialogue 54% fewer "
-                       "hallucinated facts",
-            quality="medium",
-        ),
-    ],
-    counter_evidence=[
-        EvidenceUnit(
-            source="Mallen et al., 2023",
-            extraction="RAG helps less for popular knowledge "
-                       "that LLMs already encode",
-            quality="high",
-        ),
-    ],
-)
-print(f"Claim: {claim.statement}")
-print(f"Confidence: {claim.confidence}")
-print(f"Supporting: {len(claim.evidence)} sources")
-print(f"Opposing: {len(claim.counter_evidence)} sources")
-```
-## Reasoning Chains
-```python
-from research_units import ReasoningChain
-chain = ReasoningChain(
-    premises=[
-        ClaimUnit("RAG reduces hallucination (Lewis 2020)"),
-        ClaimUnit("Knowledge conflicts degrade RAG quality "
-                  "(Chen 2022)"),
-        ClaimUnit("Adaptive retrieval mitigates conflicts "
-                  "(Jiang 2023)"),
-    ],
-    logic="inductive",
-    conclusion=ClaimUnit(
-        statement="Adaptive retrieval-augmented generation "
-                  "with conflict resolution is the most "
-                  "promising approach to LLM factuality",
-        confidence="medium",
-    ),
-)
-# Validate chain
-validation = chain.validate()
-print(f"Valid: {validation.is_valid}")
-print(f"Gaps: {validation.gaps}")
-print(f"Strength: {validation.strength}")
-```
-## Knowledge Maps
-```python
-from research_units import KnowledgeMap
-kmap = KnowledgeMap("RAG for Factuality")
-# Add clusters of related claims
-kmap.add_cluster("retrieval_methods", [claim1, claim2, claim3])
-kmap.add_cluster("conflict_resolution", [claim4, claim5])
-kmap.add_cluster("evaluation", [claim6, claim7])
-# Identify contradictions
-contradictions = kmap.find_contradictions()
-for c in contradictions:
-    print(f"Conflict: {c.claim_a.statement[:50]}...")
-    print(f"  vs: {c.claim_b.statement[:50]}...")
-    print(f"  Resolution: {c.suggested_resolution}")
-# Identify gaps
-gaps = kmap.find_gaps()
-for g in gaps:
-    print(f"Gap: {g.description}")
-    print(f"  Between: {g.cluster_a} ↔ {g.cluster_b}")
-    print(f"  Suggested research: {g.suggestion}")
-# Export
-kmap.export("knowledge_map.json")
-kmap.visualize("knowledge_map.html")
-```
-## Pipeline Workflow
-```markdown
-### Step-by-Step Research Pipeline
-1. **Define scope**: Research question → decomposed sub-questions
-2. **Search**: Literature search per sub-question
-3. **Extract**: Create evidence units from each paper
-4. **Claim**: Formulate claims supported by evidence
-5. **Chain**: Build reasoning chains linking claims
-6. **Map**: Assemble knowledge map showing landscape
-7. **Synthesize**: Write narrative from structured units
-8. **Validate**: Peer review individual units + chains
-```
-## Use Cases
-1. **Systematic reviews**: Structured evidence synthesis
-2. **Research proposals**: Evidence-backed argument construction
-3. **Collaborative research**: Shared, verifiable claim units
-4. **Teaching**: Demonstrate evidence-based reasoning
-5. **Knowledge management**: Reusable research building blocks
-## References
-- [research-units-pipeline-skills](https://github.com/WILLOSCAR/research-units-pipeline-skills)

package/skills/research/paper-review/paper-compare-guide/SKILL.md DELETED Viewed

@@ -1,238 +0,0 @@
----
-name: paper-compare-guide
-description: "Compare research papers side-by-side on methodology and findings"
-metadata:
-  openclaw:
-    emoji: "⚖️"
-    category: "research"
-    subcategory: "paper-review"
-    keywords: ["paper comparison", "side-by-side analysis", "methodology comparison", "research synthesis", "critical analysis"]
-    source: "https://github.com/AcademicSkills/paper-compare-guide"
----
-# Paper Compare Guide
-A skill for conducting structured side-by-side comparisons of research papers. Extracts and aligns key dimensions across papers including research questions, methodologies, datasets, results, and conclusions, producing comparison matrices and synthesis narratives that highlight agreements, contradictions, and complementary contributions.
-## Overview
-Comparing multiple papers on the same topic is a fundamental task in academic research, required for literature reviews, related work sections, research gap identification, and method selection. However, ad hoc comparisons often miss important dimensions or apply inconsistent criteria across papers. This skill provides a systematic framework that ensures comprehensive, fair comparisons by defining comparison dimensions upfront and extracting standardized data from each paper.
-The approach works for comparing 2-10 papers and produces two types of output: a structured comparison matrix (tabular, suitable for inclusion in papers) and a narrative synthesis (prose, suitable for literature review sections). It handles comparisons across papers that address the same question with different methods, papers that use the same method on different data, and papers that reach conflicting conclusions about the same phenomenon.
-## Comparison Framework
-### Defining Comparison Dimensions
-Before reading the papers, define the dimensions you will compare:
-```python
-COMPARISON_DIMENSIONS = {
-    'basic': {
-        'title': str,
-        'authors': list,
-        'year': int,
-        'venue': str,
-        'citation_count': int,
-    },
-    'research_design': {
-        'research_question': str,
-        'hypothesis': str,
-        'study_type': str,  # RCT, observational, simulation, etc.
-        'theoretical_framework': str,
-    },
-    'methodology': {
-        'data_source': str,
-        'sample_size': str,
-        'sampling_method': str,
-        'variables': {
-            'independent': list,
-            'dependent': list,
-            'control': list,
-        },
-        'analysis_method': str,
-        'tools_used': list,
-    },
-    'results': {
-        'main_finding': str,
-        'effect_size': str,
-        'statistical_significance': str,
-        'secondary_findings': list,
-    },
-    'quality': {
-        'strengths': list,
-        'limitations': list,
-        'reproducibility': str,  # high, medium, low
-        'generalizability': str,
-    }
-}
-```
-### Extraction Protocol
-For each paper, systematically extract data for every dimension:
-```
-Paper A: Smith et al. (2024)
-  research_question: "Does intervention X improve outcome Y in population Z?"
-  study_type: "Randomized controlled trial"
-  sample_size: "n=200 (100 treatment, 100 control)"
-  analysis_method: "Mixed-effects logistic regression"
-  main_finding: "Intervention X improved Y by 23% (OR=1.45, 95% CI [1.12, 1.88])"
-  limitations: ["Single-site study", "Short follow-up (6 months)"]
-Paper B: Jones et al. (2025)
-  research_question: "What is the effect of intervention X on outcome Y?"
-  study_type: "Quasi-experimental (pre-post with control group)"
-  sample_size: "n=350 (200 treatment, 150 control)"
-  analysis_method: "Difference-in-differences regression"
-  main_finding: "Intervention X improved Y by 18% (beta=0.18, p<0.01)"
-  limitations: ["Non-random assignment", "Potential selection bias"]
-```
-## Comparison Matrix Generation
-### Tabular Comparison
-| Dimension | Smith et al. (2024) | Jones et al. (2025) | Chen et al. (2025) |
-|-----------|-------------------|-------------------|--------------------|
-| **Study type** | RCT | Quasi-experimental | Observational cohort |
-| **Sample size** | n=200 | n=350 | n=1,200 |
-| **Population** | University students | Working adults | General population |
-| **Intervention** | X (standardized) | X (adapted version) | X (self-selected) |
-| **Primary outcome** | Y (binary) | Y (continuous) | Y (composite score) |
-| **Main effect** | OR=1.45 | beta=0.18 | HR=1.32 |
-| **Significant?** | Yes (p=.003) | Yes (p<.01) | Yes (p=.02) |
-| **Follow-up** | 6 months | 12 months | 24 months |
-| **Key limitation** | Single-site | Non-random | Self-selection |
-| **Code available** | Yes (GitHub) | No | Data only (Zenodo) |
-### Automated Matrix Builder
-```python
-import pandas as pd
-def build_comparison_matrix(papers: list, dimensions: list) -> pd.DataFrame:
-    """
-    Build a comparison matrix from extracted paper data.
-    Args:
-        papers: List of dicts, each containing extracted dimensions
-        dimensions: List of dimension keys to include in the matrix
-    """
-    matrix = {}
-    for paper in papers:
-        label = f"{paper['authors'][0].split()[-1]} et al. ({paper['year']})"
-        matrix[label] = {}
-        for dim in dimensions:
-            value = paper.get(dim, 'Not reported')
-            if isinstance(value, list):
-                value = '; '.join(str(v) for v in value)
-            matrix[label][dim] = value
-    df = pd.DataFrame(matrix).T
-    return df
-```
-## Synthesis Narratives
-### Agreement Synthesis
-When papers agree on a finding:
-```
-Template:
-"Multiple studies converge on the finding that [finding]. [Author A] (year)
-demonstrated this using [method A] with [sample A], finding [specific result].
-This was corroborated by [Author B] (year) using [different method/sample],
-who reported [similar result]. The consistency across [different methods/
-populations/contexts] strengthens confidence in this finding."
-```
-### Contradiction Synthesis
-When papers disagree:
-```
-Template:
-"The evidence on [topic] is mixed. [Author A] (year) found [finding A]
-using [method], while [Author B] (year) reported [contradictory finding B].
-Several factors may explain this discrepancy: (1) [methodological difference],
-(2) [population difference], (3) [measurement difference]. Further research
-using [suggested approach] is needed to resolve this inconsistency."
-```
-### Complementary Synthesis
-When papers address different aspects of the same topic:
-```
-Template:
-"The studies contribute complementary evidence on [topic]. [Author A] (year)
-addressed [aspect 1] by [method], establishing that [finding]. Building on
-this, [Author B] (year) examined [aspect 2] and found [finding]. Together,
-these studies suggest that [integrated conclusion], though [gap] remains
-unaddressed."
-```
-## Advanced Comparison Techniques
-### Methodological Quality Comparison
-Apply a standardized quality assessment tool to all papers:
-| Quality Criterion | Paper A | Paper B | Paper C |
-|------------------|---------|---------|---------|
-| Clear research question | Yes | Yes | Partially |
-| Appropriate design | Yes | Mostly | Yes |
-| Adequate sample size | No (underpowered) | Yes | Yes |
-| Valid measurement | Yes | Yes | Questionable |
-| Controls for confounders | Yes | Partially | No |
-| Appropriate analysis | Yes | Yes | Yes |
-| Effect size reported | Yes | No | Yes |
-| Limitations discussed | Yes | Yes | Minimal |
-| **Overall quality** | **High** | **Medium** | **Medium** |
-### Visualizing Paper Relationships
-```
-Conceptual Map:
-  Smith (2024) ──agrees──→ Jones (2025)
-       │                        │
-       │                        │
-   extends                  contradicts
-       │                        │
-       ↓                        ↓
-  Lee (2023)              Chen (2025)
-       │
-   replicates
-       │
-       ↓
-  Park (2022) [foundational study]
-```
-## Output Formats
-The comparison can be output in several formats depending on the use case:
-1. **Comparison table**: For inclusion in a paper's related work section or supplementary materials.
-2. **Narrative synthesis**: For the body of a literature review chapter.
-3. **Gap analysis**: For identifying your own research contribution.
-4. **Presentation slide**: A single-slide summary for lab meetings or conference talks.
-5. **Decision matrix**: For choosing which method to adopt in your own research.
-## Best Practices
-- Define comparison dimensions before reading the papers to avoid post hoc cherry-picking.
-- Compare no more than 8-10 papers in a single matrix; beyond that, group papers by theme.
-- Always note when a dimension is "not reported" rather than leaving it blank.
-- Be fair: apply the same critical lens to all papers, including those you agree with.
-- When results conflict, investigate whether the conflict is real (different findings) or apparent (different operationalizations of the same concept).
-- Include both quantitative comparisons (effect sizes, sample sizes) and qualitative assessments (strengths, limitations).
-## References
-- Snyder, H. (2019). Literature Review as a Research Methodology. *Journal of Business Research*, 104, 333-339.
-- Templier, M. & Pare, G. (2015). A Framework for Guiding and Evaluating Literature Reviews. *Communications of the AIS*, 37, 112-137.
-- Webster, J. & Watson, R. T. (2002). Analyzing the Past to Prepare for the Future: Writing a Literature Review. *MIS Quarterly*, 26(2), xiii-xxiii.

package/skills/research/paper-review/paper-digest-guide/SKILL.md DELETED Viewed

@@ -1,240 +0,0 @@
----
-name: paper-digest-guide
-description: "Fetch paper and spawn sub-agents to read citations recursively"
-metadata:
-  openclaw:
-    emoji: "🔗"
-    category: "research"
-    subcategory: "paper-review"
-    keywords: ["citation analysis", "recursive reading", "paper digestion", "citation graph", "reference mining", "snowball search"]
-    source: "https://github.com/AcademicSkills/paper-digest-guide"
----
-# Paper Digest Guide
-A skill for deeply digesting academic papers by recursively following and analyzing their citation networks. Starting from a seed paper, it spawns sub-tasks to read the most important cited references, extracts the intellectual lineage of key claims, and builds a citation-aware knowledge graph that shows how ideas developed and where the research frontier stands.
-## Overview
-A single paper rarely tells the full story. Key claims rest on cited evidence, methodologies build on prior work, and the significance of findings depends on what came before. The Paper Digest approach addresses this by not just reading a paper but also selectively reading its most important references, and optionally their references, building a multi-level understanding of the intellectual context. This process, often called "citation snowballing" or "reference mining," is one of the most effective ways to deeply understand a research area.
-The skill implements a controlled recursive reading strategy with configurable depth (typically 1-2 levels) and breadth (top 5-10 most relevant citations per paper). It produces a citation knowledge graph annotating each reference with its role in the seed paper's argument, along with a synthesis report that traces the evolution of key ideas.
-## Citation Digestion Pipeline
-### Pipeline Architecture
-```
-Seed Paper
-  │
-  ├── Parse references (extract all citations)
-  │
-  ├── Classify citation role for each reference
-  │   ├── Foundational (key theory/framework the paper builds on)
-  │   ├── Methodological (method the paper adopts/adapts)
-  │   ├── Empirical (prior findings the paper extends)
-  │   ├── Contrasting (work the paper disagrees with)
-  │   └── Peripheral (background/context, low priority)
-  │
-  ├── Prioritize: rank by role importance + citation count
-  │
-  ├── Spawn sub-reads for top-N references
-  │   ├── Sub-read 1: [Foundational paper] → structured summary
-  │   ├── Sub-read 2: [Key method paper] → structured summary
-  │   ├── Sub-read 3: [Contrasting paper] → structured summary
-  │   └── ...
-  │
-  ├── (Optional) Depth 2: repeat for sub-read citations
-  │
-  └── Synthesize: build citation knowledge graph + narrative
-```
-### Citation Role Classification
-```python
-def classify_citation_role(citation_context: str, paper_section: str) -> str:
-    """
-    Classify the role of a citation based on its context in the paper.
-    Args:
-        citation_context: The sentence(s) containing the citation
-        paper_section: Which section the citation appears in
-    """
-    # Heuristic classification based on section and language
-    role_indicators = {
-        'foundational': {
-            'sections': ['introduction', 'background', 'related work'],
-            'phrases': ['builds on', 'based on', 'following', 'framework of',
-                       'theory proposed by', 'seminal work']
-        },
-        'methodological': {
-            'sections': ['methods', 'methodology', 'approach'],
-            'phrases': ['we adopt', 'following the approach of', 'using the method',
-                       'as proposed by', 'we extend the method']
-        },
-        'empirical': {
-            'sections': ['introduction', 'results', 'discussion'],
-            'phrases': ['found that', 'showed that', 'demonstrated',
-                       'reported', 'consistent with', 'prior studies']
-        },
-        'contrasting': {
-            'sections': ['introduction', 'related work', 'discussion'],
-            'phrases': ['unlike', 'in contrast to', 'however', 'whereas',
-                       'disagree', 'limitations of', 'fails to']
-        },
-        'peripheral': {
-            'sections': ['introduction'],
-            'phrases': ['for example', 'such as', 'see also', 'e.g.',
-                       'for a review see', 'has been studied']
-        }
-    }
-    for role, indicators in role_indicators.items():
-        section_match = paper_section.lower() in [s.lower() for s in indicators['sections']]
-        phrase_match = any(p in citation_context.lower() for p in indicators['phrases'])
-        if section_match and phrase_match:
-            return role
-    return 'peripheral'  # default
-```
-## Citation Priority Ranking
-### Prioritization Criteria
-| Criterion | Weight | Rationale |
-|-----------|--------|-----------|
-| Citation role: foundational | 5 | Must understand to grasp the paper |
-| Citation role: contrasting | 4 | Shows alternative perspectives |
-| Citation role: methodological | 4 | Needed to evaluate the approach |
-| Citation role: empirical | 3 | Supports key claims |
-| Cited multiple times in paper | +2 | Indicates higher importance |
-| High citation count (>100) | +1 | Widely recognized work |
-| Recent (<3 years old) | +1 | Current state of the art |
-| Citation role: peripheral | 1 | Low priority for deep reading |
-```python
-def rank_citations(citations: list) -> list:
-    """
-    Rank citations by priority for recursive reading.
-    Returns sorted list with top candidates first.
-    """
-    role_weights = {
-        'foundational': 5, 'contrasting': 4, 'methodological': 4,
-        'empirical': 3, 'peripheral': 1
-    }
-    for c in citations:
-        score = role_weights.get(c['role'], 1)
-        if c.get('times_cited_in_paper', 1) > 1:
-            score += 2
-        if c.get('global_citations', 0) > 100:
-            score += 1
-        if c.get('year', 0) >= 2023:
-            score += 1
-        c['priority_score'] = score
-    return sorted(citations, key=lambda x: x['priority_score'], reverse=True)
-```
-## Recursive Reading Configuration
-### Depth and Breadth Control
-```yaml
-digest_config:
-  # Depth: how many levels of citations to follow
-  max_depth: 2      # 1 = direct citations only, 2 = citations of citations
-  # Breadth: how many citations to read at each level
-  top_n_per_level:
-    depth_1: 8      # Read top 8 references from seed paper
-    depth_2: 3      # Read top 3 references from each depth-1 paper
-  # Focus: which citation roles to prioritize
-  priority_roles: ["foundational", "methodological", "contrasting"]
-  # Read depth for sub-papers
-  sub_read_depth: "pass_2"  # pass_1 (survey), pass_2 (comprehension), pass_3 (critical)
-  # Termination conditions
-  max_total_papers: 30       # Hard cap on total papers read
-  stop_on_saturation: true   # Stop when no new concepts are found
-```
-### Estimated Time and Output
-| Depth | Breadth | Papers Read | Time Estimate | Output Size |
-|-------|---------|-------------|---------------|-------------|
-| 1 | Top 5 | ~6 | 1-2 hours | 3-5 page report |
-| 1 | Top 10 | ~11 | 2-4 hours | 5-8 page report |
-| 2 | Top 8 + Top 3 | ~30 | 4-8 hours | 10-15 page report |
-## Citation Knowledge Graph
-### Building the Graph
-```python
-def build_citation_graph(seed_paper: dict, digested_papers: list) -> dict:
-    """
-    Build a knowledge graph from the digested papers.
-    Nodes are papers, edges are citation relationships with roles.
-    """
-    graph = {
-        'nodes': [],
-        'edges': [],
-        'clusters': {}
-    }
-    # Add seed paper as root node
-    graph['nodes'].append({
-        'id': seed_paper['doi'],
-        'label': f"{seed_paper['first_author']} ({seed_paper['year']})",
-        'type': 'seed',
-        'summary': seed_paper['main_finding']
-    })
-    # Add digested papers and their relationships
-    for paper in digested_papers:
-        graph['nodes'].append({
-            'id': paper['doi'],
-            'label': f"{paper['first_author']} ({paper['year']})",
-            'type': 'reference',
-            'role': paper['citation_role'],
-            'depth': paper['depth_level'],
-            'summary': paper['main_finding']
-        })
-        graph['edges'].append({
-            'source': paper['cited_by_doi'],
-            'target': paper['doi'],
-            'role': paper['citation_role'],
-            'context': paper['citation_context']
-        })
-    return graph
-```
-### Interpreting the Graph
-The resulting knowledge graph reveals:
-- **Intellectual lineage**: The chain of foundational works that led to the seed paper.
-- **Methodological genealogy**: Which methods were inherited and from where.
-- **Debate structure**: Papers on opposing sides of a disagreement, traced through contrasting citations.
-- **Convergence points**: Papers cited by multiple branches, indicating central concepts.
-- **Research frontier**: Recent papers at the leaves of the graph, indicating current directions.
-## Best Practices
-- Start with depth 1 and increase only if the topic requires deeper historical context.
-- Prioritize contrasting citations; they provide the most critical perspective on the seed paper.
-- When a citation cannot be accessed (paywall, unavailable), note it as a gap rather than skipping silently.
-- Use the citation knowledge graph to identify your own paper's position in the literature.
-- Save the full graph for reuse; it can seed future literature reviews on related topics.
-- Cross-check that the seed paper's characterization of cited work is accurate by reading the originals.
-## References
-- Wohlin, C. (2014). Guidelines for Snowballing in Systematic Literature Studies. *EASE 2014*.
-- Greenhalgh, T. & Peacock, R. (2005). Effectiveness and Efficiency of Search Methods. *BMJ*, 331, 1064-1065.
-- Chen, C. (2006). CiteSpace II: Detecting and Visualizing Emerging Trends and Transient Patterns. *JASIST*, 57(3), 359-377.