npm - @wentorai/research-plugins - Versions diffs - 1.4.0 → 1.4.3 - Mend

@wentorai/research-plugins 1.4.0 → 1.4.3

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (63) hide show

package/skills/literature/search/semantic-scholar-api/SKILL.md CHANGED Viewed

@@ -1,134 +1,137 @@
 ---
 name: semantic-scholar-api
-description: "Search papers and analyze citation graphs via Semantic Scholar"
+description: "Search papers and analyze citation graphs via OpenAlex and CrossRef APIs"
 metadata:
   openclaw:
     emoji: "🔍"
     category: "literature"
     subcategory: "search"
     keywords: ["academic database search", "semantic search", "AI-powered literature search", "citation analysis", "citation network"]
-    source: "https://api.semanticscholar.org/"
+    source: "https://api.openalex.org/"
 ---
-# Semantic Scholar API Guide
+# OpenAlex & CrossRef API Guide
 ## Overview
-Semantic Scholar is a free, AI-powered research tool created by the Allen Institute for AI (AI2) that indexes over 200 million academic papers across all fields of science. Unlike traditional keyword-based search engines, Semantic Scholar uses natural language processing and machine learning to understand paper content, identify influential citations, and surface the most relevant results.
+OpenAlex is a free, open catalog of the global research system, indexing over 250 million academic works across all fields of science. It provides structured access to papers, authors, institutions, concepts, and citation networks. OpenAlex is the successor to Microsoft Academic Graph and is maintained by OurResearch (the team behind Unpaywall).
-The Semantic Scholar Academic Graph API provides structured access to papers, authors, citations, and references. It distinguishes between influential and non-influential citations using a trained classifier, helping researchers quickly identify the most impactful works in any field. The API also provides TLDR summaries generated by AI for many papers.
+CrossRef is the official DOI registration agency for scholarly content, providing metadata for over 150 million DOIs across all publishers and disciplines. Together, OpenAlex and CrossRef provide comprehensive coverage for academic search, citation analysis, and bibliometric research.
-The API can be used without authentication for basic access. Registering for a free API key unlocks higher rate limits and is recommended for production applications. The API returns clean JSON responses and supports field selection to minimize response payload size.
+Both APIs are free to use without authentication. OpenAlex requests a polite `User-Agent` header; CrossRef requests a `User-Agent` with contact email for access to the polite pool (faster rate limits).
 ## Authentication
-No authentication is required for basic usage. For higher rate limits, request a free API key at https://www.semanticscholar.org/product/api and include it as a header:
+No authentication is required for either API.
+OpenAlex: Include a `User-Agent` header for polite access:
 ```
-x-api-key: YOUR_API_KEY
+User-Agent: ResearchPlugins/1.0 (https://wentor.ai)
 ```
-Without an API key, rate limits are 5,000 requests per 5 minutes. With a key, limits are significantly higher (up to 1 request per second sustained).
+CrossRef: Include a `User-Agent` header with contact email for polite pool:
+```
+User-Agent: ResearchPlugins/1.0 (https://wentor.ai; mailto:dev@wentor.ai)
+```
 ## Core Endpoints
-### Paper Search: Find Papers by Query
+### OpenAlex: Search Works
-- **URL**: `GET https://api.semanticscholar.org/graph/v1/paper/search`
+- **URL**: `GET https://api.openalex.org/works`
 - **Parameters**:
   | Param | Type | Required | Description |
   |-------|------|----------|-------------|
-  | query | string | Yes | Search query string |
-  | offset | integer | No | Pagination offset (default: 0) |
-  | limit | integer | No | Results per page (default: 10, max: 100) |
-  | fields | string | No | Comma-separated fields to return (e.g., title,abstract,year,citationCount) |
-  | year | string | No | Year range filter (e.g., 2020-2024 or 2024-) |
-  | fieldsOfStudy | string | No | Filter by field (e.g., Computer Science, Medicine) |
+  | search | string | No | Full-text search query |
+  | filter | string | No | Filter expression (e.g., `from_publication_date:2024-01-01`) |
+  | sort | string | No | Sort field (e.g., `cited_by_count:desc`, `publication_date:desc`) |
+  | per_page | integer | No | Results per page (default: 25, max: 200) |
+  | page | integer | No | Page number (default: 1) |
 - **Example**:
   ```bash
-  curl "https://api.semanticscholar.org/graph/v1/paper/search?query=attention+is+all+you+need&limit=5&fields=title,year,citationCount,authors,tldr"
+  curl "https://api.openalex.org/works?search=attention+is+all+you+need&per_page=5"
   ```
-- **Response**: JSON with `total`, `offset`, and `data` array containing paper objects with requested fields.
+- **Response**: JSON with `meta` (count, page info) and `results` array containing work objects.
-### Paper Details: Retrieve Full Paper Metadata
+### OpenAlex: Get Work Details
-- **URL**: `GET https://api.semanticscholar.org/graph/v1/paper/{paper_id}`
+- **URL**: `GET https://api.openalex.org/works/{id}`
 - **Parameters**:
   | Param | Type | Required | Description |
   |-------|------|----------|-------------|
-  | paper_id | string | Yes | Semantic Scholar ID, DOI, ArXiv ID, or other identifier (e.g., DOI:10.1234/...) |
-  | fields | string | No | Comma-separated fields to return |
+  | id | string | Yes | OpenAlex ID (e.g., `W2741809807`), DOI URL, or other identifier |
 - **Example**:
   ```bash
-  curl "https://api.semanticscholar.org/graph/v1/paper/DOI:10.18653/v1/N19-1423?fields=title,abstract,year,citationCount,influentialCitationCount,references,citations"
+  curl "https://api.openalex.org/works/W2741809807"
   ```
-- **Response**: JSON with full paper metadata including `paperId`, `title`, `abstract`, `year`, `citationCount`, `influentialCitationCount`, `references`, and `citations`.
+- **Response**: JSON with full work metadata including `id`, `title`, `abstract_inverted_index`, `publication_year`, `cited_by_count`, `authorships`, `concepts`, `referenced_works`.
-### Author Search: Find Researchers
+### OpenAlex: Search Authors
-- **URL**: `GET https://api.semanticscholar.org/graph/v1/author/search`
+- **URL**: `GET https://api.openalex.org/authors`
 - **Parameters**:
   | Param | Type | Required | Description |
   |-------|------|----------|-------------|
-  | query | string | Yes | Author name query |
-  | offset | integer | No | Pagination offset |
-  | limit | integer | No | Results per page (max: 1000) |
-  | fields | string | No | Fields to return (e.g., name,paperCount,citationCount,hIndex) |
+  | search | string | No | Author name search |
+  | filter | string | No | Filter expression |
+  | per_page | integer | No | Results per page (max: 200) |
 - **Example**:
   ```bash
-  curl "https://api.semanticscholar.org/graph/v1/author/search?query=Yoshua+Bengio&fields=name,paperCount,citationCount,hIndex"
+  curl "https://api.openalex.org/authors?search=Yoshua+Bengio&per_page=5"
   ```
-- **Response**: JSON with author profiles including publication and citation metrics.
+- **Response**: JSON with author profiles including `works_count`, `cited_by_count`, `summary_stats.h_index`, affiliations.
-### Dataset Releases: Bulk Data Access
+### CrossRef: Resolve DOI
-- **URL**: `GET https://api.semanticscholar.org/datasets/v1/release`
+- **URL**: `GET https://api.crossref.org/works/{doi}`
 - **Parameters**:
   | Param | Type | Required | Description |
   |-------|------|----------|-------------|
-  | (none) | - | - | Returns list of available dataset releases |
+  | doi | string | Yes | DOI to resolve (e.g., `10.1038/nature12373`) |
 - **Example**:
   ```bash
-  curl "https://api.semanticscholar.org/datasets/v1/release"
+  curl "https://api.crossref.org/works/10.18653/v1/N19-1423"
   ```
-- **Response**: JSON array of release identifiers (dates) for bulk dataset downloads.
+- **Response**: JSON with full bibliographic metadata including title, authors, journal, dates, references count, and citation count.
 ## Rate Limits
-Without API key: 5,000 requests per 5 minutes (approximately 16.7 requests per second in bursts). With API key: higher sustained throughput, varies by key tier. The API returns HTTP 429 when limits are exceeded. Use the `Retry-After` header value to determine wait time before retrying. Batch endpoints are available for retrieving multiple papers or authors in a single request, which is more efficient than individual lookups.
+OpenAlex: No strict rate limit, but use polite `User-Agent` header. Recommended: max 10 requests per second. The API returns HTTP 429 when limits are exceeded.
+CrossRef: Without polite pool: ~50 requests per second. With polite pool (contact email in User-Agent): higher limits. The API returns HTTP 429 when limits are exceeded.
 ## Common Patterns
 ### Build a Citation Network
-Retrieve a paper and its citation tree to map influence:
+Retrieve a paper and find all works that cite it:
 ```bash
-# Get paper with its references and citations
-curl "https://api.semanticscholar.org/graph/v1/paper/CorpusID:49313245?fields=title,citations.title,citations.citationCount,references.title,references.citationCount"
+# Get paper details
+curl "https://api.openalex.org/works/W2741809807"
+# Get works citing this paper, sorted by citation count
+curl "https://api.openalex.org/works?filter=cites:W2741809807&sort=cited_by_count:desc&per_page=20"
 ```
 ### Find Influential Papers on a Topic
-Search for highly cited and influential works:
+Search for highly cited works on a topic:
 ```bash
-curl "https://api.semanticscholar.org/graph/v1/paper/search?query=graph+neural+networks&fields=title,year,citationCount,influentialCitationCount&limit=20"
+curl "https://api.openalex.org/works?search=graph+neural+networks&sort=cited_by_count:desc&per_page=20"
 ```
-### Batch Paper Lookup
+### Batch Paper Lookup via CrossRef
-Retrieve metadata for multiple papers in a single request using the batch endpoint:
+Search CrossRef for papers matching a query, sorted by citation count:
 ```bash
-curl -X POST "https://api.semanticscholar.org/graph/v1/paper/batch" \
-  -H "Content-Type: application/json" \
-  -d '{"ids": ["DOI:10.1038/s41586-021-03819-2", "CorpusID:49313245"]}' \
-  --url-query "fields=title,year,citationCount"
+curl "https://api.crossref.org/works?query=graph+neural+networks&sort=is-referenced-by-count&order=desc&rows=20"
 ```
 ## References
-- Official documentation: https://api.semanticscholar.org/
-- API tutorial: https://www.semanticscholar.org/product/api/tutorial
-- Semantic Scholar paper: https://arxiv.org/abs/2301.10140
+- OpenAlex documentation: https://docs.openalex.org/
+- CrossRef API documentation: https://api.crossref.org/swagger-ui/index.html
+- OpenAlex source: https://github.com/ourresearch/openalex-guts

package/skills/research/automation/paper-to-agent-guide/SKILL.md CHANGED Viewed

@@ -83,7 +83,7 @@ The skill supports building knowledge graphs from processed papers:
 - Extract entities (methods, datasets, metrics, tools, concepts)
 - Map relationships between entities (uses, extends, contradicts, supports)
-- Link to external knowledge bases (Semantic Scholar, OpenAlex, DOI)
+- Link to external knowledge bases (OpenAlex, CrossRef, DOI)
 - Track citation chains for key claims
 - Identify research lineages and methodological evolution

package/skills/research/deep-research/in-depth-research-guide/SKILL.md CHANGED Viewed

@@ -52,7 +52,7 @@ Search systematically across source tiers:
 | Tier | Source Type | Examples | Purpose |
 |------|-----------|---------|---------|
-| **1** | Academic databases | Semantic Scholar, PubMed, Scopus, Web of Science | Peer-reviewed primary research |
+| **1** | Academic databases | OpenAlex, PubMed, Scopus, Web of Science | Peer-reviewed primary research |
 | **2** | Preprint servers | arXiv, bioRxiv, SSRN, medRxiv | Cutting-edge, not yet reviewed |
 | **3** | Grey literature | WHO reports, World Bank, NBER working papers | Policy and institutional knowledge |
 | **4** | Patents and standards | Google Patents, USPTO, IEEE standards | Technical implementations |

package/skills/research/deep-research/kosmos-scientist-guide/SKILL.md CHANGED Viewed

@@ -48,7 +48,7 @@ You are an AI Scientist conducting rigorous research.
 Follow the scientific method strictly:
 1. **Literature Review**: Search for related work before
-   proposing anything new. Use Semantic Scholar API.
+   proposing anything new. Use OpenAlex API.
 2. **Hypothesis**: State falsifiable hypotheses clearly.
 3. **Experiment Design**: Define independent/dependent
    variables, controls, evaluation metrics.
@@ -62,7 +62,7 @@ Follow the scientific method strictly:
 ## Tools Available
 - Python 3.11+ with PyTorch, NumPy, SciPy
 - LaTeX (pdflatex + bibtex)
-- Semantic Scholar API for literature
+- OpenAlex API for literature
 - W&B for experiment tracking (optional)
 ```
@@ -153,7 +153,7 @@ Analyze results and write paper:
    - Method (formal description)
    - Experiments (setup + results + analysis)
    - Conclusion (summary + limitations + future)
-5. Verify all citations are real (Semantic Scholar)
+5. Verify all citations are real (OpenAlex/CrossRef)
 """
 ```

package/skills/research/deep-research/llm-scientific-discovery-guide/SKILL.md CHANGED Viewed

@@ -62,7 +62,7 @@ from scientific_agent import HypothesisGenerator
 generator = HypothesisGenerator(
     llm_provider="anthropic",
-    knowledge_sources=["pubmed", "semantic_scholar"],
+    knowledge_sources=["pubmed", "openalex"],
 )
 hypotheses = generator.generate(

package/skills/research/deep-research/local-deep-research-guide/SKILL.md CHANGED Viewed

@@ -16,7 +16,7 @@ metadata:
 Local Deep Research is an open-source deep research tool with over 4,000 GitHub stars that conducts comprehensive multi-source research using either local LLMs (via Ollama, LM Studio, or vLLM) or cloud-based models. It searches across 10+ academic and web sources simultaneously, synthesizes the findings, and produces well-cited research reports. The project is designed for researchers who need thorough, multi-perspective research coverage while maintaining the option to keep everything running locally for privacy.
-What makes Local Deep Research stand out is its breadth of search integration. Rather than relying on a single search API, it queries multiple sources in parallel -- including Google Scholar, Semantic Scholar, arXiv, PubMed, Wikipedia, web search engines, and more -- then cross-references and synthesizes the results. This multi-source approach produces more comprehensive and balanced research outputs compared to single-source tools.
+What makes Local Deep Research stand out is its breadth of search integration. Rather than relying on a single search API, it queries multiple sources in parallel -- including Google Scholar, OpenAlex, arXiv, PubMed, Wikipedia, web search engines, and more -- then cross-references and synthesizes the results. This multi-source approach produces more comprehensive and balanced research outputs compared to single-source tools.
 The tool is particularly well-suited for academic researchers who need to conduct preliminary literature reviews, verify claims across multiple databases, or explore interdisciplinary topics where relevant work may be scattered across different platforms and publication venues.
@@ -94,7 +94,7 @@ from local_deep_research import DeepResearcher
 researcher = DeepResearcher(
     llm_provider="ollama",
     llm_model="llama3.1:70b",
-    search_sources=["google_scholar", "semantic_scholar",
+    search_sources=["google_scholar", "openalex",
                     "arxiv", "web"],
     max_iterations=10,
 )
@@ -114,7 +114,7 @@ Local Deep Research queries multiple sources in parallel for each research sub-q
 | Source | Type | API Key Required | Best For |
 |--------|------|-----------------|----------|
 | Google Scholar | Academic | No (via scraping) | Broad academic search |
-| Semantic Scholar | Academic | Optional | CS/AI papers, citation data |
+| OpenAlex | Academic | No | Cross-disciplinary, citation data |
 | arXiv | Academic | No | Preprints, ML/physics/math |
 | PubMed | Academic | No | Biomedical literature |
 | Wikipedia | Encyclopedia | No | Background and definitions |
@@ -128,12 +128,12 @@ Local Deep Research queries multiple sources in parallel for each research sub-q
 # Customize source priorities for your research domain
 researcher = DeepResearcher(
     search_sources={
-        "primary": ["semantic_scholar", "arxiv"],
+        "primary": ["openalex", "arxiv"],
         "secondary": ["google_scholar", "web"],
         "reference": ["wikipedia", "crossref"],
     },
     source_weights={
-        "semantic_scholar": 1.5,  # Prioritize academic sources
+        "openalex": 1.5,  # Prioritize academic sources
         "arxiv": 1.5,
         "web": 0.8,
     },
@@ -249,5 +249,5 @@ local-deep-research "Your sensitive research query here"
 - Repository: https://github.com/LearningCircuit/local-deep-research
 - Ollama: https://ollama.com/
 - SearXNG: https://github.com/searxng/searxng
-- Semantic Scholar API: https://api.semanticscholar.org/
+- OpenAlex API: https://api.openalex.org/
 - arXiv API: https://info.arxiv.org/help/api/

package/skills/research/deep-research/open-researcher-guide/SKILL.md CHANGED Viewed

@@ -43,14 +43,14 @@ result = researcher.research(
 ```python
 # Each sub-question triggers:
-# - Academic search (Semantic Scholar, arXiv)
+# - Academic search (OpenAlex, arXiv)
 # - Paper reading (abstract + key sections)
 # - Evidence extraction
 # - Follow-up question generation
 # Configuration
 researcher = OpenResearcher(
-    search_backends=["semantic_scholar", "arxiv"],
+    search_backends=["openalex", "arxiv"],
     max_iterations=5,           # Research rounds per sub-question
     papers_per_iteration=10,    # Papers to read per round
     follow_up_questions=True,   # Generate follow-up questions
@@ -96,7 +96,7 @@ researcher = OpenResearcher(
     llm_provider="anthropic",
     model="claude-sonnet-4-20250514",
     search_config={
-        "backends": ["semantic_scholar", "arxiv"],
+        "backends": ["openalex", "arxiv"],
         "max_results_per_query": 20,
     },
     reading_config={

package/skills/research/deep-research/tongyi-deep-research-guide/SKILL.md CHANGED Viewed

@@ -119,12 +119,12 @@ DeepResearch integrates with multiple search providers to cast a wide net:
 - **Tavily**: AI-optimized search API designed for research agents
 - **Serper**: Fast Google search results API
 - **SearXNG**: Self-hosted meta-search engine for privacy-focused deployments
-- **Semantic Scholar API**: Direct academic paper search (no API key required for basic access)
+- **OpenAlex API**: Direct academic paper search (free, no API key required)
 ```python
 # Configure multiple search backends for comprehensive coverage
 agent = DeepResearch(
-    search_engines=["bing", "semantic_scholar"],
+    search_engines=["bing", "openalex"],
     search_strategy="parallel",  # Search all engines simultaneously
 )
 ```
@@ -151,7 +151,7 @@ Create research profiles optimized for specific academic domains:
 # Biomedical research profile
 bio_config = {
     "preferred_sources": ["pubmed", "biorxiv", "nature", "science"],
-    "search_engines": ["semantic_scholar", "bing"],
+    "search_engines": ["openalex", "bing"],
     "terminology_mode": "technical",
     "citation_format": "apa",
 }
@@ -214,4 +214,4 @@ The trace includes all search queries, retrieved documents, LLM prompts and resp
 - Repository: https://github.com/Alibaba-NLP/DeepResearch
 - Qwen model family: https://github.com/QwenLM/Qwen
 - Alibaba NLP group: https://github.com/Alibaba-NLP
-- Semantic Scholar API: https://api.semanticscholar.org/
+- OpenAlex API: https://api.openalex.org/

package/skills/research/methodology/grad-school-guide/SKILL.md CHANGED Viewed

@@ -30,7 +30,7 @@ A strong research question is the foundation of any good paper. It should be spe
 |-----------|-------------|---------------|
 | **F**easible | Can be answered with available resources | Do you have the data, compute, and time? |
 | **I**nteresting | Engages the research community | Would peers read this at a top venue? |
-| **N**ovel | Not already answered | Has Semantic Scholar search been done? |
+| **N**ovel | Not already answered | Has OpenAlex/CrossRef search been done? |
 | **E**thical | Follows research ethics standards | Does it require IRB approval? |
 | **R**elevant | Advances the field meaningfully | Does it connect to open problems? |

package/skills/research/paper-review/automated-review-guide/SKILL.md CHANGED Viewed

@@ -274,7 +274,7 @@ Plagiarism and integrity:
 Reference management:
   - scite.ai: smart citation analysis (supporting/contrasting)
-  - Semantic Scholar: related work discovery
+  - OpenAlex: related work discovery
   - Connected Papers: citation graph visualization
 ```

package/skills/tools/diagram/excalidraw-diagram-guide/SKILL.md CHANGED Viewed

@@ -86,7 +86,7 @@ For software or experimental system diagrams, use grouped rectangles with labele
 Input: "Draw a system architecture with three layers:
         Frontend (React dashboard),
         Backend (FastAPI + PostgreSQL),
-        External (Semantic Scholar API, CrossRef API)"
+        External (OpenAlex API, CrossRef API)"
 ```
 The output places each layer as a dashed-border container with internal component boxes and inter-layer arrows.

package/skills/tools/diagram/mermaid-architect-guide/SKILL.md CHANGED Viewed

@@ -56,7 +56,7 @@ C4Context
     System(platform, "Wentor Platform", "AI-powered research assistant ecosystem")
-    System_Ext(scholar, "Semantic Scholar", "Academic paper database")
+    System_Ext(scholar, "OpenAlex", "Academic paper database")
     System_Ext(crossref, "CrossRef", "DOI resolution and metadata")
     System_Ext(github, "GitHub", "Code and skill repositories")

package/skills/tools/diagram/plantuml-guide/SKILL.md CHANGED Viewed

@@ -225,7 +225,7 @@ package "Data Layer" {
 }
 package "External APIs" {
-  [Semantic Scholar] as S2
+  [Unpaywall] as UP
   [CrossRef] as CR
   [OpenAlex] as OA
 }

package/skills/tools/document/grobid-pdf-parsing/SKILL.md CHANGED Viewed

@@ -16,7 +16,7 @@ metadata:
 Academic PDFs are the primary format for distributing research, yet extracting structured data from them remains challenging. PDFs encode visual layout, not semantic structure -- headings, paragraphs, equations, tables, and citations are all just positioned text and graphics. GROBID (GeneRation Of BIbliographic Data) is the leading open-source tool for parsing academic PDFs into structured XML/TEI format, extracting metadata, body text, references, and figures with high accuracy.
-GROBID is used by major academic platforms including Semantic Scholar, CORE, and ResearchGate for large-scale document processing. It combines machine learning models (CRF and deep learning) with heuristic rules to handle the diverse formatting of academic papers across publishers and disciplines.
+GROBID is used by major academic platforms including CORE, ResearchGate, and others for large-scale document processing. It combines machine learning models (CRF and deep learning) with heuristic rules to handle the diverse formatting of academic papers across publishers and disciplines.
 This guide covers installing and running GROBID, using its REST API for batch processing, extracting specific elements (metadata, references, body sections), and integrating GROBID output into downstream workflows such as knowledge bases, systematic reviews, and literature analysis pipelines.

package/skills/tools/document/paper-parse-guide/SKILL.md CHANGED Viewed

@@ -32,7 +32,7 @@ Both modes begin by parsing the paper's structure from its PDF or HTML source, e
 | DOI | Resolve via CrossRef/Unpaywall | Auto-fetches open access version |
 | arXiv ID | `https://arxiv.org/pdf/{id}` | Always available |
 | URL | Direct download | May require institutional access |
-| Semantic Scholar ID | S2 API + PDF link | Includes metadata |
+| OpenAlex ID | OpenAlex API + OA link | Includes metadata |
 ### PDF Parsing Pipeline
@@ -238,6 +238,6 @@ comparison = create_comparison_table(summaries,
 - GROBID: https://github.com/kermitt2/grobid
 - PyMuPDF: https://pymupdf.readthedocs.io
-- Semantic Scholar API: https://api.semanticscholar.org
+- OpenAlex API: https://api.openalex.org
 - Unpaywall API: https://unpaywall.org/products/api
 - S. Keshav, "How to Read a Paper" (2007): http://ccr.sigcomm.org/online/files/p83-keshavA.pdf

package/skills/tools/knowledge-graph/citation-network-builder/SKILL.md CHANGED Viewed

@@ -44,12 +44,12 @@ OpenAlex (free):
   - Limits: Reference linking less complete than WoS
   - Best for: Large-scale analysis, reproducible research
-Semantic Scholar (free):
+CrossRef (free):
   - Format: JSON via REST API
-  - Coverage: ~200M papers, strong in CS/Biomed
-  - Strengths: Free, citation context, citation intents
-  - Limits: Weaker coverage in humanities and social sciences
-  - Best for: CS/AI-focused networks, citation intent analysis
+  - Coverage: ~150M DOIs across all publishers
+  - Strengths: Free, authoritative DOI metadata, reference linking
+  - Limits: No abstract text, citation counts may lag
+  - Best for: Cross-publisher networks, DOI resolution
 ```
 ### Data Cleaning for Network Construction

package/skills/tools/knowledge-graph/knowledge-graph-construction/SKILL.md CHANGED Viewed

@@ -293,7 +293,7 @@ Provide a detailed answer citing specific papers, methods, and findings from the
 - **Start with a clear schema.** Define your entity types and relations before extracting data. A schema change later requires re-processing.
 - **Use persistent identifiers.** DOIs for papers, ORCIDs for authors, and canonical names for methods prevent duplicate nodes.
 - **Validate extracted triples.** LLM extraction is imperfect. Sample and manually verify 5-10% of extractions.
-- **Enrich with external data.** Link your KG to OpenAlex, Semantic Scholar, or Wikidata for additional metadata.
+- **Enrich with external data.** Link your KG to OpenAlex, CrossRef, or Wikidata for additional metadata.
 - **Version your graph.** Export snapshots regularly and track changes over time.
 - **Design queries before building.** Know what questions you want to answer before deciding on the schema.

package/skills/tools/scraping/academic-web-scraping/SKILL.md CHANGED Viewed

@@ -28,7 +28,6 @@ APIs are always preferable to scraping when available. They provide structured d
 | API | Data | Rate Limit | Auth |
 |-----|------|-----------|------|
-| Semantic Scholar | Papers, authors, citations | 100 req/sec (with key) | API key (free) |
 | OpenAlex | Papers, authors, venues, concepts | 100K req/day | Email in header |
 | Crossref | DOI metadata | 50 req/sec (polite pool) | Email in header |
 | PubMed (Entrez) | Biomedical literature | 10 req/sec (with key) | API key (free) |
@@ -319,7 +318,7 @@ class DataCollector:
 ## References
 - [OpenAlex API Documentation](https://docs.openalex.org/) -- Open bibliographic data API
-- [Semantic Scholar API](https://api.semanticscholar.org/) -- Paper and author data
+- [CrossRef API](https://api.crossref.org/) -- DOI resolution and metadata
 - [BeautifulSoup Documentation](https://www.crummy.com/software/BeautifulSoup/bs4/doc/) -- HTML parsing
 - [Scrapy Documentation](https://docs.scrapy.org/) -- Web scraping framework
 - [Playwright Documentation](https://playwright.dev/python/) -- Browser automation

package/skills/tools/scraping/google-scholar-scraper/SKILL.md CHANGED Viewed

@@ -41,7 +41,7 @@ Ethical guidelines:
     OpenAlex could answer it instead
 Official and semi-official alternatives:
-  - Semantic Scholar API: free, 100 requests/sec, excellent coverage
+  - OpenAlex API: free, no key required, excellent coverage
   - OpenAlex API: free, comprehensive, well-documented
   - Crossref API: free, DOI-based metadata and citation counts
   - CORE API: free, full-text open access content
@@ -232,12 +232,12 @@ OpenAlex (openalex.org):
   - Data: titles, abstracts, citations, authors, institutions
   - Best for: large-scale bibliometric analysis
-Semantic Scholar (semanticscholar.org):
-  - Coverage: 200M+ papers
-  - API: REST, free key available
-  - Rate limit: 100 requests/sec with API key
-  - Data: titles, abstracts, citations, citation contexts, TLDR
-  - Best for: citation analysis, NLP on papers
+OpenAlex (openalex.org):
+  - Coverage: 250M+ works, all disciplines
+  - API: REST, no key required
+  - Rate limit: ~10 requests/sec polite
+  - Data: titles, abstracts, citations, concepts, author profiles
+  - Best for: cross-disciplinary analysis, open data research
 Crossref (crossref.org):
   - Coverage: 130M+ DOIs

package/skills/writing/citation/SKILL.md CHANGED Viewed

@@ -11,7 +11,7 @@ Select the skill matching the user's need, then `read` its SKILL.md.
 |-------|-------------|
 | [academic-citation-manager](./academic-citation-manager/SKILL.md) | Manage academic citations across BibTeX, APA, MLA, and Chicago formats |
 | [bibtex-management-guide](./bibtex-management-guide/SKILL.md) | Clean, format, deduplicate, and manage BibTeX bibliography files for LaTeX |
-| [citation-assistant-skill](./citation-assistant-skill/SKILL.md) | Claude Code skill for citation workflow via Semantic Scholar |
+| [citation-assistant-skill](./citation-assistant-skill/SKILL.md) | Claude Code skill for citation workflow via OpenAlex and CrossRef |
 | [citation-style-guide](./citation-style-guide/SKILL.md) | APA, MLA, Chicago citation format guide with CSL configuration |
 | [jabref-reference-guide](./jabref-reference-guide/SKILL.md) | Guide to JabRef open-source BibTeX and BibLaTeX reference manager |
 | [jasminum-zotero-guide](./jasminum-zotero-guide/SKILL.md) | Guide to Jasminum for retrieving CNKI Chinese academic metadata in Zotero |

package/skills/writing/citation/academic-citation-manager/SKILL.md CHANGED Viewed

@@ -18,7 +18,7 @@ Manage academic citations across multiple formats (BibTeX, APA 7th, MLA 9th, Chi
 Citation management is a persistent friction point in academic writing. Researchers collect references from multiple sources (databases, PDFs, colleagues, web pages), store them in different formats, and must output them in the specific style required by each target journal. Errors in citations -- misspelled author names, incorrect years, broken DOIs, inconsistent formatting -- are among the most common reasons for desk rejection and reviewer criticism.
-This skill provides a comprehensive citation management workflow that goes beyond what GUI reference managers offer. It can retrieve complete metadata from a DOI in seconds, convert between any citation format, detect and merge duplicate entries, validate entries against CrossRef and Semantic Scholar databases, and generate properly formatted bibliographies for any major citation style.
+This skill provides a comprehensive citation management workflow that goes beyond what GUI reference managers offer. It can retrieve complete metadata from a DOI in seconds, convert between any citation format, detect and merge duplicate entries, validate entries against CrossRef and OpenAlex databases, and generate properly formatted bibliographies for any major citation style.
 The approach is text-based and scriptable, making it ideal for integration with LaTeX workflows, Markdown writing pipelines, and automated document generation. All citation data is stored in standard BibTeX format as the canonical source, with on-demand conversion to other formats for specific manuscript requirements.
@@ -52,33 +52,36 @@ print(bibtex)
 # }
 ```
-### From Semantic Scholar
+### From OpenAlex
 ```python
-def get_citation_from_s2(paper_id):
-    """Retrieve citation data from Semantic Scholar API."""
-    url = f"https://api.semanticscholar.org/graph/v1/paper/{paper_id}"
-    params = {"fields": "title,authors,year,venue,doi,citationCount,externalIds"}
-    response = requests.get(url, params=params)
+def get_citation_from_openalex(work_id):
+    """Retrieve citation data from OpenAlex API."""
+    url = f"https://api.openalex.org/works/{work_id}"
+    headers = {"User-Agent": "ResearchPlugins/1.0 (https://wentor.ai)"}
+    response = requests.get(url, headers=headers)
     if response.status_code == 200:
         data = response.json()
         return format_as_bibtex(data)
     return None
-def format_as_bibtex(s2_data):
-    """Convert Semantic Scholar data to BibTeX."""
-    authors = s2_data.get("authors", [])
-    author_str = " and ".join(a["name"] for a in authors)
-    first_author = authors[0]["name"].split()[-1] if authors else "Unknown"
-    year = s2_data.get("year", "")
+def format_as_bibtex(oa_data):
+    """Convert OpenAlex data to BibTeX."""
+    authorships = oa_data.get("authorships", [])
+    author_str = " and ".join(a["author"]["display_name"] for a in authorships)
+    first_author = authorships[0]["author"]["display_name"].split()[-1] if authorships else "Unknown"
+    year = str(oa_data.get("publication_year", ""))
     key = f"{first_author}_{year}"
+    venue = oa_data.get("primary_location", {}) or {}
+    journal = (venue.get("source") or {}).get("display_name", "")
     return f"""@article{{{key},
-  title={{{s2_data.get('title', '')}}},
+  title={{{oa_data.get('title', '')}}},
   author={{{author_str}}},
   year={{{year}}},
-  journal={{{s2_data.get('venue', '')}}},
-  doi={{{s2_data.get('doi', '')}}}
+  journal={{{journal}}},
+  doi={{{oa_data.get('doi', '')}}}
 }}"""
 ```
@@ -308,7 +311,7 @@ pandoc paper.md --citeproc --bibliography=references.bib \
 ## References
 - CrossRef API: https://api.crossref.org
-- Semantic Scholar API: https://api.semanticscholar.org
+- OpenAlex API: https://api.openalex.org
 - APA 7th Edition Manual: https://apastyle.apa.org/products/publication-manual-7th-edition
 - BibTeX documentation: http://www.bibtex.org
 - CSL styles repository: https://github.com/citation-style-language/styles