npm - @wentorai/research-plugins - Versions diffs - 1.4.0 → 1.4.2 - Mend

@wentorai/research-plugins 1.4.0 → 1.4.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (53) hide show

package/skills/literature/search/citation-chaining-guide/SKILL.md CHANGED Viewed

@@ -40,24 +40,30 @@ Examine the reference list of each seed paper and identify which cited works are
 ```python
 import requests
-def get_references(paper_id, limit=100):
-    """Get all references of a paper via Semantic Scholar."""
-    url = f"https://api.semanticscholar.org/graph/v1/paper/{paper_id}/references"
-    response = requests.get(url, params={
-        "fields": "title,year,citationCount,externalIds,abstract",
-        "limit": limit
-    })
-    refs = response.json().get("data", [])
-    return [r["citedPaper"] for r in refs if r["citedPaper"].get("title")]
+HEADERS = {"User-Agent": "ResearchPlugins/1.0 (https://wentor.ai)"}
+def get_references(work_id):
+    """Get all references of a paper via OpenAlex."""
+    url = f"https://api.openalex.org/works/{work_id}"
+    response = requests.get(url, headers=HEADERS)
+    paper = response.json()
+    ref_ids = paper.get("referenced_works", [])
+    references = []
+    for ref_id in ref_ids:
+        ref = requests.get(f"https://api.openalex.org/works/{ref_id.split('/')[-1]}", headers=HEADERS).json()
+        if ref.get("title"):
+            references.append(ref)
+    return references
 # Get references of a seed paper
-seed_doi = "DOI:10.1038/s41586-021-03819-2"
-references = get_references(seed_doi)
+seed_id = "W2741809807"
+references = get_references(seed_id)
 # Sort by citation count to find the most influential foundations
-references.sort(key=lambda p: p.get("citationCount", 0), reverse=True)
+references.sort(key=lambda p: p.get("cited_by_count", 0), reverse=True)
 for ref in references[:15]:
-    print(f"[{ref.get('year', '?')}] {ref['title']} ({ref.get('citationCount', 0)} citations)")
+    print(f"[{ref.get('publication_year', '?')}] {ref['title']} ({ref.get('cited_by_count', 0)} citations)")
 ```
 ### Step 3: Forward Chaining (Citation Tracking)
@@ -65,28 +71,32 @@ for ref in references[:15]:
 Find all papers that have cited your seed paper.
 ```python
-def get_citations(paper_id, limit=200):
-    """Get papers citing a given paper via Semantic Scholar."""
-    url = f"https://api.semanticscholar.org/graph/v1/paper/{paper_id}/citations"
+def get_citations(work_id, limit=200):
+    """Get papers citing a given paper via OpenAlex."""
     all_citations = []
-    offset = 0
-    while offset < limit:
-        response = requests.get(url, params={
-            "fields": "title,year,citationCount,externalIds,abstract",
-            "limit": min(100, limit - offset),
-            "offset": offset
-        })
-        data = response.json().get("data", [])
-        if not data:
+    page = 1
+    while len(all_citations) < limit:
+        response = requests.get(
+            "https://api.openalex.org/works",
+            params={
+                "filter": f"cites:{work_id}",
+                "sort": "cited_by_count:desc",
+                "per_page": min(200, limit - len(all_citations)),
+                "page": page
+            },
+            headers=HEADERS
+        )
+        results = response.json().get("results", [])
+        if not results:
             break
-        all_citations.extend([c["citingPaper"] for c in data if c["citingPaper"].get("title")])
-        offset += len(data)
+        all_citations.extend(results)
+        page += 1
     return all_citations
-citations = get_citations(seed_doi)
+citations = get_citations(seed_id)
 # Filter for recent, well-cited papers
-recent_impactful = [c for c in citations if c.get("year", 0) >= 2022 and c.get("citationCount", 0) >= 5]
-recent_impactful.sort(key=lambda p: p.get("citationCount", 0), reverse=True)
+recent_impactful = [c for c in citations if c.get("publication_year", 0) >= 2022 and c.get("cited_by_count", 0) >= 5]
+recent_impactful.sort(key=lambda p: p.get("cited_by_count", 0), reverse=True)
 ```
 ### Step 4: Co-Citation and Bibliographic Coupling
@@ -134,7 +144,7 @@ Repeat the process with the most relevant papers discovered in each round:
 | Google Scholar "Cited by" | Forward chaining | Free |
 | Web of Science "Cited References" / "Times Cited" | Both directions | Subscription |
 | Scopus "References" / "Cited by" | Both directions | Subscription |
-| Semantic Scholar API | Programmatic, both directions | Free |
+| OpenAlex API | Programmatic, both directions | Free |
 | Connected Papers (connectedpapers.com) | Visual co-citation graph | Free (limited) |
 | Litmaps (litmaps.com) | Visual citation network | Free tier |
 | CoCites (cocites.com) | Co-citation analysis | Free |
@@ -145,4 +155,4 @@ Repeat the process with the most relevant papers discovered in each round:
 - **Citation bias**: Highly cited papers are not always the best or most relevant. Pay attention to less-cited but methodologically sound papers.
 - **Recency bias**: Forward chaining favors recent papers with fewer citations. Allow time for citation accumulation or use Mendeley readership as a proxy.
 - **Field boundaries**: Citation chains tend to stay within disciplinary silos. Combine with keyword searches in adjacent-field databases to break out.
-- **Incomplete coverage**: No single database indexes all citations. Cross-check with at least two sources (e.g., Semantic Scholar + Google Scholar).
+- **Incomplete coverage**: No single database indexes all citations. Cross-check with at least two sources (e.g., OpenAlex + Google Scholar).

package/skills/literature/search/database-comparison-guide/SKILL.md CHANGED Viewed

@@ -96,5 +96,5 @@ A robust literature search should query multiple databases to maximize recall:
 - **Scopus vs. Web of Science**: Scopus has broader coverage (especially post-2000 and non-English journals); WoS has deeper historical archives and the Journal Impact Factor.
 - **Google Scholar** finds the most results but lacks advanced filtering. Use it for snowball searches and finding grey literature, not as your primary systematic search tool.
-- **API access**: PubMed (E-utilities), Semantic Scholar, OpenAlex, and Crossref all offer free APIs for programmatic searching. Scopus and WoS require institutional API keys.
+- **API access**: PubMed (E-utilities), OpenAlex, and Crossref all offer free APIs for programmatic searching. Scopus and WoS require institutional API keys.
 - **Alert services**: Set up saved search alerts on PubMed, Scopus, and Google Scholar to stay current in fast-moving fields.

package/skills/literature/search/semantic-scholar-api/SKILL.md CHANGED Viewed

@@ -1,134 +1,137 @@
 ---
 name: semantic-scholar-api
-description: "Search papers and analyze citation graphs via Semantic Scholar"
+description: "Search papers and analyze citation graphs via OpenAlex and CrossRef APIs"
 metadata:
   openclaw:
     emoji: "🔍"
     category: "literature"
     subcategory: "search"
     keywords: ["academic database search", "semantic search", "AI-powered literature search", "citation analysis", "citation network"]
-    source: "https://api.semanticscholar.org/"
+    source: "https://api.openalex.org/"
 ---
-# Semantic Scholar API Guide
+# OpenAlex & CrossRef API Guide
 ## Overview
-Semantic Scholar is a free, AI-powered research tool created by the Allen Institute for AI (AI2) that indexes over 200 million academic papers across all fields of science. Unlike traditional keyword-based search engines, Semantic Scholar uses natural language processing and machine learning to understand paper content, identify influential citations, and surface the most relevant results.
+OpenAlex is a free, open catalog of the global research system, indexing over 250 million academic works across all fields of science. It provides structured access to papers, authors, institutions, concepts, and citation networks. OpenAlex is the successor to Microsoft Academic Graph and is maintained by OurResearch (the team behind Unpaywall).
-The Semantic Scholar Academic Graph API provides structured access to papers, authors, citations, and references. It distinguishes between influential and non-influential citations using a trained classifier, helping researchers quickly identify the most impactful works in any field. The API also provides TLDR summaries generated by AI for many papers.
+CrossRef is the official DOI registration agency for scholarly content, providing metadata for over 150 million DOIs across all publishers and disciplines. Together, OpenAlex and CrossRef provide comprehensive coverage for academic search, citation analysis, and bibliometric research.
-The API can be used without authentication for basic access. Registering for a free API key unlocks higher rate limits and is recommended for production applications. The API returns clean JSON responses and supports field selection to minimize response payload size.
+Both APIs are free to use without authentication. OpenAlex requests a polite `User-Agent` header; CrossRef requests a `User-Agent` with contact email for access to the polite pool (faster rate limits).
 ## Authentication
-No authentication is required for basic usage. For higher rate limits, request a free API key at https://www.semanticscholar.org/product/api and include it as a header:
+No authentication is required for either API.
+OpenAlex: Include a `User-Agent` header for polite access:
 ```
-x-api-key: YOUR_API_KEY
+User-Agent: ResearchPlugins/1.0 (https://wentor.ai)
 ```
-Without an API key, rate limits are 5,000 requests per 5 minutes. With a key, limits are significantly higher (up to 1 request per second sustained).
+CrossRef: Include a `User-Agent` header with contact email for polite pool:
+```
+User-Agent: ResearchPlugins/1.0 (https://wentor.ai; mailto:dev@wentor.ai)
+```
 ## Core Endpoints
-### Paper Search: Find Papers by Query
+### OpenAlex: Search Works
-- **URL**: `GET https://api.semanticscholar.org/graph/v1/paper/search`
+- **URL**: `GET https://api.openalex.org/works`
 - **Parameters**:
   | Param | Type | Required | Description |
   |-------|------|----------|-------------|
-  | query | string | Yes | Search query string |
-  | offset | integer | No | Pagination offset (default: 0) |
-  | limit | integer | No | Results per page (default: 10, max: 100) |
-  | fields | string | No | Comma-separated fields to return (e.g., title,abstract,year,citationCount) |
-  | year | string | No | Year range filter (e.g., 2020-2024 or 2024-) |
-  | fieldsOfStudy | string | No | Filter by field (e.g., Computer Science, Medicine) |
+  | search | string | No | Full-text search query |
+  | filter | string | No | Filter expression (e.g., `from_publication_date:2024-01-01`) |
+  | sort | string | No | Sort field (e.g., `cited_by_count:desc`, `publication_date:desc`) |
+  | per_page | integer | No | Results per page (default: 25, max: 200) |
+  | page | integer | No | Page number (default: 1) |
 - **Example**:
   ```bash
-  curl "https://api.semanticscholar.org/graph/v1/paper/search?query=attention+is+all+you+need&limit=5&fields=title,year,citationCount,authors,tldr"
+  curl "https://api.openalex.org/works?search=attention+is+all+you+need&per_page=5"
   ```
-- **Response**: JSON with `total`, `offset`, and `data` array containing paper objects with requested fields.
+- **Response**: JSON with `meta` (count, page info) and `results` array containing work objects.
-### Paper Details: Retrieve Full Paper Metadata
+### OpenAlex: Get Work Details
-- **URL**: `GET https://api.semanticscholar.org/graph/v1/paper/{paper_id}`
+- **URL**: `GET https://api.openalex.org/works/{id}`
 - **Parameters**:
   | Param | Type | Required | Description |
   |-------|------|----------|-------------|
-  | paper_id | string | Yes | Semantic Scholar ID, DOI, ArXiv ID, or other identifier (e.g., DOI:10.1234/...) |
-  | fields | string | No | Comma-separated fields to return |
+  | id | string | Yes | OpenAlex ID (e.g., `W2741809807`), DOI URL, or other identifier |
 - **Example**:
   ```bash
-  curl "https://api.semanticscholar.org/graph/v1/paper/DOI:10.18653/v1/N19-1423?fields=title,abstract,year,citationCount,influentialCitationCount,references,citations"
+  curl "https://api.openalex.org/works/W2741809807"
   ```
-- **Response**: JSON with full paper metadata including `paperId`, `title`, `abstract`, `year`, `citationCount`, `influentialCitationCount`, `references`, and `citations`.
+- **Response**: JSON with full work metadata including `id`, `title`, `abstract_inverted_index`, `publication_year`, `cited_by_count`, `authorships`, `concepts`, `referenced_works`.
-### Author Search: Find Researchers
+### OpenAlex: Search Authors
-- **URL**: `GET https://api.semanticscholar.org/graph/v1/author/search`
+- **URL**: `GET https://api.openalex.org/authors`
 - **Parameters**:
   | Param | Type | Required | Description |
   |-------|------|----------|-------------|
-  | query | string | Yes | Author name query |
-  | offset | integer | No | Pagination offset |
-  | limit | integer | No | Results per page (max: 1000) |
-  | fields | string | No | Fields to return (e.g., name,paperCount,citationCount,hIndex) |
+  | search | string | No | Author name search |
+  | filter | string | No | Filter expression |
+  | per_page | integer | No | Results per page (max: 200) |
 - **Example**:
   ```bash
-  curl "https://api.semanticscholar.org/graph/v1/author/search?query=Yoshua+Bengio&fields=name,paperCount,citationCount,hIndex"
+  curl "https://api.openalex.org/authors?search=Yoshua+Bengio&per_page=5"
   ```
-- **Response**: JSON with author profiles including publication and citation metrics.
+- **Response**: JSON with author profiles including `works_count`, `cited_by_count`, `summary_stats.h_index`, affiliations.
-### Dataset Releases: Bulk Data Access
+### CrossRef: Resolve DOI
-- **URL**: `GET https://api.semanticscholar.org/datasets/v1/release`
+- **URL**: `GET https://api.crossref.org/works/{doi}`
 - **Parameters**:
   | Param | Type | Required | Description |
   |-------|------|----------|-------------|
-  | (none) | - | - | Returns list of available dataset releases |
+  | doi | string | Yes | DOI to resolve (e.g., `10.1038/nature12373`) |
 - **Example**:
   ```bash
-  curl "https://api.semanticscholar.org/datasets/v1/release"
+  curl "https://api.crossref.org/works/10.18653/v1/N19-1423"
   ```
-- **Response**: JSON array of release identifiers (dates) for bulk dataset downloads.
+- **Response**: JSON with full bibliographic metadata including title, authors, journal, dates, references count, and citation count.
 ## Rate Limits
-Without API key: 5,000 requests per 5 minutes (approximately 16.7 requests per second in bursts). With API key: higher sustained throughput, varies by key tier. The API returns HTTP 429 when limits are exceeded. Use the `Retry-After` header value to determine wait time before retrying. Batch endpoints are available for retrieving multiple papers or authors in a single request, which is more efficient than individual lookups.
+OpenAlex: No strict rate limit, but use polite `User-Agent` header. Recommended: max 10 requests per second. The API returns HTTP 429 when limits are exceeded.
+CrossRef: Without polite pool: ~50 requests per second. With polite pool (contact email in User-Agent): higher limits. The API returns HTTP 429 when limits are exceeded.
 ## Common Patterns
 ### Build a Citation Network
-Retrieve a paper and its citation tree to map influence:
+Retrieve a paper and find all works that cite it:
 ```bash
-# Get paper with its references and citations
-curl "https://api.semanticscholar.org/graph/v1/paper/CorpusID:49313245?fields=title,citations.title,citations.citationCount,references.title,references.citationCount"
+# Get paper details
+curl "https://api.openalex.org/works/W2741809807"
+# Get works citing this paper, sorted by citation count
+curl "https://api.openalex.org/works?filter=cites:W2741809807&sort=cited_by_count:desc&per_page=20"
 ```
 ### Find Influential Papers on a Topic
-Search for highly cited and influential works:
+Search for highly cited works on a topic:
 ```bash
-curl "https://api.semanticscholar.org/graph/v1/paper/search?query=graph+neural+networks&fields=title,year,citationCount,influentialCitationCount&limit=20"
+curl "https://api.openalex.org/works?search=graph+neural+networks&sort=cited_by_count:desc&per_page=20"
 ```
-### Batch Paper Lookup
+### Batch Paper Lookup via CrossRef
-Retrieve metadata for multiple papers in a single request using the batch endpoint:
+Search CrossRef for papers matching a query, sorted by citation count:
 ```bash
-curl -X POST "https://api.semanticscholar.org/graph/v1/paper/batch" \
-  -H "Content-Type: application/json" \
-  -d '{"ids": ["DOI:10.1038/s41586-021-03819-2", "CorpusID:49313245"]}' \
-  --url-query "fields=title,year,citationCount"
+curl "https://api.crossref.org/works?query=graph+neural+networks&sort=is-referenced-by-count&order=desc&rows=20"
 ```
 ## References
-- Official documentation: https://api.semanticscholar.org/
-- API tutorial: https://www.semanticscholar.org/product/api/tutorial
-- Semantic Scholar paper: https://arxiv.org/abs/2301.10140
+- OpenAlex documentation: https://docs.openalex.org/
+- CrossRef API documentation: https://api.crossref.org/swagger-ui/index.html
+- OpenAlex source: https://github.com/ourresearch/openalex-guts

package/skills/research/automation/paper-to-agent-guide/SKILL.md CHANGED Viewed

@@ -83,7 +83,7 @@ The skill supports building knowledge graphs from processed papers:
 - Extract entities (methods, datasets, metrics, tools, concepts)
 - Map relationships between entities (uses, extends, contradicts, supports)
-- Link to external knowledge bases (Semantic Scholar, OpenAlex, DOI)
+- Link to external knowledge bases (OpenAlex, CrossRef, DOI)
 - Track citation chains for key claims
 - Identify research lineages and methodological evolution

package/skills/research/deep-research/in-depth-research-guide/SKILL.md CHANGED Viewed

@@ -52,7 +52,7 @@ Search systematically across source tiers:
 | Tier | Source Type | Examples | Purpose |
 |------|-----------|---------|---------|
-| **1** | Academic databases | Semantic Scholar, PubMed, Scopus, Web of Science | Peer-reviewed primary research |
+| **1** | Academic databases | OpenAlex, PubMed, Scopus, Web of Science | Peer-reviewed primary research |
 | **2** | Preprint servers | arXiv, bioRxiv, SSRN, medRxiv | Cutting-edge, not yet reviewed |
 | **3** | Grey literature | WHO reports, World Bank, NBER working papers | Policy and institutional knowledge |
 | **4** | Patents and standards | Google Patents, USPTO, IEEE standards | Technical implementations |

package/skills/research/deep-research/kosmos-scientist-guide/SKILL.md CHANGED Viewed

@@ -48,7 +48,7 @@ You are an AI Scientist conducting rigorous research.
 Follow the scientific method strictly:
 1. **Literature Review**: Search for related work before
-   proposing anything new. Use Semantic Scholar API.
+   proposing anything new. Use OpenAlex API.
 2. **Hypothesis**: State falsifiable hypotheses clearly.
 3. **Experiment Design**: Define independent/dependent
    variables, controls, evaluation metrics.
@@ -62,7 +62,7 @@ Follow the scientific method strictly:
 ## Tools Available
 - Python 3.11+ with PyTorch, NumPy, SciPy
 - LaTeX (pdflatex + bibtex)
-- Semantic Scholar API for literature
+- OpenAlex API for literature
 - W&B for experiment tracking (optional)
 ```
@@ -153,7 +153,7 @@ Analyze results and write paper:
    - Method (formal description)
    - Experiments (setup + results + analysis)
    - Conclusion (summary + limitations + future)
-5. Verify all citations are real (Semantic Scholar)
+5. Verify all citations are real (OpenAlex/CrossRef)
 """
 ```

package/skills/research/deep-research/llm-scientific-discovery-guide/SKILL.md CHANGED Viewed

@@ -62,7 +62,7 @@ from scientific_agent import HypothesisGenerator
 generator = HypothesisGenerator(
     llm_provider="anthropic",
-    knowledge_sources=["pubmed", "semantic_scholar"],
+    knowledge_sources=["pubmed", "openalex"],
 )
 hypotheses = generator.generate(

package/skills/research/deep-research/local-deep-research-guide/SKILL.md CHANGED Viewed

@@ -16,7 +16,7 @@ metadata:
 Local Deep Research is an open-source deep research tool with over 4,000 GitHub stars that conducts comprehensive multi-source research using either local LLMs (via Ollama, LM Studio, or vLLM) or cloud-based models. It searches across 10+ academic and web sources simultaneously, synthesizes the findings, and produces well-cited research reports. The project is designed for researchers who need thorough, multi-perspective research coverage while maintaining the option to keep everything running locally for privacy.
-What makes Local Deep Research stand out is its breadth of search integration. Rather than relying on a single search API, it queries multiple sources in parallel -- including Google Scholar, Semantic Scholar, arXiv, PubMed, Wikipedia, web search engines, and more -- then cross-references and synthesizes the results. This multi-source approach produces more comprehensive and balanced research outputs compared to single-source tools.
+What makes Local Deep Research stand out is its breadth of search integration. Rather than relying on a single search API, it queries multiple sources in parallel -- including Google Scholar, OpenAlex, arXiv, PubMed, Wikipedia, web search engines, and more -- then cross-references and synthesizes the results. This multi-source approach produces more comprehensive and balanced research outputs compared to single-source tools.
 The tool is particularly well-suited for academic researchers who need to conduct preliminary literature reviews, verify claims across multiple databases, or explore interdisciplinary topics where relevant work may be scattered across different platforms and publication venues.
@@ -94,7 +94,7 @@ from local_deep_research import DeepResearcher
 researcher = DeepResearcher(
     llm_provider="ollama",
     llm_model="llama3.1:70b",
-    search_sources=["google_scholar", "semantic_scholar",
+    search_sources=["google_scholar", "openalex",
                     "arxiv", "web"],
     max_iterations=10,
 )
@@ -114,7 +114,7 @@ Local Deep Research queries multiple sources in parallel for each research sub-q
 | Source | Type | API Key Required | Best For |
 |--------|------|-----------------|----------|
 | Google Scholar | Academic | No (via scraping) | Broad academic search |
-| Semantic Scholar | Academic | Optional | CS/AI papers, citation data |
+| OpenAlex | Academic | No | Cross-disciplinary, citation data |
 | arXiv | Academic | No | Preprints, ML/physics/math |
 | PubMed | Academic | No | Biomedical literature |
 | Wikipedia | Encyclopedia | No | Background and definitions |
@@ -128,12 +128,12 @@ Local Deep Research queries multiple sources in parallel for each research sub-q
 # Customize source priorities for your research domain
 researcher = DeepResearcher(
     search_sources={
-        "primary": ["semantic_scholar", "arxiv"],
+        "primary": ["openalex", "arxiv"],
         "secondary": ["google_scholar", "web"],
         "reference": ["wikipedia", "crossref"],
     },
     source_weights={
-        "semantic_scholar": 1.5,  # Prioritize academic sources
+        "openalex": 1.5,  # Prioritize academic sources
         "arxiv": 1.5,
         "web": 0.8,
     },
@@ -249,5 +249,5 @@ local-deep-research "Your sensitive research query here"
 - Repository: https://github.com/LearningCircuit/local-deep-research
 - Ollama: https://ollama.com/
 - SearXNG: https://github.com/searxng/searxng
-- Semantic Scholar API: https://api.semanticscholar.org/
+- OpenAlex API: https://api.openalex.org/
 - arXiv API: https://info.arxiv.org/help/api/

package/skills/research/deep-research/open-researcher-guide/SKILL.md CHANGED Viewed

@@ -43,14 +43,14 @@ result = researcher.research(
 ```python
 # Each sub-question triggers:
-# - Academic search (Semantic Scholar, arXiv)
+# - Academic search (OpenAlex, arXiv)
 # - Paper reading (abstract + key sections)
 # - Evidence extraction
 # - Follow-up question generation
 # Configuration
 researcher = OpenResearcher(
-    search_backends=["semantic_scholar", "arxiv"],
+    search_backends=["openalex", "arxiv"],
     max_iterations=5,           # Research rounds per sub-question
     papers_per_iteration=10,    # Papers to read per round
     follow_up_questions=True,   # Generate follow-up questions
@@ -96,7 +96,7 @@ researcher = OpenResearcher(
     llm_provider="anthropic",
     model="claude-sonnet-4-20250514",
     search_config={
-        "backends": ["semantic_scholar", "arxiv"],
+        "backends": ["openalex", "arxiv"],
         "max_results_per_query": 20,
     },
     reading_config={

package/skills/research/deep-research/tongyi-deep-research-guide/SKILL.md CHANGED Viewed

@@ -119,12 +119,12 @@ DeepResearch integrates with multiple search providers to cast a wide net:
 - **Tavily**: AI-optimized search API designed for research agents
 - **Serper**: Fast Google search results API
 - **SearXNG**: Self-hosted meta-search engine for privacy-focused deployments
-- **Semantic Scholar API**: Direct academic paper search (no API key required for basic access)
+- **OpenAlex API**: Direct academic paper search (free, no API key required)
 ```python
 # Configure multiple search backends for comprehensive coverage
 agent = DeepResearch(
-    search_engines=["bing", "semantic_scholar"],
+    search_engines=["bing", "openalex"],
     search_strategy="parallel",  # Search all engines simultaneously
 )
 ```
@@ -151,7 +151,7 @@ Create research profiles optimized for specific academic domains:
 # Biomedical research profile
 bio_config = {
     "preferred_sources": ["pubmed", "biorxiv", "nature", "science"],
-    "search_engines": ["semantic_scholar", "bing"],
+    "search_engines": ["openalex", "bing"],
     "terminology_mode": "technical",
     "citation_format": "apa",
 }
@@ -214,4 +214,4 @@ The trace includes all search queries, retrieved documents, LLM prompts and resp
 - Repository: https://github.com/Alibaba-NLP/DeepResearch
 - Qwen model family: https://github.com/QwenLM/Qwen
 - Alibaba NLP group: https://github.com/Alibaba-NLP
-- Semantic Scholar API: https://api.semanticscholar.org/
+- OpenAlex API: https://api.openalex.org/

package/skills/research/methodology/grad-school-guide/SKILL.md CHANGED Viewed

@@ -30,7 +30,7 @@ A strong research question is the foundation of any good paper. It should be spe
 |-----------|-------------|---------------|
 | **F**easible | Can be answered with available resources | Do you have the data, compute, and time? |
 | **I**nteresting | Engages the research community | Would peers read this at a top venue? |
-| **N**ovel | Not already answered | Has Semantic Scholar search been done? |
+| **N**ovel | Not already answered | Has OpenAlex/CrossRef search been done? |
 | **E**thical | Follows research ethics standards | Does it require IRB approval? |
 | **R**elevant | Advances the field meaningfully | Does it connect to open problems? |

package/skills/research/paper-review/automated-review-guide/SKILL.md CHANGED Viewed

@@ -274,7 +274,7 @@ Plagiarism and integrity:
 Reference management:
   - scite.ai: smart citation analysis (supporting/contrasting)
-  - Semantic Scholar: related work discovery
+  - OpenAlex: related work discovery
   - Connected Papers: citation graph visualization
 ```

package/skills/tools/diagram/excalidraw-diagram-guide/SKILL.md CHANGED Viewed

@@ -86,7 +86,7 @@ For software or experimental system diagrams, use grouped rectangles with labele
 Input: "Draw a system architecture with three layers:
         Frontend (React dashboard),
         Backend (FastAPI + PostgreSQL),
-        External (Semantic Scholar API, CrossRef API)"
+        External (OpenAlex API, CrossRef API)"
 ```
 The output places each layer as a dashed-border container with internal component boxes and inter-layer arrows.

package/skills/tools/diagram/mermaid-architect-guide/SKILL.md CHANGED Viewed

@@ -56,7 +56,7 @@ C4Context
     System(platform, "Wentor Platform", "AI-powered research assistant ecosystem")
-    System_Ext(scholar, "Semantic Scholar", "Academic paper database")
+    System_Ext(scholar, "OpenAlex", "Academic paper database")
     System_Ext(crossref, "CrossRef", "DOI resolution and metadata")
     System_Ext(github, "GitHub", "Code and skill repositories")

package/skills/tools/diagram/plantuml-guide/SKILL.md CHANGED Viewed

@@ -225,7 +225,7 @@ package "Data Layer" {
 }
 package "External APIs" {
-  [Semantic Scholar] as S2
+  [Unpaywall] as UP
   [CrossRef] as CR
   [OpenAlex] as OA
 }

package/skills/tools/document/grobid-pdf-parsing/SKILL.md CHANGED Viewed

@@ -16,7 +16,7 @@ metadata:
 Academic PDFs are the primary format for distributing research, yet extracting structured data from them remains challenging. PDFs encode visual layout, not semantic structure -- headings, paragraphs, equations, tables, and citations are all just positioned text and graphics. GROBID (GeneRation Of BIbliographic Data) is the leading open-source tool for parsing academic PDFs into structured XML/TEI format, extracting metadata, body text, references, and figures with high accuracy.
-GROBID is used by major academic platforms including Semantic Scholar, CORE, and ResearchGate for large-scale document processing. It combines machine learning models (CRF and deep learning) with heuristic rules to handle the diverse formatting of academic papers across publishers and disciplines.
+GROBID is used by major academic platforms including CORE, ResearchGate, and others for large-scale document processing. It combines machine learning models (CRF and deep learning) with heuristic rules to handle the diverse formatting of academic papers across publishers and disciplines.
 This guide covers installing and running GROBID, using its REST API for batch processing, extracting specific elements (metadata, references, body sections), and integrating GROBID output into downstream workflows such as knowledge bases, systematic reviews, and literature analysis pipelines.

package/skills/tools/document/paper-parse-guide/SKILL.md CHANGED Viewed

@@ -32,7 +32,7 @@ Both modes begin by parsing the paper's structure from its PDF or HTML source, e
 | DOI | Resolve via CrossRef/Unpaywall | Auto-fetches open access version |
 | arXiv ID | `https://arxiv.org/pdf/{id}` | Always available |
 | URL | Direct download | May require institutional access |
-| Semantic Scholar ID | S2 API + PDF link | Includes metadata |
+| OpenAlex ID | OpenAlex API + OA link | Includes metadata |
 ### PDF Parsing Pipeline
@@ -238,6 +238,6 @@ comparison = create_comparison_table(summaries,
 - GROBID: https://github.com/kermitt2/grobid
 - PyMuPDF: https://pymupdf.readthedocs.io
-- Semantic Scholar API: https://api.semanticscholar.org
+- OpenAlex API: https://api.openalex.org
 - Unpaywall API: https://unpaywall.org/products/api
 - S. Keshav, "How to Read a Paper" (2007): http://ccr.sigcomm.org/online/files/p83-keshavA.pdf

package/skills/tools/knowledge-graph/citation-network-builder/SKILL.md CHANGED Viewed

@@ -44,12 +44,12 @@ OpenAlex (free):
   - Limits: Reference linking less complete than WoS
   - Best for: Large-scale analysis, reproducible research
-Semantic Scholar (free):
+CrossRef (free):
   - Format: JSON via REST API
-  - Coverage: ~200M papers, strong in CS/Biomed
-  - Strengths: Free, citation context, citation intents
-  - Limits: Weaker coverage in humanities and social sciences
-  - Best for: CS/AI-focused networks, citation intent analysis
+  - Coverage: ~150M DOIs across all publishers
+  - Strengths: Free, authoritative DOI metadata, reference linking
+  - Limits: No abstract text, citation counts may lag
+  - Best for: Cross-publisher networks, DOI resolution
 ```
 ### Data Cleaning for Network Construction

package/skills/tools/knowledge-graph/knowledge-graph-construction/SKILL.md CHANGED Viewed

@@ -293,7 +293,7 @@ Provide a detailed answer citing specific papers, methods, and findings from the
 - **Start with a clear schema.** Define your entity types and relations before extracting data. A schema change later requires re-processing.
 - **Use persistent identifiers.** DOIs for papers, ORCIDs for authors, and canonical names for methods prevent duplicate nodes.
 - **Validate extracted triples.** LLM extraction is imperfect. Sample and manually verify 5-10% of extractions.
-- **Enrich with external data.** Link your KG to OpenAlex, Semantic Scholar, or Wikidata for additional metadata.
+- **Enrich with external data.** Link your KG to OpenAlex, CrossRef, or Wikidata for additional metadata.
 - **Version your graph.** Export snapshots regularly and track changes over time.
 - **Design queries before building.** Know what questions you want to answer before deciding on the schema.

package/skills/tools/scraping/academic-web-scraping/SKILL.md CHANGED Viewed

@@ -28,7 +28,6 @@ APIs are always preferable to scraping when available. They provide structured d
 | API | Data | Rate Limit | Auth |
 |-----|------|-----------|------|
-| Semantic Scholar | Papers, authors, citations | 100 req/sec (with key) | API key (free) |
 | OpenAlex | Papers, authors, venues, concepts | 100K req/day | Email in header |
 | Crossref | DOI metadata | 50 req/sec (polite pool) | Email in header |
 | PubMed (Entrez) | Biomedical literature | 10 req/sec (with key) | API key (free) |
@@ -319,7 +318,7 @@ class DataCollector:
 ## References
 - [OpenAlex API Documentation](https://docs.openalex.org/) -- Open bibliographic data API
-- [Semantic Scholar API](https://api.semanticscholar.org/) -- Paper and author data
+- [CrossRef API](https://api.crossref.org/) -- DOI resolution and metadata
 - [BeautifulSoup Documentation](https://www.crummy.com/software/BeautifulSoup/bs4/doc/) -- HTML parsing
 - [Scrapy Documentation](https://docs.scrapy.org/) -- Web scraping framework
 - [Playwright Documentation](https://playwright.dev/python/) -- Browser automation