npm - @xiaotianxt/skills - Versions diffs - 0.1.0 - Mend

@xiaotianxt/skills 0.1.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (120) hide show

package/EXCLUDED.md +42 -0
package/LICENSE +21 -0
package/README.md +165 -0
package/SECURITY.md +23 -0
package/SOURCES.md +45 -0
package/bin/skills.mjs +241 -0
package/package.json +38 -0
package/skills/1password/SKILL.md +94 -0
package/skills/1password/agents/openai.yaml +4 -0
package/skills/1password/references/item-management.md +80 -0
package/skills/1password/references/op-cli.md +107 -0
package/skills/apple-calendar-event/SKILL.md +81 -0
package/skills/apple-calendar-event/agents/openai.yaml +4 -0
package/skills/apple-calendar-event/scripts/calendar_audit.py +201 -0
package/skills/apple-calendar-event/scripts/calendar_event.py +164 -0
package/skills/bro-browser/SKILL.md +118 -0
package/skills/bro-browser/agents/openai.yaml +4 -0
package/skills/bro-browser/references/tool-map.md +102 -0
package/skills/bro-browser/references/workflows.md +146 -0
package/skills/bro-browser/scripts/bro-call.mjs +189 -0
package/skills/calendar/SKILL.md +182 -0
package/skills/calendar/agents/openai.yaml +4 -0
package/skills/calendar/references/operations.md +255 -0
package/skills/calendar/scripts/calendar_list_review.py +157 -0
package/skills/calendar/scripts/event_dedupe_preview.py +155 -0
package/skills/canvas/SKILL.md +70 -0
package/skills/canvas/agents/openai.yaml +4 -0
package/skills/canvas/references/canvas-api.md +76 -0
package/skills/course-exam-review-planner/SKILL.md +127 -0
package/skills/cx/SKILL.md +25 -0
package/skills/gh-fix-ci/LICENSE.txt +201 -0
package/skills/gh-fix-ci/SKILL.md +81 -0
package/skills/gh-fix-ci/agents/openai.yaml +6 -0
package/skills/gh-fix-ci/assets/github-small.svg +3 -0
package/skills/gh-fix-ci/assets/github.png +0 -0
package/skills/gh-fix-ci/scripts/inspect_pr_checks.py +509 -0
package/skills/gh-review-workflow/SKILL.md +61 -0
package/skills/gh-review-workflow/agents/openai.yaml +4 -0
package/skills/gh-review-workflow/references/workflow.md +48 -0
package/skills/gh-review-workflow/scripts/fetch_review_state.py +222 -0
package/skills/gh-review-workflow/scripts/resolve_review_threads.py +83 -0
package/skills/github/SKILL.md +74 -0
package/skills/github/agents/openai.yaml +6 -0
package/skills/github/assets/github-small.svg +3 -0
package/skills/github/assets/github.png +0 -0
package/skills/gws-calendar/SKILL.md +126 -0
package/skills/gws-calendar-agenda/SKILL.md +52 -0
package/skills/gws-calendar-insert/SKILL.md +66 -0
package/skills/gws-docs/SKILL.md +48 -0
package/skills/gws-docs-write/SKILL.md +49 -0
package/skills/gws-drive/SKILL.md +137 -0
package/skills/gws-drive-upload/SKILL.md +52 -0
package/skills/gws-gmail/SKILL.md +62 -0
package/skills/gws-gmail-forward/SKILL.md +55 -0
package/skills/gws-gmail-reply/SKILL.md +58 -0
package/skills/gws-gmail-reply-all/SKILL.md +62 -0
package/skills/gws-gmail-send/SKILL.md +57 -0
package/skills/gws-gmail-triage/SKILL.md +50 -0
package/skills/gws-gmail-watch/SKILL.md +58 -0
package/skills/gws-shared/SKILL.md +27 -0
package/skills/helium-browser-mcp/SKILL.md +137 -0
package/skills/helium-browser-mcp/agents/openai.yaml +4 -0
package/skills/helium-browser-mcp/scripts/obmcp.mjs +92 -0
package/skills/helium-browser-mcp/scripts/openbrowsermcp-stdio-proxy.mjs +170 -0
package/skills/learn/SKILL.md +122 -0
package/skills/learn/agents/openai.yaml +7 -0
package/skills/learn/assets/AGENTS.template.md +33 -0
package/skills/learn/assets/errorlog.template.typ +61 -0
package/skills/learn/assets/reading-sequence.template.md +23 -0
package/skills/learn/assets/source-index.template.md +17 -0
package/skills/learn/assets/tasklog.template.typ +57 -0
package/skills/learn/assets/workbook.template.typ +60 -0
package/skills/learn/references/learning-science.md +103 -0
package/skills/learn/scripts/init_learning_workspace.py +70 -0
package/skills/macos-messages/SKILL.md +258 -0
package/skills/memory/SKILL.md +33 -0
package/skills/memory/codex.md +186 -0
package/skills/memory/opencode.md +164 -0
package/skills/mimestreamctl/SKILL.md +170 -0
package/skills/mimestreamctl/agents/openai.yaml +4 -0
package/skills/mimestreamctl/scripts/mimestreamctl +33 -0
package/skills/mon/SKILL.md +51 -0
package/skills/mon/scripts/mon_spend_review.py +458 -0
package/skills/ocr/SKILL.md +136 -0
package/skills/ocr/agents/openai.yaml +4 -0
package/skills/ocr/references/local-ocr-best-practices.md +297 -0
package/skills/ocr/references/mineru-api.md +159 -0
package/skills/ocr/scripts/ocr-router +22 -0
package/skills/ocr/scripts/ocr_router.py +741 -0
package/skills/panopto-mp4-bulk-download/SKILL.md +57 -0
package/skills/panopto-mp4-bulk-download/agents/openai.yaml +4 -0
package/skills/panopto-mp4-bulk-download/references/url-patterns.md +26 -0
package/skills/panopto-mp4-bulk-download/scripts/panopto_bulk_mp4.sh +213 -0
package/skills/rust-systems-style/SKILL.md +109 -0
package/skills/rust-systems-style/agents/openai.yaml +4 -0
package/skills/rust-systems-style/references/rust-review-checklist.md +77 -0
package/skills/rust-systems-style/references/style-sources.md +68 -0
package/skills/ship-ai-native-cli/SKILL.md +76 -0
package/skills/ship-ai-native-cli/agents/openai.yaml +4 -0
package/skills/ship-ai-native-cli/references/case-notes.md +83 -0
package/skills/ship-ai-native-cli/references/product-method.md +82 -0
package/skills/ship-ai-native-cli/references/release-checklist.md +147 -0
package/skills/ship-ai-native-cli/references/rust-cli-shape.md +111 -0
package/skills/telegram-mtproto-session/SKILL.md +125 -0
package/skills/telegram-mtproto-session/agents/openai.yaml +4 -0
package/skills/telegram-mtproto-session/scripts/telegram_session.py +687 -0
package/skills/tg/SKILL.md +173 -0
package/skills/things3-manager/SKILL.md +116 -0
package/skills/things3-manager/scripts/things +42 -0
package/skills/things3-manager/scripts/things_cli.py +514 -0
package/skills/web-artifacts-builder/LICENSE.txt +202 -0
package/skills/web-artifacts-builder/SKILL.md +74 -0
package/skills/web-artifacts-builder/scripts/bundle-artifact.sh +54 -0
package/skills/web-artifacts-builder/scripts/init-artifact.sh +379 -0
package/skills/web-artifacts-builder/scripts/shadcn-components.tar.gz +0 -0
package/skills/yeet/LICENSE.txt +201 -0
package/skills/yeet/SKILL.md +71 -0
package/skills/yeet/agents/openai.yaml +6 -0
package/skills/yeet/assets/yeet-small.svg +3 -0
package/skills/yeet/assets/yeet.png +0 -0

package/skills/mon/scripts/mon_spend_review.py ADDED Viewed

@@ -0,0 +1,458 @@
+#!/usr/bin/env python3
+"""Review Monarch transactions for shared-expense reimbursements.
+This script is intentionally local-only: it reads JSON produced by `mon
+transactions --json` and emits a compact spending review. It never reads tokens
+or calls Monarch directly.
+"""
+from __future__ import annotations
+import argparse
+import datetime as dt
+import json
+import re
+import sys
+from dataclasses import dataclass, field
+from pathlib import Path
+from typing import Any
+SPEND_CATEGORIES = {
+    "Restaurants & Bars",
+    "Groceries",
+    "Entertainment & Recreation",
+    "Travel & Vacation",
+    "Shopping",
+    "Coffee Shops",
+    "Public Transit",
+    "Internet & Cable",
+    "Insurance",
+    "Office Supplies & Expenses",
+    "Miscellaneous",
+    "Cash & ATM",
+}
+NON_CONSUMPTION_OUTFLOW = {
+    "Transfer",
+    "Credit Card Payment",
+    "Rent",
+}
+NON_REIMBURSEMENT_INFLOW = {
+    "Paychecks",
+    "Interest",
+    "Credit Card Payment",
+}
+DEFAULT_OWN_NAMES = {
+    "yupei tian",
+}
+INFRA_NAMES = {
+    "carnegie mellon",
+    "chase",
+    "discover",
+}
+@dataclass
+class Tx:
+    id: str
+    date: dt.date
+    amount: float
+    category: str
+    merchant: str
+    raw_name: str
+    account: str
+    pending: bool
+    hidden: bool
+    @classmethod
+    def from_monarch(cls, value: dict[str, Any]) -> "Tx":
+        category = (value.get("category") or {}).get("name") or ""
+        merchant = (value.get("merchant") or {}).get("name") or ""
+        raw_name = value.get("plaidName") or ""
+        account = (value.get("account") or {}).get("displayName") or ""
+        return cls(
+            id=str(value.get("id") or ""),
+            date=dt.date.fromisoformat(value["date"]),
+            amount=float(value["amount"]),
+            category=category,
+            merchant=merchant,
+            raw_name=raw_name,
+            account=account,
+            pending=bool(value.get("pending")),
+            hidden=bool(value.get("hideFromReports")),
+        )
+    @property
+    def name(self) -> str:
+        return self.merchant or self.raw_name
+    @property
+    def abs_amount(self) -> float:
+        return abs(self.amount)
+@dataclass
+class Event:
+    date: dt.date
+    expenses: list[Tx]
+    reimbursements: list[Tx] = field(default_factory=list)
+    @property
+    def gross(self) -> float:
+        return sum(tx.abs_amount for tx in self.expenses)
+    @property
+    def reimbursement_total(self) -> float:
+        return sum(tx.amount for tx in self.reimbursements)
+    @property
+    def net(self) -> float:
+        return self.gross - self.reimbursement_total
+    @property
+    def confidence(self) -> str:
+        if not self.reimbursements:
+            return "none"
+        ratio = self.reimbursement_total / self.gross if self.gross else 0
+        if 0.55 <= ratio <= 1.05:
+            return "high"
+        if 0.25 <= ratio < 0.55 or 1.05 < ratio <= 1.25:
+            return "medium"
+        return "low"
+def money(value: float) -> str:
+    return f"${value:,.2f}"
+def load_transactions(path: Path) -> list[Tx]:
+    data = json.loads(path.read_text())
+    values = data.get("allTransactions", {}).get("results", [])
+    txs = [Tx.from_monarch(value) for value in values]
+    seen: set[str] = set()
+    unique: list[Tx] = []
+    for tx in txs:
+        key = tx.id or f"{tx.date}:{tx.amount}:{tx.name}:{tx.account}"
+        if key in seen:
+            continue
+        seen.add(key)
+        unique.append(tx)
+    return sorted(unique, key=lambda tx: (tx.date, tx.amount, tx.name))
+def compact_name(tx: Tx) -> str:
+    text = tx.raw_name or tx.merchant
+    match = re.search(r"zelle payment from\s+(.+?)(?:\s+[A-Z0-9]{8,}|\s+\d{8,}|$)", text, re.I)
+    if match:
+        return f"Zelle from {match.group(1).strip()}"
+    match = re.search(r"payment from\s+(.+)", text, re.I)
+    if match:
+        return f"Payment from {match.group(1).strip()}"
+    return tx.name
+def text_has_any(text: str, needles: set[str]) -> bool:
+    low = text.lower()
+    return any(needle in low for needle in needles)
+def is_visible_settled(tx: Tx, include_pending: bool) -> bool:
+    if tx.hidden:
+        return False
+    if tx.pending and not include_pending:
+        return False
+    return True
+def is_consumption_outflow(tx: Tx, include_pending: bool) -> bool:
+    if not is_visible_settled(tx, include_pending) or tx.amount >= 0:
+        return False
+    if tx.category in NON_CONSUMPTION_OUTFLOW:
+        return False
+    return tx.category in SPEND_CATEGORIES or bool(tx.category)
+def is_reimbursement_candidate(tx: Tx, include_pending: bool, own_names: set[str]) -> bool:
+    if not is_visible_settled(tx, include_pending) or tx.amount <= 0:
+        return False
+    if tx.category in NON_REIMBURSEMENT_INFLOW:
+        return False
+    full_text = f"{tx.merchant} {tx.raw_name}"
+    if text_has_any(full_text, own_names | INFRA_NAMES):
+        return False
+    if tx.category in {"Transfer", "Other Income", "Business Income", "Shopping"}:
+        return True
+    low = full_text.lower()
+    return "zelle payment from" in low or "payment from" in low
+def is_merchant_credit(tx: Tx, include_pending: bool, own_names: set[str]) -> bool:
+    if not is_visible_settled(tx, include_pending) or tx.amount <= 0:
+        return False
+    if is_reimbursement_candidate(tx, include_pending, own_names):
+        return False
+    if tx.category in NON_REIMBURSEMENT_INFLOW:
+        return False
+    return tx.category in SPEND_CATEGORIES
+def build_events(txs: list[Tx], min_anchor: float, include_pending: bool) -> list[Event]:
+    by_day: dict[dt.date, list[Tx]] = {}
+    for tx in txs:
+        if is_consumption_outflow(tx, include_pending):
+            by_day.setdefault(tx.date, []).append(tx)
+    events: list[Event] = []
+    for day, day_txs in sorted(by_day.items()):
+        anchors = [
+            tx
+            for tx in day_txs
+            if tx.category in {"Restaurants & Bars", "Groceries", "Entertainment & Recreation", "Travel & Vacation", "Shopping"}
+            and tx.abs_amount >= min_anchor
+        ]
+        if not anchors:
+            continue
+        events.append(Event(date=day, expenses=sorted(anchors, key=lambda tx: tx.amount)))
+    return events
+def assign_reimbursements(events: list[Event], reimbursements: list[Tx], window_days: int) -> None:
+    for reimb in sorted(reimbursements, key=lambda tx: (tx.date, tx.amount)):
+        candidates = [
+            event
+            for event in events
+            if dt.timedelta(days=0) <= reimb.date - event.date <= dt.timedelta(days=window_days)
+        ]
+        if not candidates:
+            continue
+        candidates.sort(
+            key=lambda event: (
+                (reimb.date - event.date).days,
+                abs(event.net - reimb.amount),
+                -event.gross,
+            )
+        )
+        chosen = candidates[0]
+        if chosen.reimbursement_total + reimb.amount <= chosen.gross * 1.25:
+            chosen.reimbursements.append(reimb)
+def summarize(txs: list[Tx], events: list[Event], reimbursements: list[Tx], merchant_credits: list[Tx], include_pending: bool) -> dict[str, Any]:
+    consumption = [tx for tx in txs if is_consumption_outflow(tx, include_pending)]
+    raw_spend = sum(tx.abs_amount for tx in consumption)
+    assigned_reimbursements = {tx.id for event in events for tx in event.reimbursements}
+    assigned_total = sum(tx.amount for tx in reimbursements if tx.id in assigned_reimbursements)
+    merchant_credit_total = sum(tx.amount for tx in merchant_credits)
+    adjusted = raw_spend - assigned_total
+    cash_adjusted = adjusted - merchant_credit_total
+    by_category: dict[str, float] = {}
+    for tx in consumption:
+        by_category[tx.category] = by_category.get(tx.category, 0.0) + tx.abs_amount
+    reclassification_rows: list[dict[str, Any]] = []
+    category_offsets: dict[str, float] = {}
+    for event in events:
+        for reimb in event.reimbursements:
+            for category, amount in allocate_reimbursement_to_categories(event, reimb.amount):
+                category_offsets[category] = category_offsets.get(category, 0.0) + amount
+                reclassification_rows.append(
+                    {
+                        "date": reimb.date.isoformat(),
+                        "source": compact_name(reimb),
+                        "originalCategory": reimb.category,
+                        "assignedEventDate": event.date.isoformat(),
+                        "assignedCategory": category,
+                        "signedAmount": round(-amount, 2),
+                        "eventGross": round(event.gross, 2),
+                        "eventMerchants": [tx.name for tx in event.expenses],
+                    }
+                )
+    category_after = {
+        category: amount - category_offsets.get(category, 0.0)
+        for category, amount in by_category.items()
+    }
+    unresolved_large = [
+        tx
+        for tx in consumption
+        if tx.abs_amount >= 80 and all(tx not in event.expenses for event in events if event.reimbursements)
+    ]
+    return {
+        "transactionCount": len(txs),
+        "rawConsumptionSpend": round(raw_spend, 2),
+        "assignedReimbursements": round(assigned_total, 2),
+        "merchantCreditTotal": round(merchant_credit_total, 2),
+        "adjustedConsumptionSpend": round(adjusted, 2),
+        "cashImpactAfterCredits": round(cash_adjusted, 2),
+        "categorySpend": [
+            {"category": category, "amount": round(amount, 2)}
+            for category, amount in sorted(by_category.items(), key=lambda item: -item[1])
+        ],
+        "categorySpendAfterReimbursements": [
+            {
+                "category": category,
+                "raw": round(by_category[category], 2),
+                "aaOffset": round(-category_offsets.get(category, 0.0), 2),
+                "adjusted": round(amount, 2),
+            }
+            for category, amount in sorted(category_after.items(), key=lambda item: -item[1])
+        ],
+        "reclassificationLedger": reclassification_rows,
+        "events": [
+            {
+                "date": event.date.isoformat(),
+                "gross": round(event.gross, 2),
+                "reimbursements": round(event.reimbursement_total, 2),
+                "net": round(event.net, 2),
+                "confidence": event.confidence,
+                "expenses": [
+                    {
+                        "amount": round(tx.abs_amount, 2),
+                        "category": tx.category,
+                        "merchant": tx.name,
+                    }
+                    for tx in event.expenses
+                ],
+                "matchedInflows": [
+                    {
+                        "date": tx.date.isoformat(),
+                        "amount": round(tx.amount, 2),
+                        "category": tx.category,
+                        "source": compact_name(tx),
+                    }
+                    for tx in event.reimbursements
+                ],
+            }
+            for event in events
+            if event.reimbursements
+        ],
+        "unresolvedLargeOutflows": [
+            {
+                "date": tx.date.isoformat(),
+                "amount": round(tx.abs_amount, 2),
+                "category": tx.category,
+                "merchant": tx.name,
+            }
+            for tx in unresolved_large
+        ],
+        "merchantCredits": [
+            {
+                "date": tx.date.isoformat(),
+                "amount": round(tx.amount, 2),
+                "category": tx.category,
+                "merchant": tx.name,
+                "pending": tx.pending,
+            }
+            for tx in merchant_credits
+        ],
+    }
+def allocate_reimbursement_to_categories(event: Event, amount: float) -> list[tuple[str, float]]:
+    by_category: dict[str, float] = {}
+    for tx in event.expenses:
+        by_category[tx.category] = by_category.get(tx.category, 0.0) + tx.abs_amount
+    if len(by_category) == 1:
+        category = next(iter(by_category))
+        return [(category, round(amount, 2))]
+    allocations: list[tuple[str, float]] = []
+    remaining = round(amount, 2)
+    categories = sorted(by_category.items(), key=lambda item: -item[1])
+    for index, (category, gross) in enumerate(categories):
+        if index == len(categories) - 1:
+            allocated = remaining
+        else:
+            allocated = round(amount * gross / event.gross, 2)
+            remaining = round(remaining - allocated, 2)
+        allocations.append((category, allocated))
+    return allocations
+def render_markdown(summary: dict[str, Any]) -> str:
+    lines = [
+        "# Monarch Spend Review",
+        "",
+        f"- Transactions: {summary['transactionCount']}",
+        f"- Raw consumption spend: {money(summary['rawConsumptionSpend'])}",
+        f"- Matched reimbursements / AA: {money(summary['assignedReimbursements'])}",
+        f"- Adjusted consumption spend: {money(summary['adjustedConsumptionSpend'])}",
+        f"- Merchant credits / refunds, listed separately: {money(summary['merchantCreditTotal'])}",
+        f"- Cash impact after credits: {money(summary['cashImpactAfterCredits'])}",
+        "",
+        "## Matched Shared-Spend Events",
+    ]
+    for event in summary["events"]:
+        lines.append(
+            f"- {event['date']}: gross {money(event['gross'])}, reimbursed {money(event['reimbursements'])}, net {money(event['net'])}, confidence {event['confidence']}"
+        )
+        for tx in event["expenses"]:
+            lines.append(f"  - expense: {money(tx['amount'])} {tx['category']} at {tx['merchant']}")
+        for tx in event["matchedInflows"]:
+            lines.append(f"  - inflow: {money(tx['amount'])} on {tx['date']} from {tx['source']} ({tx['category']})")
+    lines.extend(["", "## Category Spend Before Reimbursements"])
+    for row in summary["categorySpend"]:
+        lines.append(f"- {row['category']}: {money(row['amount'])}")
+    lines.extend(["", "## Category Spend After AA Reclassification"])
+    for row in summary["categorySpendAfterReimbursements"]:
+        lines.append(
+            f"- {row['category']}: raw {money(row['raw'])}, AA offset {money(row['aaOffset'])}, adjusted {money(row['adjusted'])}"
+        )
+    lines.extend(["", "## Reclassification Ledger"])
+    for row in summary["reclassificationLedger"]:
+        merchants = ", ".join(row["eventMerchants"])
+        lines.append(
+            f"- {row['date']}: {money(row['signedAmount'])} from {row['source']} -> {row['assignedCategory']} for {row['assignedEventDate']} event ({merchants})"
+        )
+    lines.extend(["", "## Unresolved Large Outflows"])
+    for tx in summary["unresolvedLargeOutflows"]:
+        lines.append(f"- {tx['date']}: {money(tx['amount'])} {tx['category']} at {tx['merchant']}")
+    if summary["merchantCredits"]:
+        lines.extend(["", "## Merchant Credits / Refunds"])
+        for tx in summary["merchantCredits"]:
+            pending = " pending" if tx["pending"] else ""
+            lines.append(f"- {tx['date']}: {money(tx['amount'])} {tx['category']} from {tx['merchant']}{pending}")
+    return "\n".join(lines) + "\n"
+def main() -> int:
+    parser = argparse.ArgumentParser(description="Review Monarch transaction JSON for shared-spend reimbursements.")
+    parser.add_argument("--input", required=True, type=Path, help="JSON file from `mon transactions --json`.")
+    parser.add_argument("--format", choices=["json", "markdown"], default="markdown")
+    parser.add_argument("--min-anchor", type=float, default=45.0, help="Minimum outflow amount to treat as a shared-spend anchor.")
+    parser.add_argument("--window-days", type=int, default=3, help="Days after an expense to match incoming reimbursements.")
+    parser.add_argument("--include-pending", action="store_true", help="Include pending transactions.")
+    parser.add_argument("--own-name", action="append", default=[], help="Name fragment to treat as own/internal transfer. Repeatable.")
+    args = parser.parse_args()
+    own_names = DEFAULT_OWN_NAMES | {value.lower() for value in args.own_name}
+    txs = load_transactions(args.input)
+    reimbursements = [tx for tx in txs if is_reimbursement_candidate(tx, args.include_pending, own_names)]
+    merchant_credits = [tx for tx in txs if is_merchant_credit(tx, args.include_pending, own_names)]
+    events = build_events(txs, args.min_anchor, args.include_pending)
+    assign_reimbursements(events, reimbursements, args.window_days)
+    summary = summarize(txs, events, reimbursements, merchant_credits, args.include_pending)
+    if args.format == "json":
+        print(json.dumps(summary, indent=2, sort_keys=True))
+    else:
+        sys.stdout.write(render_markdown(summary))
+    return 0
+if __name__ == "__main__":
+    raise SystemExit(main())

package/skills/ocr/SKILL.md ADDED Viewed

@@ -0,0 +1,136 @@
+---
+name: ocr
+description: Extract text, Markdown, tables, formulas, and structured content from PDFs, scanned documents, screenshots, and images using the best available local or cloud OCR route. Use when Codex needs OCR, PDF text-layer extraction, MinerU local or official API parsing, VLM document parsing, table/formula extraction, scanned PDF handling, or when deciding whether a PDF should be read directly, OCRed, parsed by MinerU, or uploaded to a cloud API.
+---
+# OCR
+Use this skill to choose and run the right document extraction path instead of defaulting to OCR for every PDF.
+## Quick Start
+Use the router first for PDFs and files where the best path is unclear:
+```bash
+ocr-doc /path/to/file.pdf --profile-only
+```
+The installed wrapper is:
+```bash
+/Users/yupeit/bin/ocr-doc
+```
+It runs:
+```bash
+/Users/yupeit/dev/skills/skills/ocr/scripts/ocr-router
+```
+The wrapper uses the skill-local virtualenv at `.venv/` when present. If the venv is missing, recreate it:
+```bash
+python3 -m venv /Users/yupeit/dev/skills/skills/ocr/.venv
+/Users/yupeit/dev/skills/skills/ocr/.venv/bin/python -m pip install requests pymupdf pyyaml
+```
+## Decision Tree
+1. For screenshots or single images, use the offline Apple Vision CLI:
+```bash
+ocr /path/to/image.png
+ocr capture
+ocr fullscreen
+```
+2. For a PDF with a strong text layer and no need for formulas/tables/layout JSON, use native text extraction:
+```bash
+ocr-doc file.pdf --engine native-text
+```
+Use `--show-profile` when you want the profile printed and extraction to continue. Use `--profile-only` when you only want the recommendation.
+3. For PDFs where tables, formulas, layout, page markers, or image assets matter, prefer MinerU:
+```bash
+ocr-doc file.pdf --engine mineru-local --require-structure --need-formulas --need-tables
+```
+4. For non-confidential documents where local MinerU is too slow or unavailable, use the official MinerU API only after explicit upload permission:
+```bash
+ocr-doc file.pdf --engine mineru-api --allow-cloud --model-version vlm
+```
+5. For small non-confidential documents needing a quick agent-friendly Markdown result, use MinerU Agent API:
+```bash
+ocr-doc file.pdf --engine mineru-agent --allow-cloud
+```
+6. For images/PDFs where semantic visual understanding is more important than deterministic layout, use Gemini VLM:
+```bash
+ocr-doc file.pdf --engine gemini-vlm --allow-cloud
+```
+## Cloud Safety
+Never upload confidential, private, school-restricted, client, credential-bearing, or unknown-sensitivity documents to cloud OCR.
+The router refuses cloud upload unless either:
+- The user interactively types `UPLOAD`.
+- The caller passes `--allow-cloud`, which must only be used after the user explicitly allows cloud upload for that document.
+Use `--no-cloud` when confidentiality is unknown:
+```bash
+ocr-doc file.pdf --no-cloud
+```
+MinerU official API credentials are read from `MINERU_API_TOKEN` / `MINERU_TOKEN`, then Keychain service `codex.mineru`, account `credential`. Never print the token.
+## MinerU Local Lessons
+For long technical books with a real PDF text layer, do not run full OCR blindly. Use MinerU `pipeline + txt` as the base when formulas/tables matter:
+```bash
+uvx 'mineru[all]' -p file.pdf -o out -b pipeline -m txt -l en -f true -t true --image-analysis false
+```
+For full textbooks or long technical PDFs, do not use a single `ocr-doc --engine mineru-local` whole-book run as the default. Use chunked local MinerU scripts or an equivalent chunked workflow:
+```bash
+python3 /Users/yupeit/dev/learn/quant/scripts/run_mineru_chunks.py \
+  --pdf file.pdf \
+  --output-dir out_chunks \
+  --page-count PAGE_COUNT \
+  --chunk-size 64 \
+  --backend pipeline \
+  --method txt \
+  --lang en \
+  --formula \
+  --table \
+  --no-image-analysis \
+  --timeout-seconds 86400
+```
+Then merge `*_content_list.json` chunks with the existing merge script. This is aligned with the prior John Hull textbook workflow: `pipeline + txt` for the whole book, then `vlm-auto-engine` only on selected formula-heavy pages for overlay.
+For cloud MinerU on a non-confidential born-digital textbook, override the generic cloud defaults: start with `--model-version pipeline --no-is-ocr --enable-formula --enable-table`, then compare `vlm` on selected difficult pages. Do not default a full textbook to `vlm + OCR` without a cost/quality reason.
+For large documents, chunk the run and keep logs; local MinerU may wait for the final result before writing the user-facing output. A previous 880-page technical book worked best with 64-page chunks and long timeouts.
+For map-like or diagram-heavy pages, MinerU may output only an image reference. If visible labels are the goal, compare native text-layer extraction and Apple Vision OCR.
+Apple Vision can fail inside a restricted Codex sandbox with a Foundation/Vision error. In that case rerun through the approved local entrypoint `/Users/yupeit/bin/ocr-doc` or `/Users/yupeit/bin/ocr` outside the sandbox.
+Read [references/local-ocr-best-practices.md](references/local-ocr-best-practices.md) before doing long or quality-sensitive extraction.
+## References
+- Read [references/mineru-api.md](references/mineru-api.md) before changing MinerU official/agent API calls.
+- Read [references/local-ocr-best-practices.md](references/local-ocr-best-practices.md) for local tool choices, PDF shape heuristics, and previous MinerU findings.

package/skills/ocr/agents/openai.yaml ADDED Viewed

@@ -0,0 +1,4 @@
+interface:
+  display_name: "OCR"
+  short_description: "Local and MinerU document OCR routing"
+  default_prompt: "Use $ocr to extract text or Markdown from this PDF using the best local or MinerU workflow."