@agentutility/mcp-mediakit 0.11.3 → 0.11.7
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +47 -47
- package/dist/tools.generated.js +58 -50
- package/package.json +1 -1
package/README.md
CHANGED
|
@@ -52,71 +52,71 @@ USDC on Base contract: `0x833589fCD6eDb6E08f4c7C32D4f71b54bdA02913`
|
|
|
52
52
|
|---|---|
|
|
53
53
|
| `add-watermark` | (0.02 USDC/call) Add watermark to PDF, image, or video. Applies text or image overlays with configurable position, opacity, font, rotation, and margin. CloudConvert backend for PDFs, PNG/JPG/GIF, and MP4/MOV/WEBM. |
|
|
54
54
|
| `audio-loudnorm` | (0.02 USDC/call) Audio loudness normalizer (EBU R128 LUFS). Podcast / Spotify / YouTube target presets (-23 / -16 / -14). Two-pass dynamic mode. Returns hosted MP3. |
|
|
55
|
-
| `audio-transcribe` | (0.01 USDC/call)
|
|
55
|
+
| `audio-transcribe` | (0.01 USDC/call) Transcribes audio to text with whisper-large-v3. Server-side fetches the audio URL (max 25 MB), relays it to Venice's audio/transcriptions endpoint, and returns the transcript with detected language, duration, and per-segment timestamps when response_format='verbose_json' (default). Also supports raw text, SRT, and VTT outputs. Use it as a speech-to-text or multi-language ASR endpoint with OpenAI Whisper API compatibility. |
|
|
56
56
|
| `compress-pdf` | (0.005 USDC/call) PDF compressor / PDF size reducer. CloudConvert optimize task. Profiles: web (default), print, archive, mrc (scanned), max. Reports % saved. |
|
|
57
57
|
| `convert-html-to-markdown` | (0.005 USDC/call) Convert HTML to Markdown. Strips nav, scripts, ads, and other boilerplate. Preserves headings, lists, tables, code blocks, links, and images. Accepts raw HTML or a URL. Returns clean Markdown ideal for LLM context windows or RAG ingestion. |
|
|
58
|
-
| `convert-pdf` | (0.
|
|
59
|
-
| `csv-to-ics` | (0.01 USDC/call) CSV
|
|
60
|
-
| `csv-to-jsonl` | (0.02 USDC/call)
|
|
61
|
-
| `doc-to-json` | (0.10 USDC/call)
|
|
62
|
-
| `excel-to-csv` | (0.005 USDC/call) Excel (.xlsx / .xls) to CSV
|
|
63
|
-
| `excel-to-google-sheets` | (0.005 USDC/call)
|
|
64
|
-
| `extract-tables` | (0.10 USDC/call)
|
|
65
|
-
| `html-markdown` | (0.005 USDC/call)
|
|
58
|
+
| `convert-pdf` | (0.0025 USDC/call) Converts PDFs to Markdown, HTML, JSON, or structured text with the Datalab Marker AI pipeline, preserving headings, tables, equations, and multi-column flow. Note: this endpoint outputs text-format conversions only. For PDF to JPG/PNG use pdf-to-jpg, for PDF merge use pdf-merge, for PDF split use pdf-split, for PDF compression use compress-pdf, and for PDF to Word route through this then a markdown to docx step. 30 pages max. Use it as a general PDF converter. |
|
|
59
|
+
| `csv-to-ics` | (0.01 USDC/call) Converts a CSV of events into an RFC 5545 compliant ICS calendar file (VCALENDAR/VEVENT) for Google Calendar, Outlook, and Apple Calendar import. Auto-detects column mapping (summary, date, time, location, description) or accepts an explicit mapping; handles all-day and timed events, configurable default duration, and custom delimiters. Up to 1000 rows. Use it as a CSV to iCal converter, calendar file generator, or .ics builder. |
|
|
60
|
+
| `csv-to-jsonl` | (0.02 USDC/call) Converts CSV or TSV data into JSON, JSONL/NDJSON, or column-oriented arrays. RFC 4180 parser with type inference (booleans, integers, floats, ISO dates, null tokens) and configurable delimiter, quote char, header presence, column rename, drop columns, and whitespace trim. Outputs newline-delimited JSON (jsonl), a JSON array (json), or column-oriented arrays (ndarray). Use it as a CSV to JSON, CSV to NDJSON, TSV to JSON, or spreadsheet to JSON converter, data pipeline preprocessor, or LLM training-data prep step. |
|
|
61
|
+
| `doc-to-json` | (0.10 USDC/call) Converts any document (PDF, DOCX, PPT, XLSX, or image) into structured JSON matching a caller-supplied schema. The file is converted to plain text via CloudConvert, then Morpheus mistral-31-24b runs with strict json_schema response_format to force the output shape. OCR-aware for images and scanned PDFs. Use it as a file parser with schema, invoice extractor, resume parser, contract extractor, RFP parser, receipt extractor, or form data extraction tool. |
|
|
62
|
+
| `excel-to-csv` | (0.005 USDC/call) Converts Excel workbooks (.xlsx / .xls) to CSV, TSV, or JSON with multi-sheet handling, returning each sheet by name. Adjacent to the 'convert excel to google sheets' demand cluster. Use it as an Excel to CSV or spreadsheet converter. |
|
|
63
|
+
| `excel-to-google-sheets` | (0.005 USDC/call) Converts Excel files for import into Google Sheets. Outputs CSV that imports directly via File > Import > Upload (or paste-into-cells), with multi-sheet handling, encoding control, and quote style. Same handler as excel-to-csv / xlsx-to-csv under a Google-Sheets-named slug. Use it as an XLSX to Google Sheets, Numbers to Google Sheets, or Excel to gsheet spreadsheet import tool. |
|
|
64
|
+
| `extract-tables` | (0.10 USDC/call) Detects and extracts every table from a PDF document, returning structured JSON or CSV per table. 30 pages max via Datalab Marker. Use it as a table extractor, PDF-to-CSV converter, or spreadsheet-from-PDF tool. |
|
|
65
|
+
| `html-markdown` | (0.005 USDC/call) Converts raw HTML or a URL into clean Markdown, preserving headings, lists, tables, code blocks, links, and images. Use it as an HTML to Markdown API, webpage to Markdown converter, or clean page text extractor. |
|
|
66
66
|
| `html-to-markdown` | (0.005 USDC/call) Convert HTML to Markdown. Strips nav, scripts, ads, and other boilerplate. Preserves headings, lists, tables, code blocks, links, and images. Accepts raw HTML or a URL — when given a URL the server fetches the page and runs the converter on the body. Returns clean Markdown plus title and char counts. Ideal for LLM context windows, RAG ingestion, and content archival. |
|
|
67
|
-
| `html-to-pdf` | (0.08 USDC/call)
|
|
67
|
+
| `html-to-pdf` | (0.08 USDC/call) Converts a URL to PDF with CloudConvert capture-website; renders JS. Configurable page size, orientation, margins, wait conditions. Use it for HTML to PDF or webpage screenshot to PDF conversion. |
|
|
68
68
|
| `image-convert` | (0.01 USDC/call) Universal image format converter (PNG, JPG, WEBP, AVIF, GIF, BMP, TIFF, ICO, HEIC, HEIF, PSD, SVG). Optional resize + quality. CloudConvert engine. |
|
|
69
69
|
| `image-format-convert` | (0.01 USDC/call) Image converter. Convert any image between PNG, JPG, WEBP, AVIF, GIF, BMP, TIFF, ICO, HEIC, HEIF, PSD, and SVG. Optional resize and quality. CloudConvert engine. Same backend as image-convert under a more search-friendly slug. |
|
|
70
70
|
| `image-translate` | (0.02 USDC/call) Image translator. Translate text in any image — signs, menus, screenshots, photos of documents — by running vision-OCR plus Venice translation. Returns the detected source text plus the target-language translation. Optional preserve_layout mode keeps the original spatial arrangement. |
|
|
71
|
-
| `image-upscale` | (0.02 USDC/call)
|
|
71
|
+
| `image-upscale` | (0.02 USDC/call) Upscales an image 2x or 4x via Venice's image/upscale endpoint (default model: venice-sd35). Returns a permanent fal-hosted URL. Use it as a super-resolution tool to sharpen or enlarge an image without loss. |
|
|
72
72
|
| `image-watermark` | (0.02 USDC/call) Image watermark / add text or logo watermark to image files. Also supports PDFs and videos through the same CloudConvert-backed watermark handler. Configurable opacity, position, rotation, font, and margin. |
|
|
73
|
-
| `json-yaml` | (0.
|
|
73
|
+
| `json-yaml` | (0.005 USDC/call) Converts between JSON and YAML in either direction. Auto-detects input format from the first non-whitespace byte ('{'/'[' = JSON, otherwise YAML), parses, and re-serializes in the requested target format; pretty-print and indent are configurable. Use it as a format transcoder, config-file translator, Helm values prepper, Kubernetes manifest converter, GitHub Actions workflow YAML helper, or round-trip parser. |
|
|
74
74
|
| `logo-detect` | (0.03 USDC/call) Brand logo detection / brand recognition in images. Venice qwen3-5-9b vision LLM. Returns detected brands with confidence, location, evidence (wordmark/logomark/lockup/color_scheme), element_type, and supports hint_brands. |
|
|
75
|
-
| `merge-pdf` | (0.01 USDC/call)
|
|
76
|
-
| `movie-database` | (0.01 USDC/call)
|
|
77
|
-
| `movie-database-api` | (0.01 USDC/call)
|
|
78
|
-
| `movie-info` | (0.01 USDC/call)
|
|
79
|
-
| `mp4-to-mp3` | (0.
|
|
80
|
-
| `mp4-to-mp3-api` | (0.
|
|
81
|
-
| `mp4-to-mp3-converter` | (0.
|
|
82
|
-
| `ocr` | (0.
|
|
75
|
+
| `merge-pdf` | (0.01 USDC/call) Combines 2-50 input PDFs from URLs into one PDF, preserving bookmarks. CloudConvert engine. Use it as a PDF merger to concatenate PDF files or join multiple PDFs into one. |
|
|
76
|
+
| `movie-database` | (0.01 USDC/call) Finds movies or TV shows by title, with optional year and region, and returns release year, poster, overview, and language. Uses TMDB when TMDB_API_KEY is configured (adding movie ratings, popularity, and an optional streaming provider summary) and falls back to no-key Wikidata metadata otherwise. Use it as a movie database API, TV database API, film lookup, title metadata search, or TMDB / Wikidata movie lookup. |
|
|
77
|
+
| `movie-database-api` | (0.01 USDC/call) Searches movies and TV shows by title and optional year, returning release date, rating, popularity, overview, poster URLs, TMDB links, and optional provider summary. Use it as a movie database API, TV database API, or TMDB metadata lookup. |
|
|
78
|
+
| `movie-info` | (0.01 USDC/call) Looks up movie and TV metadata: title, release year, rating, overview, poster, and optional streaming providers. Same TMDB-backed response as movie-database with a compact search slug for agents. Use it as a movie info API, film metadata lookup, or TV metadata lookup. |
|
|
79
|
+
| `mp4-to-mp3` | (0.02 USDC/call) Converts MP4, MOV, WebM, MKV, AVI, M4V, and FLV video files to MP3 via CloudConvert, with selectable bitrate (96/128/192 kbps). 60-min / 500MB max. Use it as an MP4 to MP3 converter, video to audio converter, or audio extractor API. |
|
|
80
|
+
| `mp4-to-mp3-api` | (0.02 USDC/call) Converts MP4, MOV, WEBM, MKV, AVI, M4V, or FLV URLs into hosted MP3 output with selectable 96, 128, or 192 kbps bitrate. CloudConvert backend, 60-min / 500MB max. Use it as an MP4 to MP3 API, video to MP3 converter, or extract-audio-from-video endpoint. |
|
|
81
|
+
| `mp4-to-mp3-converter` | (0.02 USDC/call) Converts MP4 video URLs into hosted MP3 files with selectable bitrate; also accepts MOV, WEBM, MKV, AVI, M4V, or FLV URLs. Same CloudConvert-backed handler as mp4-to-mp3. Use it as an MP4 to MP3 converter API, to extract audio from MP4, or for video-to-MP3 conversion. |
|
|
82
|
+
| `ocr` | (0.0025 USDC/call) Runs OCR on scanned PDFs and image-based documents, returning clean Markdown or plain text. The Datalab Marker engine preserves layout, tables, and math. 30 pages max. Use it as an OCR API, optical character recognition service, scanned document extractor, or image-PDF to text and scanned PDF OCR tool. |
|
|
83
83
|
| `office-to-pdf` | (0.05 USDC/call) Office to PDF converter — DOCX/DOC, XLSX/XLS, PPTX/PPT, ODT/ODS/ODP, RTF, TXT, CSV, EPUB, MD, HTML, Apple Pages/Numbers/Keynote to PDF. CloudConvert engine. |
|
|
84
|
-
| `pdf-compress` | (0.005 USDC/call)
|
|
85
|
-
| `pdf-extract-tables` | (0.10 USDC/call)
|
|
86
|
-
| `pdf-merge` | (0.01 USDC/call)
|
|
87
|
-
| `pdf-parser-api` | (0.
|
|
84
|
+
| `pdf-compress` | (0.005 USDC/call) Shrinks PDFs to a smaller size for email or the web, with three quality levels: ebook (lowest, web-quality), printer (medium), prepress (highest, archival). CloudConvert engine. Use it as a PDF compressor or PDF size reducer. |
|
|
85
|
+
| `pdf-extract-tables` | (0.10 USDC/call) Extracts every table from a PDF, digital or scanned, and returns row-by-column text matrices page-by-page. AI + OCR pipeline with optional cell bounding boxes for downstream layout reconstruction and an optional page_range filter ('1-5', '3', '1,3,5'). Handles merged headers, multi-page financial statements, balance sheets, lab results, scanned reports. 30 pages max. Sibling of pdf-to-markdown using the same Datalab backend, but pre-parsed to tables only. Use it as a PDF table extractor, scanned-table parser, financial-table OCR, multi-page table consolidator, or Datalab Marker tables endpoint. |
|
|
86
|
+
| `pdf-merge` | (0.01 USDC/call) Merges 2-50 PDFs from URLs into a single PDF, preserving bookmarks. CloudConvert engine. Use it as a PDF merger, PDF combiner, or PDF concatenator. |
|
|
87
|
+
| `pdf-parser-api` | (0.0025 USDC/call) Parses a public PDF URL into Markdown, HTML, or JSON blocks with layout-aware text, headings, tables, and equations. Datalab Marker backend, 30 pages max. Use it as a PDF parser API, PDF content extractor, or scanned PDF OCR API. |
|
|
88
88
|
| `pdf-split` | (0.04 USDC/call) PDF splitter / PDF page extractor. Two modes: page ranges (['1-3','5','7-end']) or one PDF per source page. CloudConvert engine. |
|
|
89
|
-
| `pdf-table-extract` | (0.10 USDC/call)
|
|
90
|
-
| `pdf-table-extractor` | (0.10 USDC/call)
|
|
91
|
-
| `pdf-text-extractor` | (0.
|
|
92
|
-
| `pdf-to-jpg` | (0.10 USDC/call) PDF to JPG
|
|
93
|
-
| `pdf-to-markdown` | (0.
|
|
94
|
-
| `pdf-to-markdown-api` | (0.
|
|
95
|
-
| `pdf-to-text` | (0.
|
|
96
|
-
| `pdf-to-text-api` | (0.
|
|
97
|
-
| `pdf-watermark` | (0.02 USDC/call)
|
|
89
|
+
| `pdf-table-extract` | (0.10 USDC/call) Extracts tables from digital or scanned PDFs, returning row/column matrices, CSV output, page numbers, and optional cell boxes. Same Datalab Marker-backed handler as pdf-extract-tables. Use it as a PDF table extract API, to extract tables from PDF, or as a PDF to CSV table parser. |
|
|
90
|
+
| `pdf-table-extractor` | (0.10 USDC/call) Finds tables in digital or scanned PDFs and returns row-by-column matrices, page numbers, and optional cell bounding boxes. Datalab Marker backend, 30 pages max. Use it as a PDF table extractor or scanned-table OCR. |
|
|
91
|
+
| `pdf-text-extractor` | (0.0025 USDC/call) Extracts clean Markdown, HTML, or structured JSON from digital or scanned PDFs while preserving reading order, tables, and equations. Datalab Marker backend, 30 pages max. Use it as a PDF text extractor, PDF to text API, or OCR PDF reader. |
|
|
92
|
+
| `pdf-to-jpg` | (0.10 USDC/call) Converts a PDF to JPG, PNG, or WEBP images, rendering every page at configurable DPI (36-600) and returning one image URL per page. CloudConvert backend. |
|
|
93
|
+
| `pdf-to-markdown` | (0.0025 USDC/call) Converts digital or scanned PDFs to clean Markdown with AI-powered, layout-aware extraction on the Datalab Marker engine. Preserves headings, tables, equations (LaTeX), bulleted lists, and multi-column flow; outputs Markdown (default), HTML, or structured JSON with per-page blocks. 30 pages max. Use it as a PDF parser, PDF to text converter, OCR PDF reader, extract-tables-from-PDF tool, equation-aware PDF parser, scanned-PDF OCR, or PDF data extractor. |
|
|
94
|
+
| `pdf-to-markdown-api` | (0.0025 USDC/call) Converts a public PDF URL into clean Markdown, HTML, or structured JSON while preserving headings, tables, equations, and reading order. Datalab Marker backend, 30 pages max. Use it as a PDF to Markdown API, PDF parser API, PDF to text API, or scanned PDF OCR API. |
|
|
95
|
+
| `pdf-to-text` | (0.0025 USDC/call) Extracts text from digital or scanned PDFs, preserving reading order across multi-column layouts with an AI + OCR pipeline (Datalab Marker). Recognizes scanned pages and returns Markdown by default (clean text with structure) or HTML / JSON. 30 pages max. Works as a PDF to plain text converter, pdftotext or pdf2txt replacement, PDF text extractor, scanned PDF OCR, or read/parse PDF content extractor. |
|
|
96
|
+
| `pdf-to-text-api` | (0.0025 USDC/call) Extracts text from digital and scanned PDFs as Markdown, plain text, HTML, or JSON with layout-aware reading order. Same Datalab Marker-backed handler as pdf-to-text. Use it as a PDF to text API, PDF text extractor, or OCR PDF reader. |
|
|
97
|
+
| `pdf-watermark` | (0.02 USDC/call) Overlays a text or image watermark on PDFs, PNG/JPG/GIF images, or MP4/MOV/WEBM videos. Configurable position, opacity, font, rotation, and margin. CloudConvert engine. Use it as a PDF watermark, image watermark, or video watermark tool. |
|
|
98
98
|
| `pdf2md` | (0.20 USDC/call) PDF to Markdown converter. AI PDF extractor. Datalab Marker — OCR + layout-aware. Best-in-class for tables, equations, multi-column. |
|
|
99
99
|
| `receipt-ocr` | (0.01 USDC/call) Receipt OCR. Reads any receipt photo and returns a structured JSON object with vendor, address, date, line items (qty / unit_price / total), subtotal, tax, tip, total, and payment method. Vision-LLM powered. Same backend as receipt-parser under a clearer slug for expense + accounting + reimbursement workflows. |
|
|
100
100
|
| `receipt-parser` | (0.01 USDC/call) Receipt parser. Extracts structured JSON from any receipt image — vendor, address, date, line items (qty / unit_price / total), subtotal, tax, tip, grand total, and payment method. Vision LLM powered. Useful for expense automation, accounting integrations, and travel-claim flows. |
|
|
101
101
|
| `speaker-diarize` | (0.10 USDC/call) Speaker diarization / who-said-what transcription. Whisper v3 + speaker labels. Returns utterances grouped by speaker, plus per-speaker stats (count, seconds, words). 60 min max. |
|
|
102
|
-
| `split-pdf` | (0.04 USDC/call)
|
|
102
|
+
| `split-pdf` | (0.04 USDC/call) Splits a PDF by page ranges into multiple PDFs. CloudConvert engine. Use it as a PDF splitter, PDF page extractor, split-PDF-by-range tool, or PDF to multiple files converter. |
|
|
103
103
|
| `subtitles` | (0.08 USDC/call) SRT / VTT subtitle generator from video or audio. Whisper v3. Word-wrapped, ready for VLC / Premiere / FFmpeg. |
|
|
104
104
|
| `transcribe` | (0.10 USDC/call) Transcribe video to text. Whisper v3 large transcription for any video URL. Auto-detects 90+ languages. Translate-to-English mode. 60-min / 500MB max. |
|
|
105
|
-
| `upscale-image` | (0.
|
|
106
|
-
| `url-to-markdown` | (0.005 USDC/call)
|
|
107
|
-
| `video-summarize` | (0.10 USDC/call)
|
|
105
|
+
| `upscale-image` | (0.02 USDC/call) Upscales images 2x or 4x with AI super-resolution, best for photos and illustrations. Venice image/upscale backend (venice-sd35), re-hosted for a permanent URL. Use it as an AI image upscaler or image enlarger. |
|
|
106
|
+
| `url-to-markdown` | (0.005 USDC/call) Fetches a public URL, strips boilerplate, and returns clean Markdown plus title and character counts. Use it as a URL to Markdown converter, webpage to Markdown tool, or article cleanup API. |
|
|
107
|
+
| `video-summarize` | (0.10 USDC/call) Summarizes videos, podcasts, and lectures in one call: Whisper v3 transcribes, then Mistral summarizes. 5 styles (tldr, bullets, paragraph, executive, chapters); returns summary + transcript; 60 min max. Use it as a video summarizer, podcast summarizer, or lecture notes generator. |
|
|
108
108
|
| `video-thumbnail` | (0.03 USDC/call) Video thumbnail / video frame extractor. First, middle, or last frame as JPG. fal.ai ffmpeg. Fast — no full transcode. |
|
|
109
|
-
| `video-to-audio` | (0.
|
|
110
|
-
| `video-to-mp3` | (0.
|
|
111
|
-
| `video-to-subtitles` | (0.02 USDC/call)
|
|
112
|
-
| `video-to-text` | (0.10 USDC/call)
|
|
113
|
-
| `video-transcribe` | (0.10 USDC/call)
|
|
114
|
-
| `video-trim` | (0.02 USDC/call)
|
|
115
|
-
| `watermark` | (0.02 USDC/call)
|
|
109
|
+
| `video-to-audio` | (0.02 USDC/call) Video to audio extractor / video to audio converter. Extract MP3 audio track from any video URL (MP4, MOV, WEBM, MKV, AVI, M4V, FLV). Selectable bitrate (96/128/192 kbps). Useful for podcast extraction, audio archival, transcription pre-processing. 60-min / 500MB max. CloudConvert backend. |
|
|
110
|
+
| `video-to-mp3` | (0.02 USDC/call) Video to MP3 / extract MP3 audio from video URL. Converts MP4, MOV, WEBM, MKV, AVI, M4V, or FLV into an MP3 file for transcription, podcast clips, and media archival. CloudConvert backend, 60-min / 500MB max. |
|
|
111
|
+
| `video-to-subtitles` | (0.02 USDC/call) Generates subtitles from video with Whisper v3, word-wrapped and ready for VLC / Premiere / FFmpeg. Auto-detects language and can translate to English. Use it as a video subtitle generator, auto-subtitle and closed captions tool, SRT generator, VTT generator, video CC endpoint, or accessibility captions source. |
|
|
112
|
+
| `video-to-text` | (0.10 USDC/call) Transcribe any video URL to text with Whisper v3 large — audio is extracted internally. Auto-detects 90+ languages, offers a translate-to-English mode and optional speaker diarization, and handles files up to 60 minutes / 500MB. Covers video transcription, video speech-to-text, and video ASR in one call. |
|
|
113
|
+
| `video-transcribe` | (0.10 USDC/call) Transcribes any video URL to text with Whisper v3 large. Auto-detects 90+ languages, offers a translate-to-English mode, and optional speaker diarization. 60-min / 500MB max. Same backend as video-to-text under a clearer slug. Use it for video transcription, video to audio transcription, or video-to-text. |
|
|
114
|
+
| `video-trim` | (0.02 USDC/call) Trims a video by passing start + end OR start + duration, in HH:MM:SS, MM:SS, or seconds. CloudConvert + x264 re-encode. Use it as a video trimmer, video cutter, or video clip tool. |
|
|
115
|
+
| `watermark` | (0.02 USDC/call) Adds a text or image watermark overlay to PDFs, PNG/JPG/GIF images, and MP4/MOV/WEBM videos. CloudConvert engine. Configurable position, opacity, font, rotation, and margin. Use it for PDF, image, or video watermarking. |
|
|
116
116
|
| `watermark-pdf` | (0.02 USDC/call) Add watermark to PDF. Text or image overlay on PDFs, PNG / JPG / GIF, or MP4 / MOV / WEBM with configurable position, opacity, font, rotation, and margin. Same backend as watermark / pdf-watermark under a clearer search slug. CloudConvert engine. |
|
|
117
|
-
| `xlsx-to-csv` | (0.005 USDC/call)
|
|
117
|
+
| `xlsx-to-csv` | (0.005 USDC/call) Converts any sheet of an .xlsx, .xlsm, .xls, or .ods workbook to CSV, with sheet selection, encoding, and quote style controls. Powered by CloudConvert. Use it as an Excel to CSV, XLSX to CSV, Numbers to CSV, or spreadsheet to CSV converter. |
|
|
118
118
|
| `xml-to-word` | (0.05 USDC/call) XML to Microsoft Word (DOCX) converter via CloudConvert. Demand-intel: 43 unmet signals. |
|
|
119
|
-
| `youtube-transcript` | (0.01 USDC/call)
|
|
119
|
+
| `youtube-transcript` | (0.01 USDC/call) Fetch the transcript of any YouTube video. Pulls auto-generated or manual captions and returns full text plus per-segment {start, duration, text}, with an optional language pick. Backed by Supadata's transcript pipeline server-side — no caller key required; returns a clear 404 when a video has no transcript. Use it as a YouTube closed-caption fetcher or video subtitles puller. |
|
|
120
120
|
|
|
121
121
|
## How it works
|
|
122
122
|
|
|
@@ -137,4 +137,4 @@ The agent never sees the payment flow — it just gets the result.
|
|
|
137
137
|
|
|
138
138
|
---
|
|
139
139
|
|
|
140
|
-
**Version:** 0.11.
|
|
140
|
+
**Version:** 0.11.7 · **License:** MIT
|
package/dist/tools.generated.js
CHANGED
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
/** Auto-generated by scripts/generate-mcp-clusters.mjs. Do not edit by hand. */
|
|
2
2
|
export const CLUSTER_SLUG = "mediakit";
|
|
3
|
-
export const VERSION = "0.11.
|
|
3
|
+
export const VERSION = "0.11.7";
|
|
4
4
|
export const TOOLS = [
|
|
5
5
|
{
|
|
6
6
|
"name": "add-watermark",
|
|
@@ -81,7 +81,7 @@ export const TOOLS = [
|
|
|
81
81
|
{
|
|
82
82
|
"name": "audio-transcribe",
|
|
83
83
|
"http_name": "audio-transcribe",
|
|
84
|
-
"description": "(0.01 USDC/call)
|
|
84
|
+
"description": "(0.01 USDC/call) Transcribes audio to text with whisper-large-v3. Server-side fetches the audio URL (max 25 MB), relays it to Venice's audio/transcriptions endpoint, and returns the transcript with detected language, duration, and per-segment timestamps when response_format='verbose_json' (default). Also supports raw text, SRT, and VTT outputs. Use it as a speech-to-text or multi-language ASR endpoint with OpenAI Whisper API compatibility.",
|
|
85
85
|
"method": "POST",
|
|
86
86
|
"input_schema": {
|
|
87
87
|
"type": "object",
|
|
@@ -176,7 +176,7 @@ export const TOOLS = [
|
|
|
176
176
|
{
|
|
177
177
|
"name": "convert-pdf",
|
|
178
178
|
"http_name": "convert-pdf",
|
|
179
|
-
"description": "(0.
|
|
179
|
+
"description": "(0.0025 USDC/call) Converts PDFs to Markdown, HTML, JSON, or structured text with the Datalab Marker AI pipeline, preserving headings, tables, equations, and multi-column flow. Note: this endpoint outputs text-format conversions only. For PDF to JPG/PNG use pdf-to-jpg, for PDF merge use pdf-merge, for PDF split use pdf-split, for PDF compression use compress-pdf, and for PDF to Word route through this then a markdown to docx step. 30 pages max. Use it as a general PDF converter.",
|
|
180
180
|
"method": "POST",
|
|
181
181
|
"input_schema": {
|
|
182
182
|
"type": "object",
|
|
@@ -203,7 +203,7 @@ export const TOOLS = [
|
|
|
203
203
|
{
|
|
204
204
|
"name": "csv-to-ics",
|
|
205
205
|
"http_name": "csv-to-ics",
|
|
206
|
-
"description": "(0.01 USDC/call) CSV
|
|
206
|
+
"description": "(0.01 USDC/call) Converts a CSV of events into an RFC 5545 compliant ICS calendar file (VCALENDAR/VEVENT) for Google Calendar, Outlook, and Apple Calendar import. Auto-detects column mapping (summary, date, time, location, description) or accepts an explicit mapping; handles all-day and timed events, configurable default duration, and custom delimiters. Up to 1000 rows. Use it as a CSV to iCal converter, calendar file generator, or .ics builder.",
|
|
207
207
|
"method": "POST",
|
|
208
208
|
"input_schema": {
|
|
209
209
|
"type": "object",
|
|
@@ -237,7 +237,7 @@ export const TOOLS = [
|
|
|
237
237
|
{
|
|
238
238
|
"name": "csv-to-jsonl",
|
|
239
239
|
"http_name": "csv-to-jsonl",
|
|
240
|
-
"description": "(0.02 USDC/call)
|
|
240
|
+
"description": "(0.02 USDC/call) Converts CSV or TSV data into JSON, JSONL/NDJSON, or column-oriented arrays. RFC 4180 parser with type inference (booleans, integers, floats, ISO dates, null tokens) and configurable delimiter, quote char, header presence, column rename, drop columns, and whitespace trim. Outputs newline-delimited JSON (jsonl), a JSON array (json), or column-oriented arrays (ndarray). Use it as a CSV to JSON, CSV to NDJSON, TSV to JSON, or spreadsheet to JSON converter, data pipeline preprocessor, or LLM training-data prep step.",
|
|
241
241
|
"method": "POST",
|
|
242
242
|
"input_schema": {
|
|
243
243
|
"type": "object",
|
|
@@ -296,7 +296,7 @@ export const TOOLS = [
|
|
|
296
296
|
{
|
|
297
297
|
"name": "doc-to-json",
|
|
298
298
|
"http_name": "doc-to-json",
|
|
299
|
-
"description": "(0.10 USDC/call)
|
|
299
|
+
"description": "(0.10 USDC/call) Converts any document (PDF, DOCX, PPT, XLSX, or image) into structured JSON matching a caller-supplied schema. The file is converted to plain text via CloudConvert, then Morpheus mistral-31-24b runs with strict json_schema response_format to force the output shape. OCR-aware for images and scanned PDFs. Use it as a file parser with schema, invoice extractor, resume parser, contract extractor, RFP parser, receipt extractor, or form data extraction tool.",
|
|
300
300
|
"method": "POST",
|
|
301
301
|
"input_schema": {
|
|
302
302
|
"type": "object",
|
|
@@ -327,7 +327,7 @@ export const TOOLS = [
|
|
|
327
327
|
{
|
|
328
328
|
"name": "excel-to-csv",
|
|
329
329
|
"http_name": "excel-to-csv",
|
|
330
|
-
"description": "(0.005 USDC/call) Excel (.xlsx / .xls) to CSV
|
|
330
|
+
"description": "(0.005 USDC/call) Converts Excel workbooks (.xlsx / .xls) to CSV, TSV, or JSON with multi-sheet handling, returning each sheet by name. Adjacent to the 'convert excel to google sheets' demand cluster. Use it as an Excel to CSV or spreadsheet converter.",
|
|
331
331
|
"method": "POST",
|
|
332
332
|
"input_schema": {
|
|
333
333
|
"type": "object",
|
|
@@ -358,7 +358,7 @@ export const TOOLS = [
|
|
|
358
358
|
{
|
|
359
359
|
"name": "excel-to-google-sheets",
|
|
360
360
|
"http_name": "excel-to-google-sheets",
|
|
361
|
-
"description": "(0.005 USDC/call)
|
|
361
|
+
"description": "(0.005 USDC/call) Converts Excel files for import into Google Sheets. Outputs CSV that imports directly via File > Import > Upload (or paste-into-cells), with multi-sheet handling, encoding control, and quote style. Same handler as excel-to-csv / xlsx-to-csv under a Google-Sheets-named slug. Use it as an XLSX to Google Sheets, Numbers to Google Sheets, or Excel to gsheet spreadsheet import tool.",
|
|
362
362
|
"method": "POST",
|
|
363
363
|
"input_schema": {
|
|
364
364
|
"type": "object",
|
|
@@ -389,7 +389,7 @@ export const TOOLS = [
|
|
|
389
389
|
{
|
|
390
390
|
"name": "extract-tables",
|
|
391
391
|
"http_name": "extract-tables",
|
|
392
|
-
"description": "(0.10 USDC/call)
|
|
392
|
+
"description": "(0.10 USDC/call) Detects and extracts every table from a PDF document, returning structured JSON or CSV per table. 30 pages max via Datalab Marker. Use it as a table extractor, PDF-to-CSV converter, or spreadsheet-from-PDF tool.",
|
|
393
393
|
"method": "POST",
|
|
394
394
|
"input_schema": {
|
|
395
395
|
"type": "object",
|
|
@@ -411,7 +411,7 @@ export const TOOLS = [
|
|
|
411
411
|
{
|
|
412
412
|
"name": "html-markdown",
|
|
413
413
|
"http_name": "html-markdown",
|
|
414
|
-
"description": "(0.005 USDC/call)
|
|
414
|
+
"description": "(0.005 USDC/call) Converts raw HTML or a URL into clean Markdown, preserving headings, lists, tables, code blocks, links, and images. Use it as an HTML to Markdown API, webpage to Markdown converter, or clean page text extractor.",
|
|
415
415
|
"method": "POST",
|
|
416
416
|
"input_schema": {
|
|
417
417
|
"type": "object",
|
|
@@ -473,7 +473,7 @@ export const TOOLS = [
|
|
|
473
473
|
{
|
|
474
474
|
"name": "html-to-pdf",
|
|
475
475
|
"http_name": "html-to-pdf",
|
|
476
|
-
"description": "(0.08 USDC/call)
|
|
476
|
+
"description": "(0.08 USDC/call) Converts a URL to PDF with CloudConvert capture-website; renders JS. Configurable page size, orientation, margins, wait conditions. Use it for HTML to PDF or webpage screenshot to PDF conversion.",
|
|
477
477
|
"method": "POST",
|
|
478
478
|
"input_schema": {
|
|
479
479
|
"type": "object",
|
|
@@ -612,7 +612,7 @@ export const TOOLS = [
|
|
|
612
612
|
{
|
|
613
613
|
"name": "image-upscale",
|
|
614
614
|
"http_name": "image-upscale",
|
|
615
|
-
"description": "(0.02 USDC/call)
|
|
615
|
+
"description": "(0.02 USDC/call) Upscales an image 2x or 4x via Venice's image/upscale endpoint (default model: venice-sd35). Returns a permanent fal-hosted URL. Use it as a super-resolution tool to sharpen or enlarge an image without loss.",
|
|
616
616
|
"method": "POST",
|
|
617
617
|
"input_schema": {
|
|
618
618
|
"type": "object",
|
|
@@ -693,7 +693,7 @@ export const TOOLS = [
|
|
|
693
693
|
{
|
|
694
694
|
"name": "json-yaml",
|
|
695
695
|
"http_name": "json-yaml",
|
|
696
|
-
"description": "(0.
|
|
696
|
+
"description": "(0.005 USDC/call) Converts between JSON and YAML in either direction. Auto-detects input format from the first non-whitespace byte ('{'/'[' = JSON, otherwise YAML), parses, and re-serializes in the requested target format; pretty-print and indent are configurable. Use it as a format transcoder, config-file translator, Helm values prepper, Kubernetes manifest converter, GitHub Actions workflow YAML helper, or round-trip parser.",
|
|
697
697
|
"method": "POST",
|
|
698
698
|
"input_schema": {
|
|
699
699
|
"type": "object",
|
|
@@ -748,7 +748,7 @@ export const TOOLS = [
|
|
|
748
748
|
{
|
|
749
749
|
"name": "merge-pdf",
|
|
750
750
|
"http_name": "merge-pdf",
|
|
751
|
-
"description": "(0.01 USDC/call)
|
|
751
|
+
"description": "(0.01 USDC/call) Combines 2-50 input PDFs from URLs into one PDF, preserving bookmarks. CloudConvert engine. Use it as a PDF merger to concatenate PDF files or join multiple PDFs into one.",
|
|
752
752
|
"method": "POST",
|
|
753
753
|
"input_schema": {
|
|
754
754
|
"type": "object",
|
|
@@ -771,7 +771,7 @@ export const TOOLS = [
|
|
|
771
771
|
{
|
|
772
772
|
"name": "movie-database",
|
|
773
773
|
"http_name": "movie-database",
|
|
774
|
-
"description": "(0.01 USDC/call)
|
|
774
|
+
"description": "(0.01 USDC/call) Finds movies or TV shows by title, with optional year and region, and returns release year, poster, overview, and language. Uses TMDB when TMDB_API_KEY is configured (adding movie ratings, popularity, and an optional streaming provider summary) and falls back to no-key Wikidata metadata otherwise. Use it as a movie database API, TV database API, film lookup, title metadata search, or TMDB / Wikidata movie lookup.",
|
|
775
775
|
"method": "POST",
|
|
776
776
|
"input_schema": {
|
|
777
777
|
"type": "object",
|
|
@@ -814,7 +814,7 @@ export const TOOLS = [
|
|
|
814
814
|
{
|
|
815
815
|
"name": "movie-database-api",
|
|
816
816
|
"http_name": "movie-database-api",
|
|
817
|
-
"description": "(0.01 USDC/call)
|
|
817
|
+
"description": "(0.01 USDC/call) Searches movies and TV shows by title and optional year, returning release date, rating, popularity, overview, poster URLs, TMDB links, and optional provider summary. Use it as a movie database API, TV database API, or TMDB metadata lookup.",
|
|
818
818
|
"method": "POST",
|
|
819
819
|
"input_schema": {
|
|
820
820
|
"type": "object",
|
|
@@ -857,7 +857,7 @@ export const TOOLS = [
|
|
|
857
857
|
{
|
|
858
858
|
"name": "movie-info",
|
|
859
859
|
"http_name": "movie-info",
|
|
860
|
-
"description": "(0.01 USDC/call)
|
|
860
|
+
"description": "(0.01 USDC/call) Looks up movie and TV metadata: title, release year, rating, overview, poster, and optional streaming providers. Same TMDB-backed response as movie-database with a compact search slug for agents. Use it as a movie info API, film metadata lookup, or TV metadata lookup.",
|
|
861
861
|
"method": "POST",
|
|
862
862
|
"input_schema": {
|
|
863
863
|
"type": "object",
|
|
@@ -900,7 +900,7 @@ export const TOOLS = [
|
|
|
900
900
|
{
|
|
901
901
|
"name": "mp4-to-mp3",
|
|
902
902
|
"http_name": "mp4-to-mp3",
|
|
903
|
-
"description": "(0.
|
|
903
|
+
"description": "(0.02 USDC/call) Converts MP4, MOV, WebM, MKV, AVI, M4V, and FLV video files to MP3 via CloudConvert, with selectable bitrate (96/128/192 kbps). 60-min / 500MB max. Use it as an MP4 to MP3 converter, video to audio converter, or audio extractor API.",
|
|
904
904
|
"method": "POST",
|
|
905
905
|
"input_schema": {
|
|
906
906
|
"type": "object",
|
|
@@ -927,7 +927,7 @@ export const TOOLS = [
|
|
|
927
927
|
{
|
|
928
928
|
"name": "mp4-to-mp3-api",
|
|
929
929
|
"http_name": "mp4-to-mp3-api",
|
|
930
|
-
"description": "(0.
|
|
930
|
+
"description": "(0.02 USDC/call) Converts MP4, MOV, WEBM, MKV, AVI, M4V, or FLV URLs into hosted MP3 output with selectable 96, 128, or 192 kbps bitrate. CloudConvert backend, 60-min / 500MB max. Use it as an MP4 to MP3 API, video to MP3 converter, or extract-audio-from-video endpoint.",
|
|
931
931
|
"method": "POST",
|
|
932
932
|
"input_schema": {
|
|
933
933
|
"type": "object",
|
|
@@ -954,7 +954,7 @@ export const TOOLS = [
|
|
|
954
954
|
{
|
|
955
955
|
"name": "mp4-to-mp3-converter",
|
|
956
956
|
"http_name": "mp4-to-mp3-converter",
|
|
957
|
-
"description": "(0.
|
|
957
|
+
"description": "(0.02 USDC/call) Converts MP4 video URLs into hosted MP3 files with selectable bitrate; also accepts MOV, WEBM, MKV, AVI, M4V, or FLV URLs. Same CloudConvert-backed handler as mp4-to-mp3. Use it as an MP4 to MP3 converter API, to extract audio from MP4, or for video-to-MP3 conversion.",
|
|
958
958
|
"method": "POST",
|
|
959
959
|
"input_schema": {
|
|
960
960
|
"type": "object",
|
|
@@ -981,7 +981,7 @@ export const TOOLS = [
|
|
|
981
981
|
{
|
|
982
982
|
"name": "ocr",
|
|
983
983
|
"http_name": "ocr",
|
|
984
|
-
"description": "(0.
|
|
984
|
+
"description": "(0.0025 USDC/call) Runs OCR on scanned PDFs and image-based documents, returning clean Markdown or plain text. The Datalab Marker engine preserves layout, tables, and math. 30 pages max. Use it as an OCR API, optical character recognition service, scanned document extractor, or image-PDF to text and scanned PDF OCR tool.",
|
|
985
985
|
"method": "POST",
|
|
986
986
|
"input_schema": {
|
|
987
987
|
"type": "object",
|
|
@@ -1025,7 +1025,7 @@ export const TOOLS = [
|
|
|
1025
1025
|
{
|
|
1026
1026
|
"name": "pdf-compress",
|
|
1027
1027
|
"http_name": "pdf-compress",
|
|
1028
|
-
"description": "(0.005 USDC/call)
|
|
1028
|
+
"description": "(0.005 USDC/call) Shrinks PDFs to a smaller size for email or the web, with three quality levels: ebook (lowest, web-quality), printer (medium), prepress (highest, archival). CloudConvert engine. Use it as a PDF compressor or PDF size reducer.",
|
|
1029
1029
|
"method": "POST",
|
|
1030
1030
|
"input_schema": {
|
|
1031
1031
|
"type": "object",
|
|
@@ -1052,7 +1052,7 @@ export const TOOLS = [
|
|
|
1052
1052
|
{
|
|
1053
1053
|
"name": "pdf-extract-tables",
|
|
1054
1054
|
"http_name": "pdf-extract-tables",
|
|
1055
|
-
"description": "(0.10 USDC/call)
|
|
1055
|
+
"description": "(0.10 USDC/call) Extracts every table from a PDF, digital or scanned, and returns row-by-column text matrices page-by-page. AI + OCR pipeline with optional cell bounding boxes for downstream layout reconstruction and an optional page_range filter ('1-5', '3', '1,3,5'). Handles merged headers, multi-page financial statements, balance sheets, lab results, scanned reports. 30 pages max. Sibling of pdf-to-markdown using the same Datalab backend, but pre-parsed to tables only. Use it as a PDF table extractor, scanned-table parser, financial-table OCR, multi-page table consolidator, or Datalab Marker tables endpoint.",
|
|
1056
1056
|
"method": "POST",
|
|
1057
1057
|
"input_schema": {
|
|
1058
1058
|
"type": "object",
|
|
@@ -1074,7 +1074,7 @@ export const TOOLS = [
|
|
|
1074
1074
|
{
|
|
1075
1075
|
"name": "pdf-merge",
|
|
1076
1076
|
"http_name": "pdf-merge",
|
|
1077
|
-
"description": "(0.01 USDC/call)
|
|
1077
|
+
"description": "(0.01 USDC/call) Merges 2-50 PDFs from URLs into a single PDF, preserving bookmarks. CloudConvert engine. Use it as a PDF merger, PDF combiner, or PDF concatenator.",
|
|
1078
1078
|
"method": "POST",
|
|
1079
1079
|
"input_schema": {
|
|
1080
1080
|
"type": "object",
|
|
@@ -1097,7 +1097,7 @@ export const TOOLS = [
|
|
|
1097
1097
|
{
|
|
1098
1098
|
"name": "pdf-parser-api",
|
|
1099
1099
|
"http_name": "pdf-parser-api",
|
|
1100
|
-
"description": "(0.
|
|
1100
|
+
"description": "(0.0025 USDC/call) Parses a public PDF URL into Markdown, HTML, or JSON blocks with layout-aware text, headings, tables, and equations. Datalab Marker backend, 30 pages max. Use it as a PDF parser API, PDF content extractor, or scanned PDF OCR API.",
|
|
1101
1101
|
"method": "POST",
|
|
1102
1102
|
"input_schema": {
|
|
1103
1103
|
"type": "object",
|
|
@@ -1150,7 +1150,7 @@ export const TOOLS = [
|
|
|
1150
1150
|
{
|
|
1151
1151
|
"name": "pdf-table-extract",
|
|
1152
1152
|
"http_name": "pdf-table-extract",
|
|
1153
|
-
"description": "(0.10 USDC/call)
|
|
1153
|
+
"description": "(0.10 USDC/call) Extracts tables from digital or scanned PDFs, returning row/column matrices, CSV output, page numbers, and optional cell boxes. Same Datalab Marker-backed handler as pdf-extract-tables. Use it as a PDF table extract API, to extract tables from PDF, or as a PDF to CSV table parser.",
|
|
1154
1154
|
"method": "POST",
|
|
1155
1155
|
"input_schema": {
|
|
1156
1156
|
"type": "object",
|
|
@@ -1172,7 +1172,7 @@ export const TOOLS = [
|
|
|
1172
1172
|
{
|
|
1173
1173
|
"name": "pdf-table-extractor",
|
|
1174
1174
|
"http_name": "pdf-table-extractor",
|
|
1175
|
-
"description": "(0.10 USDC/call)
|
|
1175
|
+
"description": "(0.10 USDC/call) Finds tables in digital or scanned PDFs and returns row-by-column matrices, page numbers, and optional cell bounding boxes. Datalab Marker backend, 30 pages max. Use it as a PDF table extractor or scanned-table OCR.",
|
|
1176
1176
|
"method": "POST",
|
|
1177
1177
|
"input_schema": {
|
|
1178
1178
|
"type": "object",
|
|
@@ -1194,7 +1194,7 @@ export const TOOLS = [
|
|
|
1194
1194
|
{
|
|
1195
1195
|
"name": "pdf-text-extractor",
|
|
1196
1196
|
"http_name": "pdf-text-extractor",
|
|
1197
|
-
"description": "(0.
|
|
1197
|
+
"description": "(0.0025 USDC/call) Extracts clean Markdown, HTML, or structured JSON from digital or scanned PDFs while preserving reading order, tables, and equations. Datalab Marker backend, 30 pages max. Use it as a PDF text extractor, PDF to text API, or OCR PDF reader.",
|
|
1198
1198
|
"method": "POST",
|
|
1199
1199
|
"input_schema": {
|
|
1200
1200
|
"type": "object",
|
|
@@ -1221,7 +1221,7 @@ export const TOOLS = [
|
|
|
1221
1221
|
{
|
|
1222
1222
|
"name": "pdf-to-jpg",
|
|
1223
1223
|
"http_name": "pdf-to-jpg",
|
|
1224
|
-
"description": "(0.10 USDC/call) PDF to JPG
|
|
1224
|
+
"description": "(0.10 USDC/call) Converts a PDF to JPG, PNG, or WEBP images, rendering every page at configurable DPI (36-600) and returning one image URL per page. CloudConvert backend.",
|
|
1225
1225
|
"method": "POST",
|
|
1226
1226
|
"input_schema": {
|
|
1227
1227
|
"type": "object",
|
|
@@ -1252,7 +1252,7 @@ export const TOOLS = [
|
|
|
1252
1252
|
{
|
|
1253
1253
|
"name": "pdf-to-markdown",
|
|
1254
1254
|
"http_name": "pdf-to-markdown",
|
|
1255
|
-
"description": "(0.
|
|
1255
|
+
"description": "(0.0025 USDC/call) Converts digital or scanned PDFs to clean Markdown with AI-powered, layout-aware extraction on the Datalab Marker engine. Preserves headings, tables, equations (LaTeX), bulleted lists, and multi-column flow; outputs Markdown (default), HTML, or structured JSON with per-page blocks. 30 pages max. Use it as a PDF parser, PDF to text converter, OCR PDF reader, extract-tables-from-PDF tool, equation-aware PDF parser, scanned-PDF OCR, or PDF data extractor.",
|
|
1256
1256
|
"method": "POST",
|
|
1257
1257
|
"input_schema": {
|
|
1258
1258
|
"type": "object",
|
|
@@ -1279,7 +1279,7 @@ export const TOOLS = [
|
|
|
1279
1279
|
{
|
|
1280
1280
|
"name": "pdf-to-markdown-api",
|
|
1281
1281
|
"http_name": "pdf-to-markdown-api",
|
|
1282
|
-
"description": "(0.
|
|
1282
|
+
"description": "(0.0025 USDC/call) Converts a public PDF URL into clean Markdown, HTML, or structured JSON while preserving headings, tables, equations, and reading order. Datalab Marker backend, 30 pages max. Use it as a PDF to Markdown API, PDF parser API, PDF to text API, or scanned PDF OCR API.",
|
|
1283
1283
|
"method": "POST",
|
|
1284
1284
|
"input_schema": {
|
|
1285
1285
|
"type": "object",
|
|
@@ -1306,7 +1306,7 @@ export const TOOLS = [
|
|
|
1306
1306
|
{
|
|
1307
1307
|
"name": "pdf-to-text",
|
|
1308
1308
|
"http_name": "pdf-to-text",
|
|
1309
|
-
"description": "(0.
|
|
1309
|
+
"description": "(0.0025 USDC/call) Extracts text from digital or scanned PDFs, preserving reading order across multi-column layouts with an AI + OCR pipeline (Datalab Marker). Recognizes scanned pages and returns Markdown by default (clean text with structure) or HTML / JSON. 30 pages max. Works as a PDF to plain text converter, pdftotext or pdf2txt replacement, PDF text extractor, scanned PDF OCR, or read/parse PDF content extractor.",
|
|
1310
1310
|
"method": "POST",
|
|
1311
1311
|
"input_schema": {
|
|
1312
1312
|
"type": "object",
|
|
@@ -1333,7 +1333,7 @@ export const TOOLS = [
|
|
|
1333
1333
|
{
|
|
1334
1334
|
"name": "pdf-to-text-api",
|
|
1335
1335
|
"http_name": "pdf-to-text-api",
|
|
1336
|
-
"description": "(0.
|
|
1336
|
+
"description": "(0.0025 USDC/call) Extracts text from digital and scanned PDFs as Markdown, plain text, HTML, or JSON with layout-aware reading order. Same Datalab Marker-backed handler as pdf-to-text. Use it as a PDF to text API, PDF text extractor, or OCR PDF reader.",
|
|
1337
1337
|
"method": "POST",
|
|
1338
1338
|
"input_schema": {
|
|
1339
1339
|
"type": "object",
|
|
@@ -1360,7 +1360,7 @@ export const TOOLS = [
|
|
|
1360
1360
|
{
|
|
1361
1361
|
"name": "pdf-watermark",
|
|
1362
1362
|
"http_name": "pdf-watermark",
|
|
1363
|
-
"description": "(0.02 USDC/call)
|
|
1363
|
+
"description": "(0.02 USDC/call) Overlays a text or image watermark on PDFs, PNG/JPG/GIF images, or MP4/MOV/WEBM videos. Configurable position, opacity, font, rotation, and margin. CloudConvert engine. Use it as a PDF watermark, image watermark, or video watermark tool.",
|
|
1364
1364
|
"method": "POST",
|
|
1365
1365
|
"input_schema": {
|
|
1366
1366
|
"type": "object",
|
|
@@ -1494,7 +1494,7 @@ export const TOOLS = [
|
|
|
1494
1494
|
{
|
|
1495
1495
|
"name": "split-pdf",
|
|
1496
1496
|
"http_name": "split-pdf",
|
|
1497
|
-
"description": "(0.04 USDC/call)
|
|
1497
|
+
"description": "(0.04 USDC/call) Splits a PDF by page ranges into multiple PDFs. CloudConvert engine. Use it as a PDF splitter, PDF page extractor, split-PDF-by-range tool, or PDF to multiple files converter.",
|
|
1498
1498
|
"method": "POST",
|
|
1499
1499
|
"input_schema": {
|
|
1500
1500
|
"type": "object",
|
|
@@ -1575,18 +1575,26 @@ export const TOOLS = [
|
|
|
1575
1575
|
{
|
|
1576
1576
|
"name": "upscale-image",
|
|
1577
1577
|
"http_name": "upscale-image",
|
|
1578
|
-
"description": "(0.
|
|
1578
|
+
"description": "(0.02 USDC/call) Upscales images 2x or 4x with AI super-resolution, best for photos and illustrations. Venice image/upscale backend (venice-sd35), re-hosted for a permanent URL. Use it as an AI image upscaler or image enlarger.",
|
|
1579
1579
|
"method": "POST",
|
|
1580
1580
|
"input_schema": {
|
|
1581
1581
|
"type": "object",
|
|
1582
1582
|
"properties": {
|
|
1583
1583
|
"image_url": {
|
|
1584
|
-
"type": "string"
|
|
1584
|
+
"type": "string",
|
|
1585
|
+
"description": "Public http(s) URL of the source image."
|
|
1585
1586
|
},
|
|
1586
1587
|
"scale": {
|
|
1587
1588
|
"type": "number",
|
|
1588
|
-
"
|
|
1589
|
-
|
|
1589
|
+
"enum": [
|
|
1590
|
+
2,
|
|
1591
|
+
4
|
|
1592
|
+
],
|
|
1593
|
+
"description": "Upscale factor. Default 2."
|
|
1594
|
+
},
|
|
1595
|
+
"model": {
|
|
1596
|
+
"type": "string",
|
|
1597
|
+
"description": "Override the upscaler model. Default 'venice-sd35'."
|
|
1590
1598
|
}
|
|
1591
1599
|
},
|
|
1592
1600
|
"required": [
|
|
@@ -1597,7 +1605,7 @@ export const TOOLS = [
|
|
|
1597
1605
|
{
|
|
1598
1606
|
"name": "url-to-markdown",
|
|
1599
1607
|
"http_name": "url-to-markdown",
|
|
1600
|
-
"description": "(0.005 USDC/call)
|
|
1608
|
+
"description": "(0.005 USDC/call) Fetches a public URL, strips boilerplate, and returns clean Markdown plus title and character counts. Use it as a URL to Markdown converter, webpage to Markdown tool, or article cleanup API.",
|
|
1601
1609
|
"method": "POST",
|
|
1602
1610
|
"input_schema": {
|
|
1603
1611
|
"type": "object",
|
|
@@ -1628,7 +1636,7 @@ export const TOOLS = [
|
|
|
1628
1636
|
{
|
|
1629
1637
|
"name": "video-summarize",
|
|
1630
1638
|
"http_name": "video-summarize",
|
|
1631
|
-
"description": "(0.10 USDC/call)
|
|
1639
|
+
"description": "(0.10 USDC/call) Summarizes videos, podcasts, and lectures in one call: Whisper v3 transcribes, then Mistral summarizes. 5 styles (tldr, bullets, paragraph, executive, chapters); returns summary + transcript; 60 min max. Use it as a video summarizer, podcast summarizer, or lecture notes generator.",
|
|
1632
1640
|
"method": "POST",
|
|
1633
1641
|
"input_schema": {
|
|
1634
1642
|
"type": "object",
|
|
@@ -1686,7 +1694,7 @@ export const TOOLS = [
|
|
|
1686
1694
|
{
|
|
1687
1695
|
"name": "video-to-audio",
|
|
1688
1696
|
"http_name": "video-to-audio",
|
|
1689
|
-
"description": "(0.
|
|
1697
|
+
"description": "(0.02 USDC/call) Video to audio extractor / video to audio converter. Extract MP3 audio track from any video URL (MP4, MOV, WEBM, MKV, AVI, M4V, FLV). Selectable bitrate (96/128/192 kbps). Useful for podcast extraction, audio archival, transcription pre-processing. 60-min / 500MB max. CloudConvert backend.",
|
|
1690
1698
|
"method": "POST",
|
|
1691
1699
|
"input_schema": {
|
|
1692
1700
|
"type": "object",
|
|
@@ -1713,7 +1721,7 @@ export const TOOLS = [
|
|
|
1713
1721
|
{
|
|
1714
1722
|
"name": "video-to-mp3",
|
|
1715
1723
|
"http_name": "video-to-mp3",
|
|
1716
|
-
"description": "(0.
|
|
1724
|
+
"description": "(0.02 USDC/call) Video to MP3 / extract MP3 audio from video URL. Converts MP4, MOV, WEBM, MKV, AVI, M4V, or FLV into an MP3 file for transcription, podcast clips, and media archival. CloudConvert backend, 60-min / 500MB max.",
|
|
1717
1725
|
"method": "POST",
|
|
1718
1726
|
"input_schema": {
|
|
1719
1727
|
"type": "object",
|
|
@@ -1740,7 +1748,7 @@ export const TOOLS = [
|
|
|
1740
1748
|
{
|
|
1741
1749
|
"name": "video-to-subtitles",
|
|
1742
1750
|
"http_name": "video-to-subtitles",
|
|
1743
|
-
"description": "(0.02 USDC/call)
|
|
1751
|
+
"description": "(0.02 USDC/call) Generates subtitles from video with Whisper v3, word-wrapped and ready for VLC / Premiere / FFmpeg. Auto-detects language and can translate to English. Use it as a video subtitle generator, auto-subtitle and closed captions tool, SRT generator, VTT generator, video CC endpoint, or accessibility captions source.",
|
|
1744
1752
|
"method": "POST",
|
|
1745
1753
|
"input_schema": {
|
|
1746
1754
|
"type": "object",
|
|
@@ -1777,7 +1785,7 @@ export const TOOLS = [
|
|
|
1777
1785
|
{
|
|
1778
1786
|
"name": "video-to-text",
|
|
1779
1787
|
"http_name": "video-to-text",
|
|
1780
|
-
"description": "(0.10 USDC/call)
|
|
1788
|
+
"description": "(0.10 USDC/call) Transcribe any video URL to text with Whisper v3 large — audio is extracted internally. Auto-detects 90+ languages, offers a translate-to-English mode and optional speaker diarization, and handles files up to 60 minutes / 500MB. Covers video transcription, video speech-to-text, and video ASR in one call.",
|
|
1781
1789
|
"method": "POST",
|
|
1782
1790
|
"input_schema": {
|
|
1783
1791
|
"type": "object",
|
|
@@ -1807,7 +1815,7 @@ export const TOOLS = [
|
|
|
1807
1815
|
{
|
|
1808
1816
|
"name": "video-transcribe",
|
|
1809
1817
|
"http_name": "video-transcribe",
|
|
1810
|
-
"description": "(0.10 USDC/call)
|
|
1818
|
+
"description": "(0.10 USDC/call) Transcribes any video URL to text with Whisper v3 large. Auto-detects 90+ languages, offers a translate-to-English mode, and optional speaker diarization. 60-min / 500MB max. Same backend as video-to-text under a clearer slug. Use it for video transcription, video to audio transcription, or video-to-text.",
|
|
1811
1819
|
"method": "POST",
|
|
1812
1820
|
"input_schema": {
|
|
1813
1821
|
"type": "object",
|
|
@@ -1837,7 +1845,7 @@ export const TOOLS = [
|
|
|
1837
1845
|
{
|
|
1838
1846
|
"name": "video-trim",
|
|
1839
1847
|
"http_name": "video-trim",
|
|
1840
|
-
"description": "(0.02 USDC/call)
|
|
1848
|
+
"description": "(0.02 USDC/call) Trims a video by passing start + end OR start + duration, in HH:MM:SS, MM:SS, or seconds. CloudConvert + x264 re-encode. Use it as a video trimmer, video cutter, or video clip tool.",
|
|
1841
1849
|
"method": "POST",
|
|
1842
1850
|
"input_schema": {
|
|
1843
1851
|
"type": "object",
|
|
@@ -1861,7 +1869,7 @@ export const TOOLS = [
|
|
|
1861
1869
|
{
|
|
1862
1870
|
"name": "watermark",
|
|
1863
1871
|
"http_name": "watermark",
|
|
1864
|
-
"description": "(0.02 USDC/call)
|
|
1872
|
+
"description": "(0.02 USDC/call) Adds a text or image watermark overlay to PDFs, PNG/JPG/GIF images, and MP4/MOV/WEBM videos. CloudConvert engine. Configurable position, opacity, font, rotation, and margin. Use it for PDF, image, or video watermarking.",
|
|
1865
1873
|
"method": "POST",
|
|
1866
1874
|
"input_schema": {
|
|
1867
1875
|
"type": "object",
|
|
@@ -1963,7 +1971,7 @@ export const TOOLS = [
|
|
|
1963
1971
|
{
|
|
1964
1972
|
"name": "xlsx-to-csv",
|
|
1965
1973
|
"http_name": "xlsx-to-csv",
|
|
1966
|
-
"description": "(0.005 USDC/call)
|
|
1974
|
+
"description": "(0.005 USDC/call) Converts any sheet of an .xlsx, .xlsm, .xls, or .ods workbook to CSV, with sheet selection, encoding, and quote style controls. Powered by CloudConvert. Use it as an Excel to CSV, XLSX to CSV, Numbers to CSV, or spreadsheet to CSV converter.",
|
|
1967
1975
|
"method": "POST",
|
|
1968
1976
|
"input_schema": {
|
|
1969
1977
|
"type": "object",
|
|
@@ -2011,7 +2019,7 @@ export const TOOLS = [
|
|
|
2011
2019
|
{
|
|
2012
2020
|
"name": "youtube-transcript",
|
|
2013
2021
|
"http_name": "youtube-transcript",
|
|
2014
|
-
"description": "(0.01 USDC/call)
|
|
2022
|
+
"description": "(0.01 USDC/call) Fetch the transcript of any YouTube video. Pulls auto-generated or manual captions and returns full text plus per-segment {start, duration, text}, with an optional language pick. Backed by Supadata's transcript pipeline server-side — no caller key required; returns a clear 404 when a video has no transcript. Use it as a YouTube closed-caption fetcher or video subtitles puller.",
|
|
2015
2023
|
"method": "POST",
|
|
2016
2024
|
"input_schema": {
|
|
2017
2025
|
"type": "object",
|
package/package.json
CHANGED