aidevops 2.99.0 → 2.100.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +25 -3
- package/VERSION +1 -1
- package/aidevops.sh +1 -1
- package/package.json +1 -1
- package/setup.sh +1 -1
package/README.md
CHANGED
|
@@ -707,10 +707,29 @@ The setup script offers to install these tools automatically.
|
|
|
707
707
|
- **[DSPyGround](https://dspyground.com/)**: Interactive playground for prompt optimization
|
|
708
708
|
- **[TOON Format](https://github.com/marcusquinn/aidevops/blob/main/.agent/toon-format.md)**: Token-Oriented Object Notation - 20-60% token reduction for LLM prompts
|
|
709
709
|
|
|
710
|
-
### **Document Processing**
|
|
710
|
+
### **Document Processing & OCR**
|
|
711
711
|
|
|
712
712
|
- **[LibPDF](https://libpdf.dev/)**: PDF form filling, digital signatures (PAdES B-B/T/LT/LTA), encryption, merge/split, text extraction
|
|
713
713
|
- **[Unstract](https://github.com/Zipstack/unstract)**: LLM-powered structured data extraction from unstructured documents (PDF, images, DOCX)
|
|
714
|
+
- **[GLM-OCR](https://ollama.com/library/glm-ocr)**: Local OCR via Ollama - purpose-built for document text extraction (tables, forms, complex layouts) with zero cloud dependency
|
|
715
|
+
|
|
716
|
+
**OCR Tool Selection:**
|
|
717
|
+
|
|
718
|
+
| Need | Tool | Why |
|
|
719
|
+
|------|------|-----|
|
|
720
|
+
| **Quick text extraction** | GLM-OCR | Local, fast, no API keys, privacy-first |
|
|
721
|
+
| **Structured JSON output** | Unstract | Schema-based extraction, complex documents |
|
|
722
|
+
| **Screen/window OCR** | Peekaboo + GLM-OCR | `peekaboo image --analyze --model ollama/glm-ocr` |
|
|
723
|
+
| **PDF text extraction** | LibPDF | Native PDF parsing, no AI needed |
|
|
724
|
+
|
|
725
|
+
**Quick start:**
|
|
726
|
+
|
|
727
|
+
```bash
|
|
728
|
+
ollama pull glm-ocr
|
|
729
|
+
ollama run glm-ocr "Extract all text" --images /path/to/document.png
|
|
730
|
+
```
|
|
731
|
+
|
|
732
|
+
See `.agent/tools/ocr/glm-ocr.md` for batch processing, PDF workflows, and Peekaboo integration.
|
|
714
733
|
|
|
715
734
|
### **Communications**
|
|
716
735
|
|
|
@@ -796,7 +815,7 @@ These use direct API calls via curl, avoiding MCP server startup entirely:
|
|
|
796
815
|
- [Playwright](https://playwright.dev/) - Fastest engine (0.9s form fill), parallel contexts, extensions, proxy (auto-installed)
|
|
797
816
|
- [playwright-cli](https://github.com/microsoft/playwright-cli) - Microsoft official CLI for AI agents, `--session` isolation, built-in tracing
|
|
798
817
|
- [dev-browser](https://github.com/nicholasgriffintn/dev-browser) - Persistent profile, stays logged in, ARIA snapshots, pairs with DevTools
|
|
799
|
-
- [agent-browser](https://github.com/vercel-labs/agent-browser) - CLI/CI/CD, `--session` parallel, ref-based element targeting
|
|
818
|
+
- [agent-browser](https://github.com/vercel-labs/agent-browser) - CLI/CI/CD, `--session` parallel, ref-based element targeting, **iOS Simulator support** (macOS)
|
|
800
819
|
- [Crawl4AI](https://github.com/unclecode/crawl4ai) - Bulk extraction, `arun_many` parallel (1.7x), LLM-ready markdown
|
|
801
820
|
- [WaterCrawl](https://github.com/watercrawl/watercrawl) - Self-hosted crawling with web search, sitemap generation, JS rendering, proxy support
|
|
802
821
|
- [Playwriter](https://github.com/nicholasgriffintn/playwriter) - Your browser's extensions/passwords/proxy, already unlocked
|
|
@@ -848,10 +867,11 @@ These use direct API calls via curl, avoiding MCP server startup entirely:
|
|
|
848
867
|
- [QuickFile](https://github.com/marcusquinn/quickfile-mcp) - Accounting API integration (MCP)
|
|
849
868
|
- [Amazon Order History](https://github.com/marcusquinn/amazon-order-history-csv-download-mcp) - Order data extraction (MCP)
|
|
850
869
|
|
|
851
|
-
**Document Processing:**
|
|
870
|
+
**Document Processing & OCR:**
|
|
852
871
|
|
|
853
872
|
- [LibPDF](https://libpdf.dev/) - PDF form filling, digital signatures, encryption, merge/split (via helper script)
|
|
854
873
|
- [Unstract](https://github.com/Zipstack/unstract) - LLM-powered structured data extraction from PDFs, images, DOCX (MCP)
|
|
874
|
+
- [GLM-OCR](https://ollama.com/library/glm-ocr) - Local OCR via Ollama for document text extraction (subagent)
|
|
855
875
|
|
|
856
876
|
### **Quick Setup**
|
|
857
877
|
|
|
@@ -923,6 +943,7 @@ Tested on macOS ARM64, all headless, warm daemon:
|
|
|
923
943
|
| **Tracing** | Full API | Built-in CLI | Via Playwright | Via Playwright | No | Via CDP | Via Playwright |
|
|
924
944
|
| **Natural language** | No | No | No | No | LLM extraction | No | Yes |
|
|
925
945
|
| **Self-healing** | No | No | No | No | No | No | Yes |
|
|
946
|
+
| **iOS Simulator** | No | No | No | **Yes** (macOS) | No | No | No |
|
|
926
947
|
| **Maintainer** | Microsoft | Microsoft | Community | Vercel | Community | Community | Browserbase |
|
|
927
948
|
|
|
928
949
|
### Tool Selection
|
|
@@ -936,6 +957,7 @@ Tested on macOS ARM64, all headless, warm daemon:
|
|
|
936
957
|
| **Bulk extraction** | Crawl4AI | Purpose-built, parallel, LLM-ready output |
|
|
937
958
|
| **Self-hosted crawling** | WaterCrawl | Docker deployment, web search, sitemap generation |
|
|
938
959
|
| **CLI/CI/CD** | playwright-cli or agent-browser | No server needed, `--session` isolation |
|
|
960
|
+
| **iOS mobile testing** | agent-browser | Real Safari in iOS Simulator (macOS only) |
|
|
939
961
|
| **Unknown pages** | Stagehand | Natural language, self-healing |
|
|
940
962
|
| **Performance debugging** | Chrome DevTools MCP | Companion tool, pairs with any browser |
|
|
941
963
|
| **Bot detection evasion** | Anti-detect stack | Camoufox (full) or rebrowser-patches (quick) |
|
package/VERSION
CHANGED
|
@@ -1 +1 @@
|
|
|
1
|
-
2.
|
|
1
|
+
2.100.0
|
package/aidevops.sh
CHANGED
package/package.json
CHANGED
package/setup.sh
CHANGED