aidevops 2.99.0 → 2.100.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/README.md CHANGED
@@ -707,10 +707,29 @@ The setup script offers to install these tools automatically.
707
707
  - **[DSPyGround](https://dspyground.com/)**: Interactive playground for prompt optimization
708
708
  - **[TOON Format](https://github.com/marcusquinn/aidevops/blob/main/.agent/toon-format.md)**: Token-Oriented Object Notation - 20-60% token reduction for LLM prompts
709
709
 
710
- ### **Document Processing**
710
+ ### **Document Processing & OCR**
711
711
 
712
712
  - **[LibPDF](https://libpdf.dev/)**: PDF form filling, digital signatures (PAdES B-B/T/LT/LTA), encryption, merge/split, text extraction
713
713
  - **[Unstract](https://github.com/Zipstack/unstract)**: LLM-powered structured data extraction from unstructured documents (PDF, images, DOCX)
714
+ - **[GLM-OCR](https://ollama.com/library/glm-ocr)**: Local OCR via Ollama - purpose-built for document text extraction (tables, forms, complex layouts) with zero cloud dependency
715
+
716
+ **OCR Tool Selection:**
717
+
718
+ | Need | Tool | Why |
719
+ |------|------|-----|
720
+ | **Quick text extraction** | GLM-OCR | Local, fast, no API keys, privacy-first |
721
+ | **Structured JSON output** | Unstract | Schema-based extraction, complex documents |
722
+ | **Screen/window OCR** | Peekaboo + GLM-OCR | `peekaboo image --analyze --model ollama/glm-ocr` |
723
+ | **PDF text extraction** | LibPDF | Native PDF parsing, no AI needed |
724
+
725
+ **Quick start:**
726
+
727
+ ```bash
728
+ ollama pull glm-ocr
729
+ ollama run glm-ocr "Extract all text" --images /path/to/document.png
730
+ ```
731
+
732
+ See `.agent/tools/ocr/glm-ocr.md` for batch processing, PDF workflows, and Peekaboo integration.
714
733
 
715
734
  ### **Communications**
716
735
 
@@ -796,7 +815,7 @@ These use direct API calls via curl, avoiding MCP server startup entirely:
796
815
  - [Playwright](https://playwright.dev/) - Fastest engine (0.9s form fill), parallel contexts, extensions, proxy (auto-installed)
797
816
  - [playwright-cli](https://github.com/microsoft/playwright-cli) - Microsoft official CLI for AI agents, `--session` isolation, built-in tracing
798
817
  - [dev-browser](https://github.com/nicholasgriffintn/dev-browser) - Persistent profile, stays logged in, ARIA snapshots, pairs with DevTools
799
- - [agent-browser](https://github.com/vercel-labs/agent-browser) - CLI/CI/CD, `--session` parallel, ref-based element targeting
818
+ - [agent-browser](https://github.com/vercel-labs/agent-browser) - CLI/CI/CD, `--session` parallel, ref-based element targeting, **iOS Simulator support** (macOS)
800
819
  - [Crawl4AI](https://github.com/unclecode/crawl4ai) - Bulk extraction, `arun_many` parallel (1.7x), LLM-ready markdown
801
820
  - [WaterCrawl](https://github.com/watercrawl/watercrawl) - Self-hosted crawling with web search, sitemap generation, JS rendering, proxy support
802
821
  - [Playwriter](https://github.com/nicholasgriffintn/playwriter) - Your browser's extensions/passwords/proxy, already unlocked
@@ -848,10 +867,11 @@ These use direct API calls via curl, avoiding MCP server startup entirely:
848
867
  - [QuickFile](https://github.com/marcusquinn/quickfile-mcp) - Accounting API integration (MCP)
849
868
  - [Amazon Order History](https://github.com/marcusquinn/amazon-order-history-csv-download-mcp) - Order data extraction (MCP)
850
869
 
851
- **Document Processing:**
870
+ **Document Processing & OCR:**
852
871
 
853
872
  - [LibPDF](https://libpdf.dev/) - PDF form filling, digital signatures, encryption, merge/split (via helper script)
854
873
  - [Unstract](https://github.com/Zipstack/unstract) - LLM-powered structured data extraction from PDFs, images, DOCX (MCP)
874
+ - [GLM-OCR](https://ollama.com/library/glm-ocr) - Local OCR via Ollama for document text extraction (subagent)
855
875
 
856
876
  ### **Quick Setup**
857
877
 
@@ -923,6 +943,7 @@ Tested on macOS ARM64, all headless, warm daemon:
923
943
  | **Tracing** | Full API | Built-in CLI | Via Playwright | Via Playwright | No | Via CDP | Via Playwright |
924
944
  | **Natural language** | No | No | No | No | LLM extraction | No | Yes |
925
945
  | **Self-healing** | No | No | No | No | No | No | Yes |
946
+ | **iOS Simulator** | No | No | No | **Yes** (macOS) | No | No | No |
926
947
  | **Maintainer** | Microsoft | Microsoft | Community | Vercel | Community | Community | Browserbase |
927
948
 
928
949
  ### Tool Selection
@@ -936,6 +957,7 @@ Tested on macOS ARM64, all headless, warm daemon:
936
957
  | **Bulk extraction** | Crawl4AI | Purpose-built, parallel, LLM-ready output |
937
958
  | **Self-hosted crawling** | WaterCrawl | Docker deployment, web search, sitemap generation |
938
959
  | **CLI/CI/CD** | playwright-cli or agent-browser | No server needed, `--session` isolation |
960
+ | **iOS mobile testing** | agent-browser | Real Safari in iOS Simulator (macOS only) |
939
961
  | **Unknown pages** | Stagehand | Natural language, self-healing |
940
962
  | **Performance debugging** | Chrome DevTools MCP | Companion tool, pairs with any browser |
941
963
  | **Bot detection evasion** | Anti-detect stack | Camoufox (full) or rebrowser-patches (quick) |
package/VERSION CHANGED
@@ -1 +1 @@
1
- 2.99.0
1
+ 2.100.0
package/aidevops.sh CHANGED
@@ -3,7 +3,7 @@
3
3
  # AI DevOps Framework CLI
4
4
  # Usage: aidevops <command> [options]
5
5
  #
6
- # Version: 2.99.0
6
+ # Version: 2.100.0
7
7
 
8
8
  set -euo pipefail
9
9
 
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "aidevops",
3
- "version": "2.99.0",
3
+ "version": "2.100.0",
4
4
  "description": "AI DevOps Framework - AI-assisted development workflows, code quality, and deployment automation",
5
5
  "type": "module",
6
6
  "main": "index.js",
package/setup.sh CHANGED
@@ -3,7 +3,7 @@
3
3
  # AI Assistant Server Access Framework Setup Script
4
4
  # Helps developers set up the framework for their infrastructure
5
5
  #
6
- # Version: 2.99.0
6
+ # Version: 2.100.0
7
7
  #
8
8
  # Quick Install (one-liner):
9
9
  # bash <(curl -fsSL https://aidevops.dev/install)