npm - task-summary-extractor - Versions diffs - 8.1.0 → 9.0.0 - Mend

task-summary-extractor 8.1.0 → 9.0.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (52) hide show

package/.env.example +38 -0
package/ARCHITECTURE.md +116 -15
package/EXPLORATION.md +164 -101
package/QUICK_START.md +5 -2
package/README.md +70 -18
package/bin/taskex.js +11 -4
package/package.json +39 -6
package/process_and_upload.js +7 -100
package/prompt.json +199 -131
package/src/config.js +52 -3
package/src/{utils → modes}/deep-dive.js +2 -2
package/src/{utils → modes}/dynamic-mode.js +2 -2
package/src/{utils → modes}/focused-reanalysis.js +4 -3
package/src/{utils → modes}/progress-updater.js +2 -2
package/src/phases/_shared.js +43 -0
package/src/phases/compile.js +101 -0
package/src/phases/deep-dive.js +118 -0
package/src/phases/discover.js +178 -0
package/src/phases/init.js +192 -0
package/src/phases/output.js +238 -0
package/src/phases/process-media.js +633 -0
package/src/phases/services.js +104 -0
package/src/phases/summary.js +86 -0
package/src/pipeline.js +431 -1462
package/src/renderers/docx.js +531 -0
package/src/renderers/html.js +672 -0
package/src/renderers/markdown.js +15 -183
package/src/renderers/pdf.js +90 -0
package/src/renderers/shared.js +211 -0
package/src/schemas/analysis-compiled.schema.json +381 -0
package/src/schemas/analysis-segment.schema.json +380 -0
package/src/services/doc-parser.js +346 -0
package/src/services/gemini.js +105 -48
package/src/services/git.js +0 -29
package/src/services/video.js +123 -8
package/src/utils/adaptive-budget.js +6 -6
package/src/utils/{progress.js → checkpoint.js} +2 -1
package/src/utils/cli.js +161 -113
package/src/utils/colors.js +83 -0
package/src/utils/confidence-filter.js +138 -0
package/src/utils/context-manager.js +0 -4
package/src/utils/diff-engine.js +2 -4
package/src/utils/global-config.js +6 -5
package/src/utils/health-dashboard.js +11 -9
package/src/utils/json-parser.js +5 -3
package/src/utils/learning-loop.js +3 -3
package/src/utils/progress-bar.js +286 -0
package/src/utils/quality-gate.js +4 -8
package/src/utils/retry.js +13 -5
package/src/utils/schema-validator.js +314 -0
package/src/utils/prompt.js +0 -32
/package/src/{utils → modes}/change-detector.js +0 -0

package/.env.example ADDED Viewed

@@ -0,0 +1,38 @@
+# ======================== FIREBASE ========================
+FIREBASE_API_KEY=your_firebase_api_key
+FIREBASE_AUTH_DOMAIN=your-project.firebaseapp.com
+FIREBASE_PROJECT_ID=your-project
+FIREBASE_STORAGE_BUCKET=your-project.appspot.com
+FIREBASE_MESSAGING_SENDER_ID=1234567890
+FIREBASE_APP_ID=1:1234567890:web:abc123
+FIREBASE_MEASUREMENT_ID=G-XXXXXXXXXX
+# ======================== GEMINI AI ========================
+GEMINI_API_KEY=your_gemini_api_key
+GEMINI_MODEL=gemini-2.5-flash
+# ======================== VIDEO PROCESSING ========================
+# Speed multiplier (default: 1.5)
+VIDEO_SPEED=1.5
+# Segment duration in seconds (default: 280)
+VIDEO_SEGMENT_TIME=280
+# ffmpeg preset: ultrafast, superfast, veryfast, faster, fast, medium, slow, slower, veryslow
+VIDEO_PRESET=slow
+# ======================== PIPELINE ========================
+# Log level: debug, info, warn, error (default: info)
+LOG_LEVEL=info
+# Max concurrent uploads (default: 3)
+MAX_PARALLEL_UPLOADS=3
+# Max retries for API calls (default: 3)
+MAX_RETRIES=3
+# Retry base delay in ms (default: 2000)
+RETRY_BASE_DELAY_MS=2000
+# ======================== GEMINI TUNING ========================
+# Thinking token budget per segment analysis (default: 24576)
+THINKING_BUDGET=24576
+# Thinking token budget for final compilation (default: 10240)
+COMPILATION_THINKING_BUDGET=10240
+# Max polling time for Gemini File API processing in ms (default: 300000 = 5 min)
+GEMINI_POLL_TIMEOUT_MS=300000

package/ARCHITECTURE.md CHANGED Viewed

@@ -74,26 +74,34 @@ flowchart TB
         FB["firebase.js"]
         VID["video.js"]
         GIT["git.js"]
+        DP["doc-parser.js"]
     end
-    subgraph Utils["Utilities — 19 modules"]
+    subgraph Utils["Utilities"]
         QG["quality-gate"]
-        FR["focused-reanalysis"]
         LL["learning-loop"]
         DE["diff-engine"]
-        CD["change-detector"]
-        PU["progress-updater"]
         CM["context-manager"]
         JP["json-parser"]
         AB["adaptive-budget"]
         HD["health-dashboard"]
+        OT["+ 7 more"]
+    end
+    subgraph Modes["Modes — AI pipeline phases"]
+        FR["focused-reanalysis"]
+        CD["change-detector"]
+        PU["progress-updater"]
         DD["deep-dive"]
         DM["dynamic-mode"]
-        OT["+ 7 more"]
     end
     subgraph Renderers["Renderers"]
         MD["markdown.js"]
+        HTML["html.js"]
+        PDF["pdf.js"]
+        DOCX["docx.js"]
+        SHARED["shared.js"]
     end
     EP --> Pipeline
@@ -101,6 +109,7 @@ flowchart TB
     P1 -.->|"--dynamic"| DYN
     Pipeline --> Services
     Pipeline --> Utils
+    Pipeline --> Modes
     Pipeline --> Renderers
     UP --> GIT
     UP --> CD
@@ -115,11 +124,11 @@ flowchart TB
 | Phase | Name | What Happens |
 |-------|------|-------------|
 | 1 | **Init** | CLI parsing, interactive folder selection (if no arg), config validation, logger setup, load learning insights, route to dynamic/progress mode |
-| 2 | **Discover** | Find videos, discover documents, resolve user name, check resume state |
+| 2 | **Discover** | Find videos/audio, discover documents, resolve user name, check resume state |
 | 3 | **Services** | Firebase auth, Gemini init, prepare document parts |
 | 4 | **Process** | Compress → Upload → Analyze → Quality Gate → Retry → Focused Pass |
 | 5 | **Compile** | Cross-segment compilation, diff engine comparison |
-| 6 | **Output** | Write JSON, render Markdown, upload to Firebase |
+| 6 | **Output** | Write JSON, render Markdown + HTML, upload to Firebase |
 | 7 | **Health** | Quality metrics dashboard, cost breakdown |
 | 8 | **Summary** | Save learning history, print run summary |
 | 9 | **Deep Dive** | (optional, `--deep-dive`) Topic discovery + explanatory document generation |
@@ -166,6 +175,7 @@ flowchart LR
     subgraph P6["Phase 6: Output"]
         JSON["results.json"]
         MDR["results.md"]
+        HTMLR["results.html"]
         FBU["Firebase upload"]
     end
@@ -294,17 +304,17 @@ flowchart TB
 ## Extraction Schema
-The AI extracts 6 structured categories from each meeting. The categories are content-adaptive — the AI populates whichever fields are relevant to the actual discussion.
+The AI extracts 6 structured categories from any content source (video, audio, documents, or mixed). The prompt auto-detects the input type and adapts: temporal content (video/audio) gets timestamps; document-only content uses section references and null timestamps. All field names remain identical regardless of input type for backward compatibility.
 ### Categories
 | Category | Key Fields | Adapts To |
 |----------|-----------|----------|
-| **Tickets / Items** | `ticket_id`, `title`, `status`, `assignee`, `reviewer`, `video_segments` with timestamps, `speaker_comments`, `details` with priority, confidence | Sprint items, requirements, interview topics, incident items |
-| **Change Requests** | `WHERE` (target: file, system, process, scope), `WHAT` (specific change), `HOW` (approach), `WHY` (justification), `dependencies`, `blocked_by`, confidence | Code changes, requirement changes, process changes, scope adjustments |
-| **References** | `name`, `type`, `role`, cross-refs to tickets & CRs, `context_doc_match` | Files, documents, URLs, tools, systems, resources mentioned |
-| **Action Items** | `description`, `assigned_to`, `status`, `deadline`, `dependencies`, related tickets & CRs, confidence | Any follow-up work discussed |
-| **Blockers** | `description`, `severity`, `owner`, `status`, `proposed_resolution`, confidence | Technical blockers, approval gates, resource constraints |
+| **Tickets / Items** | `ticket_id`, `title`, `status`, `assignee`, `reviewer`, `video_segments` with timestamps (or null for docs), `speaker_comments`, `details` with priority, confidence | Sprint items, requirements, interview topics, incident items, legal matters, deals |
+| **Change Requests** | `WHERE` (target: file, system, process, scope), `WHAT` (specific change), `HOW` (approach), `WHY` (justification), `dependencies`, `blocked_by`, confidence | Code changes, requirement changes, process changes, scope adjustments, contract revisions, policy updates |
+| **References** | `name`, `type`, `role`, cross-refs to tickets & CRs, `context_doc_match` | Files, documents, URLs, tools, systems, resources, contracts, reports mentioned |
+| **Action Items** | `description`, `assigned_to`, `status`, `deadline`, `dependencies`, related tickets & CRs, confidence | Any follow-up work discussed or documented |
+| **Blockers** | `description`, `severity`, `owner`, `status`, `proposed_resolution`, confidence | Technical blockers, approval gates, resource constraints, legal reviews, budget approvals |
 | **Scope Changes** | `type` (added/removed/deferred), `original` vs `new` scope, `decided_by`, `impact`, confidence | Feature scope, project scope, contract scope, training scope |
 ### Personalized Task Section
@@ -500,7 +510,11 @@ taskex --dynamic --request "Document this microservices architecture"
 |-----------|--------|-------------|
 | `.vtt` `.srt` `.txt` `.md` `.csv` | Inline text | Read and passed directly as text parts |
 | `.pdf` | Gemini File API | Uploaded as binary, Gemini processes natively |
-| `.docx` `.doc` | Firebase only | Uploaded for archival, not processable by Gemini |
+| `.mp3` `.wav` `.ogg` `.m4a` | Gemini File API | Uploaded as audio, Gemini processes natively |
+| `.docx` | Doc parser (mammoth) | Converted to plain text, sent as inline text |
+| `.xlsx` `.xls` | Doc parser (xlsx) | Converted to pipe-delimited tables, sent as inline text |
+| `.doc` `.pptx` `.ppt` `.odt` `.odp` `.ods` `.rtf` `.epub` | Doc parser (officeparser) | Converted to plain text, sent as inline text |
+| `.html` `.htm` | Doc parser (built-in) | HTML tags stripped, sent as inline text |
 Directories skipped during recursive discovery: `node_modules`, `.git`, `compressed`, `logs`, `gemini_runs`, `runs`
@@ -541,10 +555,15 @@ JSONL structured format includes phase spans with timing metrics for observabili
 | **Gemini AI** | `@google/genai@^1.42.0` | Video analysis, File API, 1M context window |
 | **Firebase** | `firebase@^12.9.0` | Anonymous auth + Cloud Storage uploads |
 | **dotenv** | `dotenv@^17.3.1` | Environment variable loading |
+| **puppeteer** | `puppeteer` | HTML → PDF conversion for PDF output format |
+| **docx** | `docx` | Programmatic Word document generation for DOCX output format |
+| **mammoth** | `mammoth` | DOCX → plain text conversion |
+| **xlsx** | `xlsx` | Excel spreadsheet parsing (XLSX/XLS) |
+| **officeparser** | `officeparser` | DOC, PPTX, ODT, RTF, EPUB text extraction |
 | **ffmpeg** | System binary | H.264 video compression + segmentation |
 | **Git** | System binary | Change detection for progress tracking |
-**Codebase: 31 files · ~10,300 lines** · npm package: `task-summary-extractor` · CLI: `taskex`
+**Codebase: ~45 files · ~13,000+ lines** · npm package: `task-summary-extractor` · CLI: `taskex`
 ---
@@ -596,6 +615,47 @@ When `usedExternalUrl` is `true`, the `fileUri` contains the Firebase Storage do
 ---
+## JSON Schema Validation
+All AI output is validated against JSON Schema definitions in `src/schemas/`:
+| Schema | File | Purpose |
+|--------|------|---------|
+| Segment analysis | `analysis-segment.schema.json` | Validates each segment's extracted data |
+| Compiled analysis | `analysis-compiled.schema.json` | Validates the final cross-segment compilation |
+Validation is performed by `src/utils/schema-validator.js` using [ajv](https://ajv.js.org/). Validation errors are reported as warnings with contextual hints for the retry/focused-pass cycle — they do not hard-fail the pipeline but are injected as corrective hints when the quality gate triggers a retry.
+---
+## Test Suite
+The project includes a comprehensive test suite using [vitest](https://vitest.dev/):
+| Metric | Value |
+|--------|-------|
+| Test files | 13 |
+| Total tests | 285 |
+| Framework | vitest v4.x |
+| Coverage | `@vitest/coverage-v8` |
+**Test categories:**
+| Directory | What's Tested |
+|-----------|---------------|
+| `tests/utils/` | Utility modules: adaptive-budget, cli, confidence-filter, context-manager, diff-engine, format, json-parser, progress-bar, quality-gate, retry, schema-validator |
+| `tests/renderers/` | Renderer modules: html, markdown |
+**Commands:**
+```bash
+npm test              # Run all tests
+npm run test:watch    # Watch mode
+npm run test:coverage # Coverage report
+```
+---
 ## See Also
 | Doc | What's In It |
@@ -603,3 +663,44 @@ When `usedExternalUrl` is `true`, the `fileUri` contains the Firebase Storage do
 | 📖 [README.md](README.md) | Setup, CLI flags, configuration, features |
 | 📖 [QUICK_START.md](QUICK_START.md) | Step-by-step first-time walkthrough |
 | 🔭 [EXPLORATION.md](EXPLORATION.md) | Module map, line counts, future roadmap |
+---
+## JSON Schema Validation
+All AI output is validated against JSON Schema definitions in `src/schemas/`:
+| Schema | File | Purpose |
+|--------|------|---------|
+| Segment analysis | `analysis-segment.schema.json` | Validates each segment's extracted data |
+| Compiled analysis | `analysis-compiled.schema.json` | Validates the final cross-segment compilation |
+Validation is performed by `src/utils/schema-validator.js` using [ajv](https://ajv.js.org/). Validation errors are reported as warnings with contextual hints for the retry/focused-pass cycle — they do not hard-fail the pipeline but are injected as corrective hints when the quality gate triggers a retry.
+---
+## Test Suite
+The project includes a comprehensive test suite using [vitest](https://vitest.dev/):
+| Metric | Value |
+|--------|-------|
+| Test files | 13 |
+| Total tests | 285 |
+| Framework | vitest v4.x |
+| Coverage | `@vitest/coverage-v8` |
+**Test categories:**
+| Directory | What's Tested |
+|-----------|---------------|
+| `tests/utils/` | Utility modules: adaptive-budget, cli, confidence-filter, context-manager, diff-engine, format, json-parser, progress-bar, quality-gate, retry, schema-validator |
+| `tests/renderers/` | Renderer modules: html, markdown |
+**Commands:**
+```bash
+npm test              # Run all tests
+npm run test:watch    # Watch mode
+npm run test:coverage # Coverage report
+```