npm - vidistill - Versions diffs - 0.2.4 → 0.3.0 - Mend

vidistill 0.2.4 → 0.3.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (3) hide show

package/README.md CHANGED Viewed

@@ -50,31 +50,6 @@ vidistill ./demo.mp4 -o ./notes/
 vidistill ./lecture.mp4 --lang zh
 ```
-### Extract
-Pull specific data from a previously processed video or re-run a targeted pass on a video file.
-```
-vidistill extract <type> <source>
-```
-**Arguments:**
-- `type` — what to extract: `code`, `links`, `people`, `transcript`, or `commands`
-- `source` — path to a vidistill output directory or a video/audio file
-**Examples:**
-```bash
-# Extract code from existing output (no API calls)
-vidistill extract code ./vidistill-output/my-video/
-# Extract links from a video file (runs targeted pipeline)
-vidistill extract links ./lecture.mp4
-```
-When pointed at an output directory, extract reads from already-generated files with zero API calls. When pointed at a video file, it runs a minimal pipeline with only the passes needed for the requested data type.
 ## API Key
 vidistill needs a Gemini API key. It checks these sources in order:
@@ -103,12 +78,19 @@ vidistill-output/my-video/
 ├── action-items.md    # tasks and follow-ups
 ├── insights.md        # implicit signals and analysis
 ├── links.md           # all URLs mentioned
+├── prereqs.md         # prerequisite knowledge (when detected)
+├── timeline.html      # interactive visual timeline
 ├── metadata.json      # processing metadata
+├── progress.json      # resume checkpoint (during processing)
 └── raw/               # raw pass outputs
 ```
 Which files are generated depends on the video content — a coding tutorial gets `code/`, a meeting gets `people.md` and `action-items.md`, etc.
+### Resume
+If a run is interrupted (Ctrl+C), progress is saved automatically. Re-running the same command detects the incomplete run and offers to resume from where it left off.
 ## How It Works
 Supported video formats: MP4, MOV, WebM, MKV, AVI, MPEG, FLV, WMV, 3GPP. Supported audio formats: MP3, AAC, WAV, FLAC, OGG, M4A.
@@ -118,7 +100,7 @@ Supported video formats: MP4, MOV, WebM, MKV, AVI, MPEG, FLV, WMV, 3GPP. Support
 3. **Pass 1** — transcript extraction with speaker identification
 4. **Pass 2** — visual content extraction (screen states, diagrams, slides)
 5. **Pass 3** — specialist passes based on video type:
-   - 3c: chat and links (live streams) — per segment
+   - 3c: chat and links (live streams) — per segment, runs 3x with consensus voting
    - 3d: implicit signals (all types) — per segment
    - 3b: people and social dynamics (meetings) — whole video
    - 3a: code reconstruction (coding videos) — whole video, runs 3x with consensus voting and validation