alys-akusa 0.1.20 → 0.1.23

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (3) hide show
  1. package/README.md +36 -2
  2. package/dist/index.cjs +2634 -194
  3. package/package.json +1 -1
package/README.md CHANGED
@@ -12,6 +12,8 @@ Or run without installing:
12
12
  ```bash
13
13
  npx alys-akusa prepare ./company-docs
14
14
  npx alys-akusa audit ./knowledge-base
15
+ npx alys-akusa media-debug ./call-recordings
16
+ npx alys-akusa media-transcribe-plan ./call-recordings --gcs-input gs://bucket/in --gcs-output gs://bucket/out
15
17
  npx alys-akusa simulate-rag ./knowledge-base
16
18
  npx alys-akusa improve ./knowledge-base
17
19
  npx alys-akusa benchmark ./knowledge-base
@@ -20,8 +22,11 @@ npx alys-akusa finetune-ready ./support-tickets
20
22
  ```
21
23
 
22
24
  Alys turns messy local knowledge into OpenAI fine-tuning JSONL, Anthropic instruction
23
- records, RAG chunks, QA datasets, eval datasets, embeddings-ready corpora, manifests,
24
- source lists, and AI readiness reports.
25
+ records, RAG chunks, QA datasets, eval datasets, embeddings-ready corpora, media
26
+ manifests, SHA-256 fingerprints, timestamped and redacted segment rows, dataset
27
+ cards, redaction reviews, and debug reports for transcript-backed audio/video assets,
28
+ Google Speech-to-Text V2 transcription plans, GCS upload manifests, BatchRecognize
29
+ request JSON, VTT/SRT import maps, source lists, and AI readiness reports.
25
30
 
26
31
  ## What It Does
27
32
 
@@ -51,9 +56,13 @@ answer support, grounding, hallucination risk, and regressions.
51
56
 
52
57
  ## Local Development
53
58
 
59
+ Do not publish to npm for normal testing. Use the repo CLI directly first:
60
+
54
61
  ```bash
55
62
  pnpm install
56
63
  pnpm alys -- audit ./knowledge-base --yes
64
+ pnpm alys -- media-debug ./call-recordings --yes
65
+ pnpm alys -- media-transcribe-plan ./call-recordings --google-project my-gcp-project --gcs-input gs://bucket/alys-media-input --gcs-output gs://bucket/alys-media-output --yes
57
66
  pnpm alys -- simulate-rag ./knowledge-base --queries 12 --top-k 5 --yes
58
67
  pnpm alys -- improve ./knowledge-base --yes
59
68
  pnpm alys -- benchmark ./knowledge-base --queries 16 --top-k 5 --yes
@@ -61,3 +70,28 @@ pnpm alys -- prepare ./docs --profile all --yes
61
70
  pnpm alys -- ingest ./knowledge-base --profile rag,embeddings,eval --yes
62
71
  pnpm alys -- finetune-ready ./tickets --profile openai,anthropic,qa --yes
63
72
  ```
73
+
74
+ Recommended pre-publish loop:
75
+
76
+ ```bash
77
+ pnpm cli:test
78
+ pnpm cli:smoke:local
79
+ pnpm cli:pack
80
+ ```
81
+
82
+ `pnpm cli:smoke:local` does not call the hosted generation API. It audits the repo
83
+ docs folder and writes disposable artifacts to `./.alys-test`.
84
+
85
+ Use the API smoke only when you explicitly want to test the live hosted generation
86
+ path with a tiny run:
87
+
88
+ ```bash
89
+ pnpm cli:smoke:api
90
+ ```
91
+
92
+ Only after those pass should you publish:
93
+
94
+ ```bash
95
+ cd apps/cli
96
+ npm publish --access public
97
+ ```