PyPI - okb - Versions diffs - 1.1.0__tar.gz → 1.1.0a0__tar.gz - Mend

okb 1.1.0tar.gz → 1.1.0a0tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (48) hide show

{okb-1.1.0 → okb-1.1.0a0}/PKG-INFO +6 -83
{okb-1.1.0 → okb-1.1.0a0}/README.md +5 -82
{okb-1.1.0 → okb-1.1.0a0}/okb/cli.py +16 -1083
{okb-1.1.0 → okb-1.1.0a0}/okb/config.py +4 -122
{okb-1.1.0 → okb-1.1.0a0}/okb/http_server.py +2 -163
{okb-1.1.0 → okb-1.1.0a0}/okb/llm/providers.py +6 -9
{okb-1.1.0 → okb-1.1.0a0}/okb/mcp_server.py +12 -1036
{okb-1.1.0 → okb-1.1.0a0}/okb/modal_llm.py +8 -26
{okb-1.1.0 → okb-1.1.0a0}/okb/plugins/sources/github.py +5 -5
{okb-1.1.0 → okb-1.1.0a0}/okb/tokens.py +3 -25
{okb-1.1.0 → okb-1.1.0a0}/pyproject.toml +1 -1
okb-1.1.0/okb/llm/analyze.py +0 -524
okb-1.1.0/okb/llm/consolidate.py +0 -685
okb-1.1.0/okb/llm/enrich.py +0 -723
okb-1.1.0/okb/llm/extractors/__init__.py +0 -13
okb-1.1.0/okb/llm/extractors/base.py +0 -44
okb-1.1.0/okb/llm/extractors/cross_doc.py +0 -478
okb-1.1.0/okb/llm/extractors/dedup.py +0 -499
okb-1.1.0/okb/llm/extractors/entity.py +0 -369
okb-1.1.0/okb/llm/extractors/todo.py +0 -149
okb-1.1.0/okb/migrations/0008.enrichment.sql +0 -46
okb-1.1.0/okb/migrations/0009.entity-consolidation.sql +0 -120
okb-1.1.0/okb/migrations/0010.token-id.sql +0 -7
{okb-1.1.0 → okb-1.1.0a0}/okb/__init__.py +0 -0
{okb-1.1.0 → okb-1.1.0a0}/okb/data/init.sql +0 -0
{okb-1.1.0 → okb-1.1.0a0}/okb/ingest.py +0 -0
{okb-1.1.0 → okb-1.1.0a0}/okb/llm/__init__.py +0 -0
{okb-1.1.0 → okb-1.1.0a0}/okb/llm/base.py +0 -0
{okb-1.1.0 → okb-1.1.0a0}/okb/llm/cache.py +0 -0
{okb-1.1.0 → okb-1.1.0a0}/okb/llm/filter.py +0 -0
{okb-1.1.0 → okb-1.1.0a0}/okb/local_embedder.py +0 -0
{okb-1.1.0 → okb-1.1.0a0}/okb/migrate.py +0 -0
{okb-1.1.0 → okb-1.1.0a0}/okb/migrations/0001.initial-schema.sql +0 -0
{okb-1.1.0 → okb-1.1.0a0}/okb/migrations/0002.sync-state.sql +0 -0
{okb-1.1.0 → okb-1.1.0a0}/okb/migrations/0003.structured-fields.sql +0 -0
{okb-1.1.0 → okb-1.1.0a0}/okb/migrations/0004.tokens.sql +0 -0
{okb-1.1.0 → okb-1.1.0a0}/okb/migrations/0005.database-metadata.sql +0 -0
{okb-1.1.0 → okb-1.1.0a0}/okb/migrations/0006.llm-cache.sql +0 -0
{okb-1.1.0 → okb-1.1.0a0}/okb/modal_embedder.py +0 -0
{okb-1.1.0 → okb-1.1.0a0}/okb/plugins/__init__.py +0 -0
{okb-1.1.0 → okb-1.1.0a0}/okb/plugins/base.py +0 -0
{okb-1.1.0 → okb-1.1.0a0}/okb/plugins/registry.py +0 -0
{okb-1.1.0 → okb-1.1.0a0}/okb/plugins/sources/__init__.py +0 -0
{okb-1.1.0 → okb-1.1.0a0}/okb/plugins/sources/dropbox_paper.py +0 -0
{okb-1.1.0 → okb-1.1.0a0}/okb/plugins/sources/todoist.py +0 -0
{okb-1.1.0 → okb-1.1.0a0}/okb/rescan.py +0 -0
{okb-1.1.0 → okb-1.1.0a0}/okb/scripts/__init__.py +0 -0
{okb-1.1.0 → okb-1.1.0a0}/okb/scripts/watch.py +0 -0

{okb-1.1.0 → okb-1.1.0a0}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.3
 Name: okb
-Version: 1.1.0
+Version: 1.1.0a0
 Summary: Personal knowledge base with semantic search for LLMs
 Requires-Python: >=3.11
 Classifier: Programming Language :: Python :: 3
@@ -85,8 +85,6 @@ okb ingest ~/notes ~/docs
 | `okb db start` | Start pgvector database container |
 | `okb db stop` | Stop database container |
 | `okb db status` | Show database status |
-| `okb db migrate [name]` | Apply pending migrations (optionally for specific db) |
-| `okb db list` | List configured databases |
 | `okb db destroy` | Remove container and volume (destructive) |
 | `okb ingest <paths>` | Ingest documents into knowledge base |
 | `okb ingest <paths> --local` | Ingest using local GPU/CPU embedding (no Modal) |
@@ -95,11 +93,10 @@ okb ingest ~/notes ~/docs
 | `okb watch <paths>` | Watch directories for changes |
 | `okb config init` | Create default config file |
 | `okb config show` | Show current configuration |
-| `okb config path` | Print config file path |
 | `okb modal deploy` | Deploy GPU embedder to Modal |
 | `okb token create` | Create API token for HTTP server |
 | `okb token list` | List tokens for a database |
-| `okb token revoke [TOKEN] --id <n>` | Revoke token by full value or ID |
+| `okb token revoke` | Revoke an API token |
 | `okb sync list` | List available API sources (plugins) |
 | `okb sync list-projects <source>` | List projects from source (for config) |
 | `okb sync run <sources>` | Sync data from external APIs |
@@ -111,18 +108,6 @@ okb ingest ~/notes ~/docs
 | `okb llm status` | Show LLM config and connectivity |
 | `okb llm deploy` | Deploy Modal LLM for open model inference |
 | `okb llm clear-cache` | Clear LLM response cache |
-| `okb enrich run` | Extract TODOs and entities from documents |
-| `okb enrich run --dry-run` | Show what would be enriched |
-| `okb enrich pending` | List entities awaiting review |
-| `okb enrich approve <id>` | Approve a pending entity |
-| `okb enrich reject <id>` | Reject a pending entity |
-| `okb enrich analyze` | Analyze database and update description/topics |
-| `okb enrich consolidate` | Run entity consolidation (duplicates, clusters) |
-| `okb enrich merge-proposals` | List pending merge proposals |
-| `okb enrich approve-merge <id>` | Approve an entity merge |
-| `okb enrich reject-merge <id>` | Reject an entity merge |
-| `okb enrich clusters` | List topic clusters |
-| `okb enrich relationships` | List entity relationships |
 ## Configuration
@@ -157,7 +142,7 @@ chunking:
 Use `--db <name>` to target a specific database with any command.
 Environment variables override config file settings:
-- `OKB_DATABASE_URL` - Database connection string
+- `KB_DATABASE_URL` - Database connection string
 - `OKB_DOCKER_PORT` - Docker port mapping
 - `OKB_CONTAINER_NAME` - Docker container name
@@ -178,7 +163,7 @@ Merge: scalars replace, lists extend, dicts deep-merge.
 ### LLM Integration (Optional)
-Enable LLM-based document classification, filtering, and enrichment:
+Enable LLM-based document classification and filtering:
 ```yaml
 llm:
@@ -194,25 +179,11 @@ llm:
 | `claude` | `export ANTHROPIC_API_KEY=...` | ~$0.25/1M tokens |
 | `modal` | `okb llm deploy` | ~$0.02/min GPU |
-**Modal LLM Setup** (no API key needed, runs on Modal's GPUs):
+For Modal (no API key needed):
 ```yaml
 llm:
   provider: modal
-  model: microsoft/Phi-3-mini-4k-instruct  # Recommended: no gating
-```
-Non-gated models (work immediately):
-- `microsoft/Phi-3-mini-4k-instruct` - Good quality, 4K context
-- `Qwen/Qwen2-1.5B-Instruct` - Smaller/faster
-Gated models (require HuggingFace approval + token):
-- `meta-llama/Llama-3.2-3B-Instruct` - Requires accepting license at HuggingFace
-- Setup: `modal secret create huggingface HF_TOKEN=hf_...`
-Deploy after configuring:
-```bash
-okb llm deploy
+  model: meta-llama/Llama-3.2-3B-Instruct
 ```
 **Pre-ingest filtering** - skip low-value content during sync:
@@ -226,36 +197,6 @@ plugins:
         action_on_skip: discard  # or "archive"
 ```
-### Document Enrichment
-Extract TODOs and entities (people, projects, technologies) from documents using LLM:
-```bash
-okb enrich run                      # Enrich un-enriched documents
-okb enrich run --dry-run            # Preview what would be enriched
-okb enrich run --source-type markdown  # Only markdown files
-okb enrich run --query "meeting"    # Filter by semantic search
-```
-Entities are created as pending suggestions for review:
-```bash
-okb enrich pending                  # List pending entities
-okb enrich approve <id>             # Approve → creates entity document
-okb enrich reject <id>              # Reject → hidden from future suggestions
-```
-Configure enrichment behavior:
-```yaml
-enrichment:
-  enabled: true
-  extract_todos: true
-  extract_entities: true
-  auto_create_todos: true       # TODOs created immediately
-  auto_create_entities: false   # Entities go to pending review
-  min_confidence_todo: 0.7
-  min_confidence_entity: 0.8
-```
 CLI commands:
 ```bash
 okb llm status              # Show config and connectivity
@@ -328,20 +269,6 @@ Then configure Claude Code to connect via SSE:
 | `add_todo` | Create a TODO item in the knowledge base |
 | `trigger_sync` | Sync API sources (Todoist, GitHub, Dropbox Paper) |
 | `trigger_rescan` | Check indexed files for changes and re-ingest |
-| `list_sync_sources` | List available API sync sources with status |
-| `enrich_document` | Run LLM enrichment to extract TODOs/entities |
-| `list_pending_entities` | List entities awaiting review |
-| `approve_entity` | Approve a pending entity |
-| `reject_entity` | Reject a pending entity |
-| `analyze_knowledge_base` | Analyze content and generate description/topics |
-| `find_entity_duplicates` | Find potential duplicate entities |
-| `merge_entities` | Merge duplicate entities |
-| `list_pending_merges` | List pending merge proposals |
-| `approve_merge` | Approve a merge proposal |
-| `reject_merge` | Reject a merge proposal |
-| `get_topic_clusters` | Get topic clusters from consolidation |
-| `get_entity_relationships` | Get relationships between entities |
-| `run_consolidation` | Run full entity consolidation pipeline |
 ## Contextual Chunking
@@ -364,10 +291,6 @@ project: student-app
 category: backend
 ---
-# Your Document Title
-Content here...
-```
 ## Plugin System

{okb-1.1.0 → okb-1.1.0a0}/README.md RENAMED Viewed

@@ -36,8 +36,6 @@ okb ingest ~/notes ~/docs
 | `okb db start` | Start pgvector database container |
 | `okb db stop` | Stop database container |
 | `okb db status` | Show database status |
-| `okb db migrate [name]` | Apply pending migrations (optionally for specific db) |
-| `okb db list` | List configured databases |
 | `okb db destroy` | Remove container and volume (destructive) |
 | `okb ingest <paths>` | Ingest documents into knowledge base |
 | `okb ingest <paths> --local` | Ingest using local GPU/CPU embedding (no Modal) |
@@ -46,11 +44,10 @@ okb ingest ~/notes ~/docs
 | `okb watch <paths>` | Watch directories for changes |
 | `okb config init` | Create default config file |
 | `okb config show` | Show current configuration |
-| `okb config path` | Print config file path |
 | `okb modal deploy` | Deploy GPU embedder to Modal |
 | `okb token create` | Create API token for HTTP server |
 | `okb token list` | List tokens for a database |
-| `okb token revoke [TOKEN] --id <n>` | Revoke token by full value or ID |
+| `okb token revoke` | Revoke an API token |
 | `okb sync list` | List available API sources (plugins) |
 | `okb sync list-projects <source>` | List projects from source (for config) |
 | `okb sync run <sources>` | Sync data from external APIs |
@@ -62,18 +59,6 @@ okb ingest ~/notes ~/docs
 | `okb llm status` | Show LLM config and connectivity |
 | `okb llm deploy` | Deploy Modal LLM for open model inference |
 | `okb llm clear-cache` | Clear LLM response cache |
-| `okb enrich run` | Extract TODOs and entities from documents |
-| `okb enrich run --dry-run` | Show what would be enriched |
-| `okb enrich pending` | List entities awaiting review |
-| `okb enrich approve <id>` | Approve a pending entity |
-| `okb enrich reject <id>` | Reject a pending entity |
-| `okb enrich analyze` | Analyze database and update description/topics |
-| `okb enrich consolidate` | Run entity consolidation (duplicates, clusters) |
-| `okb enrich merge-proposals` | List pending merge proposals |
-| `okb enrich approve-merge <id>` | Approve an entity merge |
-| `okb enrich reject-merge <id>` | Reject an entity merge |
-| `okb enrich clusters` | List topic clusters |
-| `okb enrich relationships` | List entity relationships |
 ## Configuration
@@ -108,7 +93,7 @@ chunking:
 Use `--db <name>` to target a specific database with any command.
 Environment variables override config file settings:
-- `OKB_DATABASE_URL` - Database connection string
+- `KB_DATABASE_URL` - Database connection string
 - `OKB_DOCKER_PORT` - Docker port mapping
 - `OKB_CONTAINER_NAME` - Docker container name
@@ -129,7 +114,7 @@ Merge: scalars replace, lists extend, dicts deep-merge.
 ### LLM Integration (Optional)
-Enable LLM-based document classification, filtering, and enrichment:
+Enable LLM-based document classification and filtering:
 ```yaml
 llm:
@@ -145,25 +130,11 @@ llm:
 | `claude` | `export ANTHROPIC_API_KEY=...` | ~$0.25/1M tokens |
 | `modal` | `okb llm deploy` | ~$0.02/min GPU |
-**Modal LLM Setup** (no API key needed, runs on Modal's GPUs):
+For Modal (no API key needed):
 ```yaml
 llm:
   provider: modal
-  model: microsoft/Phi-3-mini-4k-instruct  # Recommended: no gating
-```
-Non-gated models (work immediately):
-- `microsoft/Phi-3-mini-4k-instruct` - Good quality, 4K context
-- `Qwen/Qwen2-1.5B-Instruct` - Smaller/faster
-Gated models (require HuggingFace approval + token):
-- `meta-llama/Llama-3.2-3B-Instruct` - Requires accepting license at HuggingFace
-- Setup: `modal secret create huggingface HF_TOKEN=hf_...`
-Deploy after configuring:
-```bash
-okb llm deploy
+  model: meta-llama/Llama-3.2-3B-Instruct
 ```
 **Pre-ingest filtering** - skip low-value content during sync:
@@ -177,36 +148,6 @@ plugins:
         action_on_skip: discard  # or "archive"
 ```
-### Document Enrichment
-Extract TODOs and entities (people, projects, technologies) from documents using LLM:
-```bash
-okb enrich run                      # Enrich un-enriched documents
-okb enrich run --dry-run            # Preview what would be enriched
-okb enrich run --source-type markdown  # Only markdown files
-okb enrich run --query "meeting"    # Filter by semantic search
-```
-Entities are created as pending suggestions for review:
-```bash
-okb enrich pending                  # List pending entities
-okb enrich approve <id>             # Approve → creates entity document
-okb enrich reject <id>              # Reject → hidden from future suggestions
-```
-Configure enrichment behavior:
-```yaml
-enrichment:
-  enabled: true
-  extract_todos: true
-  extract_entities: true
-  auto_create_todos: true       # TODOs created immediately
-  auto_create_entities: false   # Entities go to pending review
-  min_confidence_todo: 0.7
-  min_confidence_entity: 0.8
-```
 CLI commands:
 ```bash
 okb llm status              # Show config and connectivity
@@ -279,20 +220,6 @@ Then configure Claude Code to connect via SSE:
 | `add_todo` | Create a TODO item in the knowledge base |
 | `trigger_sync` | Sync API sources (Todoist, GitHub, Dropbox Paper) |
 | `trigger_rescan` | Check indexed files for changes and re-ingest |
-| `list_sync_sources` | List available API sync sources with status |
-| `enrich_document` | Run LLM enrichment to extract TODOs/entities |
-| `list_pending_entities` | List entities awaiting review |
-| `approve_entity` | Approve a pending entity |
-| `reject_entity` | Reject a pending entity |
-| `analyze_knowledge_base` | Analyze content and generate description/topics |
-| `find_entity_duplicates` | Find potential duplicate entities |
-| `merge_entities` | Merge duplicate entities |
-| `list_pending_merges` | List pending merge proposals |
-| `approve_merge` | Approve a merge proposal |
-| `reject_merge` | Reject a merge proposal |
-| `get_topic_clusters` | Get topic clusters from consolidation |
-| `get_entity_relationships` | Get relationships between entities |
-| `run_consolidation` | Run full entity consolidation pipeline |
 ## Contextual Chunking
@@ -315,10 +242,6 @@ project: student-app
 category: backend
 ---
-# Your Document Title
-Content here...
-```
 ## Plugin System

okb 1.1.0__tar.gz → 1.1.0a0__tar.gz

okb 1.1.0tar.gz → 1.1.0a0tar.gz