PyPI - fableforge-shell-whisperer - Versions diffs - 0.1.0__tar.gz - Mend

fableforge-shell-whisperer 0.1.0__tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (21) hide show

fableforge_shell_whisperer-0.1.0/LICENSE ADDED Viewed

@@ -0,0 +1,21 @@
+MIT License
+Copyright (c) 2025 FableForge Contributors
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE.

fableforge_shell_whisperer-0.1.0/PKG-INFO ADDED Viewed

@@ -0,0 +1,483 @@
+Metadata-Version: 2.4
+Name: fableforge-shell-whisperer
+Version: 0.1.0
+Summary: 1.5B edge-native shell agent — natural language to shell commands, 50ms latency on phone/edge
+Author: FableForge Team
+License: MIT
+Keywords: shell,nlp,edge,onnx,llm,cli
+Classifier: Development Status :: 3 - Alpha
+Classifier: Intended Audience :: Developers
+Classifier: License :: OSI Approved :: MIT License
+Classifier: Programming Language :: Python :: 3
+Classifier: Programming Language :: Python :: 3.10
+Classifier: Programming Language :: Python :: 3.11
+Classifier: Programming Language :: Python :: 3.12
+Classifier: Topic :: System :: Shells
+Requires-Python: >=3.10
+Description-Content-Type: text/markdown
+License-File: LICENSE
+Requires-Dist: rich>=13.0
+Requires-Dist: click>=8.1
+Requires-Dist: pydantic>=2.0
+Requires-Dist: httpx>=0.25
+Requires-Dist: fastapi>=0.110.0
+Requires-Dist: uvicorn>=0.29.0
+Requires-Dist: websockets>=12.0
+Provides-Extra: train
+Requires-Dist: torch>=2.1.0; extra == "train"
+Requires-Dist: transformers>=4.40.0; extra == "train"
+Requires-Dist: peft>=0.10.0; extra == "train"
+Requires-Dist: datasets>=2.18.0; extra == "train"
+Requires-Dist: unsloth>=2024.1; extra == "train"
+Requires-Dist: accelerate>=0.27.0; extra == "train"
+Requires-Dist: trl>=0.8.0; extra == "train"
+Requires-Dist: wandb>=0.16.0; extra == "train"
+Provides-Extra: gpu
+Requires-Dist: bitsandbytes>=0.43.0; extra == "gpu"
+Requires-Dist: xformers>=0.0.23; extra == "gpu"
+Provides-Extra: export
+Requires-Dist: onnx>=1.16.0; extra == "export"
+Requires-Dist: onnxruntime>=1.17.0; extra == "export"
+Requires-Dist: optimum>=1.17.0; extra == "export"
+Provides-Extra: gguf
+Requires-Dist: llama-cpp-python>=0.2.50; extra == "gguf"
+Provides-Extra: all
+Requires-Dist: shell-whisperer[export,gguf,gpu,train]; extra == "all"
+Provides-Extra: dev
+Requires-Dist: pytest>=8.0.0; extra == "dev"
+Requires-Dist: pytest-asyncio>=0.23.0; extra == "dev"
+Requires-Dist: ruff>=0.3.0; extra == "dev"
+Requires-Dist: mypy>=1.9.0; extra == "dev"
+Dynamic: license-file
+# ShellWhisperer
+[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](LICENSE) [![Python 3.10+](https://img.shields.io/badge/python-3.10%2B-blue.svg)](https://www.python.org/downloads/) [![Tests](https://img.shields.io/badge/tests-0-yellow.svg)](tests/)
+> **Natural language → shell commands. 50ms on edge.**
+A 1.5B parameter edge-native shell agent fine-tuned from Qwen3-1.5B. Converts natural language descriptions into safe, correct shell commands — designed to run on phones and edge devices via ONNX Runtime or llama.cpp GGUF.
+## Features
+- **Edge-native**: Runs in <50ms on mobile/edge via ONNX or GGUF
+- **Multi-OS**: Linux, macOS, Windows PowerShell prompts
+- **Safety-first**: Built-in guardrails against destructive commands (rm -rf /, fork bombs, pipe-to-shell)
+- **Context-aware**: Uses working directory, OS type, and recent command history
+- **Multiple backends**: HuggingFace Transformers, ONNX Runtime, llama.cpp
+- **Streaming**: WebSocket streaming for real-time output
+- **Fine-tune your own**: LoRA training on Fable5 traces or custom data
+## Quick Start
+### Install
+```bash
+pip install shell-whisperer
+# With training support:
+pip install "shell-whisperer[train]"
+# With GGUF inference:
+pip install "shell-whisperer[gguf]"
+# Everything:
+pip install "shell-whisperer[train,gguf,dev]"
+```
+### One-shot Prediction
+```bash
+# Basic usage
+sw "find all python files over 100 lines"
+# → find . -name "*.py" -exec wc -l {} + | awk '$1 > 100'
+sw "kill the process on port 8080"
+# → lsof -ti:8080 | xargs kill -9
+sw --os-type macos "install ffmpeg"
+# → brew install ffmpeg
+sw --os-type windows "show all listening ports"
+# → Get-NetTCPConnection -State Listen | Format-Table LocalPort, OwningProcess -AutoSize
+```
+### Interactive Mode
+```bash
+sw --interactive
+sw> find all python files over 100 lines
+┌─────────────────────────────────────────────────────────┐
+│ find . -name "*.py" -exec wc -l {} + | awk '$1 > 100'  │
+└─────────────────────────────────────────────────────────┘
+42.5ms
+sw> !os macos
+OS set to: macos
+sw> install ffmpeg
+┌──────────────────────┐
+│ brew install ffmpeg  │
+└──────────────────────┘
+28.1ms
+```
+### Start API Server
+```bash
+sw --serve --port 8000
+# Or specify model:
+sw --serve --model ./models/shell-whisperer-merged --port 8000
+```
+## Fine-Tuning
+### Prepare Training Data
+ShellWhisperer extracts training pairs from Fable5 trace formats:
+```python
+from shell_whisperer.data_extractor import load_training_data
+# Load from JSONL traces (auto-detects format)
+pairs = load_training_data("./traces/glint_data.jsonl", fmt="auto")
+# Or specify format explicitly
+from shell_whisperer.data_extractor import (
+    extract_bash_from_glint,
+    extract_bash_from_armand0e,
+    extract_bash_from_vfable,
+)
+pairs = extract_bash_from_glint("./traces/glint.jsonl")
+pairs = extract_bash_from_armand0e("./traces/armand0e.jsonl")
+pairs = extract_bash_from_vfable("./traces/vfable.jsonl")
+```
+#### Supported Trace Formats
+**Glint** — Command traces with shell intent metadata:
+```jsonl
+{"type": "shell_intent", "intent": "find all python files over 100 lines"}
+{"type": "shell_command", "command": "find . -name '*.py' -exec wc -l {} + | awk '$1 > 100'", "shell": "bash", "exit_code": 0}
+```
+**armand0e** — Structured shell session logs:
+```jsonl
+{"event": "command_executed", "prompt": "show disk usage sorted by size", "command": "du -sh * | sort -rh", "exit_status": 0}
+```
+**v-Fable** — Validated Fable traces with confirmation signals:
+```jsonl
+{"role": "user", "utterance": "find all json files modified recently"}
+{"role": "assistant", "tool_call": {"name": "execute_shell", "arguments": {"command": "find . -name '*.json' -mtime -7"}}, "validation": {"confirmed": true, "exit_code": 0}}
+```
+### Train with LoRA
+```bash
+# LoRA fine-tune (default)
+sw train --data ./traces/data.jsonl --epochs 3
+# Full fine-tune
+sw train --data ./traces/data.jsonl --full-finetune
+# Custom parameters
+sw train \
+  --data ./traces/data.jsonl \
+  --model-name Qwen/Qwen3-1.5B \
+  --output-dir ./models/my-shell-whisperer \
+  --epochs 5 \
+  --lr 1e-4 \
+  --batch-size 8 \
+  --os-type linux
+```
+### Training in Python
+```python
+from shell_whisperer.trainer import TrainConfig, train_lora
+from shell_whisperer.data_extractor import load_training_data
+# Load data
+pairs = load_training_data("./traces/data.jsonl", include_builtin=True)
+print(f"Training with {len(pairs)} pairs")
+# Configure and train
+config = TrainConfig(
+    model_name="Qwen/Qwen3-1.5B",
+    output_dir="./models/shell-whisperer-lora",
+    epochs=3,
+    learning_rate=2e-4,
+    lora_r=16,
+    use_4bit=True,
+    use_unsloth=True,
+)
+adapter_path = train_lora(config=config, training_pairs=pairs)
+# Merge adapter with base model
+from shell_whisperer.trainer import merge_and_save
+merge_and_save(
+    adapter_path=adapter_path,
+    output_dir="./models/shell-whisperer-merged",
+)
+```
+## Export for Edge
+```bash
+# Export to ONNX
+sw export --format onnx --model ./models/shell-whisperer-merged
+# Export to GGUF (for llama.cpp)
+sw export --format gguf --model ./models/shell-whisperer-merged
+# 4-bit quantization (smallest, fastest)
+sw export --format 4bit --model ./models/shell-whisperer-merged
+# 8-bit quantization
+sw export --format 8bit --model ./models/shell-whisperer-merged
+# Export all formats
+sw export --format all --model ./models/shell-whisperer-merged
+```
+### Memory Estimates
+| Format | RAM Required | Latency (edge) |
+|--------|-------------|----------------|
+| FP32   | ~6.0 GB     | ~200ms         |
+| FP16   | ~3.0 GB     | ~100ms         |
+| 8-bit  | ~1.5 GB     | ~60ms          |
+| 4-bit  | ~0.75 GB    | ~50ms          |
+| GGUF Q4_K_M | ~0.84 GB | ~50ms    |
+## Inference
+### Python API
+```python
+from shell_whisperer import ShellWhisperer
+# Load model
+sw = ShellWhisperer(os_type="linux")
+sw.load_model("./models/shell-whisperer-merged")
+# Predict
+result = sw.predict("find all python files over 100 lines")
+print(result.command)
+# → find . -name "*.py" -exec wc -l {} + | awk '$1 > 100'
+# Context-aware prediction
+result = sw.predict(
+    "find config files",
+    working_directory="/etc",
+    recent_history=["ls -la", "cd /etc"],
+    os_type="linux",
+)
+# Batch prediction
+results = sw.predict_batch([
+    "find all python files over 100 lines",
+    "kill the process on port 8080",
+    "show disk usage sorted by size",
+])
+# Streaming
+for token in sw.predict_stream("find all python files"):
+    print(token, end="", flush=True)
+# Safety warnings
+result = sw.predict("delete everything")
+if result.safety_warnings:
+    for warning in result.safety_warnings:
+        print(f"⚠ {warning}")
+sw.unload()
+```
+### REST API
+```bash
+# Start server
+sw --serve --port 8000
+# Predict
+curl -X POST http://localhost:8000/predict \
+  -H "Content-Type: application/json" \
+  -d '{"prompt": "find all python files over 100 lines", "os_type": "linux"}'
+# Batch predict
+curl -X POST http://localhost:8000/predict/batch \
+  -H "Content-Type: application/json" \
+  -d '{"prompts": ["find python files", "kill port 8080"]}'
+# Health check
+curl http://localhost:8000/health
+# Model info
+curl http://localhost:8000/info
+```
+### WebSocket Streaming
+```javascript
+const ws = new WebSocket("ws://localhost:8000/ws/stream");
+ws.onopen = () => {
+  ws.send(JSON.stringify({
+    prompt: "find all python files over 100 lines",
+    os_type: "linux"
+  }));
+};
+ws.onmessage = (event) => {
+  const data = JSON.parse(event.data);
+  if (data.token) {
+    process.stdout.write(data.token);
+  } else if (data.done) {
+    console.log("\nCommand:", data.command);
+    if (data.safety_warnings.length) {
+      console.log("Warnings:", data.safety_warnings);
+    }
+  }
+};
+```
+## Example Training Pairs
+Built-in high-quality pairs from real-world shell usage:
+| Natural Language | Shell Command | Quality |
+|----------------|---------------|---------|
+| find all python files over 100 lines | `find . -name "*.py" -exec wc -l {} + \| awk '$1 > 100'` | 0.85 |
+| kill the process on port 8080 | `lsof -ti:8080 \| xargs kill -9` | 0.80 |
+| show disk usage sorted by size | `du -sh * \| sort -rh` | 0.75 |
+| recursively search for TODO in all python files | `grep -rn "TODO" --include="*.py" .` | 0.83 |
+| rename all .txt files to .md | `for f in *.txt; do mv "$f" "${f%.txt}.md"; done` | 0.84 |
+| remove all stopped docker containers | `docker container prune -f` | 0.73 |
+| show all git commits by the current user this month | `git log --author="$(git config user.name)" --since="$(date +%Y-%m-01)" --oneline` | 0.88 |
+| list all unique IPs that connected via SSH | `grep "Accepted" /var/log/auth.log \| awk '{print $11}' \| sort -u` | 0.84 |
+## Safety System
+ShellWhisperer includes a built-in safety layer that:
+1. **Blocks destructive commands**: `rm -rf /`, fork bombs, `dd` to disk
+2. **Warns on sudo**: Flags commands requiring elevated privileges
+3. **Flags pipe-to-shell**: Warns about `curl | bash` patterns
+4. **Prevents chmod 777**: Warns about insecure permissions
+```python
+result = sw.predict("delete all files")
+# Safety warning: ⚠ SAFETY: Destructive: recursive force-delete
+```
+## Architecture
+```
+┌─────────────────────────────────────────┐
+│           Natural Language Input         │
+└──────────────┬──────────────────────────┘
+               │
+┌──────────────▼──────────────────────────┐
+│         System Prompt (OS-specific)       │
+│    LINUX_PROMPT / MACOS_PROMPT /         │
+│    WINDOWS_PROMPT + Safety Rules          │
+└──────────────┬──────────────────────────┘
+               │
+┌──────────────▼──────────────────────────┐
+│       Qwen3-1.5B (LoRA fine-tuned)       │
+│         1.5B parameters                  │
+└──────────────┬──────────────────────────┘
+               │
+┌──────────────▼──────────────────────────┐
+│          Output Cleaning                  │
+│   - Strip markdown/backticks             │
+│   - Remove model prefixes                │
+│   - Multi-line pipe/chain handling        │
+└──────────────┬──────────────────────────┘
+               │
+┌──────────────▼──────────────────────────┐
+│          Safety Check                     │
+│   - rm -rf protection                    │
+│   - sudo warning                         │
+│   - pipe-to-shell detection              │
+└──────────────┬──────────────────────────┘
+               │
+┌──────────────▼──────────────────────────┐
+│         Shell Command Output              │
+└───────────────────────────────────────────┘
+```
+## Project Structure
+```
+shell-whisperer/
+├── pyproject.toml
+├── README.md
+├── src/shell_whisperer/
+│   ├── __init__.py           # Package init + exports
+│   ├── prompts.py            # OS-specific system prompts + safety rules
+│   ├── data_extractor.py     # Fable5 trace extraction + quality filtering
+│   ├── trainer.py            # LoRA/Full fine-tuning on Qwen3-1.5B
+│   ├── exporter.py           # ONNX, GGUF, 4-bit/8-bit export + memory estimation
+│   ├── inference.py          # Multi-backend inference (Transformers, ONNX, llama.cpp)
+│   ├── server.py             # FastAPI server (REST + WebSocket)
+│   └── cli.py                # CLI: sw predict, train, export, serve
+└── tests/
+    ├── test_data_extractor.py
+    └── test_inference.py
+```
+## Development
+```bash
+# Install dev dependencies
+pip install -e ".[dev]"
+# Run tests
+pytest
+# Lint
+ruff check src/ tests/
+# Type check
+mypy src/
+```
+## License
+MIT
+## Ecosystem
+Part of the [FableForge](../) ecosystem — 21 open-source projects built from 210K real agent traces:
+| Project | Description |
+| --- | --- |
+| **[Anvil](../anvil)** | Self-verified coding agent |
+| **[VerifyLoop](../verifyloop)** | Plan→Execute→Verify→Recover framework |
+| **[ErrorRecovery](../error-recovery)** | Self-healing middleware (3,725 error patterns) |
+| **[FableForge-14B](../fableforge-14b)** | The fine-tuned 14B model (4-stage training) |
+| **[ShellWhisperer](../shell-whisperer)** | 1.5B edge agent (phone/RPi, 50ms) |
+| **[ReasonCritic](../reason-critic)** | Verification model (130 benchmark tasks) |
+| **[TraceCompiler](../trace-compiler)** | Compile traces → LoRA skills |
+| **[AgentRuntime](../agent-runtime)** | Persistent agent daemon (systemd for AI) |
+| **[AgentSwarm](../agent-swarm)** | Multi-agent from real trace transitions |
+| **[AgentTelemetry](../agent-telemetry)** | Datadog for agents (token tracking, costs) |
+| **[BenchAgent](../bench-agent)** | HumanEval for tool-use (107 tasks) |
+| **[AgentDev](../agent-dev)** | VSCode extension with verification |
+| **[TraceViz](../trace-viz)** | Trace replay visualizer (Next.js) |
+| **[AgentSkills](../agent-skills)** | npm for agent behaviors |
+| **[AgentCurriculum](../agent-curriculum)** | 5-stage progressive training |
+| **[AgentFuzzer](../agent-fuzzer)** | Adversarial testing for agents |
+| **[AgentConstitution](../agent-constitution)** | Safety guardrails from traces |
+| **[CostOptimizer](../cost-optimizer)** | Token cost reduction (50-80%) |
+| **[AgentProfiler](../agent-profiler)** | Behavioral fingerprinting |
+| **[TrajectoryDistiller](../trajectory-distiller)** | Trace→training data pipeline |
+| **[Fable5-Dataset](../fable5-dataset)** | HuggingFace dataset release |