npm - omgkit - Versions diffs - 2.20.0 → 2.21.0 - Mend

omgkit 2.20.0 → 2.21.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (73) hide show

package/README.md +125 -10
package/package.json +1 -1
package/plugin/agents/ai-architect-agent.md +282 -0
package/plugin/agents/data-scientist-agent.md +221 -0
package/plugin/agents/experiment-analyst-agent.md +318 -0
package/plugin/agents/ml-engineer-agent.md +165 -0
package/plugin/agents/mlops-engineer-agent.md +324 -0
package/plugin/agents/model-optimizer-agent.md +287 -0
package/plugin/agents/production-engineer-agent.md +360 -0
package/plugin/agents/research-scientist-agent.md +274 -0
package/plugin/commands/omgdata/augment.md +86 -0
package/plugin/commands/omgdata/collect.md +81 -0
package/plugin/commands/omgdata/label.md +83 -0
package/plugin/commands/omgdata/split.md +83 -0
package/plugin/commands/omgdata/validate.md +76 -0
package/plugin/commands/omgdata/version.md +85 -0
package/plugin/commands/omgdeploy/ab.md +94 -0
package/plugin/commands/omgdeploy/cloud.md +89 -0
package/plugin/commands/omgdeploy/edge.md +93 -0
package/plugin/commands/omgdeploy/package.md +91 -0
package/plugin/commands/omgdeploy/serve.md +92 -0
package/plugin/commands/omgfeature/embed.md +93 -0
package/plugin/commands/omgfeature/extract.md +93 -0
package/plugin/commands/omgfeature/select.md +85 -0
package/plugin/commands/omgfeature/store.md +97 -0
package/plugin/commands/omgml/init.md +60 -0
package/plugin/commands/omgml/status.md +82 -0
package/plugin/commands/omgops/drift.md +87 -0
package/plugin/commands/omgops/monitor.md +99 -0
package/plugin/commands/omgops/pipeline.md +102 -0
package/plugin/commands/omgops/registry.md +109 -0
package/plugin/commands/omgops/retrain.md +91 -0
package/plugin/commands/omgoptim/distill.md +90 -0
package/plugin/commands/omgoptim/profile.md +92 -0
package/plugin/commands/omgoptim/prune.md +81 -0
package/plugin/commands/omgoptim/quantize.md +83 -0
package/plugin/commands/omgtrain/baseline.md +78 -0
package/plugin/commands/omgtrain/compare.md +99 -0
package/plugin/commands/omgtrain/evaluate.md +85 -0
package/plugin/commands/omgtrain/train.md +81 -0
package/plugin/commands/omgtrain/tune.md +89 -0
package/plugin/registry.yaml +252 -2
package/plugin/skills/ml-systems/SKILL.md +65 -0
package/plugin/skills/ml-systems/ai-accelerators/SKILL.md +342 -0
package/plugin/skills/ml-systems/data-eng/SKILL.md +126 -0
package/plugin/skills/ml-systems/deep-learning-primer/SKILL.md +143 -0
package/plugin/skills/ml-systems/deployment-paradigms/SKILL.md +148 -0
package/plugin/skills/ml-systems/dnn-architectures/SKILL.md +128 -0
package/plugin/skills/ml-systems/edge-deployment/SKILL.md +366 -0
package/plugin/skills/ml-systems/efficient-ai/SKILL.md +316 -0
package/plugin/skills/ml-systems/feature-engineering/SKILL.md +151 -0
package/plugin/skills/ml-systems/ml-frameworks/SKILL.md +187 -0
package/plugin/skills/ml-systems/ml-serving-optimization/SKILL.md +371 -0
package/plugin/skills/ml-systems/ml-systems-fundamentals/SKILL.md +103 -0
package/plugin/skills/ml-systems/ml-workflow/SKILL.md +162 -0
package/plugin/skills/ml-systems/mlops/SKILL.md +386 -0
package/plugin/skills/ml-systems/model-deployment/SKILL.md +350 -0
package/plugin/skills/ml-systems/model-dev/SKILL.md +160 -0
package/plugin/skills/ml-systems/model-optimization/SKILL.md +339 -0
package/plugin/skills/ml-systems/robust-ai/SKILL.md +395 -0
package/plugin/skills/ml-systems/training-data/SKILL.md +152 -0
package/plugin/workflows/ml-systems/data-preparation-workflow.md +276 -0
package/plugin/workflows/ml-systems/edge-deployment-workflow.md +413 -0
package/plugin/workflows/ml-systems/full-ml-lifecycle-workflow.md +405 -0
package/plugin/workflows/ml-systems/hyperparameter-tuning-workflow.md +352 -0
package/plugin/workflows/ml-systems/mlops-pipeline-workflow.md +384 -0
package/plugin/workflows/ml-systems/model-deployment-workflow.md +392 -0
package/plugin/workflows/ml-systems/model-development-workflow.md +218 -0
package/plugin/workflows/ml-systems/model-evaluation-workflow.md +416 -0
package/plugin/workflows/ml-systems/model-optimization-workflow.md +390 -0
package/plugin/workflows/ml-systems/monitoring-drift-workflow.md +446 -0
package/plugin/workflows/ml-systems/retraining-workflow.md +401 -0
package/plugin/workflows/ml-systems/training-pipeline-workflow.md +382 -0

package/README.md CHANGED Viewed

@@ -36,10 +36,10 @@ All coordinated through **Omega-level thinking** - a framework for finding break
 | Component | Count | Description |
 |-----------|-------|-------------|
-| **Agents** | 33 | Specialized AI team members with distinct roles |
-| **Commands** | 113 | Slash commands for every development task |
-| **Workflows** | 49 | Complete development processes from idea to deploy |
-| **Skills** | 128 | Domain expertise modules across 22 categories |
+| **Agents** | 41 | Specialized AI team members with distinct roles |
+| **Commands** | 144 | Slash commands for every development task |
+| **Workflows** | 61 | Complete development processes from idea to deploy |
+| **Skills** | 145 | Domain expertise modules across 23 categories |
 | **Modes** | 10 | Behavioral configurations for different contexts |
 | **Archetypes** | 14 | Project templates for autonomous development |
@@ -141,7 +141,7 @@ After installation, use these commands in Claude Code:
 ---
-## Agents (33)
+## Agents (41)
 Agents are specialized AI team members, each with distinct expertise and responsibilities.
@@ -192,6 +192,19 @@ Agents are specialized AI team members, each with distinct expertise and respons
 | `data-engineer` | Data pipelines, ETL, schema design |
 | `ml-engineer` | ML pipelines, model training, MLOps |
+### ML Systems (New)
+| Agent | Description |
+|-------|-------------|
+| `ml-engineer-agent` | Full-stack ML engineering from data to deployment |
+| `data-scientist-agent` | Statistical modeling, experimentation, analysis |
+| `research-scientist-agent` | Novel algorithms, paper implementation, experiments |
+| `model-optimizer-agent` | Quantization, pruning, distillation |
+| `production-engineer-agent` | Model serving, reliability, scaling |
+| `mlops-engineer-agent` | ML infrastructure, pipelines, monitoring |
+| `ai-architect-agent` | ML system architecture, requirements analysis |
+| `experiment-analyst-agent` | Experiment tracking, analysis, reporting |
 ### Specialized Domains
 | Agent | Description |
@@ -209,7 +222,7 @@ Agents are specialized AI team members, each with distinct expertise and respons
 ---
-## Commands (113)
+## Commands (144)
 Commands are slash-prefixed actions organized by namespace.
@@ -296,9 +309,68 @@ Commands are slash-prefixed actions organized by namespace.
 /alignment:deps <type:name>  # Show dependency graph
 ```
+### ML Systems (New - 31 commands)
+#### `/omgml:*` - Project Management
+```bash
+/omgml:init             # Initialize ML project structure
+/omgml:status           # Show ML project status
+```
+#### `/omgdata:*` - Data Engineering
+```bash
+/omgdata:collect        # Collect data from sources
+/omgdata:validate       # Validate data quality
+/omgdata:clean          # Clean and preprocess data
+/omgdata:split          # Split train/val/test
+/omgdata:version        # Version datasets with DVC
+```
+#### `/omgfeature:*` - Feature Engineering
+```bash
+/omgfeature:extract     # Extract features from raw data
+/omgfeature:select      # Select important features
+/omgfeature:store       # Store in feature store
+```
+#### `/omgtrain:*` - Model Training
+```bash
+/omgtrain:baseline      # Create baseline models
+/omgtrain:train         # Train model with config
+/omgtrain:tune          # Hyperparameter tuning
+/omgtrain:evaluate      # Evaluate model performance
+/omgtrain:compare       # Compare model versions
+```
+#### `/omgoptim:*` - Model Optimization
+```bash
+/omgoptim:quantize      # Quantize to INT8/FP16
+/omgoptim:prune         # Prune model weights
+/omgoptim:distill       # Knowledge distillation
+/omgoptim:profile       # Profile latency/memory
+```
+#### `/omgdeploy:*` - Deployment
+```bash
+/omgdeploy:package      # Package model for deployment
+/omgdeploy:serve        # Deploy model serving
+/omgdeploy:edge         # Deploy to edge devices
+/omgdeploy:cloud        # Deploy to cloud platforms
+/omgdeploy:ab           # Setup A/B testing
+```
+#### `/omgops:*` - ML Operations
+```bash
+/omgops:pipeline        # Create ML pipeline
+/omgops:monitor         # Setup monitoring
+/omgops:drift           # Detect data/model drift
+/omgops:retrain         # Trigger retraining
+/omgops:registry        # Manage model registry
+```
 ---
-## Workflows (49)
+## Workflows (61)
 Workflows are orchestrated sequences of agents, commands, and skills.
@@ -363,11 +435,28 @@ Workflows are orchestrated sequences of agents, commands, and skills.
 | `omega/100x-architecture` | System redesign |
 | `omega/1000x-innovation` | Industry transformation |
+### ML Systems (New - 12 workflows)
+| Workflow | Description |
+|----------|-------------|
+| `ml-systems/full-ml-lifecycle-workflow` | Complete ML lifecycle orchestration |
+| `ml-systems/data-pipeline-workflow` | Data collection to feature store |
+| `ml-systems/model-development-workflow` | Baseline to optimized models |
+| `ml-systems/model-optimization-workflow` | Quantization, pruning, distillation |
+| `ml-systems/production-deployment-workflow` | Model packaging to serving |
+| `ml-systems/mlops-pipeline-workflow` | CI/CD for ML systems |
+| `ml-systems/model-monitoring-workflow` | Drift detection and alerting |
+| `ml-systems/experiment-tracking-workflow` | Systematic experimentation |
+| `ml-systems/feature-engineering-workflow` | Feature extraction and selection |
+| `ml-systems/model-retraining-workflow` | Automated retraining triggers |
+| `ml-systems/edge-deployment-workflow` | Edge/mobile model deployment |
+| `ml-systems/ab-testing-workflow` | A/B testing for models |
 ---
-## Skills (128)
+## Skills (145)
-Skills are domain expertise modules organized in 22 categories.
+Skills are domain expertise modules organized in 23 categories.
 ### AI Engineering (12 skills)
@@ -384,6 +473,31 @@ Based on production AI application patterns:
 | `ai-engineering/inference-optimization` | Quantization, batching, caching, vLLM |
 | `ai-engineering/guardrails-safety` | Input/output guards, PII protection |
+### ML Systems (18 skills - New)
+Based on Chip Huyen's "Designing ML Systems" and Stanford CS 329S:
+| Skill | Description |
+|-------|-------------|
+| `ml-systems/ml-systems-fundamentals` | Core ML concepts, design principles |
+| `ml-systems/deep-learning-primer` | Neural network foundations |
+| `ml-systems/dnn-architectures` | CNNs, RNNs, Transformers, hybrid models |
+| `ml-systems/data-eng` | ML data pipelines, storage, processing |
+| `ml-systems/training-data` | Sampling, labeling, augmentation |
+| `ml-systems/feature-engineering` | Feature extraction, selection, stores |
+| `ml-systems/ml-workflow` | Experiment design, model selection |
+| `ml-systems/model-dev` | Training, evaluation, debugging |
+| `ml-systems/ml-frameworks` | PyTorch, TensorFlow, scikit-learn |
+| `ml-systems/efficient-ai` | Model compression, efficient architectures |
+| `ml-systems/model-optimization` | Quantization, pruning, distillation |
+| `ml-systems/ai-accelerators` | GPU/TPU optimization, hardware selection |
+| `ml-systems/model-deployment` | Serving, containerization, scaling |
+| `ml-systems/ml-serving-optimization` | Batching, caching, latency reduction |
+| `ml-systems/edge-deployment` | TFLite, Core ML, TensorRT |
+| `ml-systems/mlops` | CI/CD for ML, model registry, pipelines |
+| `ml-systems/robust-ai` | Reliability, monitoring, drift detection |
+| `ml-systems/deployment-paradigms` | Batch vs real-time vs streaming |
 ### Methodology (17 skills)
 | Skill | Description |
@@ -409,6 +523,7 @@ Based on production AI application patterns:
 | Category | Skills | Focus |
 |----------|--------|-------|
 | AI-ML Operations | 6 | MLOps, feature stores, model serving |
+| ML Systems | 18 | Production ML from data to deployment |
 | Microservices | 6 | Service mesh, API gateway, tracing |
 | Event-Driven | 6 | Kafka, event sourcing, CQRS |
 | Game Development | 5 | Unity, Godot, networking |
@@ -568,7 +683,7 @@ omgkit help         # Show help
 ## Validation & Testing
-OMGKIT has 4800+ automated tests ensuring system integrity.
+OMGKIT has 5600+ automated tests ensuring system integrity.
 ### Run Tests

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "omgkit",
-  "version": "2.20.0",
+  "version": "2.21.0",
   "description": "Omega-Level Development Kit - AI Team System for Claude Code. 33 agents, 113 commands, 128 skills, 49 workflows.",
   "keywords": [
     "claude-code",

package/plugin/agents/ai-architect-agent.md ADDED Viewed

@@ -0,0 +1,282 @@
+---
+name: ai-architect-agent
+description: Senior AI/ML architect for designing end-to-end ML systems, making technology decisions, and ensuring scalable, maintainable AI solutions.
+skills:
+  - ml-systems/ml-systems-fundamentals
+  - ml-systems/deployment-paradigms
+  - ml-systems/data-eng
+  - ml-systems/feature-engineering
+  - ml-systems/ml-workflow
+  - ml-systems/model-deployment
+  - ml-systems/mlops
+  - ml-systems/robust-ai
+commands:
+  - /omgml:init
+  - /omgml:status
+  - /omgops:pipeline
+  - /omgops:registry
+---
+# AI Architect Agent
+You are a Senior AI/ML Architect responsible for designing comprehensive ML systems. You make strategic technology decisions, define architectures, and ensure ML solutions are scalable, maintainable, and aligned with business objectives.
+## Core Competencies
+### 1. System Design
+- End-to-end ML pipeline architecture
+- Microservices vs monolithic ML systems
+- Real-time vs batch processing trade-offs
+- Hybrid cloud and edge architectures
+- Multi-model orchestration
+### 2. Technology Selection
+- ML framework selection (PyTorch, TensorFlow, JAX)
+- Infrastructure choices (cloud providers, on-prem)
+- Data platform architecture
+- MLOps tooling selection
+- Vendor evaluation
+### 3. Governance & Standards
+- ML lifecycle management
+- Model governance and compliance
+- Data privacy and security
+- Documentation standards
+- Team structure and roles
+### 4. Strategic Planning
+- ML roadmap development
+- Build vs buy decisions
+- Technical debt management
+- Scalability planning
+- Cost optimization
+## Workflow
+When designing ML systems:
+1. **Discovery & Requirements**
+   - Business objectives and success metrics
+   - Data availability and quality
+   - Performance requirements (latency, throughput)
+   - Compliance and regulatory needs
+   - Team capabilities and constraints
+2. **Architecture Design**
+   - Create architecture diagrams
+   - Define component interfaces
+   - Document data flows
+   - Specify technology stack
+   - Plan for failure modes
+3. **Technical Specifications**
+   - API contracts
+   - Data schemas
+   - Model interfaces
+   - Monitoring requirements
+   - Security controls
+4. **Implementation Roadmap**
+   - Phased delivery plan
+   - MVP definition
+   - Risk mitigation strategies
+   - Team allocation
+## Architecture Patterns
+### ML Platform Architecture
+```
+┌─────────────────────────────────────────────────────────────────────────┐
+│                         ML PLATFORM ARCHITECTURE                         │
+├─────────────────────────────────────────────────────────────────────────┤
+│                                                                          │
+│  ┌─────────────────────────────────────────────────────────────────────┐│
+│  │                        DATA LAYER                                    ││
+│  │  ┌──────────┐  ┌──────────┐  ┌──────────┐  ┌──────────┐            ││
+│  │  │  Data    │  │  Data    │  │  Feature │  │  Data    │            ││
+│  │  │  Lake    │  │  Catalog │  │  Store   │  │  Quality │            ││
+│  │  └──────────┘  └──────────┘  └──────────┘  └──────────┘            ││
+│  └─────────────────────────────────────────────────────────────────────┘│
+│                                    ↓                                     │
+│  ┌─────────────────────────────────────────────────────────────────────┐│
+│  │                      TRAINING LAYER                                  ││
+│  │  ┌──────────┐  ┌──────────┐  ┌──────────┐  ┌──────────┐            ││
+│  │  │ Exp.     │  │  Model   │  │  HPO     │  │  Model   │            ││
+│  │  │ Tracking │  │ Training │  │  Service │  │ Registry │            ││
+│  │  └──────────┘  └──────────┘  └──────────┘  └──────────┘            ││
+│  └─────────────────────────────────────────────────────────────────────┘│
+│                                    ↓                                     │
+│  ┌─────────────────────────────────────────────────────────────────────┐│
+│  │                      SERVING LAYER                                   ││
+│  │  ┌──────────┐  ┌──────────┐  ┌──────────┐  ┌──────────┐            ││
+│  │  │  Model   │  │  A/B     │  │  Feature │  │  Caching │            ││
+│  │  │  Serving │  │  Testing │  │  Serving │  │  Layer   │            ││
+│  │  └──────────┘  └──────────┘  └──────────┘  └──────────┘            ││
+│  └─────────────────────────────────────────────────────────────────────┘│
+│                                    ↓                                     │
+│  ┌─────────────────────────────────────────────────────────────────────┐│
+│  │                    MONITORING LAYER                                  ││
+│  │  ┌──────────┐  ┌──────────┐  ┌──────────┐  ┌──────────┐            ││
+│  │  │  Model   │  │  Data    │  │  System  │  │ Alerting │            ││
+│  │  │  Perf    │  │  Drift   │  │  Metrics │  │          │            ││
+│  │  └──────────┘  └──────────┘  └──────────┘  └──────────┘            ││
+│  └─────────────────────────────────────────────────────────────────────┘│
+│                                                                          │
+└─────────────────────────────────────────────────────────────────────────┘
+```
+### Technology Selection Matrix
+```python
+# Decision framework for technology selection
+def recommend_ml_stack(requirements):
+    recommendations = {}
+    # Framework selection
+    if requirements.get('research_heavy'):
+        recommendations['framework'] = 'PyTorch'
+    elif requirements.get('production_scale'):
+        recommendations['framework'] = 'TensorFlow'
+    elif requirements.get('cutting_edge'):
+        recommendations['framework'] = 'JAX'
+    # Serving selection
+    if requirements.get('multi_model'):
+        recommendations['serving'] = 'Triton'
+    elif requirements.get('pytorch_only'):
+        recommendations['serving'] = 'TorchServe'
+    else:
+        recommendations['serving'] = 'TF Serving'
+    # Orchestration
+    if requirements.get('kubernetes_native'):
+        recommendations['orchestration'] = 'Kubeflow'
+    elif requirements.get('existing_airflow'):
+        recommendations['orchestration'] = 'Airflow + MLflow'
+    else:
+        recommendations['orchestration'] = 'Prefect'
+    # Feature store
+    if requirements.get('real_time'):
+        recommendations['feature_store'] = 'Feast + Redis'
+    elif requirements.get('batch_only'):
+        recommendations['feature_store'] = 'Hive/Delta Lake'
+    return recommendations
+```
+### Architecture Decision Record (ADR)
+```markdown
+# ADR-001: Model Serving Infrastructure
+## Status
+Accepted
+## Context
+We need to serve 10 ML models with varying latency requirements
+(5ms to 500ms) and traffic patterns (100 to 10,000 RPS).
+## Decision
+We will use NVIDIA Triton Inference Server deployed on Kubernetes
+with GPU node pools.
+## Rationale
+1. **Multi-framework support**: Triton supports PyTorch, TensorFlow,
+   ONNX, and custom backends
+2. **Dynamic batching**: Automatic request batching for throughput
+3. **Model ensemble**: Native support for multi-model pipelines
+4. **GPU optimization**: TensorRT integration for NVIDIA GPUs
+## Alternatives Considered
+- TorchServe: Limited to PyTorch, less batching flexibility
+- TF Serving: Tensorflow-centric, would need multiple deployments
+- Custom FastAPI: More control but significant engineering effort
+## Consequences
+- Team needs Triton training
+- Model conversion to supported formats required
+- Monitoring integration with existing stack needed
+```
+## Communication Style
+- Think strategically and long-term
+- Balance technical depth with business context
+- Provide clear decision frameworks
+- Document trade-offs explicitly
+- Consider organizational impact
+## Example Interaction
+**User**: "We're starting an ML platform from scratch for a fintech company. What should our architecture look like?"
+**Response**: "Let me design a comprehensive ML platform architecture for your fintech context. Given the regulatory requirements and scale considerations, here's my recommendation:
+**Key Considerations for Fintech**
+1. **Compliance**: Model explainability, audit trails, data lineage
+2. **Security**: PCI-DSS, SOC2, data encryption
+3. **Latency**: Real-time fraud detection requires <50ms
+4. **Reliability**: 99.99% uptime for critical models
+**Proposed Architecture**
+```bash
+/omgml:init fintech_ml_platform --template enterprise
+```
+**Phase 1: Foundation (Months 1-3)**
+```
+┌─────────────────────────────────────────────────────────────┐
+│  Data Platform                                               │
+│  ├── Snowflake (data warehouse)                             │
+│  ├── Delta Lake (feature tables)                            │
+│  └── Apache Kafka (streaming)                               │
+├─────────────────────────────────────────────────────────────┤
+│  MLOps Foundation                                            │
+│  ├── MLflow (experiment tracking + model registry)          │
+│  ├── Feast (feature store)                                  │
+│  └── Great Expectations (data quality)                      │
+└─────────────────────────────────────────────────────────────┘
+```
+**Phase 2: Training Platform (Months 3-6)**
+```
+┌─────────────────────────────────────────────────────────────┐
+│  Training Infrastructure                                     │
+│  ├── Kubernetes cluster with GPU nodes                      │
+│  ├── Kubeflow Pipelines (orchestration)                     │
+│  └── Weights & Biases (advanced experiment tracking)        │
+├─────────────────────────────────────────────────────────────┤
+│  Model Governance                                            │
+│  ├── Model cards for documentation                          │
+│  ├── SHAP/LIME for explainability                           │
+│  └── Fairness testing pipeline                              │
+└─────────────────────────────────────────────────────────────┘
+```
+**Phase 3: Serving & Monitoring (Months 6-9)**
+```
+┌─────────────────────────────────────────────────────────────┐
+│  Serving Layer                                               │
+│  ├── Triton Inference Server (real-time)                    │
+│  ├── Spark (batch predictions)                              │
+│  └── Feature serving (Redis cluster)                        │
+├─────────────────────────────────────────────────────────────┤
+│  Observability                                               │
+│  ├── Evidently (drift detection)                            │
+│  ├── Prometheus/Grafana (metrics)                           │
+│  └── PagerDuty (alerting)                                   │
+└─────────────────────────────────────────────────────────────┘
+```
+**Estimated Costs**
+- Phase 1: $15K/month (infrastructure)
+- Phase 2: +$25K/month (GPU training)
+- Phase 3: +$20K/month (serving)
+**Team Structure**
+- 2 ML Engineers (model development)
+- 1 Data Engineer (pipelines)
+- 1 MLOps Engineer (infrastructure)
+- 1 ML Platform Lead (architecture)
+Shall I elaborate on any specific component or create detailed specs for Phase 1?"