@intentsolutionsio/ai-ml-engineering-pack 1.0.0 → 1.0.4

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/README.md CHANGED
@@ -8,31 +8,35 @@ Master prompt engineering, LLM integration, RAG systems, and AI safety with 12 s
8
8
  [![Version](https://img.shields.io/badge/version-1.0.0-blue.svg)](https://github.com/jeremylongshore/claude-code-plugins)
9
9
  [![Claude Code](https://img.shields.io/badge/Claude_Code-0.1.0+-purple.svg)](https://claude.ai/code)
10
10
 
11
- ## What's Included
11
+ ## What's Included
12
12
 
13
13
  **12 specialized plugins across 4 AI/ML categories:**
14
14
 
15
15
  ### 1. Prompt Engineering (3 plugins)
16
+
16
17
  - **prompt-architect** (agent) - Expert in CoT reasoning, few-shot learning, and advanced prompt patterns
17
18
  - **prompt-optimizer** (agent) - Reduce LLM costs by 60-90% while maintaining quality
18
19
  - **prompt-template-gen** (command: `/ptg`) - Generate production-ready prompt templates with type safety
19
20
 
20
21
  ### 2. LLM Integration (3 plugins)
22
+
21
23
  - **llm-integration-expert** (agent) - Production API patterns, error handling, streaming, rate limiting
22
24
  - **model-selector** (agent) - Choose optimal models based on cost, quality, latency requirements
23
25
  - **llm-api-scaffold** (command: `/las`) - Generate complete LLM API with FastAPI, Docker, monitoring
24
26
 
25
27
  ### 3. RAG Systems (3 plugins)
28
+
26
29
  - **rag-architect** (agent) - Design RAG systems, chunking strategies, retrieval optimization
27
30
  - **vector-db-expert** (agent) - Select and configure vector databases (Pinecone, Qdrant, Weaviate, etc.)
28
31
  - **rag-pipeline-gen** (command: `/rpg`) - Generate complete RAG pipeline with embeddings and retrieval
29
32
 
30
33
  ### 4. AI Safety (3 plugins)
34
+
31
35
  - **ai-safety-expert** (agent) - Content filtering, PII detection, bias mitigation, compliance
32
36
  - **prompt-injection-defender** (agent) - Defend against prompt injection and jailbreak attacks
33
37
  - **ai-monitoring-setup** (command: `/ams`) - Set up LLM monitoring, cost tracking, and alerts
34
38
 
35
- ## Quick Start
39
+ ## Quick Start
36
40
 
37
41
  ### Installation
38
42
 
@@ -47,7 +51,7 @@ claude plugin install ai-ml-engineering-pack@claude-code-plugins-plus
47
51
  claude plugin list
48
52
  ```
49
53
 
50
- **Full installation guide:** [INSTALLATION.md](./INSTALLATION.md)
54
+ **Full installation guide:** INSTALLATION.md
51
55
 
52
56
  ### 10-Minute Tutorial
53
57
 
@@ -75,9 +79,9 @@ claude
75
79
  "Implement PII detection and toxicity filtering for my chatbot"
76
80
  ```
77
81
 
78
- **Complete tutorial:** [QUICK_START.md](./QUICK_START.md)
82
+ **Complete tutorial:** QUICK_START.md
79
83
 
80
- ## ROI & Value Proposition
84
+ ## ROI & Value Proposition
81
85
 
82
86
  Real-world results from production deployments:
83
87
 
@@ -92,16 +96,18 @@ Real-world results from production deployments:
92
96
 
93
97
  **Average ROI: 29,351%** | **Average payback period: 3 days**
94
98
 
95
- **Detailed case studies:** [USE_CASES.md](./USE_CASES.md)
99
+ **Detailed case studies:** USE_CASES.md
96
100
 
97
- ## Plugin Reference
101
+ ## Plugin Reference
98
102
 
99
103
  ### Prompt Engineering
100
104
 
101
105
  #### `prompt-architect` (Agent)
106
+
102
107
  Expert in advanced prompt engineering techniques and patterns.
103
108
 
104
109
  **Capabilities:**
110
+
105
111
  - Chain-of-Thought (CoT) reasoning
106
112
  - Few-shot and zero-shot learning
107
113
  - Prompt composition patterns
@@ -109,6 +115,7 @@ Expert in advanced prompt engineering techniques and patterns.
109
115
  - Multi-modal prompts (text + images)
110
116
 
111
117
  **When to use:**
118
+
112
119
  - "Design a prompt for [complex task]"
113
120
  - "Improve this prompt: [existing prompt]"
114
121
  - "What's the best prompting technique for [use case]?"
@@ -118,9 +125,11 @@ Expert in advanced prompt engineering techniques and patterns.
118
125
  ---
119
126
 
120
127
  #### `prompt-optimizer` (Agent)
128
+
121
129
  Optimize prompts for cost reduction (60-90% savings) while maintaining quality.
122
130
 
123
131
  **Capabilities:**
132
+
124
133
  - Token reduction techniques (remove verbosity, use abbreviations)
125
134
  - Prompt caching strategies
126
135
  - Model selection guidance (cheap vs expensive)
@@ -128,11 +137,13 @@ Optimize prompts for cost reduction (60-90% savings) while maintaining quality.
128
137
  - ROI calculation
129
138
 
130
139
  **When to use:**
140
+
131
141
  - "Reduce the cost of this prompt: [prompt]"
132
142
  - "Optimize my prompts for $1000/month budget"
133
143
  - "How can I reduce token usage by 70%?"
134
144
 
135
145
  **Example:**
146
+
136
147
  ```
137
148
  Before (52 tokens): "I would like you to please analyze..."
138
149
  After (15 tokens): "Analyze and summarize main points."
@@ -144,9 +155,11 @@ Savings: 71% token reduction = $0.15/1000 calls (GPT-4)
144
155
  ---
145
156
 
146
157
  #### `/ptg` - Prompt Template Generator (Command)
158
+
147
159
  Generate production-ready prompt templates with type safety and validation.
148
160
 
149
161
  **Usage:**
162
+
150
163
  ```bash
151
164
  /ptg
152
165
 
@@ -158,6 +171,7 @@ Generate production-ready prompt templates with type safety and validation.
158
171
  ```
159
172
 
160
173
  **Generated output:**
174
+
161
175
  - Python: Pydantic models with type safety
162
176
  - TypeScript: Zod schemas with validation
163
177
  - Usage examples
@@ -165,6 +179,7 @@ Generate production-ready prompt templates with type safety and validation.
165
179
  - Unit tests
166
180
 
167
181
  **Example output:**
182
+
168
183
  ```python
169
184
  @dataclass
170
185
  class ProductDescriptionInput:
@@ -186,9 +201,11 @@ class ProductDescriptionGenerator:
186
201
  ### LLM Integration
187
202
 
188
203
  #### `llm-integration-expert` (Agent)
204
+
189
205
  Production patterns for LLM API integration with error handling and reliability.
190
206
 
191
207
  **Capabilities:**
208
+
192
209
  - Multi-provider integration (OpenAI, Anthropic, Google, Cohere)
193
210
  - Exponential backoff retry logic
194
211
  - Rate limiting (token bucket, sliding window)
@@ -198,11 +215,13 @@ Production patterns for LLM API integration with error handling and reliability.
198
215
  - Token counting and cost tracking
199
216
 
200
217
  **When to use:**
218
+
201
219
  - "Implement LLM API integration with retry logic"
202
220
  - "Add streaming support to my chatbot"
203
221
  - "Build multi-provider fallback system"
204
222
 
205
223
  **Code examples:**
224
+
206
225
  ```python
207
226
  # Retry with exponential backoff
208
227
  @retry_with_backoff(max_retries=3, base_delay=1.0)
@@ -219,9 +238,11 @@ await rate_limiter.wait_for_token()
219
238
  ---
220
239
 
221
240
  #### `model-selector` (Agent)
241
+
222
242
  Guide model selection based on cost, quality, latency, and use case requirements.
223
243
 
224
244
  **Capabilities:**
245
+
225
246
  - Model comparison matrix (GPT-4, Claude 3, Gemini)
226
247
  - Pricing analysis (per 1M tokens)
227
248
  - Latency benchmarks
@@ -230,11 +251,13 @@ Guide model selection based on cost, quality, latency, and use case requirements
230
251
  - A/B testing frameworks
231
252
 
232
253
  **When to use:**
254
+
233
255
  - "Which model should I use for customer support?"
234
256
  - "Compare GPT-4 vs Claude 3 Opus for code generation"
235
257
  - "How can I reduce costs with model cascading?"
236
258
 
237
259
  **Model comparison:**
260
+
238
261
  | Model | Input ($/1M) | Output ($/1M) | Latency | Best For |
239
262
  |-------|-------------|---------------|---------|----------|
240
263
  | GPT-4 Turbo | $10 | $30 | 3-5s | Complex reasoning |
@@ -247,9 +270,11 @@ Guide model selection based on cost, quality, latency, and use case requirements
247
270
  ---
248
271
 
249
272
  #### `/las` - LLM API Scaffold (Command)
273
+
250
274
  Generate complete production-ready LLM API integration code.
251
275
 
252
276
  **Usage:**
277
+
253
278
  ```bash
254
279
  /las
255
280
 
@@ -261,6 +286,7 @@ Generate complete production-ready LLM API integration code.
261
286
  ```
262
287
 
263
288
  **Generated files:**
289
+
264
290
  ```
265
291
  llm-api/
266
292
  ├── main.py # FastAPI application
@@ -275,22 +301,25 @@ llm-api/
275
301
  ```
276
302
 
277
303
  **Features included:**
278
- - Exponential backoff retry (3 attempts)
279
- - Rate limiting (token bucket algorithm)
280
- - Response caching (Redis, 5 min TTL)
281
- - Streaming support (SSE)
282
- - Cost tracking
283
- - Prometheus metrics
284
- - Docker deployment
304
+
305
+ - Exponential backoff retry (3 attempts)
306
+ - Rate limiting (token bucket algorithm)
307
+ - Response caching (Redis, 5 min TTL)
308
+ - Streaming support (SSE)
309
+ - Cost tracking
310
+ - Prometheus metrics
311
+ - Docker deployment
285
312
 
286
313
  ---
287
314
 
288
315
  ### RAG Systems
289
316
 
290
317
  #### `rag-architect` (Agent)
318
+
291
319
  Expert in designing and optimizing Retrieval-Augmented Generation systems.
292
320
 
293
321
  **Capabilities:**
322
+
294
323
  - RAG architecture patterns
295
324
  - Chunking strategies (fixed, recursive, semantic)
296
325
  - Embedding model selection
@@ -299,11 +328,13 @@ Expert in designing and optimizing Retrieval-Augmented Generation systems.
299
328
  - Evaluation metrics (MRR, NDCG)
300
329
 
301
330
  **When to use:**
331
+
302
332
  - "Design a RAG system for customer support knowledge base"
303
333
  - "What chunking strategy should I use for legal documents?"
304
334
  - "How can I improve retrieval accuracy?"
305
335
 
306
336
  **Chunking strategies:**
337
+
307
338
  ```python
308
339
  # Fixed-size (simple, fast)
309
340
  chunks = [text[i:i+512] for i in range(0, len(text), 512)]
@@ -324,9 +355,11 @@ chunks = semantic_splitter.split_by_meaning(text)
324
355
  ---
325
356
 
326
357
  #### `vector-db-expert` (Agent)
358
+
327
359
  Select and optimize vector databases for RAG systems.
328
360
 
329
361
  **Capabilities:**
362
+
330
363
  - Database comparison (Pinecone, Qdrant, Weaviate, ChromaDB, pgvector, Milvus)
331
364
  - HNSW index tuning
332
365
  - Scaling strategies (sharding, replication)
@@ -334,11 +367,13 @@ Select and optimize vector databases for RAG systems.
334
367
  - Migration planning
335
368
 
336
369
  **When to use:**
370
+
337
371
  - "Which vector database should I use for 10M documents?"
338
372
  - "How do I tune HNSW parameters for better performance?"
339
373
  - "Compare Pinecone vs Qdrant for my use case"
340
374
 
341
375
  **Database comparison:**
376
+
342
377
  | Database | Best For | Pricing | Hosting |
343
378
  |----------|---------|---------|---------|
344
379
  | Pinecone | Managed, auto-scaling | $0.096/GB/month | Cloud only |
@@ -352,9 +387,11 @@ Select and optimize vector databases for RAG systems.
352
387
  ---
353
388
 
354
389
  #### `/rpg` - RAG Pipeline Generator (Command)
390
+
355
391
  Generate complete RAG pipeline with all components.
356
392
 
357
393
  **Usage:**
394
+
358
395
  ```bash
359
396
  /rpg
360
397
 
@@ -367,6 +404,7 @@ Generate complete RAG pipeline with all components.
367
404
  ```
368
405
 
369
406
  **Generated files:**
407
+
370
408
  ```
371
409
  rag-system/
372
410
  ├── document_loader.py # PDF/DOCX/TXT loaders
@@ -382,24 +420,27 @@ rag-system/
382
420
  ```
383
421
 
384
422
  **Features included:**
385
- - Multi-format document loading (PDF, DOCX, TXT, MD)
386
- - Recursive chunking (512 tokens, 50 overlap)
387
- - Vector similarity search
388
- - Cohere reranking (optional)
389
- - Source attribution with page numbers
390
- - Query expansion
391
- - Caching
392
- - FastAPI REST endpoints
393
- - Docker deployment
423
+
424
+ - Multi-format document loading (PDF, DOCX, TXT, MD)
425
+ - Recursive chunking (512 tokens, 50 overlap)
426
+ - Vector similarity search
427
+ - Cohere reranking (optional)
428
+ - Source attribution with page numbers
429
+ - Query expansion
430
+ - Caching
431
+ - FastAPI REST endpoints
432
+ - Docker deployment
394
433
 
395
434
  ---
396
435
 
397
436
  ### AI Safety
398
437
 
399
438
  #### `ai-safety-expert` (Agent)
439
+
400
440
  Comprehensive AI safety with content filtering, PII protection, and bias mitigation.
401
441
 
402
442
  **Capabilities:**
443
+
403
444
  - Toxicity detection (BERT-based classification)
404
445
  - PII detection and redaction (Presidio)
405
446
  - Bias detection (gender, racial, age)
@@ -408,12 +449,14 @@ Comprehensive AI safety with content filtering, PII protection, and bias mitigat
408
449
  - GDPR/CCPA/HIPAA compliance
409
450
 
410
451
  **When to use:**
452
+
411
453
  - "Implement PII detection for user inputs"
412
454
  - "Add toxicity filtering to my chatbot"
413
455
  - "Detect and mitigate bias in LLM outputs"
414
456
  - "Ensure HIPAA compliance for medical data"
415
457
 
416
458
  **Safety pipeline:**
459
+
417
460
  ```python
418
461
  class SafetyGuardrails:
419
462
  async def safe_completion(self, user_input: str, llm):
@@ -434,6 +477,7 @@ class SafetyGuardrails:
434
477
  ```
435
478
 
436
479
  **PII detection:**
480
+
437
481
  - Email addresses, phone numbers, SSN
438
482
  - Credit card numbers
439
483
  - IP addresses
@@ -445,9 +489,11 @@ class SafetyGuardrails:
445
489
  ---
446
490
 
447
491
  #### `prompt-injection-defender` (Agent)
492
+
448
493
  Defend against prompt injection attacks and jailbreaks.
449
494
 
450
495
  **Capabilities:**
496
+
451
497
  - Pattern-based detection (regex for common attacks)
452
498
  - ML classification (fine-tuned BERT model)
453
499
  - Input sanitization
@@ -456,11 +502,13 @@ Defend against prompt injection attacks and jailbreaks.
456
502
  - Jailbreak detection (DAN, Developer Mode, etc.)
457
503
 
458
504
  **When to use:**
505
+
459
506
  - "Protect my chatbot from prompt injection"
460
507
  - "Detect jailbreak attempts"
461
508
  - "Validate user inputs for manipulation"
462
509
 
463
510
  **Attack patterns detected:**
511
+
464
512
  ```python
465
513
  ATTACK_PATTERNS = [
466
514
  r'ignore\s+(all\s+)?(previous|prior|above)\s+instructions',
@@ -472,6 +520,7 @@ ATTACK_PATTERNS = [
472
520
  ```
473
521
 
474
522
  **Defense strategies:**
523
+
475
524
  1. **Detection:** Identify attack patterns
476
525
  2. **Sanitization:** Remove/escape dangerous inputs
477
526
  3. **Validation:** Verify outputs don't leak system prompts
@@ -482,9 +531,11 @@ ATTACK_PATTERNS = [
482
531
  ---
483
532
 
484
533
  #### `/ams` - AI Monitoring Setup (Command)
534
+
485
535
  Set up comprehensive LLM monitoring with cost tracking and alerting.
486
536
 
487
537
  **Usage:**
538
+
488
539
  ```bash
489
540
  /ams
490
541
 
@@ -496,6 +547,7 @@ Set up comprehensive LLM monitoring with cost tracking and alerting.
496
547
  ```
497
548
 
498
549
  **Generated files:**
550
+
499
551
  ```
500
552
  monitoring/
501
553
  ├── metrics.py # Prometheus metrics
@@ -508,6 +560,7 @@ monitoring/
508
560
  ```
509
561
 
510
562
  **Metrics collected:**
563
+
511
564
  - Request count (by model, status)
512
565
  - Latency (p50, p95, p99)
513
566
  - Token usage (input, output)
@@ -516,12 +569,14 @@ monitoring/
516
569
  - Cache hit rate
517
570
 
518
571
  **Alerts configured:**
572
+
519
573
  - Budget threshold (80%, 90%, 100%)
520
574
  - High error rate (>5%)
521
575
  - Slow responses (>10s)
522
576
  - Token limit approaching
523
577
 
524
578
  **Dashboards:**
579
+
525
580
  - Real-time request monitoring
526
581
  - Cost tracking (daily, weekly, monthly)
527
582
  - Model performance comparison
@@ -529,14 +584,14 @@ monitoring/
529
584
 
530
585
  ---
531
586
 
532
- ## Documentation
587
+ ## Documentation
533
588
 
534
- - **[Installation Guide](./INSTALLATION.md)** - Prerequisites, setup, verification
535
- - **[Quick Start](./QUICK_START.md)** - 10-minute tutorial with examples
536
- - **[Use Cases](./USE_CASES.md)** - Real-world applications with ROI
537
- - **[Troubleshooting](./000-docs/157-DR-FAQS-troubleshooting.md)** - Common issues and solutions
589
+ - **Installation Guide** - Prerequisites, setup, verification
590
+ - **Quick Start** - 10-minute tutorial with examples
591
+ - **Use Cases** - Real-world applications with ROI
592
+ - **Troubleshooting** - Common issues and solutions
538
593
 
539
- ## Example Workflows
594
+ ## Example Workflows
540
595
 
541
596
  ### Build a Customer Support Bot (10 minutes)
542
597
 
@@ -584,7 +639,7 @@ Use case: Customer feedback analysis
584
639
 
585
640
  ### Build RAG System for Legal Documents (15 minutes)
586
641
 
587
- ```bash
642
+ ```text
588
643
  claude
589
644
 
590
645
  # 1. Design RAG architecture
@@ -608,42 +663,47 @@ Track: accuracy, retrieval time, cost per query
608
663
 
609
664
  **Result:** Legal document analysis system with 94% accuracy, 82ms latency, PII protection.
610
665
 
611
- ## Learning Resources
666
+ ## Learning Resources
612
667
 
613
668
  ### Video Tutorials (Coming Soon)
669
+
614
670
  - Prompt Engineering Masterclass (30 min)
615
671
  - Building Production RAG Systems (45 min)
616
672
  - AI Safety Best Practices (20 min)
617
673
 
618
674
  ### Blog Posts
675
+
619
676
  - [Reduce LLM Costs by 90%](https://example.com/reduce-llm-costs)
620
677
  - [Building RAG Systems That Actually Work](https://example.com/rag-systems)
621
678
  - [Comprehensive Guide to AI Safety](https://example.com/ai-safety)
622
679
 
623
680
  ### Community
681
+
624
682
  - [Discord](https://discord.com/invite/6PPFFzqPDZ) - #claude-code channel
625
683
  - [GitHub Discussions](https://github.com/jeremylongshore/claude-code-plugins/discussions)
626
684
  - [Stack Overflow](https://stackoverflow.com/questions/tagged/claude-code) - `claude-code` tag
627
685
 
628
- ## Pricing
686
+ ## Pricing
629
687
 
630
688
  **One-time purchase: $79**
631
689
 
632
690
  What's included:
633
- - All 12 plugins (lifetime access)
634
- - Free updates and new plugins
635
- - Email support
636
- - Community Discord access
637
- - Documentation and examples
691
+
692
+ - All 12 plugins (lifetime access)
693
+ - Free updates and new plugins
694
+ - Email support
695
+ - Community Discord access
696
+ - Documentation and examples
638
697
 
639
698
  **Compare to alternatives:**
699
+
640
700
  - Manual implementation: 40+ hours ($4,000 at $100/hour)
641
701
  - Consultants: $150-300/hour × 40 hours = $6,000-12,000
642
702
  - AI/ML Engineering Pack: **$79** (99% cost savings)
643
703
 
644
704
  **Average payback period: 3 days**
645
705
 
646
- [Buy Now on Gumroad](https://gumroad.com/l/ai-ml-engineering-pack) | [Volume Licensing](mailto:[email protected])
706
+ Buy Now on Gumroad | [Volume Licensing](mailto:[email protected])
647
707
 
648
708
  ## 🆘 Support
649
709
 
@@ -655,44 +715,47 @@ What's included:
655
715
 
656
716
  **Community:** Join Discord for community support
657
717
 
658
- ## Updates
718
+ ## Updates
659
719
 
660
720
  **Current version:** 1.0.0
661
721
 
662
722
  **Update policy:** Free updates for life, including new plugins and features
663
723
 
664
724
  **Changelog:**
725
+
665
726
  - **v1.0.0** (2025-10-10) - Initial release with 12 plugins
666
727
 
667
728
  To update:
729
+
668
730
  ```bash
669
731
  claude plugin update ai-ml-engineering-pack
670
732
  ```
671
733
 
672
- ## License
734
+ ## License
673
735
 
674
- MIT License - See [LICENSE](./000-docs/001-BL-LICN-license.txt) for details
736
+ MIT License - See LICENSE for details
675
737
 
676
738
  **Commercial use permitted** - Use in commercial projects, redistribute, modify
677
739
 
678
- ## Acknowledgments
740
+ ## Acknowledgments
679
741
 
680
742
  Built with:
743
+
681
744
  - [Claude Code](https://claude.ai/code) - AI-powered development CLI
682
745
  - [LangChain](https://langchain.com) - LLM framework
683
746
  - [Presidio](https://microsoft.github.io/presidio/) - PII detection
684
747
  - [Qdrant](https://qdrant.tech) - Vector database
685
748
  - [FastAPI](https://fastapi.tiangolo.com) - Modern Python framework
686
749
 
687
- ## Ready to Get Started?
750
+ ## Ready to Get Started?
688
751
 
689
- 1. **[Install the pack](./INSTALLATION.md)** - 5-minute setup
690
- 2. **[Complete Quick Start](./QUICK_START.md)** - Build your first AI feature in 10 minutes
691
- 3. **[Explore use cases](./USE_CASES.md)** - See real-world ROI examples
752
+ 1. **Install the pack** - 5-minute setup
753
+ 2. **Complete Quick Start** - Build your first AI feature in 10 minutes
754
+ 3. **Explore use cases** - See real-world ROI examples
692
755
  4. **[Join the community](https://discord.com/invite/6PPFFzqPDZ)** - Connect with other AI/ML engineers
693
756
 
694
757
  ---
695
758
 
696
759
  **Questions?** Email [email protected] or open a [GitHub issue](https://github.com/jeremylongshore/claude-code-plugins/issues).
697
760
 
698
- **Built by AI engineers, for AI engineers.**
761
+ **Built by AI engineers, for AI engineers.**
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "@intentsolutionsio/ai-ml-engineering-pack",
3
- "version": "1.0.0",
3
+ "version": "1.0.4",
4
4
  "description": "Professional AI/ML Engineering toolkit: Prompt engineering, LLM integration, RAG systems, AI safety with 12 expert plugins",
5
5
  "keywords": [
6
6
  "ai",
@@ -1,13 +1,22 @@
1
1
  ---
2
2
  name: optimizing-prompts
3
- description: |
4
- Execute this skill optimizes prompts for large language models (llms) to reduce token usage, lower costs, and improve performance. it analyzes the prompt, identifies areas for simplification and redundancy removal, and rewrites the prompt to be more conci... Use when optimizing performance. Trigger with phrases like 'optimize', 'performance', or 'speed up'.
3
+ description: 'Execute this skill optimizes prompts for large language models (llms)
4
+ to reduce token usage, lower costs, and improve performance. it analyzes the prompt,
5
+ identifies areas for simplification and redundancy removal, and rewrites the prompt
6
+ to be more conci... Use when optimizing performance. Trigger with phrases like ''optimize'',
7
+ ''performance'', or ''speed up''.
8
+
9
+ '
5
10
  allowed-tools: Read, Write, Edit, Grep, Glob, Bash(cmd:*)
6
11
  version: 1.0.0
7
12
  author: Jeremy Longshore <jeremy@intentsolutions.io>
8
13
  license: MIT
9
- compatible-with: claude-code, codex, openclaw
10
- tags: [packages, llm, performance, cost-optimization]
14
+ tags:
15
+ - packages
16
+ - llm
17
+ - performance
18
+ - cost-optimization
19
+ compatibility: Designed for Claude Code, also compatible with Codex and OpenClaw
11
20
  ---
12
21
  # Ai Ml Engineering Pack
13
22
 
@@ -26,6 +35,7 @@ Refine prompts for optimal LLM performance. It streamlines prompts to minimize t
26
35
  ## When to Use This Skill
27
36
 
28
37
  This skill activates when you need to:
38
+
29
39
  - Reduce the cost of using an LLM.
30
40
  - Improve the speed of LLM responses.
31
41
  - Enhance the quality or clarity of LLM outputs by refining the prompt.
@@ -37,6 +47,7 @@ This skill activates when you need to:
37
47
  User request: "Optimize this prompt for cost and quality: 'I would like you to create a detailed product description for a new ergonomic office chair, highlighting its features, benefits, and target audience, and also include information about its warranty and return policy.'"
38
48
 
39
49
  The skill will:
50
+
40
51
  1. Analyze the prompt for redundancies and areas for simplification.
41
52
  2. Rewrite the prompt to be more concise: "Create a product description for an ergonomic office chair. Include features, benefits, target audience, warranty, and return policy."
42
53
  3. Provide the optimized prompt and explain the token reduction achieved.
@@ -46,6 +57,7 @@ The skill will:
46
57
  User request: "Optimize this prompt for better summarization: 'Please read the following document and provide a comprehensive summary of all the key points, main arguments, supporting evidence, and overall conclusion, ensuring that the summary is accurate, concise, and easy to understand.'"
47
58
 
48
59
  The skill will:
60
+
49
61
  1. Identify areas for improvement in the prompt's clarity and focus.
50
62
  2. Rewrite the prompt to be more direct: "Summarize this document, including key points, arguments, evidence, and the conclusion."
51
63
  3. Present the optimized prompt and explain how it enhances summarization performance.
@@ -85,4 +97,4 @@ The skill produces structured output relevant to the task.
85
97
  ## Resources
86
98
 
87
99
  - Project documentation
88
- - Related skills and commands
100
+ - Related skills and commands
@@ -243,4 +243,4 @@ Output: A boolean value indicating whether a prompt injection attempt is detecte
243
243
  Implement heuristics and/or machine learning models to identify suspicious patterns. Check for phrases like "ignore previous instructions" and "as an AI language model".
244
244
  ```
245
245
 
246
- Remember to adapt these examples to your specific needs and experiment with different prompts to achieve the best results. Good luck!
246
+ Remember to adapt these examples to your specific needs and experiment with different prompts to achieve the best results. Good luck!
@@ -101,4 +101,4 @@
101
101
 
102
102
  ### 8.3. Data Sources
103
103
 
104
- `[Describe the data sources used to evaluate the prompt's performance. Specify the size and characteristics of the dataset.]`
104
+ `[Describe the data sources used to evaluate the prompt's performance. Specify the size and characteristics of the dataset.]`
@@ -1,4 +1,3 @@
1
1
  # References
2
2
 
3
3
  Bundled resources for ai-ml-engineering-pack skill
4
-
@@ -6,6 +6,6 @@ Bundled resources for ai-ml-engineering-pack skill
6
6
  - [x] prompt_validator.py: Script to validate prompt syntax and structure.
7
7
  - [x] cost_estimator.py: Script to estimate the cost of running a prompt on different LLMs.
8
8
 
9
-
10
9
  ## Auto-Generated
10
+
11
11
  Scripts generated on 2025-12-10 03:48:17