@intentsolutionsio/ai-ml-engineering-pack 1.0.0 → 1.0.4
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +110 -47
- package/package.json +1 -1
- package/skills/optimizing-prompts/SKILL.md +17 -5
- package/skills/optimizing-prompts/assets/example_prompts.md +1 -1
- package/skills/optimizing-prompts/assets/optimization_report_template.md +1 -1
- package/skills/optimizing-prompts/references/README.md +0 -1
- package/skills/optimizing-prompts/scripts/README.md +1 -1
package/README.md
CHANGED
|
@@ -8,31 +8,35 @@ Master prompt engineering, LLM integration, RAG systems, and AI safety with 12 s
|
|
|
8
8
|
[](https://github.com/jeremylongshore/claude-code-plugins)
|
|
9
9
|
[](https://claude.ai/code)
|
|
10
10
|
|
|
11
|
-
##
|
|
11
|
+
## What's Included
|
|
12
12
|
|
|
13
13
|
**12 specialized plugins across 4 AI/ML categories:**
|
|
14
14
|
|
|
15
15
|
### 1. Prompt Engineering (3 plugins)
|
|
16
|
+
|
|
16
17
|
- **prompt-architect** (agent) - Expert in CoT reasoning, few-shot learning, and advanced prompt patterns
|
|
17
18
|
- **prompt-optimizer** (agent) - Reduce LLM costs by 60-90% while maintaining quality
|
|
18
19
|
- **prompt-template-gen** (command: `/ptg`) - Generate production-ready prompt templates with type safety
|
|
19
20
|
|
|
20
21
|
### 2. LLM Integration (3 plugins)
|
|
22
|
+
|
|
21
23
|
- **llm-integration-expert** (agent) - Production API patterns, error handling, streaming, rate limiting
|
|
22
24
|
- **model-selector** (agent) - Choose optimal models based on cost, quality, latency requirements
|
|
23
25
|
- **llm-api-scaffold** (command: `/las`) - Generate complete LLM API with FastAPI, Docker, monitoring
|
|
24
26
|
|
|
25
27
|
### 3. RAG Systems (3 plugins)
|
|
28
|
+
|
|
26
29
|
- **rag-architect** (agent) - Design RAG systems, chunking strategies, retrieval optimization
|
|
27
30
|
- **vector-db-expert** (agent) - Select and configure vector databases (Pinecone, Qdrant, Weaviate, etc.)
|
|
28
31
|
- **rag-pipeline-gen** (command: `/rpg`) - Generate complete RAG pipeline with embeddings and retrieval
|
|
29
32
|
|
|
30
33
|
### 4. AI Safety (3 plugins)
|
|
34
|
+
|
|
31
35
|
- **ai-safety-expert** (agent) - Content filtering, PII detection, bias mitigation, compliance
|
|
32
36
|
- **prompt-injection-defender** (agent) - Defend against prompt injection and jailbreak attacks
|
|
33
37
|
- **ai-monitoring-setup** (command: `/ams`) - Set up LLM monitoring, cost tracking, and alerts
|
|
34
38
|
|
|
35
|
-
##
|
|
39
|
+
## Quick Start
|
|
36
40
|
|
|
37
41
|
### Installation
|
|
38
42
|
|
|
@@ -47,7 +51,7 @@ claude plugin install ai-ml-engineering-pack@claude-code-plugins-plus
|
|
|
47
51
|
claude plugin list
|
|
48
52
|
```
|
|
49
53
|
|
|
50
|
-
**Full installation guide:**
|
|
54
|
+
**Full installation guide:** INSTALLATION.md
|
|
51
55
|
|
|
52
56
|
### 10-Minute Tutorial
|
|
53
57
|
|
|
@@ -75,9 +79,9 @@ claude
|
|
|
75
79
|
"Implement PII detection and toxicity filtering for my chatbot"
|
|
76
80
|
```
|
|
77
81
|
|
|
78
|
-
**Complete tutorial:**
|
|
82
|
+
**Complete tutorial:** QUICK_START.md
|
|
79
83
|
|
|
80
|
-
##
|
|
84
|
+
## ROI & Value Proposition
|
|
81
85
|
|
|
82
86
|
Real-world results from production deployments:
|
|
83
87
|
|
|
@@ -92,16 +96,18 @@ Real-world results from production deployments:
|
|
|
92
96
|
|
|
93
97
|
**Average ROI: 29,351%** | **Average payback period: 3 days**
|
|
94
98
|
|
|
95
|
-
**Detailed case studies:**
|
|
99
|
+
**Detailed case studies:** USE_CASES.md
|
|
96
100
|
|
|
97
|
-
##
|
|
101
|
+
## Plugin Reference
|
|
98
102
|
|
|
99
103
|
### Prompt Engineering
|
|
100
104
|
|
|
101
105
|
#### `prompt-architect` (Agent)
|
|
106
|
+
|
|
102
107
|
Expert in advanced prompt engineering techniques and patterns.
|
|
103
108
|
|
|
104
109
|
**Capabilities:**
|
|
110
|
+
|
|
105
111
|
- Chain-of-Thought (CoT) reasoning
|
|
106
112
|
- Few-shot and zero-shot learning
|
|
107
113
|
- Prompt composition patterns
|
|
@@ -109,6 +115,7 @@ Expert in advanced prompt engineering techniques and patterns.
|
|
|
109
115
|
- Multi-modal prompts (text + images)
|
|
110
116
|
|
|
111
117
|
**When to use:**
|
|
118
|
+
|
|
112
119
|
- "Design a prompt for [complex task]"
|
|
113
120
|
- "Improve this prompt: [existing prompt]"
|
|
114
121
|
- "What's the best prompting technique for [use case]?"
|
|
@@ -118,9 +125,11 @@ Expert in advanced prompt engineering techniques and patterns.
|
|
|
118
125
|
---
|
|
119
126
|
|
|
120
127
|
#### `prompt-optimizer` (Agent)
|
|
128
|
+
|
|
121
129
|
Optimize prompts for cost reduction (60-90% savings) while maintaining quality.
|
|
122
130
|
|
|
123
131
|
**Capabilities:**
|
|
132
|
+
|
|
124
133
|
- Token reduction techniques (remove verbosity, use abbreviations)
|
|
125
134
|
- Prompt caching strategies
|
|
126
135
|
- Model selection guidance (cheap vs expensive)
|
|
@@ -128,11 +137,13 @@ Optimize prompts for cost reduction (60-90% savings) while maintaining quality.
|
|
|
128
137
|
- ROI calculation
|
|
129
138
|
|
|
130
139
|
**When to use:**
|
|
140
|
+
|
|
131
141
|
- "Reduce the cost of this prompt: [prompt]"
|
|
132
142
|
- "Optimize my prompts for $1000/month budget"
|
|
133
143
|
- "How can I reduce token usage by 70%?"
|
|
134
144
|
|
|
135
145
|
**Example:**
|
|
146
|
+
|
|
136
147
|
```
|
|
137
148
|
Before (52 tokens): "I would like you to please analyze..."
|
|
138
149
|
After (15 tokens): "Analyze and summarize main points."
|
|
@@ -144,9 +155,11 @@ Savings: 71% token reduction = $0.15/1000 calls (GPT-4)
|
|
|
144
155
|
---
|
|
145
156
|
|
|
146
157
|
#### `/ptg` - Prompt Template Generator (Command)
|
|
158
|
+
|
|
147
159
|
Generate production-ready prompt templates with type safety and validation.
|
|
148
160
|
|
|
149
161
|
**Usage:**
|
|
162
|
+
|
|
150
163
|
```bash
|
|
151
164
|
/ptg
|
|
152
165
|
|
|
@@ -158,6 +171,7 @@ Generate production-ready prompt templates with type safety and validation.
|
|
|
158
171
|
```
|
|
159
172
|
|
|
160
173
|
**Generated output:**
|
|
174
|
+
|
|
161
175
|
- Python: Pydantic models with type safety
|
|
162
176
|
- TypeScript: Zod schemas with validation
|
|
163
177
|
- Usage examples
|
|
@@ -165,6 +179,7 @@ Generate production-ready prompt templates with type safety and validation.
|
|
|
165
179
|
- Unit tests
|
|
166
180
|
|
|
167
181
|
**Example output:**
|
|
182
|
+
|
|
168
183
|
```python
|
|
169
184
|
@dataclass
|
|
170
185
|
class ProductDescriptionInput:
|
|
@@ -186,9 +201,11 @@ class ProductDescriptionGenerator:
|
|
|
186
201
|
### LLM Integration
|
|
187
202
|
|
|
188
203
|
#### `llm-integration-expert` (Agent)
|
|
204
|
+
|
|
189
205
|
Production patterns for LLM API integration with error handling and reliability.
|
|
190
206
|
|
|
191
207
|
**Capabilities:**
|
|
208
|
+
|
|
192
209
|
- Multi-provider integration (OpenAI, Anthropic, Google, Cohere)
|
|
193
210
|
- Exponential backoff retry logic
|
|
194
211
|
- Rate limiting (token bucket, sliding window)
|
|
@@ -198,11 +215,13 @@ Production patterns for LLM API integration with error handling and reliability.
|
|
|
198
215
|
- Token counting and cost tracking
|
|
199
216
|
|
|
200
217
|
**When to use:**
|
|
218
|
+
|
|
201
219
|
- "Implement LLM API integration with retry logic"
|
|
202
220
|
- "Add streaming support to my chatbot"
|
|
203
221
|
- "Build multi-provider fallback system"
|
|
204
222
|
|
|
205
223
|
**Code examples:**
|
|
224
|
+
|
|
206
225
|
```python
|
|
207
226
|
# Retry with exponential backoff
|
|
208
227
|
@retry_with_backoff(max_retries=3, base_delay=1.0)
|
|
@@ -219,9 +238,11 @@ await rate_limiter.wait_for_token()
|
|
|
219
238
|
---
|
|
220
239
|
|
|
221
240
|
#### `model-selector` (Agent)
|
|
241
|
+
|
|
222
242
|
Guide model selection based on cost, quality, latency, and use case requirements.
|
|
223
243
|
|
|
224
244
|
**Capabilities:**
|
|
245
|
+
|
|
225
246
|
- Model comparison matrix (GPT-4, Claude 3, Gemini)
|
|
226
247
|
- Pricing analysis (per 1M tokens)
|
|
227
248
|
- Latency benchmarks
|
|
@@ -230,11 +251,13 @@ Guide model selection based on cost, quality, latency, and use case requirements
|
|
|
230
251
|
- A/B testing frameworks
|
|
231
252
|
|
|
232
253
|
**When to use:**
|
|
254
|
+
|
|
233
255
|
- "Which model should I use for customer support?"
|
|
234
256
|
- "Compare GPT-4 vs Claude 3 Opus for code generation"
|
|
235
257
|
- "How can I reduce costs with model cascading?"
|
|
236
258
|
|
|
237
259
|
**Model comparison:**
|
|
260
|
+
|
|
238
261
|
| Model | Input ($/1M) | Output ($/1M) | Latency | Best For |
|
|
239
262
|
|-------|-------------|---------------|---------|----------|
|
|
240
263
|
| GPT-4 Turbo | $10 | $30 | 3-5s | Complex reasoning |
|
|
@@ -247,9 +270,11 @@ Guide model selection based on cost, quality, latency, and use case requirements
|
|
|
247
270
|
---
|
|
248
271
|
|
|
249
272
|
#### `/las` - LLM API Scaffold (Command)
|
|
273
|
+
|
|
250
274
|
Generate complete production-ready LLM API integration code.
|
|
251
275
|
|
|
252
276
|
**Usage:**
|
|
277
|
+
|
|
253
278
|
```bash
|
|
254
279
|
/las
|
|
255
280
|
|
|
@@ -261,6 +286,7 @@ Generate complete production-ready LLM API integration code.
|
|
|
261
286
|
```
|
|
262
287
|
|
|
263
288
|
**Generated files:**
|
|
289
|
+
|
|
264
290
|
```
|
|
265
291
|
llm-api/
|
|
266
292
|
├── main.py # FastAPI application
|
|
@@ -275,22 +301,25 @@ llm-api/
|
|
|
275
301
|
```
|
|
276
302
|
|
|
277
303
|
**Features included:**
|
|
278
|
-
|
|
279
|
-
-
|
|
280
|
-
-
|
|
281
|
-
-
|
|
282
|
-
-
|
|
283
|
-
-
|
|
284
|
-
-
|
|
304
|
+
|
|
305
|
+
- Exponential backoff retry (3 attempts)
|
|
306
|
+
- Rate limiting (token bucket algorithm)
|
|
307
|
+
- Response caching (Redis, 5 min TTL)
|
|
308
|
+
- Streaming support (SSE)
|
|
309
|
+
- Cost tracking
|
|
310
|
+
- Prometheus metrics
|
|
311
|
+
- Docker deployment
|
|
285
312
|
|
|
286
313
|
---
|
|
287
314
|
|
|
288
315
|
### RAG Systems
|
|
289
316
|
|
|
290
317
|
#### `rag-architect` (Agent)
|
|
318
|
+
|
|
291
319
|
Expert in designing and optimizing Retrieval-Augmented Generation systems.
|
|
292
320
|
|
|
293
321
|
**Capabilities:**
|
|
322
|
+
|
|
294
323
|
- RAG architecture patterns
|
|
295
324
|
- Chunking strategies (fixed, recursive, semantic)
|
|
296
325
|
- Embedding model selection
|
|
@@ -299,11 +328,13 @@ Expert in designing and optimizing Retrieval-Augmented Generation systems.
|
|
|
299
328
|
- Evaluation metrics (MRR, NDCG)
|
|
300
329
|
|
|
301
330
|
**When to use:**
|
|
331
|
+
|
|
302
332
|
- "Design a RAG system for customer support knowledge base"
|
|
303
333
|
- "What chunking strategy should I use for legal documents?"
|
|
304
334
|
- "How can I improve retrieval accuracy?"
|
|
305
335
|
|
|
306
336
|
**Chunking strategies:**
|
|
337
|
+
|
|
307
338
|
```python
|
|
308
339
|
# Fixed-size (simple, fast)
|
|
309
340
|
chunks = [text[i:i+512] for i in range(0, len(text), 512)]
|
|
@@ -324,9 +355,11 @@ chunks = semantic_splitter.split_by_meaning(text)
|
|
|
324
355
|
---
|
|
325
356
|
|
|
326
357
|
#### `vector-db-expert` (Agent)
|
|
358
|
+
|
|
327
359
|
Select and optimize vector databases for RAG systems.
|
|
328
360
|
|
|
329
361
|
**Capabilities:**
|
|
362
|
+
|
|
330
363
|
- Database comparison (Pinecone, Qdrant, Weaviate, ChromaDB, pgvector, Milvus)
|
|
331
364
|
- HNSW index tuning
|
|
332
365
|
- Scaling strategies (sharding, replication)
|
|
@@ -334,11 +367,13 @@ Select and optimize vector databases for RAG systems.
|
|
|
334
367
|
- Migration planning
|
|
335
368
|
|
|
336
369
|
**When to use:**
|
|
370
|
+
|
|
337
371
|
- "Which vector database should I use for 10M documents?"
|
|
338
372
|
- "How do I tune HNSW parameters for better performance?"
|
|
339
373
|
- "Compare Pinecone vs Qdrant for my use case"
|
|
340
374
|
|
|
341
375
|
**Database comparison:**
|
|
376
|
+
|
|
342
377
|
| Database | Best For | Pricing | Hosting |
|
|
343
378
|
|----------|---------|---------|---------|
|
|
344
379
|
| Pinecone | Managed, auto-scaling | $0.096/GB/month | Cloud only |
|
|
@@ -352,9 +387,11 @@ Select and optimize vector databases for RAG systems.
|
|
|
352
387
|
---
|
|
353
388
|
|
|
354
389
|
#### `/rpg` - RAG Pipeline Generator (Command)
|
|
390
|
+
|
|
355
391
|
Generate complete RAG pipeline with all components.
|
|
356
392
|
|
|
357
393
|
**Usage:**
|
|
394
|
+
|
|
358
395
|
```bash
|
|
359
396
|
/rpg
|
|
360
397
|
|
|
@@ -367,6 +404,7 @@ Generate complete RAG pipeline with all components.
|
|
|
367
404
|
```
|
|
368
405
|
|
|
369
406
|
**Generated files:**
|
|
407
|
+
|
|
370
408
|
```
|
|
371
409
|
rag-system/
|
|
372
410
|
├── document_loader.py # PDF/DOCX/TXT loaders
|
|
@@ -382,24 +420,27 @@ rag-system/
|
|
|
382
420
|
```
|
|
383
421
|
|
|
384
422
|
**Features included:**
|
|
385
|
-
|
|
386
|
-
-
|
|
387
|
-
-
|
|
388
|
-
-
|
|
389
|
-
-
|
|
390
|
-
-
|
|
391
|
-
-
|
|
392
|
-
-
|
|
393
|
-
-
|
|
423
|
+
|
|
424
|
+
- Multi-format document loading (PDF, DOCX, TXT, MD)
|
|
425
|
+
- Recursive chunking (512 tokens, 50 overlap)
|
|
426
|
+
- Vector similarity search
|
|
427
|
+
- Cohere reranking (optional)
|
|
428
|
+
- Source attribution with page numbers
|
|
429
|
+
- Query expansion
|
|
430
|
+
- Caching
|
|
431
|
+
- FastAPI REST endpoints
|
|
432
|
+
- Docker deployment
|
|
394
433
|
|
|
395
434
|
---
|
|
396
435
|
|
|
397
436
|
### AI Safety
|
|
398
437
|
|
|
399
438
|
#### `ai-safety-expert` (Agent)
|
|
439
|
+
|
|
400
440
|
Comprehensive AI safety with content filtering, PII protection, and bias mitigation.
|
|
401
441
|
|
|
402
442
|
**Capabilities:**
|
|
443
|
+
|
|
403
444
|
- Toxicity detection (BERT-based classification)
|
|
404
445
|
- PII detection and redaction (Presidio)
|
|
405
446
|
- Bias detection (gender, racial, age)
|
|
@@ -408,12 +449,14 @@ Comprehensive AI safety with content filtering, PII protection, and bias mitigat
|
|
|
408
449
|
- GDPR/CCPA/HIPAA compliance
|
|
409
450
|
|
|
410
451
|
**When to use:**
|
|
452
|
+
|
|
411
453
|
- "Implement PII detection for user inputs"
|
|
412
454
|
- "Add toxicity filtering to my chatbot"
|
|
413
455
|
- "Detect and mitigate bias in LLM outputs"
|
|
414
456
|
- "Ensure HIPAA compliance for medical data"
|
|
415
457
|
|
|
416
458
|
**Safety pipeline:**
|
|
459
|
+
|
|
417
460
|
```python
|
|
418
461
|
class SafetyGuardrails:
|
|
419
462
|
async def safe_completion(self, user_input: str, llm):
|
|
@@ -434,6 +477,7 @@ class SafetyGuardrails:
|
|
|
434
477
|
```
|
|
435
478
|
|
|
436
479
|
**PII detection:**
|
|
480
|
+
|
|
437
481
|
- Email addresses, phone numbers, SSN
|
|
438
482
|
- Credit card numbers
|
|
439
483
|
- IP addresses
|
|
@@ -445,9 +489,11 @@ class SafetyGuardrails:
|
|
|
445
489
|
---
|
|
446
490
|
|
|
447
491
|
#### `prompt-injection-defender` (Agent)
|
|
492
|
+
|
|
448
493
|
Defend against prompt injection attacks and jailbreaks.
|
|
449
494
|
|
|
450
495
|
**Capabilities:**
|
|
496
|
+
|
|
451
497
|
- Pattern-based detection (regex for common attacks)
|
|
452
498
|
- ML classification (fine-tuned BERT model)
|
|
453
499
|
- Input sanitization
|
|
@@ -456,11 +502,13 @@ Defend against prompt injection attacks and jailbreaks.
|
|
|
456
502
|
- Jailbreak detection (DAN, Developer Mode, etc.)
|
|
457
503
|
|
|
458
504
|
**When to use:**
|
|
505
|
+
|
|
459
506
|
- "Protect my chatbot from prompt injection"
|
|
460
507
|
- "Detect jailbreak attempts"
|
|
461
508
|
- "Validate user inputs for manipulation"
|
|
462
509
|
|
|
463
510
|
**Attack patterns detected:**
|
|
511
|
+
|
|
464
512
|
```python
|
|
465
513
|
ATTACK_PATTERNS = [
|
|
466
514
|
r'ignore\s+(all\s+)?(previous|prior|above)\s+instructions',
|
|
@@ -472,6 +520,7 @@ ATTACK_PATTERNS = [
|
|
|
472
520
|
```
|
|
473
521
|
|
|
474
522
|
**Defense strategies:**
|
|
523
|
+
|
|
475
524
|
1. **Detection:** Identify attack patterns
|
|
476
525
|
2. **Sanitization:** Remove/escape dangerous inputs
|
|
477
526
|
3. **Validation:** Verify outputs don't leak system prompts
|
|
@@ -482,9 +531,11 @@ ATTACK_PATTERNS = [
|
|
|
482
531
|
---
|
|
483
532
|
|
|
484
533
|
#### `/ams` - AI Monitoring Setup (Command)
|
|
534
|
+
|
|
485
535
|
Set up comprehensive LLM monitoring with cost tracking and alerting.
|
|
486
536
|
|
|
487
537
|
**Usage:**
|
|
538
|
+
|
|
488
539
|
```bash
|
|
489
540
|
/ams
|
|
490
541
|
|
|
@@ -496,6 +547,7 @@ Set up comprehensive LLM monitoring with cost tracking and alerting.
|
|
|
496
547
|
```
|
|
497
548
|
|
|
498
549
|
**Generated files:**
|
|
550
|
+
|
|
499
551
|
```
|
|
500
552
|
monitoring/
|
|
501
553
|
├── metrics.py # Prometheus metrics
|
|
@@ -508,6 +560,7 @@ monitoring/
|
|
|
508
560
|
```
|
|
509
561
|
|
|
510
562
|
**Metrics collected:**
|
|
563
|
+
|
|
511
564
|
- Request count (by model, status)
|
|
512
565
|
- Latency (p50, p95, p99)
|
|
513
566
|
- Token usage (input, output)
|
|
@@ -516,12 +569,14 @@ monitoring/
|
|
|
516
569
|
- Cache hit rate
|
|
517
570
|
|
|
518
571
|
**Alerts configured:**
|
|
572
|
+
|
|
519
573
|
- Budget threshold (80%, 90%, 100%)
|
|
520
574
|
- High error rate (>5%)
|
|
521
575
|
- Slow responses (>10s)
|
|
522
576
|
- Token limit approaching
|
|
523
577
|
|
|
524
578
|
**Dashboards:**
|
|
579
|
+
|
|
525
580
|
- Real-time request monitoring
|
|
526
581
|
- Cost tracking (daily, weekly, monthly)
|
|
527
582
|
- Model performance comparison
|
|
@@ -529,14 +584,14 @@ monitoring/
|
|
|
529
584
|
|
|
530
585
|
---
|
|
531
586
|
|
|
532
|
-
##
|
|
587
|
+
## Documentation
|
|
533
588
|
|
|
534
|
-
- **
|
|
535
|
-
- **
|
|
536
|
-
- **
|
|
537
|
-
- **
|
|
589
|
+
- **Installation Guide** - Prerequisites, setup, verification
|
|
590
|
+
- **Quick Start** - 10-minute tutorial with examples
|
|
591
|
+
- **Use Cases** - Real-world applications with ROI
|
|
592
|
+
- **Troubleshooting** - Common issues and solutions
|
|
538
593
|
|
|
539
|
-
##
|
|
594
|
+
## Example Workflows
|
|
540
595
|
|
|
541
596
|
### Build a Customer Support Bot (10 minutes)
|
|
542
597
|
|
|
@@ -584,7 +639,7 @@ Use case: Customer feedback analysis
|
|
|
584
639
|
|
|
585
640
|
### Build RAG System for Legal Documents (15 minutes)
|
|
586
641
|
|
|
587
|
-
```
|
|
642
|
+
```text
|
|
588
643
|
claude
|
|
589
644
|
|
|
590
645
|
# 1. Design RAG architecture
|
|
@@ -608,42 +663,47 @@ Track: accuracy, retrieval time, cost per query
|
|
|
608
663
|
|
|
609
664
|
**Result:** Legal document analysis system with 94% accuracy, 82ms latency, PII protection.
|
|
610
665
|
|
|
611
|
-
##
|
|
666
|
+
## Learning Resources
|
|
612
667
|
|
|
613
668
|
### Video Tutorials (Coming Soon)
|
|
669
|
+
|
|
614
670
|
- Prompt Engineering Masterclass (30 min)
|
|
615
671
|
- Building Production RAG Systems (45 min)
|
|
616
672
|
- AI Safety Best Practices (20 min)
|
|
617
673
|
|
|
618
674
|
### Blog Posts
|
|
675
|
+
|
|
619
676
|
- [Reduce LLM Costs by 90%](https://example.com/reduce-llm-costs)
|
|
620
677
|
- [Building RAG Systems That Actually Work](https://example.com/rag-systems)
|
|
621
678
|
- [Comprehensive Guide to AI Safety](https://example.com/ai-safety)
|
|
622
679
|
|
|
623
680
|
### Community
|
|
681
|
+
|
|
624
682
|
- [Discord](https://discord.com/invite/6PPFFzqPDZ) - #claude-code channel
|
|
625
683
|
- [GitHub Discussions](https://github.com/jeremylongshore/claude-code-plugins/discussions)
|
|
626
684
|
- [Stack Overflow](https://stackoverflow.com/questions/tagged/claude-code) - `claude-code` tag
|
|
627
685
|
|
|
628
|
-
##
|
|
686
|
+
## Pricing
|
|
629
687
|
|
|
630
688
|
**One-time purchase: $79**
|
|
631
689
|
|
|
632
690
|
What's included:
|
|
633
|
-
|
|
634
|
-
-
|
|
635
|
-
-
|
|
636
|
-
-
|
|
637
|
-
-
|
|
691
|
+
|
|
692
|
+
- All 12 plugins (lifetime access)
|
|
693
|
+
- Free updates and new plugins
|
|
694
|
+
- Email support
|
|
695
|
+
- Community Discord access
|
|
696
|
+
- Documentation and examples
|
|
638
697
|
|
|
639
698
|
**Compare to alternatives:**
|
|
699
|
+
|
|
640
700
|
- Manual implementation: 40+ hours ($4,000 at $100/hour)
|
|
641
701
|
- Consultants: $150-300/hour × 40 hours = $6,000-12,000
|
|
642
702
|
- AI/ML Engineering Pack: **$79** (99% cost savings)
|
|
643
703
|
|
|
644
704
|
**Average payback period: 3 days**
|
|
645
705
|
|
|
646
|
-
|
|
706
|
+
Buy Now on Gumroad | [Volume Licensing](mailto:[email protected])
|
|
647
707
|
|
|
648
708
|
## 🆘 Support
|
|
649
709
|
|
|
@@ -655,44 +715,47 @@ What's included:
|
|
|
655
715
|
|
|
656
716
|
**Community:** Join Discord for community support
|
|
657
717
|
|
|
658
|
-
##
|
|
718
|
+
## Updates
|
|
659
719
|
|
|
660
720
|
**Current version:** 1.0.0
|
|
661
721
|
|
|
662
722
|
**Update policy:** Free updates for life, including new plugins and features
|
|
663
723
|
|
|
664
724
|
**Changelog:**
|
|
725
|
+
|
|
665
726
|
- **v1.0.0** (2025-10-10) - Initial release with 12 plugins
|
|
666
727
|
|
|
667
728
|
To update:
|
|
729
|
+
|
|
668
730
|
```bash
|
|
669
731
|
claude plugin update ai-ml-engineering-pack
|
|
670
732
|
```
|
|
671
733
|
|
|
672
|
-
##
|
|
734
|
+
## License
|
|
673
735
|
|
|
674
|
-
MIT License - See
|
|
736
|
+
MIT License - See LICENSE for details
|
|
675
737
|
|
|
676
738
|
**Commercial use permitted** - Use in commercial projects, redistribute, modify
|
|
677
739
|
|
|
678
|
-
##
|
|
740
|
+
## Acknowledgments
|
|
679
741
|
|
|
680
742
|
Built with:
|
|
743
|
+
|
|
681
744
|
- [Claude Code](https://claude.ai/code) - AI-powered development CLI
|
|
682
745
|
- [LangChain](https://langchain.com) - LLM framework
|
|
683
746
|
- [Presidio](https://microsoft.github.io/presidio/) - PII detection
|
|
684
747
|
- [Qdrant](https://qdrant.tech) - Vector database
|
|
685
748
|
- [FastAPI](https://fastapi.tiangolo.com) - Modern Python framework
|
|
686
749
|
|
|
687
|
-
##
|
|
750
|
+
## Ready to Get Started?
|
|
688
751
|
|
|
689
|
-
1. **
|
|
690
|
-
2. **
|
|
691
|
-
3. **
|
|
752
|
+
1. **Install the pack** - 5-minute setup
|
|
753
|
+
2. **Complete Quick Start** - Build your first AI feature in 10 minutes
|
|
754
|
+
3. **Explore use cases** - See real-world ROI examples
|
|
692
755
|
4. **[Join the community](https://discord.com/invite/6PPFFzqPDZ)** - Connect with other AI/ML engineers
|
|
693
756
|
|
|
694
757
|
---
|
|
695
758
|
|
|
696
759
|
**Questions?** Email [email protected] or open a [GitHub issue](https://github.com/jeremylongshore/claude-code-plugins/issues).
|
|
697
760
|
|
|
698
|
-
**Built by AI engineers, for AI engineers.**
|
|
761
|
+
**Built by AI engineers, for AI engineers.**
|
package/package.json
CHANGED
|
@@ -1,13 +1,22 @@
|
|
|
1
1
|
---
|
|
2
2
|
name: optimizing-prompts
|
|
3
|
-
description:
|
|
4
|
-
|
|
3
|
+
description: 'Execute this skill optimizes prompts for large language models (llms)
|
|
4
|
+
to reduce token usage, lower costs, and improve performance. it analyzes the prompt,
|
|
5
|
+
identifies areas for simplification and redundancy removal, and rewrites the prompt
|
|
6
|
+
to be more conci... Use when optimizing performance. Trigger with phrases like ''optimize'',
|
|
7
|
+
''performance'', or ''speed up''.
|
|
8
|
+
|
|
9
|
+
'
|
|
5
10
|
allowed-tools: Read, Write, Edit, Grep, Glob, Bash(cmd:*)
|
|
6
11
|
version: 1.0.0
|
|
7
12
|
author: Jeremy Longshore <jeremy@intentsolutions.io>
|
|
8
13
|
license: MIT
|
|
9
|
-
|
|
10
|
-
|
|
14
|
+
tags:
|
|
15
|
+
- packages
|
|
16
|
+
- llm
|
|
17
|
+
- performance
|
|
18
|
+
- cost-optimization
|
|
19
|
+
compatibility: Designed for Claude Code, also compatible with Codex and OpenClaw
|
|
11
20
|
---
|
|
12
21
|
# Ai Ml Engineering Pack
|
|
13
22
|
|
|
@@ -26,6 +35,7 @@ Refine prompts for optimal LLM performance. It streamlines prompts to minimize t
|
|
|
26
35
|
## When to Use This Skill
|
|
27
36
|
|
|
28
37
|
This skill activates when you need to:
|
|
38
|
+
|
|
29
39
|
- Reduce the cost of using an LLM.
|
|
30
40
|
- Improve the speed of LLM responses.
|
|
31
41
|
- Enhance the quality or clarity of LLM outputs by refining the prompt.
|
|
@@ -37,6 +47,7 @@ This skill activates when you need to:
|
|
|
37
47
|
User request: "Optimize this prompt for cost and quality: 'I would like you to create a detailed product description for a new ergonomic office chair, highlighting its features, benefits, and target audience, and also include information about its warranty and return policy.'"
|
|
38
48
|
|
|
39
49
|
The skill will:
|
|
50
|
+
|
|
40
51
|
1. Analyze the prompt for redundancies and areas for simplification.
|
|
41
52
|
2. Rewrite the prompt to be more concise: "Create a product description for an ergonomic office chair. Include features, benefits, target audience, warranty, and return policy."
|
|
42
53
|
3. Provide the optimized prompt and explain the token reduction achieved.
|
|
@@ -46,6 +57,7 @@ The skill will:
|
|
|
46
57
|
User request: "Optimize this prompt for better summarization: 'Please read the following document and provide a comprehensive summary of all the key points, main arguments, supporting evidence, and overall conclusion, ensuring that the summary is accurate, concise, and easy to understand.'"
|
|
47
58
|
|
|
48
59
|
The skill will:
|
|
60
|
+
|
|
49
61
|
1. Identify areas for improvement in the prompt's clarity and focus.
|
|
50
62
|
2. Rewrite the prompt to be more direct: "Summarize this document, including key points, arguments, evidence, and the conclusion."
|
|
51
63
|
3. Present the optimized prompt and explain how it enhances summarization performance.
|
|
@@ -85,4 +97,4 @@ The skill produces structured output relevant to the task.
|
|
|
85
97
|
## Resources
|
|
86
98
|
|
|
87
99
|
- Project documentation
|
|
88
|
-
- Related skills and commands
|
|
100
|
+
- Related skills and commands
|
|
@@ -243,4 +243,4 @@ Output: A boolean value indicating whether a prompt injection attempt is detecte
|
|
|
243
243
|
Implement heuristics and/or machine learning models to identify suspicious patterns. Check for phrases like "ignore previous instructions" and "as an AI language model".
|
|
244
244
|
```
|
|
245
245
|
|
|
246
|
-
Remember to adapt these examples to your specific needs and experiment with different prompts to achieve the best results. Good luck!
|
|
246
|
+
Remember to adapt these examples to your specific needs and experiment with different prompts to achieve the best results. Good luck!
|
|
@@ -101,4 +101,4 @@
|
|
|
101
101
|
|
|
102
102
|
### 8.3. Data Sources
|
|
103
103
|
|
|
104
|
-
`[Describe the data sources used to evaluate the prompt's performance. Specify the size and characteristics of the dataset.]`
|
|
104
|
+
`[Describe the data sources used to evaluate the prompt's performance. Specify the size and characteristics of the dataset.]`
|
|
@@ -6,6 +6,6 @@ Bundled resources for ai-ml-engineering-pack skill
|
|
|
6
6
|
- [x] prompt_validator.py: Script to validate prompt syntax and structure.
|
|
7
7
|
- [x] cost_estimator.py: Script to estimate the cost of running a prompt on different LLMs.
|
|
8
8
|
|
|
9
|
-
|
|
10
9
|
## Auto-Generated
|
|
10
|
+
|
|
11
11
|
Scripts generated on 2025-12-10 03:48:17
|