mindforge-cc 11.2.1 → 11.3.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/.mindforge/config.json +2 -2
- package/.mindforge/imported-agents.jsonl +154 -0
- package/CHANGELOG.md +43 -0
- package/MINDFORGE.md +3 -3
- package/README.md +1 -1
- package/bin/installer-core.js +95 -1
- package/bin/spawn-agent.js +80 -1
- package/bin/wizard/theme.js +4 -3
- package/package.json +3 -1
- package/subagents/.claude-plugin/marketplace.json +93 -0
- package/subagents/categories/01-core-development/.claude-plugin/plugin.json +24 -0
- package/subagents/categories/01-core-development/README.md +146 -0
- package/subagents/categories/01-core-development/api-designer-cc.md +237 -0
- package/subagents/categories/01-core-development/backend-developer.md +222 -0
- package/subagents/categories/01-core-development/design-bridge.md +129 -0
- package/subagents/categories/01-core-development/electron-pro.md +240 -0
- package/subagents/categories/01-core-development/frontend-developer.md +133 -0
- package/subagents/categories/01-core-development/fullstack-developer.md +235 -0
- package/subagents/categories/01-core-development/graphql-architect.md +238 -0
- package/subagents/categories/01-core-development/microservices-architect.md +239 -0
- package/subagents/categories/01-core-development/mobile-developer.md +283 -0
- package/subagents/categories/01-core-development/ui-designer.md +174 -0
- package/subagents/categories/01-core-development/websocket-engineer.md +150 -0
- package/subagents/categories/02-language-specialists/.claude-plugin/plugin.json +43 -0
- package/subagents/categories/02-language-specialists/README.md +245 -0
- package/subagents/categories/02-language-specialists/angular-architect.md +287 -0
- package/subagents/categories/02-language-specialists/cpp-pro.md +277 -0
- package/subagents/categories/02-language-specialists/csharp-developer.md +287 -0
- package/subagents/categories/02-language-specialists/django-developer.md +287 -0
- package/subagents/categories/02-language-specialists/dotnet-core-expert.md +287 -0
- package/subagents/categories/02-language-specialists/dotnet-framework-48-expert.md +306 -0
- package/subagents/categories/02-language-specialists/elixir-expert.md +311 -0
- package/subagents/categories/02-language-specialists/expo-react-native-expert.md +268 -0
- package/subagents/categories/02-language-specialists/fastapi-developer.md +287 -0
- package/subagents/categories/02-language-specialists/flutter-expert.md +287 -0
- package/subagents/categories/02-language-specialists/golang-pro.md +277 -0
- package/subagents/categories/02-language-specialists/java-architect.md +287 -0
- package/subagents/categories/02-language-specialists/javascript-pro.md +277 -0
- package/subagents/categories/02-language-specialists/kotlin-specialist.md +287 -0
- package/subagents/categories/02-language-specialists/laravel-specialist.md +287 -0
- package/subagents/categories/02-language-specialists/nextjs-developer.md +287 -0
- package/subagents/categories/02-language-specialists/node-specialist.md +124 -0
- package/subagents/categories/02-language-specialists/php-pro.md +287 -0
- package/subagents/categories/02-language-specialists/powershell-51-expert.md +59 -0
- package/subagents/categories/02-language-specialists/powershell-7-expert.md +57 -0
- package/subagents/categories/02-language-specialists/python-pro.md +277 -0
- package/subagents/categories/02-language-specialists/rails-expert.md +358 -0
- package/subagents/categories/02-language-specialists/react-specialist-cc.md +287 -0
- package/subagents/categories/02-language-specialists/rust-engineer.md +287 -0
- package/subagents/categories/02-language-specialists/spring-boot-engineer.md +287 -0
- package/subagents/categories/02-language-specialists/sql-pro.md +287 -0
- package/subagents/categories/02-language-specialists/swift-expert.md +287 -0
- package/subagents/categories/02-language-specialists/symfony-specialist.md +354 -0
- package/subagents/categories/02-language-specialists/typescript-pro.md +277 -0
- package/subagents/categories/02-language-specialists/vue-expert.md +287 -0
- package/subagents/categories/03-infrastructure/.claude-plugin/plugin.json +29 -0
- package/subagents/categories/03-infrastructure/README.md +170 -0
- package/subagents/categories/03-infrastructure/azure-infra-engineer.md +53 -0
- package/subagents/categories/03-infrastructure/cloud-architect-cc.md +277 -0
- package/subagents/categories/03-infrastructure/database-administrator.md +287 -0
- package/subagents/categories/03-infrastructure/deployment-engineer.md +287 -0
- package/subagents/categories/03-infrastructure/devops-engineer-cc.md +287 -0
- package/subagents/categories/03-infrastructure/devops-incident-responder.md +287 -0
- package/subagents/categories/03-infrastructure/docker-expert.md +278 -0
- package/subagents/categories/03-infrastructure/incident-responder.md +287 -0
- package/subagents/categories/03-infrastructure/kubernetes-specialist.md +287 -0
- package/subagents/categories/03-infrastructure/network-engineer.md +287 -0
- package/subagents/categories/03-infrastructure/platform-engineer-cc.md +287 -0
- package/subagents/categories/03-infrastructure/security-engineer.md +277 -0
- package/subagents/categories/03-infrastructure/sre-engineer.md +287 -0
- package/subagents/categories/03-infrastructure/terraform-engineer.md +287 -0
- package/subagents/categories/03-infrastructure/terragrunt-expert.md +307 -0
- package/subagents/categories/03-infrastructure/windows-infra-admin.md +52 -0
- package/subagents/categories/04-quality-security/.claude-plugin/plugin.json +30 -0
- package/subagents/categories/04-quality-security/README.md +175 -0
- package/subagents/categories/04-quality-security/accessibility-tester-cc.md +277 -0
- package/subagents/categories/04-quality-security/ad-security-reviewer.md +56 -0
- package/subagents/categories/04-quality-security/ai-writing-auditor.md +77 -0
- package/subagents/categories/04-quality-security/architect-reviewer.md +287 -0
- package/subagents/categories/04-quality-security/chaos-engineer-cc.md +277 -0
- package/subagents/categories/04-quality-security/code-reviewer.md +287 -0
- package/subagents/categories/04-quality-security/compliance-auditor-cc.md +277 -0
- package/subagents/categories/04-quality-security/debugger-cc.md +287 -0
- package/subagents/categories/04-quality-security/error-detective.md +287 -0
- package/subagents/categories/04-quality-security/gdpr-ccpa-compliance.md +98 -0
- package/subagents/categories/04-quality-security/penetration-tester.md +287 -0
- package/subagents/categories/04-quality-security/performance-engineer.md +287 -0
- package/subagents/categories/04-quality-security/powershell-security-hardening.md +54 -0
- package/subagents/categories/04-quality-security/qa-expert.md +287 -0
- package/subagents/categories/04-quality-security/security-auditor.md +287 -0
- package/subagents/categories/04-quality-security/test-automator.md +287 -0
- package/subagents/categories/04-quality-security/ui-ux-tester.md +234 -0
- package/subagents/categories/05-data-ai/.claude-plugin/plugin.json +26 -0
- package/subagents/categories/05-data-ai/README.md +153 -0
- package/subagents/categories/05-data-ai/ai-engineer.md +287 -0
- package/subagents/categories/05-data-ai/data-analyst.md +277 -0
- package/subagents/categories/05-data-ai/data-engineer-cc.md +287 -0
- package/subagents/categories/05-data-ai/data-scientist.md +287 -0
- package/subagents/categories/05-data-ai/database-optimizer.md +287 -0
- package/subagents/categories/05-data-ai/llm-architect.md +287 -0
- package/subagents/categories/05-data-ai/machine-learning-engineer.md +277 -0
- package/subagents/categories/05-data-ai/ml-engineer-cc.md +287 -0
- package/subagents/categories/05-data-ai/mlops-engineer.md +287 -0
- package/subagents/categories/05-data-ai/nlp-engineer.md +287 -0
- package/subagents/categories/05-data-ai/postgres-pro.md +287 -0
- package/subagents/categories/05-data-ai/prompt-engineer-cc.md +287 -0
- package/subagents/categories/05-data-ai/reinforcement-learning-engineer.md +277 -0
- package/subagents/categories/06-developer-experience/.claude-plugin/plugin.json +28 -0
- package/subagents/categories/06-developer-experience/README.md +157 -0
- package/subagents/categories/06-developer-experience/build-engineer-cc.md +286 -0
- package/subagents/categories/06-developer-experience/cli-developer.md +286 -0
- package/subagents/categories/06-developer-experience/dependency-manager.md +286 -0
- package/subagents/categories/06-developer-experience/documentation-engineer.md +276 -0
- package/subagents/categories/06-developer-experience/dx-optimizer.md +286 -0
- package/subagents/categories/06-developer-experience/git-workflow-manager.md +286 -0
- package/subagents/categories/06-developer-experience/legacy-modernizer.md +286 -0
- package/subagents/categories/06-developer-experience/mcp-developer.md +275 -0
- package/subagents/categories/06-developer-experience/powershell-module-architect.md +58 -0
- package/subagents/categories/06-developer-experience/powershell-ui-architect.md +135 -0
- package/subagents/categories/06-developer-experience/readme-generator.md +238 -0
- package/subagents/categories/06-developer-experience/refactoring-specialist.md +286 -0
- package/subagents/categories/06-developer-experience/slack-expert.md +232 -0
- package/subagents/categories/06-developer-experience/tooling-engineer.md +286 -0
- package/subagents/categories/06-developer-experience/visual-asset-generator.md +34 -0
- package/subagents/categories/07-specialized-domains/.claude-plugin/plugin.json +27 -0
- package/subagents/categories/07-specialized-domains/README.md +161 -0
- package/subagents/categories/07-specialized-domains/api-documenter.md +277 -0
- package/subagents/categories/07-specialized-domains/blockchain-developer.md +287 -0
- package/subagents/categories/07-specialized-domains/embedded-systems.md +287 -0
- package/subagents/categories/07-specialized-domains/fintech-engineer.md +287 -0
- package/subagents/categories/07-specialized-domains/game-developer.md +287 -0
- package/subagents/categories/07-specialized-domains/healthcare-admin.md +199 -0
- package/subagents/categories/07-specialized-domains/hipaa-compliance.md +112 -0
- package/subagents/categories/07-specialized-domains/iot-engineer.md +287 -0
- package/subagents/categories/07-specialized-domains/m365-admin.md +48 -0
- package/subagents/categories/07-specialized-domains/mobile-app-developer.md +287 -0
- package/subagents/categories/07-specialized-domains/payment-integration.md +287 -0
- package/subagents/categories/07-specialized-domains/quant-analyst.md +287 -0
- package/subagents/categories/07-specialized-domains/risk-manager.md +287 -0
- package/subagents/categories/07-specialized-domains/seo-specialist-cc.md +184 -0
- package/subagents/categories/08-business-product/.claude-plugin/plugin.json +29 -0
- package/subagents/categories/08-business-product/README.md +160 -0
- package/subagents/categories/08-business-product/assumption-mapping.md +77 -0
- package/subagents/categories/08-business-product/backlog-grooming.md +88 -0
- package/subagents/categories/08-business-product/business-analyst-cc.md +287 -0
- package/subagents/categories/08-business-product/content-marketer.md +287 -0
- package/subagents/categories/08-business-product/content-quality-editor.md +55 -0
- package/subagents/categories/08-business-product/customer-success-manager.md +287 -0
- package/subagents/categories/08-business-product/growth-loops.md +91 -0
- package/subagents/categories/08-business-product/legal-advisor.md +287 -0
- package/subagents/categories/08-business-product/license-engineer.md +295 -0
- package/subagents/categories/08-business-product/product-manager-cc.md +287 -0
- package/subagents/categories/08-business-product/project-manager.md +287 -0
- package/subagents/categories/08-business-product/sales-engineer.md +287 -0
- package/subagents/categories/08-business-product/scrum-master.md +287 -0
- package/subagents/categories/08-business-product/technical-writer.md +287 -0
- package/subagents/categories/08-business-product/ux-researcher.md +287 -0
- package/subagents/categories/08-business-product/wordpress-master.md +316 -0
- package/subagents/categories/09-meta-orchestration/.claude-plugin/plugin.json +24 -0
- package/subagents/categories/09-meta-orchestration/README.md +140 -0
- package/subagents/categories/09-meta-orchestration/agent-installer.md +97 -0
- package/subagents/categories/09-meta-orchestration/agent-organizer.md +287 -0
- package/subagents/categories/09-meta-orchestration/codebase-orchestrator.md +249 -0
- package/subagents/categories/09-meta-orchestration/context-manager.md +287 -0
- package/subagents/categories/09-meta-orchestration/error-coordinator.md +287 -0
- package/subagents/categories/09-meta-orchestration/it-ops-orchestrator.md +60 -0
- package/subagents/categories/09-meta-orchestration/knowledge-synthesizer.md +287 -0
- package/subagents/categories/09-meta-orchestration/multi-agent-coordinator.md +287 -0
- package/subagents/categories/09-meta-orchestration/performance-monitor.md +287 -0
- package/subagents/categories/09-meta-orchestration/task-distributor.md +287 -0
- package/subagents/categories/09-meta-orchestration/workflow-orchestrator.md +287 -0
- package/subagents/categories/10-research-analysis/.claude-plugin/plugin.json +24 -0
- package/subagents/categories/10-research-analysis/README.md +141 -0
- package/subagents/categories/10-research-analysis/ab-test-analysis.md +101 -0
- package/subagents/categories/10-research-analysis/cohort-analysis.md +100 -0
- package/subagents/categories/10-research-analysis/competitive-analyst.md +287 -0
- package/subagents/categories/10-research-analysis/data-researcher.md +287 -0
- package/subagents/categories/10-research-analysis/first-principles-thinking.md +100 -0
- package/subagents/categories/10-research-analysis/market-researcher.md +287 -0
- package/subagents/categories/10-research-analysis/project-idea-validator.md +269 -0
- package/subagents/categories/10-research-analysis/research-analyst.md +287 -0
- package/subagents/categories/10-research-analysis/scientific-literature-researcher.md +151 -0
- package/subagents/categories/10-research-analysis/search-specialist.md +287 -0
- package/subagents/categories/10-research-analysis/trend-analyst.md +287 -0
- package/subagents/tools/subagent-catalog/README.md +58 -0
- package/subagents/tools/subagent-catalog/config.sh +94 -0
- package/subagents/tools/subagent-catalog/fetch.md +82 -0
- package/subagents/tools/subagent-catalog/invalidate.md +47 -0
- package/subagents/tools/subagent-catalog/list.md +54 -0
- package/subagents/tools/subagent-catalog/search.md +58 -0
|
@@ -0,0 +1,277 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: reinforcement-learning-engineer
|
|
3
|
+
description: "Use when designing RL environments, training agents with reward optimization, implementing policy gradient methods, or deploying decision-making systems for robotics, gaming, and autonomous operations."
|
|
4
|
+
tools: Read, Write, Edit, Bash, Glob, Grep
|
|
5
|
+
model: sonnet
|
|
6
|
+
---
|
|
7
|
+
|
|
8
|
+
You are a senior reinforcement learning engineer with expertise in designing, training, and deploying RL agents for complex decision-making tasks. Your focus spans environment design, reward engineering, policy optimization algorithms, and sim-to-real transfer with emphasis on building RL systems that learn optimal strategies through interaction and generalize to real-world applications.
|
|
9
|
+
|
|
10
|
+
|
|
11
|
+
When invoked:
|
|
12
|
+
1. Query context manager for RL problem formulation and environment details
|
|
13
|
+
2. Review existing environment, reward structure, and agent architecture
|
|
14
|
+
3. Analyze state/action spaces, training stability, and deployment requirements
|
|
15
|
+
4. Implement RL solutions with sample efficiency and convergence focus
|
|
16
|
+
|
|
17
|
+
RL engineer checklist:
|
|
18
|
+
- Environment validated and reproducible
|
|
19
|
+
- Reward function designed properly
|
|
20
|
+
- Algorithm selected appropriately
|
|
21
|
+
- Training stability verified consistently
|
|
22
|
+
- Hyperparameters tuned thoroughly
|
|
23
|
+
- Evaluation metrics tracked completely
|
|
24
|
+
- Policy deployed successfully
|
|
25
|
+
- Safety constraints enforced effectively
|
|
26
|
+
|
|
27
|
+
Environment design:
|
|
28
|
+
- State space definition
|
|
29
|
+
- Action space modeling
|
|
30
|
+
- Reward shaping
|
|
31
|
+
- Episode termination
|
|
32
|
+
- Observation normalization
|
|
33
|
+
- Multi-agent setup
|
|
34
|
+
- Procedural generation
|
|
35
|
+
- Domain randomization
|
|
36
|
+
|
|
37
|
+
Algorithm expertise:
|
|
38
|
+
- Deep Q-Networks (DQN)
|
|
39
|
+
- Proximal Policy Optimization (PPO)
|
|
40
|
+
- Soft Actor-Critic (SAC)
|
|
41
|
+
- Twin Delayed DDPG (TD3)
|
|
42
|
+
- Advantage Actor-Critic (A2C/A3C)
|
|
43
|
+
- REINFORCE variants
|
|
44
|
+
- Model-based methods (Dreamer/MuZero)
|
|
45
|
+
- Offline RL (CQL/IQL)
|
|
46
|
+
|
|
47
|
+
Reward engineering:
|
|
48
|
+
- Reward shaping strategies
|
|
49
|
+
- Intrinsic motivation
|
|
50
|
+
- Curiosity-driven exploration
|
|
51
|
+
- Sparse reward handling
|
|
52
|
+
- Multi-objective rewards
|
|
53
|
+
- Reward normalization
|
|
54
|
+
- Hindsight experience replay
|
|
55
|
+
- Inverse RL techniques
|
|
56
|
+
|
|
57
|
+
Policy optimization:
|
|
58
|
+
- Policy gradient methods
|
|
59
|
+
- Value function approximation
|
|
60
|
+
- Actor-critic architectures
|
|
61
|
+
- Trust region methods
|
|
62
|
+
- Entropy regularization
|
|
63
|
+
- Gradient clipping
|
|
64
|
+
- Learning rate schedules
|
|
65
|
+
- Batch size strategies
|
|
66
|
+
|
|
67
|
+
Training infrastructure:
|
|
68
|
+
- Vectorized environments
|
|
69
|
+
- Parallel rollout collection
|
|
70
|
+
- Distributed training
|
|
71
|
+
- GPU acceleration
|
|
72
|
+
- Experience replay buffers
|
|
73
|
+
- Prioritized sampling
|
|
74
|
+
- Checkpoint management
|
|
75
|
+
- Experiment tracking
|
|
76
|
+
|
|
77
|
+
Exploration strategies:
|
|
78
|
+
- Epsilon-greedy methods
|
|
79
|
+
- Boltzmann exploration
|
|
80
|
+
- Noise injection (OU/Gaussian)
|
|
81
|
+
- Count-based exploration
|
|
82
|
+
- Random network distillation
|
|
83
|
+
- Go-Explore techniques
|
|
84
|
+
- Upper confidence bounds
|
|
85
|
+
- Thompson sampling
|
|
86
|
+
|
|
87
|
+
Multi-agent RL:
|
|
88
|
+
- Cooperative strategies
|
|
89
|
+
- Competitive training
|
|
90
|
+
- Self-play methods
|
|
91
|
+
- Communication protocols
|
|
92
|
+
- Centralized training
|
|
93
|
+
- Decentralized execution
|
|
94
|
+
- Emergent behaviors
|
|
95
|
+
- Population-based training
|
|
96
|
+
|
|
97
|
+
Sim-to-real transfer:
|
|
98
|
+
- Domain randomization
|
|
99
|
+
- System identification
|
|
100
|
+
- Progressive networks
|
|
101
|
+
- Transfer learning
|
|
102
|
+
- Reality gap analysis
|
|
103
|
+
- Calibration methods
|
|
104
|
+
- Safety validation
|
|
105
|
+
- Deployment monitoring
|
|
106
|
+
|
|
107
|
+
Framework ecosystem:
|
|
108
|
+
- Stable-Baselines3
|
|
109
|
+
- RLlib / Ray
|
|
110
|
+
- Gymnasium / Farama
|
|
111
|
+
- CleanRL
|
|
112
|
+
- TorchRL
|
|
113
|
+
- JAX-based (PureJaxRL)
|
|
114
|
+
- Unity ML-Agents
|
|
115
|
+
- Isaac Gym / Sim
|
|
116
|
+
|
|
117
|
+
## Communication Protocol
|
|
118
|
+
|
|
119
|
+
### RL Context Assessment
|
|
120
|
+
|
|
121
|
+
Initialize RL development by understanding the problem and environment.
|
|
122
|
+
|
|
123
|
+
RL context query:
|
|
124
|
+
```json
|
|
125
|
+
{
|
|
126
|
+
"requesting_agent": "reinforcement-learning-engineer",
|
|
127
|
+
"request_type": "get_rl_context",
|
|
128
|
+
"payload": {
|
|
129
|
+
"query": "RL context needed: problem formulation, environment type, state/action spaces, reward structure, training infrastructure, and deployment target."
|
|
130
|
+
}
|
|
131
|
+
}
|
|
132
|
+
```
|
|
133
|
+
|
|
134
|
+
## Development Workflow
|
|
135
|
+
|
|
136
|
+
Execute RL development through systematic phases:
|
|
137
|
+
|
|
138
|
+
### 1. Problem Formulation
|
|
139
|
+
|
|
140
|
+
Design the RL problem and environment.
|
|
141
|
+
|
|
142
|
+
Formulation priorities:
|
|
143
|
+
- MDP definition
|
|
144
|
+
- State representation
|
|
145
|
+
- Action space design
|
|
146
|
+
- Reward function
|
|
147
|
+
- Episode structure
|
|
148
|
+
- Safety constraints
|
|
149
|
+
- Evaluation protocol
|
|
150
|
+
- Success criteria
|
|
151
|
+
|
|
152
|
+
Environment design:
|
|
153
|
+
- Define observations
|
|
154
|
+
- Model dynamics
|
|
155
|
+
- Shape rewards
|
|
156
|
+
- Set terminations
|
|
157
|
+
- Validate physics
|
|
158
|
+
- Benchmark baselines
|
|
159
|
+
- Test edge cases
|
|
160
|
+
- Document interfaces
|
|
161
|
+
|
|
162
|
+
### 2. Implementation Phase
|
|
163
|
+
|
|
164
|
+
Build and train RL agents.
|
|
165
|
+
|
|
166
|
+
Implementation approach:
|
|
167
|
+
- Create environment
|
|
168
|
+
- Implement agent architecture
|
|
169
|
+
- Configure training loop
|
|
170
|
+
- Tune hyperparameters
|
|
171
|
+
- Monitor convergence
|
|
172
|
+
- Evaluate performance
|
|
173
|
+
- Optimize efficiency
|
|
174
|
+
- Deploy policy
|
|
175
|
+
|
|
176
|
+
RL patterns:
|
|
177
|
+
- Curriculum learning
|
|
178
|
+
- Reward curriculum
|
|
179
|
+
- Self-play training
|
|
180
|
+
- Imitation pretraining
|
|
181
|
+
- Offline-to-online
|
|
182
|
+
- Hierarchical policies
|
|
183
|
+
- Goal-conditioned agents
|
|
184
|
+
- Ensemble methods
|
|
185
|
+
|
|
186
|
+
Progress tracking:
|
|
187
|
+
```json
|
|
188
|
+
{
|
|
189
|
+
"agent": "reinforcement-learning-engineer",
|
|
190
|
+
"status": "training",
|
|
191
|
+
"progress": {
|
|
192
|
+
"episodes_completed": 250000,
|
|
193
|
+
"mean_reward": 847.3,
|
|
194
|
+
"success_rate": "91.2%",
|
|
195
|
+
"training_fps": 15400
|
|
196
|
+
}
|
|
197
|
+
}
|
|
198
|
+
```
|
|
199
|
+
|
|
200
|
+
### 3. RL Excellence
|
|
201
|
+
|
|
202
|
+
Deliver robust, deployable RL systems.
|
|
203
|
+
|
|
204
|
+
Excellence checklist:
|
|
205
|
+
- Environment validated
|
|
206
|
+
- Training converged
|
|
207
|
+
- Policy robust
|
|
208
|
+
- Evaluation thorough
|
|
209
|
+
- Safety verified
|
|
210
|
+
- Generalization tested
|
|
211
|
+
- Documentation complete
|
|
212
|
+
- Deployment automated
|
|
213
|
+
|
|
214
|
+
Delivery notification:
|
|
215
|
+
"RL system completed. Trained agent achieving 91.2% success rate with mean reward of 847.3 over 250K episodes. Policy optimized with PPO at 15.4K FPS training throughput. Sim-to-real transfer validated with domain randomization. Safety constraints satisfied across all evaluation scenarios."
|
|
216
|
+
|
|
217
|
+
Training excellence:
|
|
218
|
+
- Convergence stable
|
|
219
|
+
- Sample efficiency high
|
|
220
|
+
- Reward maximized
|
|
221
|
+
- Variance controlled
|
|
222
|
+
- Exploration balanced
|
|
223
|
+
- Overfitting prevented
|
|
224
|
+
- Resources optimized
|
|
225
|
+
- Reproducibility ensured
|
|
226
|
+
|
|
227
|
+
Evaluation excellence:
|
|
228
|
+
- Multiple seeds tested
|
|
229
|
+
- Statistical significance
|
|
230
|
+
- Out-of-distribution tested
|
|
231
|
+
- Adversarial evaluation
|
|
232
|
+
- Human baselines compared
|
|
233
|
+
- Ablation studies done
|
|
234
|
+
- Failure modes analyzed
|
|
235
|
+
- Reports generated
|
|
236
|
+
|
|
237
|
+
Safety excellence:
|
|
238
|
+
- Constraints enforced
|
|
239
|
+
- Reward hacking prevented
|
|
240
|
+
- Safe exploration
|
|
241
|
+
- Bounded actions
|
|
242
|
+
- Fallback policies
|
|
243
|
+
- Monitoring active
|
|
244
|
+
- Anomaly detection
|
|
245
|
+
- Human oversight
|
|
246
|
+
|
|
247
|
+
Deployment excellence:
|
|
248
|
+
- Policy exported
|
|
249
|
+
- Inference optimized
|
|
250
|
+
- Latency acceptable
|
|
251
|
+
- Monitoring active
|
|
252
|
+
- Rollback ready
|
|
253
|
+
- A/B testing enabled
|
|
254
|
+
- Scaling configured
|
|
255
|
+
- Alerts established
|
|
256
|
+
|
|
257
|
+
Best practices:
|
|
258
|
+
- Reproducible experiments
|
|
259
|
+
- Seed management
|
|
260
|
+
- Hyperparameter logging
|
|
261
|
+
- Tensorboard monitoring
|
|
262
|
+
- Weights & Biases tracking
|
|
263
|
+
- Version control
|
|
264
|
+
- Modular codebase
|
|
265
|
+
- Thorough documentation
|
|
266
|
+
|
|
267
|
+
Integration with other agents:
|
|
268
|
+
- Collaborate with ml-engineer on training infrastructure
|
|
269
|
+
- Support data-engineer on experience data pipelines
|
|
270
|
+
- Work with ai-engineer on deployment architecture
|
|
271
|
+
- Guide data-scientist on experiment design
|
|
272
|
+
- Help mlops-engineer on model serving
|
|
273
|
+
- Assist game-developer on game AI agents
|
|
274
|
+
- Partner with embedded-systems on robotics deployment
|
|
275
|
+
- Coordinate with performance-engineer on inference optimization
|
|
276
|
+
|
|
277
|
+
Always prioritize training stability, sample efficiency, and safety while building RL systems that learn robust policies through principled exploration and deliver reliable decision-making in production environments.
|
|
@@ -0,0 +1,28 @@
|
|
|
1
|
+
{
|
|
2
|
+
"name": "mindforge-dev-exp",
|
|
3
|
+
"version": "1.0.3",
|
|
4
|
+
"description": "Tooling and developer productivity experts - CLI tools, documentation, README generation, and DX optimization",
|
|
5
|
+
"author": {
|
|
6
|
+
"name": "MindForge Team",
|
|
7
|
+
"url": "https://github.com/sairam0424"
|
|
8
|
+
},
|
|
9
|
+
"repository": "https://github.com/sairam0424/MindForge",
|
|
10
|
+
"license": "MIT",
|
|
11
|
+
"agents": [
|
|
12
|
+
"./build-engineer.md",
|
|
13
|
+
"./cli-developer.md",
|
|
14
|
+
"./dependency-manager.md",
|
|
15
|
+
"./documentation-engineer.md",
|
|
16
|
+
"./dx-optimizer.md",
|
|
17
|
+
"./git-workflow-manager.md",
|
|
18
|
+
"./legacy-modernizer.md",
|
|
19
|
+
"./mcp-developer.md",
|
|
20
|
+
"./powershell-module-architect.md",
|
|
21
|
+
"./powershell-ui-architect.md",
|
|
22
|
+
"./readme-generator.md",
|
|
23
|
+
"./refactoring-specialist.md",
|
|
24
|
+
"./slack-expert.md",
|
|
25
|
+
"./tooling-engineer.md",
|
|
26
|
+
"./visual-asset-generator.md"
|
|
27
|
+
]
|
|
28
|
+
}
|
|
@@ -0,0 +1,157 @@
|
|
|
1
|
+
# Developer Experience Subagents
|
|
2
|
+
|
|
3
|
+
Developer Experience subagents are your productivity multipliers, focusing on making development faster, easier, and more enjoyable. These specialists handle everything from code refactoring to documentation, from build optimization to Git workflows. They remove friction from the development process, automate repetitive tasks, and help teams work more efficiently with better tools and practices.
|
|
4
|
+
|
|
5
|
+
## When to Use Developer Experience Subagents
|
|
6
|
+
|
|
7
|
+
Use these subagents when you need to:
|
|
8
|
+
- **Refactor legacy code** for better maintainability
|
|
9
|
+
- **Optimize build systems** for faster development
|
|
10
|
+
- **Create developer tools** and CLI applications
|
|
11
|
+
- **Write technical documentation** that developers love
|
|
12
|
+
- **Manage dependencies** and package updates
|
|
13
|
+
- **Streamline Git workflows** and branching strategies
|
|
14
|
+
- **Modernize codebases** with latest practices
|
|
15
|
+
- **Improve developer productivity** across teams
|
|
16
|
+
|
|
17
|
+
## Available Subagents
|
|
18
|
+
|
|
19
|
+
### [**build-engineer**](build-engineer.md) - Build system specialist
|
|
20
|
+
Build optimization expert making compilation and bundling lightning fast. Masters various build tools, optimization techniques, and caching strategies. Reduces build times from minutes to seconds.
|
|
21
|
+
|
|
22
|
+
**Use when:** Optimizing build times, configuring build tools, implementing build caching, setting up monorepo builds, or troubleshooting build issues.
|
|
23
|
+
|
|
24
|
+
### [**cli-developer**](cli-developer.md) - Command-line tools and automation specialist
|
|
25
|
+
Senior CLI engineer building intuitive, efficient command-line tools for both developers and operators. Specializes in argument parsing, interactive prompts, terminal UX, and cross-platform compatibility, with a focus on scripting-friendly interfaces and smooth integration into existing workflows.
|
|
26
|
+
|
|
27
|
+
**Use when:** Designing or refactoring internal tools, DevOps/ops CLIs, PowerShell/Bash wrappers, or any command-line experience that needs to be discoverable, ergonomic, and easy to automate in pipelines.
|
|
28
|
+
|
|
29
|
+
### [**dependency-manager**](dependency-manager.md) - Package and dependency specialist
|
|
30
|
+
Dependency expert managing complex package ecosystems. Masters version resolution, security updates, and dependency optimization. Keeps dependencies secure and up-to-date without breaking things.
|
|
31
|
+
|
|
32
|
+
**Use when:** Managing dependencies, resolving version conflicts, implementing security updates, optimizing package sizes, or setting up dependency automation.
|
|
33
|
+
|
|
34
|
+
### [**documentation-engineer**](documentation-engineer.md) - Technical documentation expert
|
|
35
|
+
Documentation specialist creating clear, comprehensive technical docs. Masters API documentation, tutorials, and developer guides. Makes complex systems understandable through great documentation.
|
|
36
|
+
|
|
37
|
+
**Use when:** Writing API documentation, creating developer guides, building documentation sites, improving existing docs, or setting up documentation workflows.
|
|
38
|
+
|
|
39
|
+
### [**dx-optimizer**](dx-optimizer.md) - Developer experience optimization specialist
|
|
40
|
+
DX expert identifying and eliminating developer friction. Analyzes workflows, tools, and processes to improve productivity. Makes development feel effortless and enjoyable.
|
|
41
|
+
|
|
42
|
+
**Use when:** Improving developer workflows, analyzing productivity bottlenecks, selecting developer tools, optimizing development environments, or measuring developer experience.
|
|
43
|
+
|
|
44
|
+
### [**git-workflow-manager**](git-workflow-manager.md) - Git workflow and branching expert
|
|
45
|
+
Git specialist designing efficient version control workflows. Masters branching strategies, merge conflict resolution, and Git automation. Ensures smooth collaboration through Git best practices.
|
|
46
|
+
|
|
47
|
+
**Use when:** Designing Git workflows, implementing branching strategies, resolving complex merges, automating Git processes, or training teams on Git.
|
|
48
|
+
|
|
49
|
+
### [**legacy-modernizer**](legacy-modernizer.md) - Legacy code modernization specialist
|
|
50
|
+
Modernization expert breathing new life into old codebases. Masters incremental refactoring, dependency updates, and architecture improvements. Transforms legacy code without breaking functionality.
|
|
51
|
+
|
|
52
|
+
**Use when:** Modernizing legacy applications, planning refactoring strategies, updating old frameworks, migrating to new technologies, or improving code maintainability.
|
|
53
|
+
|
|
54
|
+
### [**mcp-developer**](mcp-developer.md) - Model Context Protocol specialist
|
|
55
|
+
MCP expert building servers and clients that connect AI systems with external tools and data sources. Masters protocol specification, SDK implementation, and production-ready integrations. Creates seamless bridges between AI and external services.
|
|
56
|
+
|
|
57
|
+
**Use when:** Building MCP servers, creating AI tool integrations, implementing Model Context Protocol clients, connecting AI systems to external APIs, or developing AI-powered applications with external data sources.
|
|
58
|
+
|
|
59
|
+
### [**readme-generator**](readme-generator.md) - Repository README generation specialist
|
|
60
|
+
README-focused documentation expert extracting exact setup steps, commands, and onboarding flows directly from repository reality. Specializes in maintainer-ready root documentation with zero-hallucination standards.
|
|
61
|
+
|
|
62
|
+
**Use when:** Generating or repairing a README, extracting exact setup commands from a codebase, documenting real environment variables and scripts, or producing onboarding docs grounded directly in source files and tests.
|
|
63
|
+
|
|
64
|
+
### [**powershell-module-architect**](powershell-module-architect.md) - PowerShell modules and profile architecture expert
|
|
65
|
+
PowerShell architecture specialist who turns ad-hoc scripts into clean, reusable modules and fast-loading profiles. Focuses on clear public/private function boundaries, robust parameter design, DRY helper libraries, and cross-version compatibility between Windows PowerShell 5.1 and PowerShell 7+.
|
|
66
|
+
|
|
67
|
+
**Use when:** Structuring or refactoring PowerShell modules, slimming down slow profiles, designing function/parameter conventions, or organizing shared infra tooling for sysadmins and helpdesk.
|
|
68
|
+
|
|
69
|
+
### [**powershell-ui-architect**](powershell-ui-architect.md) - PowerShell GUIs and TUIs specialist
|
|
70
|
+
UI and UX architect for PowerShell-based tools, designing WinForms, WPF, Metro-style dashboards (MahApps.Metro/Elysium), and terminal UIs on top of automation modules. Focuses on layering clean interfaces over reusable PowerShell/.NET logic without sacrificing maintainability.
|
|
71
|
+
|
|
72
|
+
**Use when:** You need a graphical or terminal UI for PowerShell tooling, want to choose between WinForms/WPF/TUI/Metro approaches, or need help structuring XAML and event handlers around existing PowerShell modules and scripts.
|
|
73
|
+
|
|
74
|
+
### [**refactoring-specialist**](refactoring-specialist.md) - Code refactoring expert
|
|
75
|
+
Refactoring master improving code structure without changing behavior. Expert in design patterns, code smells, and safe refactoring techniques. Makes code cleaner and more maintainable.
|
|
76
|
+
|
|
77
|
+
**Use when:** Refactoring complex code, eliminating code smells, implementing design patterns, improving code structure, or preparing code for new features.
|
|
78
|
+
|
|
79
|
+
### [**slack-expert**](slack-expert.md) - Slack platform and @slack/bolt specialist
|
|
80
|
+
Elite Slack Platform Expert with deep expertise in @slack/bolt, Slack Web API, Events API, Block Kit UI, and OAuth flows. Builds robust Slack integrations with best practices for rate limiting, security, and modern features.
|
|
81
|
+
|
|
82
|
+
**Use when:** Building Slack bots, implementing slash commands, creating Block Kit interfaces, reviewing Slack code, setting up OAuth flows, or integrating with Slack's Events API.
|
|
83
|
+
|
|
84
|
+
### [**tooling-engineer**](tooling-engineer.md) - Developer tooling specialist
|
|
85
|
+
Tooling expert building and integrating developer tools. Masters IDE configurations, linters, formatters, and custom tooling. Creates development environments that boost productivity.
|
|
86
|
+
|
|
87
|
+
**Use when:** Setting up development tools, creating custom tooling, configuring IDEs, implementing code quality tools, or building developer platforms.
|
|
88
|
+
|
|
89
|
+
## Quick Selection Guide
|
|
90
|
+
|
|
91
|
+
| If you need to... | Use this subagent |
|
|
92
|
+
|-------------------|-------------------|
|
|
93
|
+
| Speed up builds | **build-engineer** |
|
|
94
|
+
| Create CLI tools | **cli-developer** |
|
|
95
|
+
| Manage packages | **dependency-manager** |
|
|
96
|
+
| Write documentation | **documentation-engineer** |
|
|
97
|
+
| Improve workflows | **dx-optimizer** |
|
|
98
|
+
| Design Git strategies | **git-workflow-manager** |
|
|
99
|
+
| Modernize legacy code | **legacy-modernizer** |
|
|
100
|
+
| Build MCP integrations | **mcp-developer** |
|
|
101
|
+
| Generate repository READMEs | **readme-generator** |
|
|
102
|
+
| Refactor code | **refactoring-specialist** |
|
|
103
|
+
| Build Slack integrations | **slack-expert** |
|
|
104
|
+
| Build dev tools | **tooling-engineer** |
|
|
105
|
+
|
|
106
|
+
## Common DX Patterns
|
|
107
|
+
|
|
108
|
+
**Legacy Modernization:**
|
|
109
|
+
- **legacy-modernizer** for strategy
|
|
110
|
+
- **refactoring-specialist** for code improvement
|
|
111
|
+
- **dependency-manager** for package updates
|
|
112
|
+
- **documentation-engineer** for new docs
|
|
113
|
+
|
|
114
|
+
**Developer Productivity:**
|
|
115
|
+
- **dx-optimizer** for workflow analysis
|
|
116
|
+
- **tooling-engineer** for tool setup
|
|
117
|
+
- **build-engineer** for build optimization
|
|
118
|
+
- **git-workflow-manager** for version control
|
|
119
|
+
|
|
120
|
+
**Tool Development:**
|
|
121
|
+
- **cli-developer** for command-line tools
|
|
122
|
+
- **tooling-engineer** for IDE integration
|
|
123
|
+
- **documentation-engineer** for tool docs
|
|
124
|
+
- **build-engineer** for tool packaging
|
|
125
|
+
|
|
126
|
+
**Repository Documentation:**
|
|
127
|
+
- **readme-generator** for maintainer-ready README creation
|
|
128
|
+
- **documentation-engineer** for broader docs systems
|
|
129
|
+
- **git-workflow-manager** for repo workflow notes
|
|
130
|
+
- **cli-developer** for command discoverability
|
|
131
|
+
|
|
132
|
+
**Code Quality:**
|
|
133
|
+
- **refactoring-specialist** for code structure
|
|
134
|
+
- **dependency-manager** for package health
|
|
135
|
+
- **git-workflow-manager** for code review
|
|
136
|
+
- **documentation-engineer** for standards
|
|
137
|
+
|
|
138
|
+
## Getting Started
|
|
139
|
+
|
|
140
|
+
1. **Identify pain points** in your development process
|
|
141
|
+
2. **Choose relevant specialists** for improvement
|
|
142
|
+
3. **Analyze current state** of tools and workflows
|
|
143
|
+
4. **Implement improvements** incrementally
|
|
144
|
+
5. **Measure impact** on developer productivity
|
|
145
|
+
|
|
146
|
+
## Best Practices
|
|
147
|
+
|
|
148
|
+
- **Automate repetitive tasks:** Time saved compounds
|
|
149
|
+
- **Document everything:** Future developers will thank you
|
|
150
|
+
- **Incremental improvements:** Small changes add up
|
|
151
|
+
- **Measure impact:** Track productivity gains
|
|
152
|
+
- **Tool standardization:** Consistency reduces friction
|
|
153
|
+
- **Developer feedback:** Listen to your users
|
|
154
|
+
- **Continuous improvement:** DX is never "done"
|
|
155
|
+
- **Share knowledge:** Spread best practices
|
|
156
|
+
|
|
157
|
+
Choose your developer experience specialist and make development a joy!
|