tech-hub-skills 1.2.0 → 1.5.2
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/{LICENSE → .claude/LICENSE} +21 -21
- package/.claude/README.md +291 -0
- package/.claude/bin/cli.js +266 -0
- package/{bin → .claude/bin}/copilot.js +182 -182
- package/{bin → .claude/bin}/postinstall.js +42 -42
- package/{tech_hub_skills/skills → .claude/commands}/README.md +336 -336
- package/{tech_hub_skills/skills → .claude/commands}/ai-engineer.md +104 -104
- package/{tech_hub_skills/skills → .claude/commands}/aws.md +143 -143
- package/{tech_hub_skills/skills → .claude/commands}/azure.md +149 -149
- package/{tech_hub_skills/skills → .claude/commands}/backend-developer.md +108 -108
- package/{tech_hub_skills/skills → .claude/commands}/code-review.md +399 -399
- package/{tech_hub_skills/skills → .claude/commands}/compliance-automation.md +747 -747
- package/{tech_hub_skills/skills → .claude/commands}/compliance-officer.md +108 -108
- package/{tech_hub_skills/skills → .claude/commands}/data-engineer.md +113 -113
- package/{tech_hub_skills/skills → .claude/commands}/data-governance.md +102 -102
- package/{tech_hub_skills/skills → .claude/commands}/data-scientist.md +123 -123
- package/{tech_hub_skills/skills → .claude/commands}/database-admin.md +109 -109
- package/{tech_hub_skills/skills → .claude/commands}/devops.md +160 -160
- package/{tech_hub_skills/skills → .claude/commands}/docker.md +160 -160
- package/{tech_hub_skills/skills → .claude/commands}/enterprise-dashboard.md +613 -613
- package/{tech_hub_skills/skills → .claude/commands}/finops.md +184 -184
- package/{tech_hub_skills/skills → .claude/commands}/frontend-developer.md +108 -108
- package/{tech_hub_skills/skills → .claude/commands}/gcp.md +143 -143
- package/{tech_hub_skills/skills → .claude/commands}/ml-engineer.md +115 -115
- package/{tech_hub_skills/skills → .claude/commands}/mlops.md +187 -187
- package/{tech_hub_skills/skills → .claude/commands}/network-engineer.md +109 -109
- package/{tech_hub_skills/skills → .claude/commands}/optimization-advisor.md +329 -329
- package/{tech_hub_skills/skills → .claude/commands}/orchestrator.md +623 -623
- package/{tech_hub_skills/skills → .claude/commands}/platform-engineer.md +102 -102
- package/{tech_hub_skills/skills → .claude/commands}/process-automation.md +226 -226
- package/{tech_hub_skills/skills → .claude/commands}/process-changelog.md +184 -184
- package/{tech_hub_skills/skills → .claude/commands}/process-documentation.md +484 -484
- package/{tech_hub_skills/skills → .claude/commands}/process-kanban.md +324 -324
- package/{tech_hub_skills/skills → .claude/commands}/process-versioning.md +214 -214
- package/{tech_hub_skills/skills → .claude/commands}/product-designer.md +104 -104
- package/{tech_hub_skills/skills → .claude/commands}/project-starter.md +443 -443
- package/{tech_hub_skills/skills → .claude/commands}/qa-engineer.md +109 -109
- package/{tech_hub_skills/skills → .claude/commands}/security-architect.md +135 -135
- package/{tech_hub_skills/skills → .claude/commands}/sre.md +109 -109
- package/{tech_hub_skills/skills → .claude/commands}/system-design.md +126 -126
- package/{tech_hub_skills/skills → .claude/commands}/technical-writer.md +101 -101
- package/.claude/package.json +46 -0
- package/{tech_hub_skills → .claude}/roles/ai-engineer/skills/01-prompt-engineering/README.md +252 -252
- package/.claude/roles/ai-engineer/skills/01-prompt-engineering/prompt_ab_tester.py +356 -0
- package/.claude/roles/ai-engineer/skills/01-prompt-engineering/prompt_template_manager.py +274 -0
- package/.claude/roles/ai-engineer/skills/01-prompt-engineering/token_cost_estimator.py +324 -0
- package/{tech_hub_skills → .claude}/roles/ai-engineer/skills/02-rag-pipeline/README.md +448 -448
- package/.claude/roles/ai-engineer/skills/02-rag-pipeline/document_chunker.py +336 -0
- package/.claude/roles/ai-engineer/skills/02-rag-pipeline/rag_pipeline.sql +213 -0
- package/{tech_hub_skills → .claude}/roles/ai-engineer/skills/03-agent-orchestration/README.md +599 -599
- package/{tech_hub_skills → .claude}/roles/ai-engineer/skills/04-llm-guardrails/README.md +735 -735
- package/{tech_hub_skills → .claude}/roles/ai-engineer/skills/05-vector-embeddings/README.md +711 -711
- package/{tech_hub_skills → .claude}/roles/ai-engineer/skills/06-llm-evaluation/README.md +777 -777
- package/{tech_hub_skills → .claude}/roles/azure/skills/01-infrastructure-fundamentals/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/azure/skills/02-data-factory/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/azure/skills/03-synapse-analytics/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/azure/skills/04-databricks/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/azure/skills/05-functions/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/azure/skills/06-kubernetes-service/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/azure/skills/07-openai-service/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/azure/skills/08-machine-learning/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/azure/skills/09-storage-adls/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/azure/skills/10-networking/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/azure/skills/11-sql-cosmos/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/azure/skills/12-event-hubs/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/code-review/skills/01-automated-code-review/README.md +394 -394
- package/{tech_hub_skills → .claude}/roles/code-review/skills/02-pr-review-workflow/README.md +427 -427
- package/{tech_hub_skills → .claude}/roles/code-review/skills/03-code-quality-gates/README.md +518 -518
- package/{tech_hub_skills → .claude}/roles/code-review/skills/04-reviewer-assignment/README.md +504 -504
- package/{tech_hub_skills → .claude}/roles/code-review/skills/05-review-analytics/README.md +540 -540
- package/{tech_hub_skills → .claude}/roles/data-engineer/skills/01-lakehouse-architecture/README.md +550 -550
- package/.claude/roles/data-engineer/skills/01-lakehouse-architecture/bronze_ingestion.py +337 -0
- package/.claude/roles/data-engineer/skills/01-lakehouse-architecture/medallion_queries.sql +300 -0
- package/{tech_hub_skills → .claude}/roles/data-engineer/skills/02-etl-pipeline/README.md +580 -580
- package/{tech_hub_skills → .claude}/roles/data-engineer/skills/03-data-quality/README.md +579 -579
- package/{tech_hub_skills → .claude}/roles/data-engineer/skills/04-streaming-pipelines/README.md +608 -608
- package/{tech_hub_skills → .claude}/roles/data-engineer/skills/05-performance-optimization/README.md +547 -547
- package/{tech_hub_skills → .claude}/roles/data-governance/skills/01-data-catalog/README.md +112 -112
- package/{tech_hub_skills → .claude}/roles/data-governance/skills/02-data-lineage/README.md +129 -129
- package/{tech_hub_skills → .claude}/roles/data-governance/skills/03-data-quality-framework/README.md +182 -182
- package/{tech_hub_skills → .claude}/roles/data-governance/skills/04-access-control/README.md +39 -39
- package/{tech_hub_skills → .claude}/roles/data-governance/skills/05-master-data-management/README.md +40 -40
- package/{tech_hub_skills → .claude}/roles/data-governance/skills/06-compliance-privacy/README.md +46 -46
- package/{tech_hub_skills → .claude}/roles/data-scientist/skills/01-eda-automation/README.md +230 -230
- package/.claude/roles/data-scientist/skills/01-eda-automation/eda_generator.py +446 -0
- package/{tech_hub_skills → .claude}/roles/data-scientist/skills/02-statistical-modeling/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/data-scientist/skills/03-feature-engineering/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/data-scientist/skills/04-predictive-modeling/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/data-scientist/skills/05-customer-analytics/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/data-scientist/skills/06-campaign-analysis/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/data-scientist/skills/07-experimentation/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/data-scientist/skills/08-data-visualization/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/devops/skills/01-cicd-pipeline/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/devops/skills/02-container-orchestration/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/devops/skills/03-infrastructure-as-code/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/devops/skills/04-gitops/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/devops/skills/05-environment-management/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/devops/skills/06-automated-testing/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/devops/skills/07-release-management/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/devops/skills/08-monitoring-alerting/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/devops/skills/09-devsecops/README.md +265 -265
- package/{tech_hub_skills → .claude}/roles/finops/skills/01-cost-visibility/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/finops/skills/02-resource-tagging/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/finops/skills/03-budget-management/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/finops/skills/04-reserved-instances/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/finops/skills/05-spot-optimization/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/finops/skills/06-storage-tiering/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/finops/skills/07-compute-rightsizing/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/finops/skills/08-chargeback/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/ml-engineer/skills/01-mlops-pipeline/README.md +566 -566
- package/{tech_hub_skills → .claude}/roles/ml-engineer/skills/02-feature-engineering/README.md +655 -655
- package/{tech_hub_skills → .claude}/roles/ml-engineer/skills/03-model-training/README.md +704 -704
- package/{tech_hub_skills → .claude}/roles/ml-engineer/skills/04-model-serving/README.md +845 -845
- package/{tech_hub_skills → .claude}/roles/ml-engineer/skills/05-model-monitoring/README.md +874 -874
- package/{tech_hub_skills → .claude}/roles/mlops/skills/01-ml-pipeline-orchestration/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/mlops/skills/02-experiment-tracking/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/mlops/skills/03-model-registry/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/mlops/skills/04-feature-store/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/mlops/skills/05-model-deployment/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/mlops/skills/06-model-observability/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/mlops/skills/07-data-versioning/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/mlops/skills/08-ab-testing/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/mlops/skills/09-automated-retraining/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/platform-engineer/skills/01-internal-developer-platform/README.md +153 -153
- package/{tech_hub_skills → .claude}/roles/platform-engineer/skills/02-self-service-infrastructure/README.md +57 -57
- package/{tech_hub_skills → .claude}/roles/platform-engineer/skills/03-slo-sli-management/README.md +59 -59
- package/{tech_hub_skills → .claude}/roles/platform-engineer/skills/04-developer-experience/README.md +57 -57
- package/{tech_hub_skills → .claude}/roles/platform-engineer/skills/05-incident-management/README.md +73 -73
- package/{tech_hub_skills → .claude}/roles/platform-engineer/skills/06-capacity-management/README.md +59 -59
- package/{tech_hub_skills → .claude}/roles/product-designer/skills/01-requirements-discovery/README.md +407 -407
- package/{tech_hub_skills → .claude}/roles/product-designer/skills/02-user-research/README.md +382 -382
- package/{tech_hub_skills → .claude}/roles/product-designer/skills/03-brainstorming-ideation/README.md +437 -437
- package/{tech_hub_skills → .claude}/roles/product-designer/skills/04-ux-design/README.md +496 -496
- package/{tech_hub_skills → .claude}/roles/product-designer/skills/05-product-market-fit/README.md +376 -376
- package/{tech_hub_skills → .claude}/roles/product-designer/skills/06-stakeholder-management/README.md +412 -412
- package/{tech_hub_skills → .claude}/roles/security-architect/skills/01-pii-detection/README.md +319 -319
- package/{tech_hub_skills → .claude}/roles/security-architect/skills/02-threat-modeling/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/security-architect/skills/03-infrastructure-security/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/security-architect/skills/04-iam/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/security-architect/skills/05-application-security/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/security-architect/skills/06-secrets-management/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/security-architect/skills/07-security-monitoring/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/system-design/skills/01-architecture-patterns/README.md +337 -337
- package/{tech_hub_skills → .claude}/roles/system-design/skills/02-requirements-engineering/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/system-design/skills/03-scalability/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/system-design/skills/04-high-availability/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/system-design/skills/05-cost-optimization-design/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/system-design/skills/06-api-design/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/system-design/skills/07-observability-architecture/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/system-design/skills/08-process-automation/PROCESS_TEMPLATE.md +336 -336
- package/{tech_hub_skills → .claude}/roles/system-design/skills/08-process-automation/README.md +521 -521
- package/.claude/roles/system-design/skills/08-process-automation/ai_prompt_generator.py +744 -0
- package/.claude/roles/system-design/skills/08-process-automation/automation_recommender.py +688 -0
- package/.claude/roles/system-design/skills/08-process-automation/plan_generator.py +679 -0
- package/.claude/roles/system-design/skills/08-process-automation/process_analyzer.py +528 -0
- package/.claude/roles/system-design/skills/08-process-automation/process_parser.py +684 -0
- package/.claude/roles/system-design/skills/08-process-automation/role_matcher.py +615 -0
- package/.claude/skills/README.md +336 -0
- package/.claude/skills/ai-engineer.md +104 -0
- package/.claude/skills/aws.md +143 -0
- package/.claude/skills/azure.md +149 -0
- package/.claude/skills/backend-developer.md +108 -0
- package/.claude/skills/code-review.md +399 -0
- package/.claude/skills/compliance-automation.md +747 -0
- package/.claude/skills/compliance-officer.md +108 -0
- package/.claude/skills/data-engineer.md +113 -0
- package/.claude/skills/data-governance.md +102 -0
- package/.claude/skills/data-scientist.md +123 -0
- package/.claude/skills/database-admin.md +109 -0
- package/.claude/skills/devops.md +160 -0
- package/.claude/skills/docker.md +160 -0
- package/.claude/skills/enterprise-dashboard.md +613 -0
- package/.claude/skills/finops.md +184 -0
- package/.claude/skills/frontend-developer.md +108 -0
- package/.claude/skills/gcp.md +143 -0
- package/.claude/skills/ml-engineer.md +115 -0
- package/.claude/skills/mlops.md +187 -0
- package/.claude/skills/network-engineer.md +109 -0
- package/.claude/skills/optimization-advisor.md +329 -0
- package/.claude/skills/orchestrator.md +623 -0
- package/.claude/skills/platform-engineer.md +102 -0
- package/.claude/skills/process-automation.md +226 -0
- package/.claude/skills/process-changelog.md +184 -0
- package/.claude/skills/process-documentation.md +484 -0
- package/.claude/skills/process-kanban.md +324 -0
- package/.claude/skills/process-versioning.md +214 -0
- package/.claude/skills/product-designer.md +104 -0
- package/.claude/skills/project-starter.md +443 -0
- package/.claude/skills/qa-engineer.md +109 -0
- package/.claude/skills/security-architect.md +135 -0
- package/.claude/skills/sre.md +109 -0
- package/.claude/skills/system-design.md +126 -0
- package/.claude/skills/technical-writer.md +101 -0
- package/.gitattributes +2 -0
- package/GITHUB_COPILOT.md +106 -0
- package/README.md +192 -291
- package/package.json +16 -46
- package/bin/cli.js +0 -241
|
@@ -1,123 +1,123 @@
|
|
|
1
|
-
# Data Scientist Skills
|
|
2
|
-
|
|
3
|
-
You are a Data Science specialist with expertise in statistical modeling, machine learning, experimentation, and data-driven insights.
|
|
4
|
-
|
|
5
|
-
## Available Skills
|
|
6
|
-
|
|
7
|
-
1. **ds-01: Automated EDA**
|
|
8
|
-
|
|
9
|
-
- Comprehensive data profiling
|
|
10
|
-
- Missing value analysis
|
|
11
|
-
- Distribution analysis
|
|
12
|
-
- Correlation matrices
|
|
13
|
-
- Automated report generation
|
|
14
|
-
|
|
15
|
-
2. **ds-02: Statistical Modeling**
|
|
16
|
-
|
|
17
|
-
- Hypothesis testing
|
|
18
|
-
- Regression analysis
|
|
19
|
-
- Time series analysis
|
|
20
|
-
- Bayesian statistics
|
|
21
|
-
- A/B test analysis
|
|
22
|
-
|
|
23
|
-
3. **ds-03: Feature Engineering**
|
|
24
|
-
|
|
25
|
-
- Feature selection techniques
|
|
26
|
-
- Feature transformation
|
|
27
|
-
- Encoding strategies
|
|
28
|
-
- Feature importance analysis
|
|
29
|
-
- Automated feature generation
|
|
30
|
-
|
|
31
|
-
4. **ds-04: Predictive Modeling**
|
|
32
|
-
|
|
33
|
-
- Classification pipelines
|
|
34
|
-
- Regression pipelines
|
|
35
|
-
- Ensemble methods
|
|
36
|
-
- Hyperparameter tuning
|
|
37
|
-
- Cross-validation strategies
|
|
38
|
-
|
|
39
|
-
5. **ds-05: Customer Analytics**
|
|
40
|
-
|
|
41
|
-
- Customer segmentation (RFM, K-means)
|
|
42
|
-
- Churn prediction
|
|
43
|
-
- CLV modeling
|
|
44
|
-
- Propensity scoring
|
|
45
|
-
- Customer journey analysis
|
|
46
|
-
|
|
47
|
-
6. **ds-06: Campaign Analysis**
|
|
48
|
-
|
|
49
|
-
- Campaign performance metrics
|
|
50
|
-
- Attribution modeling
|
|
51
|
-
- Uplift modeling
|
|
52
|
-
- ROI calculation
|
|
53
|
-
- Channel optimization
|
|
54
|
-
|
|
55
|
-
7. **ds-07: Experimentation**
|
|
56
|
-
|
|
57
|
-
- A/B test design
|
|
58
|
-
- Sample size calculation
|
|
59
|
-
- Statistical significance testing
|
|
60
|
-
- Multi-armed bandits
|
|
61
|
-
- Sequential testing
|
|
62
|
-
|
|
63
|
-
8. **ds-08: Data Visualization**
|
|
64
|
-
- Interactive dashboards
|
|
65
|
-
- Exploratory visualizations
|
|
66
|
-
- Presentation-ready plots
|
|
67
|
-
- Geospatial visualization
|
|
68
|
-
- Time series plots
|
|
69
|
-
|
|
70
|
-
## When to Use Data Scientist Skills
|
|
71
|
-
|
|
72
|
-
- Exploratory data analysis on new datasets
|
|
73
|
-
- Building predictive models
|
|
74
|
-
- Designing and analyzing experiments
|
|
75
|
-
- Customer segmentation and analytics
|
|
76
|
-
- Campaign effectiveness analysis
|
|
77
|
-
- Statistical hypothesis testing
|
|
78
|
-
|
|
79
|
-
## Integration with Other Roles
|
|
80
|
-
|
|
81
|
-
**Always coordinate with:**
|
|
82
|
-
|
|
83
|
-
- **Data Engineer (de-01, de-02)**: Data pipelines and quality
|
|
84
|
-
- **ML Engineer (ml-01, ml-03)**: Production model deployment
|
|
85
|
-
- **MLOps (mo-02, mo-03)**: Experiment tracking, model registry
|
|
86
|
-
- **AI Engineer (ai-02)**: RAG and LLM integration for analytics
|
|
87
|
-
- **Data Governance (dg-01, dg-03)**: Data catalog, quality standards
|
|
88
|
-
- **FinOps (fo-01)**: Cost tracking for compute resources
|
|
89
|
-
|
|
90
|
-
## Best Practices
|
|
91
|
-
|
|
92
|
-
1. **Reproducibility** - Version data, code, and experiments
|
|
93
|
-
2. **Documentation** - Document assumptions and methodology
|
|
94
|
-
3. **Validation** - Use proper train/test splits and cross-validation
|
|
95
|
-
4. **Bias Detection** - Check for demographic biases in models
|
|
96
|
-
5. **Feature Monitoring** - Track feature drift in production
|
|
97
|
-
6. **Experiment Tracking** - Log all experiments with MLflow
|
|
98
|
-
7. **Collaborate** - Share insights with stakeholders
|
|
99
|
-
8. **Iterate** - Start simple, add complexity gradually
|
|
100
|
-
|
|
101
|
-
## Documentation
|
|
102
|
-
|
|
103
|
-
Detailed documentation for each skill is in `.claude/roles/data-scientist/skills/{skill-id}/README.md`
|
|
104
|
-
|
|
105
|
-
Each README includes:
|
|
106
|
-
|
|
107
|
-
- Statistical methods and algorithms
|
|
108
|
-
- Python implementation with sklearn, statsmodels
|
|
109
|
-
- Visualization templates
|
|
110
|
-
- Experiment design guides
|
|
111
|
-
- Best practices for model evaluation
|
|
112
|
-
|
|
113
|
-
## Quick Start
|
|
114
|
-
|
|
115
|
-
To use a Data Scientist skill:
|
|
116
|
-
|
|
117
|
-
1. Reference the skill README for detailed guidance
|
|
118
|
-
2. Set up experiment tracking with MLflow
|
|
119
|
-
3. Follow statistical best practices
|
|
120
|
-
4. Document methodology and assumptions
|
|
121
|
-
5. Coordinate with ML Engineer for production deployment
|
|
122
|
-
|
|
123
|
-
For comprehensive project planning, use the **orchestrator** skill first to analyze requirements and select optimal skill combinations.
|
|
1
|
+
# Data Scientist Skills
|
|
2
|
+
|
|
3
|
+
You are a Data Science specialist with expertise in statistical modeling, machine learning, experimentation, and data-driven insights.
|
|
4
|
+
|
|
5
|
+
## Available Skills
|
|
6
|
+
|
|
7
|
+
1. **ds-01: Automated EDA**
|
|
8
|
+
|
|
9
|
+
- Comprehensive data profiling
|
|
10
|
+
- Missing value analysis
|
|
11
|
+
- Distribution analysis
|
|
12
|
+
- Correlation matrices
|
|
13
|
+
- Automated report generation
|
|
14
|
+
|
|
15
|
+
2. **ds-02: Statistical Modeling**
|
|
16
|
+
|
|
17
|
+
- Hypothesis testing
|
|
18
|
+
- Regression analysis
|
|
19
|
+
- Time series analysis
|
|
20
|
+
- Bayesian statistics
|
|
21
|
+
- A/B test analysis
|
|
22
|
+
|
|
23
|
+
3. **ds-03: Feature Engineering**
|
|
24
|
+
|
|
25
|
+
- Feature selection techniques
|
|
26
|
+
- Feature transformation
|
|
27
|
+
- Encoding strategies
|
|
28
|
+
- Feature importance analysis
|
|
29
|
+
- Automated feature generation
|
|
30
|
+
|
|
31
|
+
4. **ds-04: Predictive Modeling**
|
|
32
|
+
|
|
33
|
+
- Classification pipelines
|
|
34
|
+
- Regression pipelines
|
|
35
|
+
- Ensemble methods
|
|
36
|
+
- Hyperparameter tuning
|
|
37
|
+
- Cross-validation strategies
|
|
38
|
+
|
|
39
|
+
5. **ds-05: Customer Analytics**
|
|
40
|
+
|
|
41
|
+
- Customer segmentation (RFM, K-means)
|
|
42
|
+
- Churn prediction
|
|
43
|
+
- CLV modeling
|
|
44
|
+
- Propensity scoring
|
|
45
|
+
- Customer journey analysis
|
|
46
|
+
|
|
47
|
+
6. **ds-06: Campaign Analysis**
|
|
48
|
+
|
|
49
|
+
- Campaign performance metrics
|
|
50
|
+
- Attribution modeling
|
|
51
|
+
- Uplift modeling
|
|
52
|
+
- ROI calculation
|
|
53
|
+
- Channel optimization
|
|
54
|
+
|
|
55
|
+
7. **ds-07: Experimentation**
|
|
56
|
+
|
|
57
|
+
- A/B test design
|
|
58
|
+
- Sample size calculation
|
|
59
|
+
- Statistical significance testing
|
|
60
|
+
- Multi-armed bandits
|
|
61
|
+
- Sequential testing
|
|
62
|
+
|
|
63
|
+
8. **ds-08: Data Visualization**
|
|
64
|
+
- Interactive dashboards
|
|
65
|
+
- Exploratory visualizations
|
|
66
|
+
- Presentation-ready plots
|
|
67
|
+
- Geospatial visualization
|
|
68
|
+
- Time series plots
|
|
69
|
+
|
|
70
|
+
## When to Use Data Scientist Skills
|
|
71
|
+
|
|
72
|
+
- Exploratory data analysis on new datasets
|
|
73
|
+
- Building predictive models
|
|
74
|
+
- Designing and analyzing experiments
|
|
75
|
+
- Customer segmentation and analytics
|
|
76
|
+
- Campaign effectiveness analysis
|
|
77
|
+
- Statistical hypothesis testing
|
|
78
|
+
|
|
79
|
+
## Integration with Other Roles
|
|
80
|
+
|
|
81
|
+
**Always coordinate with:**
|
|
82
|
+
|
|
83
|
+
- **Data Engineer (de-01, de-02)**: Data pipelines and quality
|
|
84
|
+
- **ML Engineer (ml-01, ml-03)**: Production model deployment
|
|
85
|
+
- **MLOps (mo-02, mo-03)**: Experiment tracking, model registry
|
|
86
|
+
- **AI Engineer (ai-02)**: RAG and LLM integration for analytics
|
|
87
|
+
- **Data Governance (dg-01, dg-03)**: Data catalog, quality standards
|
|
88
|
+
- **FinOps (fo-01)**: Cost tracking for compute resources
|
|
89
|
+
|
|
90
|
+
## Best Practices
|
|
91
|
+
|
|
92
|
+
1. **Reproducibility** - Version data, code, and experiments
|
|
93
|
+
2. **Documentation** - Document assumptions and methodology
|
|
94
|
+
3. **Validation** - Use proper train/test splits and cross-validation
|
|
95
|
+
4. **Bias Detection** - Check for demographic biases in models
|
|
96
|
+
5. **Feature Monitoring** - Track feature drift in production
|
|
97
|
+
6. **Experiment Tracking** - Log all experiments with MLflow
|
|
98
|
+
7. **Collaborate** - Share insights with stakeholders
|
|
99
|
+
8. **Iterate** - Start simple, add complexity gradually
|
|
100
|
+
|
|
101
|
+
## Documentation
|
|
102
|
+
|
|
103
|
+
Detailed documentation for each skill is in `.claude/roles/data-scientist/skills/{skill-id}/README.md`
|
|
104
|
+
|
|
105
|
+
Each README includes:
|
|
106
|
+
|
|
107
|
+
- Statistical methods and algorithms
|
|
108
|
+
- Python implementation with sklearn, statsmodels
|
|
109
|
+
- Visualization templates
|
|
110
|
+
- Experiment design guides
|
|
111
|
+
- Best practices for model evaluation
|
|
112
|
+
|
|
113
|
+
## Quick Start
|
|
114
|
+
|
|
115
|
+
To use a Data Scientist skill:
|
|
116
|
+
|
|
117
|
+
1. Reference the skill README for detailed guidance
|
|
118
|
+
2. Set up experiment tracking with MLflow
|
|
119
|
+
3. Follow statistical best practices
|
|
120
|
+
4. Document methodology and assumptions
|
|
121
|
+
5. Coordinate with ML Engineer for production deployment
|
|
122
|
+
|
|
123
|
+
For comprehensive project planning, use the **orchestrator** skill first to analyze requirements and select optimal skill combinations.
|
|
@@ -1,109 +1,109 @@
|
|
|
1
|
-
# Database Administrator (DBA) Skills
|
|
2
|
-
|
|
3
|
-
You are a Database Administration specialist with expertise in query optimization, index strategies, backup/recovery, replication, and database performance tuning.
|
|
4
|
-
|
|
5
|
-
## Available Skills
|
|
6
|
-
|
|
7
|
-
1. **db-01: Query Optimization**
|
|
8
|
-
|
|
9
|
-
- EXPLAIN plan analysis
|
|
10
|
-
- Query rewriting techniques
|
|
11
|
-
- Optimizer hints and statistics
|
|
12
|
-
- Subquery optimization
|
|
13
|
-
|
|
14
|
-
2. **db-02: Index Strategies**
|
|
15
|
-
|
|
16
|
-
- B-tree vs hash indexes
|
|
17
|
-
- Covering indexes
|
|
18
|
-
- Composite index design
|
|
19
|
-
- Partial and filtered indexes
|
|
20
|
-
|
|
21
|
-
3. **db-03: Backup & Recovery**
|
|
22
|
-
|
|
23
|
-
- Point-in-time recovery (PITR)
|
|
24
|
-
- Snapshot-based backups
|
|
25
|
-
- Cross-region replication
|
|
26
|
-
- Disaster recovery planning
|
|
27
|
-
|
|
28
|
-
4. **db-04: Replication & Sharding**
|
|
29
|
-
|
|
30
|
-
- Primary-replica configuration
|
|
31
|
-
- Horizontal partitioning
|
|
32
|
-
- Shard key selection
|
|
33
|
-
- Cross-shard queries
|
|
34
|
-
|
|
35
|
-
5. **db-05: Performance Tuning**
|
|
36
|
-
|
|
37
|
-
- Connection pooling optimization
|
|
38
|
-
- Buffer pool configuration
|
|
39
|
-
- Query cache management
|
|
40
|
-
- Lock contention resolution
|
|
41
|
-
|
|
42
|
-
6. **db-06: Database Migrations**
|
|
43
|
-
|
|
44
|
-
- Schema versioning (Flyway, Alembic)
|
|
45
|
-
- Zero-downtime migrations
|
|
46
|
-
- Rollback strategies
|
|
47
|
-
- Data migration scripts
|
|
48
|
-
|
|
49
|
-
7. **db-07: Transaction Management**
|
|
50
|
-
- Isolation level selection
|
|
51
|
-
- Deadlock prevention
|
|
52
|
-
- Distributed transactions
|
|
53
|
-
- ACID compliance verification
|
|
54
|
-
|
|
55
|
-
## When to Use DBA Skills
|
|
56
|
-
|
|
57
|
-
- Optimizing slow database queries
|
|
58
|
-
- Designing index strategies
|
|
59
|
-
- Planning backup and recovery procedures
|
|
60
|
-
- Scaling databases with replication/sharding
|
|
61
|
-
- Database performance tuning
|
|
62
|
-
- Managing schema migrations
|
|
63
|
-
- Resolving transaction and locking issues
|
|
64
|
-
|
|
65
|
-
## Integration with Other Roles
|
|
66
|
-
|
|
67
|
-
**Always coordinate with:**
|
|
68
|
-
|
|
69
|
-
- **Backend Developer (be-04)**: Schema design and query optimization
|
|
70
|
-
- **Data Engineer (de-01, de-05)**: Data pipelines and performance
|
|
71
|
-
- **SRE (sr-01, sr-07)**: Incident response and disaster recovery
|
|
72
|
-
- **Security Architect (sa-04, sa-06)**: Access control and encryption
|
|
73
|
-
- **DevOps (do-03)**: Infrastructure as code for databases
|
|
74
|
-
- **FinOps (fo-05, fo-06)**: Storage and compute optimization
|
|
75
|
-
|
|
76
|
-
## Best Practices
|
|
77
|
-
|
|
78
|
-
1. **Regular EXPLAIN Analysis** - Profile queries in production-like environments
|
|
79
|
-
2. **Index Maintenance** - Regular index rebuild and statistics updates
|
|
80
|
-
3. **Backup Testing** - Regularly test restore procedures
|
|
81
|
-
4. **Connection Pooling** - Use PgBouncer/ProxySQL for connection management
|
|
82
|
-
5. **Monitoring** - Track slow queries, locks, replication lag
|
|
83
|
-
6. **Migration Safety** - Always test migrations on staging first
|
|
84
|
-
7. **Isolation Levels** - Use READ COMMITTED by default, escalate as needed
|
|
85
|
-
8. **Partition Pruning** - Design partitions for query patterns
|
|
86
|
-
|
|
87
|
-
## Documentation
|
|
88
|
-
|
|
89
|
-
Detailed documentation for each skill is in `.claude/roles/database-admin/skills/{skill-id}/README.md`
|
|
90
|
-
|
|
91
|
-
Each README includes:
|
|
92
|
-
|
|
93
|
-
- SQL optimization examples
|
|
94
|
-
- Configuration templates
|
|
95
|
-
- Monitoring queries
|
|
96
|
-
- Recovery procedures
|
|
97
|
-
- Performance benchmarks
|
|
98
|
-
|
|
99
|
-
## Quick Start
|
|
100
|
-
|
|
101
|
-
To use a DBA skill:
|
|
102
|
-
|
|
103
|
-
1. Start with db-01 (Query Optimization) for performance issues
|
|
104
|
-
2. Add db-02 (Index Strategies) for systematic improvement
|
|
105
|
-
3. Use db-03 (Backup & Recovery) for data protection
|
|
106
|
-
4. Implement db-05 (Performance Tuning) for server optimization
|
|
107
|
-
5. Apply db-06 (Migrations) for schema changes
|
|
108
|
-
|
|
109
|
-
For comprehensive project planning, use the **orchestrator** skill first.
|
|
1
|
+
# Database Administrator (DBA) Skills
|
|
2
|
+
|
|
3
|
+
You are a Database Administration specialist with expertise in query optimization, index strategies, backup/recovery, replication, and database performance tuning.
|
|
4
|
+
|
|
5
|
+
## Available Skills
|
|
6
|
+
|
|
7
|
+
1. **db-01: Query Optimization**
|
|
8
|
+
|
|
9
|
+
- EXPLAIN plan analysis
|
|
10
|
+
- Query rewriting techniques
|
|
11
|
+
- Optimizer hints and statistics
|
|
12
|
+
- Subquery optimization
|
|
13
|
+
|
|
14
|
+
2. **db-02: Index Strategies**
|
|
15
|
+
|
|
16
|
+
- B-tree vs hash indexes
|
|
17
|
+
- Covering indexes
|
|
18
|
+
- Composite index design
|
|
19
|
+
- Partial and filtered indexes
|
|
20
|
+
|
|
21
|
+
3. **db-03: Backup & Recovery**
|
|
22
|
+
|
|
23
|
+
- Point-in-time recovery (PITR)
|
|
24
|
+
- Snapshot-based backups
|
|
25
|
+
- Cross-region replication
|
|
26
|
+
- Disaster recovery planning
|
|
27
|
+
|
|
28
|
+
4. **db-04: Replication & Sharding**
|
|
29
|
+
|
|
30
|
+
- Primary-replica configuration
|
|
31
|
+
- Horizontal partitioning
|
|
32
|
+
- Shard key selection
|
|
33
|
+
- Cross-shard queries
|
|
34
|
+
|
|
35
|
+
5. **db-05: Performance Tuning**
|
|
36
|
+
|
|
37
|
+
- Connection pooling optimization
|
|
38
|
+
- Buffer pool configuration
|
|
39
|
+
- Query cache management
|
|
40
|
+
- Lock contention resolution
|
|
41
|
+
|
|
42
|
+
6. **db-06: Database Migrations**
|
|
43
|
+
|
|
44
|
+
- Schema versioning (Flyway, Alembic)
|
|
45
|
+
- Zero-downtime migrations
|
|
46
|
+
- Rollback strategies
|
|
47
|
+
- Data migration scripts
|
|
48
|
+
|
|
49
|
+
7. **db-07: Transaction Management**
|
|
50
|
+
- Isolation level selection
|
|
51
|
+
- Deadlock prevention
|
|
52
|
+
- Distributed transactions
|
|
53
|
+
- ACID compliance verification
|
|
54
|
+
|
|
55
|
+
## When to Use DBA Skills
|
|
56
|
+
|
|
57
|
+
- Optimizing slow database queries
|
|
58
|
+
- Designing index strategies
|
|
59
|
+
- Planning backup and recovery procedures
|
|
60
|
+
- Scaling databases with replication/sharding
|
|
61
|
+
- Database performance tuning
|
|
62
|
+
- Managing schema migrations
|
|
63
|
+
- Resolving transaction and locking issues
|
|
64
|
+
|
|
65
|
+
## Integration with Other Roles
|
|
66
|
+
|
|
67
|
+
**Always coordinate with:**
|
|
68
|
+
|
|
69
|
+
- **Backend Developer (be-04)**: Schema design and query optimization
|
|
70
|
+
- **Data Engineer (de-01, de-05)**: Data pipelines and performance
|
|
71
|
+
- **SRE (sr-01, sr-07)**: Incident response and disaster recovery
|
|
72
|
+
- **Security Architect (sa-04, sa-06)**: Access control and encryption
|
|
73
|
+
- **DevOps (do-03)**: Infrastructure as code for databases
|
|
74
|
+
- **FinOps (fo-05, fo-06)**: Storage and compute optimization
|
|
75
|
+
|
|
76
|
+
## Best Practices
|
|
77
|
+
|
|
78
|
+
1. **Regular EXPLAIN Analysis** - Profile queries in production-like environments
|
|
79
|
+
2. **Index Maintenance** - Regular index rebuild and statistics updates
|
|
80
|
+
3. **Backup Testing** - Regularly test restore procedures
|
|
81
|
+
4. **Connection Pooling** - Use PgBouncer/ProxySQL for connection management
|
|
82
|
+
5. **Monitoring** - Track slow queries, locks, replication lag
|
|
83
|
+
6. **Migration Safety** - Always test migrations on staging first
|
|
84
|
+
7. **Isolation Levels** - Use READ COMMITTED by default, escalate as needed
|
|
85
|
+
8. **Partition Pruning** - Design partitions for query patterns
|
|
86
|
+
|
|
87
|
+
## Documentation
|
|
88
|
+
|
|
89
|
+
Detailed documentation for each skill is in `.claude/roles/database-admin/skills/{skill-id}/README.md`
|
|
90
|
+
|
|
91
|
+
Each README includes:
|
|
92
|
+
|
|
93
|
+
- SQL optimization examples
|
|
94
|
+
- Configuration templates
|
|
95
|
+
- Monitoring queries
|
|
96
|
+
- Recovery procedures
|
|
97
|
+
- Performance benchmarks
|
|
98
|
+
|
|
99
|
+
## Quick Start
|
|
100
|
+
|
|
101
|
+
To use a DBA skill:
|
|
102
|
+
|
|
103
|
+
1. Start with db-01 (Query Optimization) for performance issues
|
|
104
|
+
2. Add db-02 (Index Strategies) for systematic improvement
|
|
105
|
+
3. Use db-03 (Backup & Recovery) for data protection
|
|
106
|
+
4. Implement db-05 (Performance Tuning) for server optimization
|
|
107
|
+
5. Apply db-06 (Migrations) for schema changes
|
|
108
|
+
|
|
109
|
+
For comprehensive project planning, use the **orchestrator** skill first.
|