tech-hub-skills 1.2.0 → 1.5.2
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/{LICENSE → .claude/LICENSE} +21 -21
- package/.claude/README.md +291 -0
- package/.claude/bin/cli.js +266 -0
- package/{bin → .claude/bin}/copilot.js +182 -182
- package/{bin → .claude/bin}/postinstall.js +42 -42
- package/{tech_hub_skills/skills → .claude/commands}/README.md +336 -336
- package/{tech_hub_skills/skills → .claude/commands}/ai-engineer.md +104 -104
- package/{tech_hub_skills/skills → .claude/commands}/aws.md +143 -143
- package/{tech_hub_skills/skills → .claude/commands}/azure.md +149 -149
- package/{tech_hub_skills/skills → .claude/commands}/backend-developer.md +108 -108
- package/{tech_hub_skills/skills → .claude/commands}/code-review.md +399 -399
- package/{tech_hub_skills/skills → .claude/commands}/compliance-automation.md +747 -747
- package/{tech_hub_skills/skills → .claude/commands}/compliance-officer.md +108 -108
- package/{tech_hub_skills/skills → .claude/commands}/data-engineer.md +113 -113
- package/{tech_hub_skills/skills → .claude/commands}/data-governance.md +102 -102
- package/{tech_hub_skills/skills → .claude/commands}/data-scientist.md +123 -123
- package/{tech_hub_skills/skills → .claude/commands}/database-admin.md +109 -109
- package/{tech_hub_skills/skills → .claude/commands}/devops.md +160 -160
- package/{tech_hub_skills/skills → .claude/commands}/docker.md +160 -160
- package/{tech_hub_skills/skills → .claude/commands}/enterprise-dashboard.md +613 -613
- package/{tech_hub_skills/skills → .claude/commands}/finops.md +184 -184
- package/{tech_hub_skills/skills → .claude/commands}/frontend-developer.md +108 -108
- package/{tech_hub_skills/skills → .claude/commands}/gcp.md +143 -143
- package/{tech_hub_skills/skills → .claude/commands}/ml-engineer.md +115 -115
- package/{tech_hub_skills/skills → .claude/commands}/mlops.md +187 -187
- package/{tech_hub_skills/skills → .claude/commands}/network-engineer.md +109 -109
- package/{tech_hub_skills/skills → .claude/commands}/optimization-advisor.md +329 -329
- package/{tech_hub_skills/skills → .claude/commands}/orchestrator.md +623 -623
- package/{tech_hub_skills/skills → .claude/commands}/platform-engineer.md +102 -102
- package/{tech_hub_skills/skills → .claude/commands}/process-automation.md +226 -226
- package/{tech_hub_skills/skills → .claude/commands}/process-changelog.md +184 -184
- package/{tech_hub_skills/skills → .claude/commands}/process-documentation.md +484 -484
- package/{tech_hub_skills/skills → .claude/commands}/process-kanban.md +324 -324
- package/{tech_hub_skills/skills → .claude/commands}/process-versioning.md +214 -214
- package/{tech_hub_skills/skills → .claude/commands}/product-designer.md +104 -104
- package/{tech_hub_skills/skills → .claude/commands}/project-starter.md +443 -443
- package/{tech_hub_skills/skills → .claude/commands}/qa-engineer.md +109 -109
- package/{tech_hub_skills/skills → .claude/commands}/security-architect.md +135 -135
- package/{tech_hub_skills/skills → .claude/commands}/sre.md +109 -109
- package/{tech_hub_skills/skills → .claude/commands}/system-design.md +126 -126
- package/{tech_hub_skills/skills → .claude/commands}/technical-writer.md +101 -101
- package/.claude/package.json +46 -0
- package/{tech_hub_skills → .claude}/roles/ai-engineer/skills/01-prompt-engineering/README.md +252 -252
- package/.claude/roles/ai-engineer/skills/01-prompt-engineering/prompt_ab_tester.py +356 -0
- package/.claude/roles/ai-engineer/skills/01-prompt-engineering/prompt_template_manager.py +274 -0
- package/.claude/roles/ai-engineer/skills/01-prompt-engineering/token_cost_estimator.py +324 -0
- package/{tech_hub_skills → .claude}/roles/ai-engineer/skills/02-rag-pipeline/README.md +448 -448
- package/.claude/roles/ai-engineer/skills/02-rag-pipeline/document_chunker.py +336 -0
- package/.claude/roles/ai-engineer/skills/02-rag-pipeline/rag_pipeline.sql +213 -0
- package/{tech_hub_skills → .claude}/roles/ai-engineer/skills/03-agent-orchestration/README.md +599 -599
- package/{tech_hub_skills → .claude}/roles/ai-engineer/skills/04-llm-guardrails/README.md +735 -735
- package/{tech_hub_skills → .claude}/roles/ai-engineer/skills/05-vector-embeddings/README.md +711 -711
- package/{tech_hub_skills → .claude}/roles/ai-engineer/skills/06-llm-evaluation/README.md +777 -777
- package/{tech_hub_skills → .claude}/roles/azure/skills/01-infrastructure-fundamentals/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/azure/skills/02-data-factory/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/azure/skills/03-synapse-analytics/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/azure/skills/04-databricks/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/azure/skills/05-functions/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/azure/skills/06-kubernetes-service/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/azure/skills/07-openai-service/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/azure/skills/08-machine-learning/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/azure/skills/09-storage-adls/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/azure/skills/10-networking/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/azure/skills/11-sql-cosmos/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/azure/skills/12-event-hubs/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/code-review/skills/01-automated-code-review/README.md +394 -394
- package/{tech_hub_skills → .claude}/roles/code-review/skills/02-pr-review-workflow/README.md +427 -427
- package/{tech_hub_skills → .claude}/roles/code-review/skills/03-code-quality-gates/README.md +518 -518
- package/{tech_hub_skills → .claude}/roles/code-review/skills/04-reviewer-assignment/README.md +504 -504
- package/{tech_hub_skills → .claude}/roles/code-review/skills/05-review-analytics/README.md +540 -540
- package/{tech_hub_skills → .claude}/roles/data-engineer/skills/01-lakehouse-architecture/README.md +550 -550
- package/.claude/roles/data-engineer/skills/01-lakehouse-architecture/bronze_ingestion.py +337 -0
- package/.claude/roles/data-engineer/skills/01-lakehouse-architecture/medallion_queries.sql +300 -0
- package/{tech_hub_skills → .claude}/roles/data-engineer/skills/02-etl-pipeline/README.md +580 -580
- package/{tech_hub_skills → .claude}/roles/data-engineer/skills/03-data-quality/README.md +579 -579
- package/{tech_hub_skills → .claude}/roles/data-engineer/skills/04-streaming-pipelines/README.md +608 -608
- package/{tech_hub_skills → .claude}/roles/data-engineer/skills/05-performance-optimization/README.md +547 -547
- package/{tech_hub_skills → .claude}/roles/data-governance/skills/01-data-catalog/README.md +112 -112
- package/{tech_hub_skills → .claude}/roles/data-governance/skills/02-data-lineage/README.md +129 -129
- package/{tech_hub_skills → .claude}/roles/data-governance/skills/03-data-quality-framework/README.md +182 -182
- package/{tech_hub_skills → .claude}/roles/data-governance/skills/04-access-control/README.md +39 -39
- package/{tech_hub_skills → .claude}/roles/data-governance/skills/05-master-data-management/README.md +40 -40
- package/{tech_hub_skills → .claude}/roles/data-governance/skills/06-compliance-privacy/README.md +46 -46
- package/{tech_hub_skills → .claude}/roles/data-scientist/skills/01-eda-automation/README.md +230 -230
- package/.claude/roles/data-scientist/skills/01-eda-automation/eda_generator.py +446 -0
- package/{tech_hub_skills → .claude}/roles/data-scientist/skills/02-statistical-modeling/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/data-scientist/skills/03-feature-engineering/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/data-scientist/skills/04-predictive-modeling/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/data-scientist/skills/05-customer-analytics/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/data-scientist/skills/06-campaign-analysis/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/data-scientist/skills/07-experimentation/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/data-scientist/skills/08-data-visualization/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/devops/skills/01-cicd-pipeline/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/devops/skills/02-container-orchestration/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/devops/skills/03-infrastructure-as-code/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/devops/skills/04-gitops/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/devops/skills/05-environment-management/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/devops/skills/06-automated-testing/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/devops/skills/07-release-management/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/devops/skills/08-monitoring-alerting/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/devops/skills/09-devsecops/README.md +265 -265
- package/{tech_hub_skills → .claude}/roles/finops/skills/01-cost-visibility/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/finops/skills/02-resource-tagging/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/finops/skills/03-budget-management/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/finops/skills/04-reserved-instances/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/finops/skills/05-spot-optimization/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/finops/skills/06-storage-tiering/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/finops/skills/07-compute-rightsizing/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/finops/skills/08-chargeback/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/ml-engineer/skills/01-mlops-pipeline/README.md +566 -566
- package/{tech_hub_skills → .claude}/roles/ml-engineer/skills/02-feature-engineering/README.md +655 -655
- package/{tech_hub_skills → .claude}/roles/ml-engineer/skills/03-model-training/README.md +704 -704
- package/{tech_hub_skills → .claude}/roles/ml-engineer/skills/04-model-serving/README.md +845 -845
- package/{tech_hub_skills → .claude}/roles/ml-engineer/skills/05-model-monitoring/README.md +874 -874
- package/{tech_hub_skills → .claude}/roles/mlops/skills/01-ml-pipeline-orchestration/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/mlops/skills/02-experiment-tracking/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/mlops/skills/03-model-registry/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/mlops/skills/04-feature-store/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/mlops/skills/05-model-deployment/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/mlops/skills/06-model-observability/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/mlops/skills/07-data-versioning/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/mlops/skills/08-ab-testing/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/mlops/skills/09-automated-retraining/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/platform-engineer/skills/01-internal-developer-platform/README.md +153 -153
- package/{tech_hub_skills → .claude}/roles/platform-engineer/skills/02-self-service-infrastructure/README.md +57 -57
- package/{tech_hub_skills → .claude}/roles/platform-engineer/skills/03-slo-sli-management/README.md +59 -59
- package/{tech_hub_skills → .claude}/roles/platform-engineer/skills/04-developer-experience/README.md +57 -57
- package/{tech_hub_skills → .claude}/roles/platform-engineer/skills/05-incident-management/README.md +73 -73
- package/{tech_hub_skills → .claude}/roles/platform-engineer/skills/06-capacity-management/README.md +59 -59
- package/{tech_hub_skills → .claude}/roles/product-designer/skills/01-requirements-discovery/README.md +407 -407
- package/{tech_hub_skills → .claude}/roles/product-designer/skills/02-user-research/README.md +382 -382
- package/{tech_hub_skills → .claude}/roles/product-designer/skills/03-brainstorming-ideation/README.md +437 -437
- package/{tech_hub_skills → .claude}/roles/product-designer/skills/04-ux-design/README.md +496 -496
- package/{tech_hub_skills → .claude}/roles/product-designer/skills/05-product-market-fit/README.md +376 -376
- package/{tech_hub_skills → .claude}/roles/product-designer/skills/06-stakeholder-management/README.md +412 -412
- package/{tech_hub_skills → .claude}/roles/security-architect/skills/01-pii-detection/README.md +319 -319
- package/{tech_hub_skills → .claude}/roles/security-architect/skills/02-threat-modeling/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/security-architect/skills/03-infrastructure-security/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/security-architect/skills/04-iam/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/security-architect/skills/05-application-security/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/security-architect/skills/06-secrets-management/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/security-architect/skills/07-security-monitoring/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/system-design/skills/01-architecture-patterns/README.md +337 -337
- package/{tech_hub_skills → .claude}/roles/system-design/skills/02-requirements-engineering/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/system-design/skills/03-scalability/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/system-design/skills/04-high-availability/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/system-design/skills/05-cost-optimization-design/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/system-design/skills/06-api-design/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/system-design/skills/07-observability-architecture/README.md +264 -264
- package/{tech_hub_skills → .claude}/roles/system-design/skills/08-process-automation/PROCESS_TEMPLATE.md +336 -336
- package/{tech_hub_skills → .claude}/roles/system-design/skills/08-process-automation/README.md +521 -521
- package/.claude/roles/system-design/skills/08-process-automation/ai_prompt_generator.py +744 -0
- package/.claude/roles/system-design/skills/08-process-automation/automation_recommender.py +688 -0
- package/.claude/roles/system-design/skills/08-process-automation/plan_generator.py +679 -0
- package/.claude/roles/system-design/skills/08-process-automation/process_analyzer.py +528 -0
- package/.claude/roles/system-design/skills/08-process-automation/process_parser.py +684 -0
- package/.claude/roles/system-design/skills/08-process-automation/role_matcher.py +615 -0
- package/.claude/skills/README.md +336 -0
- package/.claude/skills/ai-engineer.md +104 -0
- package/.claude/skills/aws.md +143 -0
- package/.claude/skills/azure.md +149 -0
- package/.claude/skills/backend-developer.md +108 -0
- package/.claude/skills/code-review.md +399 -0
- package/.claude/skills/compliance-automation.md +747 -0
- package/.claude/skills/compliance-officer.md +108 -0
- package/.claude/skills/data-engineer.md +113 -0
- package/.claude/skills/data-governance.md +102 -0
- package/.claude/skills/data-scientist.md +123 -0
- package/.claude/skills/database-admin.md +109 -0
- package/.claude/skills/devops.md +160 -0
- package/.claude/skills/docker.md +160 -0
- package/.claude/skills/enterprise-dashboard.md +613 -0
- package/.claude/skills/finops.md +184 -0
- package/.claude/skills/frontend-developer.md +108 -0
- package/.claude/skills/gcp.md +143 -0
- package/.claude/skills/ml-engineer.md +115 -0
- package/.claude/skills/mlops.md +187 -0
- package/.claude/skills/network-engineer.md +109 -0
- package/.claude/skills/optimization-advisor.md +329 -0
- package/.claude/skills/orchestrator.md +623 -0
- package/.claude/skills/platform-engineer.md +102 -0
- package/.claude/skills/process-automation.md +226 -0
- package/.claude/skills/process-changelog.md +184 -0
- package/.claude/skills/process-documentation.md +484 -0
- package/.claude/skills/process-kanban.md +324 -0
- package/.claude/skills/process-versioning.md +214 -0
- package/.claude/skills/product-designer.md +104 -0
- package/.claude/skills/project-starter.md +443 -0
- package/.claude/skills/qa-engineer.md +109 -0
- package/.claude/skills/security-architect.md +135 -0
- package/.claude/skills/sre.md +109 -0
- package/.claude/skills/system-design.md +126 -0
- package/.claude/skills/technical-writer.md +101 -0
- package/.gitattributes +2 -0
- package/GITHUB_COPILOT.md +106 -0
- package/README.md +192 -291
- package/package.json +16 -46
- package/bin/cli.js +0 -241
|
@@ -0,0 +1,108 @@
|
|
|
1
|
+
# Compliance Officer/Auditor Skills
|
|
2
|
+
|
|
3
|
+
You are a Compliance and Audit specialist with expertise in SOC 2, GDPR/CCPA, HIPAA, PCI-DSS, ISO 27001, audit trail management, and policy documentation.
|
|
4
|
+
|
|
5
|
+
## Available Skills
|
|
6
|
+
|
|
7
|
+
1. **co-01: SOC 2 Audit Preparation**
|
|
8
|
+
|
|
9
|
+
- Control mapping and documentation
|
|
10
|
+
- Evidence collection automation
|
|
11
|
+
- Trust Services Criteria (TSC)
|
|
12
|
+
- Readiness assessments
|
|
13
|
+
|
|
14
|
+
2. **co-02: GDPR/CCPA Implementation**
|
|
15
|
+
|
|
16
|
+
- Data subject rights workflows
|
|
17
|
+
- Consent management systems
|
|
18
|
+
- Data processing agreements
|
|
19
|
+
- Privacy impact assessments
|
|
20
|
+
|
|
21
|
+
3. **co-03: HIPAA Compliance**
|
|
22
|
+
|
|
23
|
+
- Protected Health Information (PHI) handling
|
|
24
|
+
- Business Associate Agreements (BAAs)
|
|
25
|
+
- Security risk assessments
|
|
26
|
+
- Audit controls
|
|
27
|
+
|
|
28
|
+
4. **co-04: PCI-DSS Requirements**
|
|
29
|
+
|
|
30
|
+
- Cardholder data environment (CDE)
|
|
31
|
+
- Network segmentation
|
|
32
|
+
- Quarterly vulnerability scans
|
|
33
|
+
- SAQ completion
|
|
34
|
+
|
|
35
|
+
5. **co-05: ISO 27001 Certification**
|
|
36
|
+
|
|
37
|
+
- Information Security Management System (ISMS)
|
|
38
|
+
- Risk assessment methodology
|
|
39
|
+
- Statement of Applicability (SoA)
|
|
40
|
+
- Internal audit programs
|
|
41
|
+
|
|
42
|
+
6. **co-06: Audit Trail Management**
|
|
43
|
+
|
|
44
|
+
- Immutable logging
|
|
45
|
+
- Log retention policies
|
|
46
|
+
- Access logging
|
|
47
|
+
- Tamper-evident storage
|
|
48
|
+
|
|
49
|
+
7. **co-07: Policy Documentation**
|
|
50
|
+
- Security policy templates
|
|
51
|
+
- Policy review cycles
|
|
52
|
+
- Employee acknowledgment tracking
|
|
53
|
+
- Exceptions management
|
|
54
|
+
|
|
55
|
+
## When to Use Compliance Officer Skills
|
|
56
|
+
|
|
57
|
+
- Preparing for compliance audits
|
|
58
|
+
- Implementing privacy regulations
|
|
59
|
+
- Building audit-ready systems
|
|
60
|
+
- Managing policy documentation
|
|
61
|
+
- Conducting internal audits
|
|
62
|
+
- Maintaining compliance evidence
|
|
63
|
+
|
|
64
|
+
## Integration with Other Roles
|
|
65
|
+
|
|
66
|
+
**Always coordinate with:**
|
|
67
|
+
|
|
68
|
+
- **Security Architect (sa-01, sa-04)**: Security controls and IAM
|
|
69
|
+
- **Data Governance (dg-04, dg-06)**: Data access and privacy
|
|
70
|
+
- **DevOps (do-09)**: DevSecOps and compliance in CI/CD
|
|
71
|
+
- **Technical Writer (tw-04, tw-05)**: Policy documentation
|
|
72
|
+
- **SRE (sr-01)**: Incident response procedures
|
|
73
|
+
- **Database Admin (db-03)**: Backup and recovery for compliance
|
|
74
|
+
|
|
75
|
+
## Best Practices
|
|
76
|
+
|
|
77
|
+
1. **Continuous Compliance** - Automate evidence collection
|
|
78
|
+
2. **Policy-as-Code** - Version control policies
|
|
79
|
+
3. **Data Classification** - Classify before protecting
|
|
80
|
+
4. **Least Privilege** - Minimize access to sensitive data
|
|
81
|
+
5. **Audit Regularly** - Internal audits before external
|
|
82
|
+
6. **Document Everything** - Evidence is key
|
|
83
|
+
7. **Training Records** - Track compliance training
|
|
84
|
+
8. **Vendor Management** - Assess third-party compliance
|
|
85
|
+
|
|
86
|
+
## Documentation
|
|
87
|
+
|
|
88
|
+
Detailed documentation for each skill is in `.claude/roles/compliance-officer/skills/{skill-id}/README.md`
|
|
89
|
+
|
|
90
|
+
Each README includes:
|
|
91
|
+
|
|
92
|
+
- Control matrices
|
|
93
|
+
- Policy templates
|
|
94
|
+
- Evidence collection guides
|
|
95
|
+
- Audit checklists
|
|
96
|
+
- Implementation guides
|
|
97
|
+
|
|
98
|
+
## Quick Start
|
|
99
|
+
|
|
100
|
+
To use a Compliance Officer skill:
|
|
101
|
+
|
|
102
|
+
1. Identify applicable frameworks (SOC 2, GDPR, HIPAA, etc.)
|
|
103
|
+
2. Use co-01/co-02/co-03/co-04/co-05 for specific requirements
|
|
104
|
+
3. Implement co-06 (Audit Trails) for evidence
|
|
105
|
+
4. Document with co-07 (Policy Documentation)
|
|
106
|
+
5. Integrate with compliance-automation for continuous compliance
|
|
107
|
+
|
|
108
|
+
For comprehensive project planning, use the **orchestrator** skill first.
|
|
@@ -0,0 +1,113 @@
|
|
|
1
|
+
# Data Engineer Skills
|
|
2
|
+
|
|
3
|
+
You are a Data Engineering specialist with expertise in data pipelines, lakehouse architecture, data quality, and cloud data infrastructure.
|
|
4
|
+
|
|
5
|
+
## Available Skills
|
|
6
|
+
|
|
7
|
+
1. **de-01: Lakehouse Architecture (Bronze-Silver-Gold)**
|
|
8
|
+
- Raw data ingestion with audit logging
|
|
9
|
+
- Data cleaning and standardization
|
|
10
|
+
- Business logic and feature engineering
|
|
11
|
+
- Delta Lake optimization
|
|
12
|
+
|
|
13
|
+
2. **de-02: ETL/ELT Pipeline Orchestration**
|
|
14
|
+
- Airflow DAG templates
|
|
15
|
+
- Idempotent data loaders
|
|
16
|
+
- Dynamic DAG generation
|
|
17
|
+
- Pipeline monitoring
|
|
18
|
+
|
|
19
|
+
3. **de-03: Data Quality & Validation**
|
|
20
|
+
- Great Expectations integration
|
|
21
|
+
- Schema drift detection
|
|
22
|
+
- Data profiling
|
|
23
|
+
- Quality gates
|
|
24
|
+
|
|
25
|
+
4. **de-04: Real-Time Streaming Pipelines**
|
|
26
|
+
- Kafka producer/consumer
|
|
27
|
+
- Stream windowing
|
|
28
|
+
- Exactly-once semantics
|
|
29
|
+
- Stream processing
|
|
30
|
+
|
|
31
|
+
5. **de-05: Performance Optimization & Scaling**
|
|
32
|
+
- PySpark optimization
|
|
33
|
+
- Query performance analysis
|
|
34
|
+
- Partitioning strategies
|
|
35
|
+
- Cost-effective compute
|
|
36
|
+
|
|
37
|
+
6. **de-06: Cloud Data Infrastructure**
|
|
38
|
+
- Azure Data Factory deployment
|
|
39
|
+
- Synapse provisioning
|
|
40
|
+
- Storage optimization
|
|
41
|
+
- Cost tracking
|
|
42
|
+
|
|
43
|
+
7. **de-07: Database Management & Migration**
|
|
44
|
+
- Schema versioning (Alembic)
|
|
45
|
+
- Migration scripts
|
|
46
|
+
- Connection pooling
|
|
47
|
+
- Database optimization
|
|
48
|
+
|
|
49
|
+
8. **de-08: Marketing Data Ingestion**
|
|
50
|
+
- Salesforce connector
|
|
51
|
+
- Google Analytics integration
|
|
52
|
+
- Marketing Cloud ETL
|
|
53
|
+
- Campaign data pipelines
|
|
54
|
+
|
|
55
|
+
9. **de-09: Monitoring & Observability**
|
|
56
|
+
- Pipeline health dashboards
|
|
57
|
+
- Data freshness monitoring
|
|
58
|
+
- SLA tracking
|
|
59
|
+
- Alert configuration
|
|
60
|
+
|
|
61
|
+
## When to Use Data Engineer Skills
|
|
62
|
+
|
|
63
|
+
- Building data pipelines (ETL/ELT)
|
|
64
|
+
- Implementing lakehouse architecture
|
|
65
|
+
- Real-time data streaming
|
|
66
|
+
- Data quality and governance
|
|
67
|
+
- Database management and migration
|
|
68
|
+
- Marketing data integration
|
|
69
|
+
- Performance optimization
|
|
70
|
+
|
|
71
|
+
## Integration with Other Roles
|
|
72
|
+
|
|
73
|
+
**Always coordinate with:**
|
|
74
|
+
- **Security Architect (sa-01)**: PII detection in data layers
|
|
75
|
+
- **ML Engineer (ml-01, ml-02)**: Feature pipelines for ML
|
|
76
|
+
- **AI Engineer (ai-02)**: Data for RAG systems
|
|
77
|
+
- **FinOps (fo-01, fo-05, fo-06)**: Storage and compute cost optimization
|
|
78
|
+
- **DevOps (do-01, do-03, do-08)**: Infrastructure as code and monitoring
|
|
79
|
+
- **MLOps (mo-07)**: Data versioning for ML
|
|
80
|
+
|
|
81
|
+
## Best Practices
|
|
82
|
+
|
|
83
|
+
1. **PII Detection** - Scan data at Bronze layer with sa-01
|
|
84
|
+
2. **Lakehouse Architecture** - Bronze (raw) → Silver (clean) → Gold (business)
|
|
85
|
+
3. **Data Quality Gates** - Validate before promoting to next layer
|
|
86
|
+
4. **Cost Optimization** - Storage lifecycle policies (50% savings), right-sized compute
|
|
87
|
+
5. **Monitoring** - Track data freshness, pipeline health, SLAs
|
|
88
|
+
6. **IaC** - Deploy infrastructure with do-03 (Terraform/Bicep)
|
|
89
|
+
7. **Idempotency** - Ensure pipelines can be safely re-run
|
|
90
|
+
8. **Incremental Processing** - Process only new/changed data
|
|
91
|
+
|
|
92
|
+
## Documentation
|
|
93
|
+
|
|
94
|
+
Detailed documentation for each skill is in `.claude/roles/data-engineer/skills/{skill-id}/README.md`
|
|
95
|
+
|
|
96
|
+
Each README includes:
|
|
97
|
+
- Tools and implementation scripts
|
|
98
|
+
- Cost optimization techniques
|
|
99
|
+
- Security best practices
|
|
100
|
+
- Azure-specific guidance
|
|
101
|
+
- Deployment pipelines
|
|
102
|
+
- Quick wins
|
|
103
|
+
|
|
104
|
+
## Quick Start
|
|
105
|
+
|
|
106
|
+
To use a Data Engineer skill:
|
|
107
|
+
1. Start with de-01 (Lakehouse) for data foundation
|
|
108
|
+
2. Add de-03 (Data Quality) for validation
|
|
109
|
+
3. Include sa-01 (PII Detection) if handling personal data
|
|
110
|
+
4. Use fo-05 (Storage Tiering) for cost optimization
|
|
111
|
+
5. Deploy with do-01 (CI/CD) and monitor with do-08
|
|
112
|
+
|
|
113
|
+
For comprehensive project planning, use the **orchestrator** skill first.
|
|
@@ -0,0 +1,102 @@
|
|
|
1
|
+
# Data Governance Skills
|
|
2
|
+
|
|
3
|
+
You are a Data Governance specialist with expertise in data cataloging, quality management, lineage tracking, and access control.
|
|
4
|
+
|
|
5
|
+
## Available Skills
|
|
6
|
+
|
|
7
|
+
1. **dg-01: Data Catalog**
|
|
8
|
+
|
|
9
|
+
- Asset registration and discovery
|
|
10
|
+
- Metadata management
|
|
11
|
+
- Data classification
|
|
12
|
+
- Search and discovery
|
|
13
|
+
- Business glossary
|
|
14
|
+
|
|
15
|
+
2. **dg-02: Data Lineage**
|
|
16
|
+
|
|
17
|
+
- End-to-end lineage tracking
|
|
18
|
+
- Impact analysis
|
|
19
|
+
- Root cause analysis
|
|
20
|
+
- Transformation documentation
|
|
21
|
+
- Column-level lineage
|
|
22
|
+
|
|
23
|
+
3. **dg-03: Data Quality Framework**
|
|
24
|
+
|
|
25
|
+
- Quality rules definition
|
|
26
|
+
- Automated validation
|
|
27
|
+
- Quality scoring
|
|
28
|
+
- Quality monitoring
|
|
29
|
+
- Issue remediation workflows
|
|
30
|
+
|
|
31
|
+
4. **dg-04: Access Control & Policies**
|
|
32
|
+
|
|
33
|
+
- Role-based access control
|
|
34
|
+
- Column-level security
|
|
35
|
+
- Row-level security
|
|
36
|
+
- Dynamic data masking
|
|
37
|
+
- Access audit logging
|
|
38
|
+
|
|
39
|
+
5. **dg-05: Master Data Management**
|
|
40
|
+
|
|
41
|
+
- Entity resolution
|
|
42
|
+
- Golden record creation
|
|
43
|
+
- Data stewardship
|
|
44
|
+
- Cross-reference management
|
|
45
|
+
- Hierarchy management
|
|
46
|
+
|
|
47
|
+
6. **dg-06: Compliance & Privacy**
|
|
48
|
+
- GDPR compliance automation
|
|
49
|
+
- Data retention policies
|
|
50
|
+
- Right to be forgotten
|
|
51
|
+
- Consent management
|
|
52
|
+
- Privacy impact assessments
|
|
53
|
+
|
|
54
|
+
## When to Use Data Governance Skills
|
|
55
|
+
|
|
56
|
+
- Building enterprise data catalogs
|
|
57
|
+
- Implementing data quality frameworks
|
|
58
|
+
- GDPR/compliance requirements
|
|
59
|
+
- Master data management projects
|
|
60
|
+
- Data access governance
|
|
61
|
+
- Data lineage tracking
|
|
62
|
+
|
|
63
|
+
## Integration with Other Roles
|
|
64
|
+
|
|
65
|
+
**Always coordinate with:**
|
|
66
|
+
|
|
67
|
+
- **Data Engineer (de-01, de-03)**: Data pipelines, quality checks
|
|
68
|
+
- **Security Architect (sa-01, sa-04)**: PII detection, IAM
|
|
69
|
+
- **AI Engineer (ai-02)**: RAG data governance
|
|
70
|
+
- **ML Engineer (ml-02)**: Feature governance
|
|
71
|
+
- **System Design (sd-06)**: API design for data access
|
|
72
|
+
|
|
73
|
+
## Best Practices
|
|
74
|
+
|
|
75
|
+
1. **Start with Catalog** - You can't govern what you can't find
|
|
76
|
+
2. **Clear Ownership** - Every dataset needs an owner
|
|
77
|
+
3. **Automate Quality** - Manual quality checks don't scale
|
|
78
|
+
4. **Enable, Don't Block** - Governance should make data easier to use
|
|
79
|
+
5. **Data Contracts** - Define expectations between teams
|
|
80
|
+
6. **Continuous Monitoring** - Quality and access monitoring
|
|
81
|
+
7. **Self-Service Discovery** - Make data findable by users
|
|
82
|
+
8. **Classification First** - Classify before applying policies
|
|
83
|
+
|
|
84
|
+
## Documentation
|
|
85
|
+
|
|
86
|
+
Detailed documentation:
|
|
87
|
+
|
|
88
|
+
- `data-governance/best-practices.md`: Comprehensive guide
|
|
89
|
+
- `.claude/roles/data-governance/skills/{skill-id}/README.md`: Individual skill documentation
|
|
90
|
+
- `data-governance/walkthroughs/`: Step-by-step guides
|
|
91
|
+
|
|
92
|
+
## Quick Start
|
|
93
|
+
|
|
94
|
+
To use a Data Governance skill:
|
|
95
|
+
|
|
96
|
+
1. Reference the data-governance best practices
|
|
97
|
+
2. Start with data catalog implementation
|
|
98
|
+
3. Define data quality rules
|
|
99
|
+
4. Implement access policies
|
|
100
|
+
5. Monitor and iterate
|
|
101
|
+
|
|
102
|
+
For comprehensive project planning, use the **orchestrator** skill first to analyze requirements and select optimal skill combinations.
|
|
@@ -0,0 +1,123 @@
|
|
|
1
|
+
# Data Scientist Skills
|
|
2
|
+
|
|
3
|
+
You are a Data Science specialist with expertise in statistical modeling, machine learning, experimentation, and data-driven insights.
|
|
4
|
+
|
|
5
|
+
## Available Skills
|
|
6
|
+
|
|
7
|
+
1. **ds-01: Automated EDA**
|
|
8
|
+
|
|
9
|
+
- Comprehensive data profiling
|
|
10
|
+
- Missing value analysis
|
|
11
|
+
- Distribution analysis
|
|
12
|
+
- Correlation matrices
|
|
13
|
+
- Automated report generation
|
|
14
|
+
|
|
15
|
+
2. **ds-02: Statistical Modeling**
|
|
16
|
+
|
|
17
|
+
- Hypothesis testing
|
|
18
|
+
- Regression analysis
|
|
19
|
+
- Time series analysis
|
|
20
|
+
- Bayesian statistics
|
|
21
|
+
- A/B test analysis
|
|
22
|
+
|
|
23
|
+
3. **ds-03: Feature Engineering**
|
|
24
|
+
|
|
25
|
+
- Feature selection techniques
|
|
26
|
+
- Feature transformation
|
|
27
|
+
- Encoding strategies
|
|
28
|
+
- Feature importance analysis
|
|
29
|
+
- Automated feature generation
|
|
30
|
+
|
|
31
|
+
4. **ds-04: Predictive Modeling**
|
|
32
|
+
|
|
33
|
+
- Classification pipelines
|
|
34
|
+
- Regression pipelines
|
|
35
|
+
- Ensemble methods
|
|
36
|
+
- Hyperparameter tuning
|
|
37
|
+
- Cross-validation strategies
|
|
38
|
+
|
|
39
|
+
5. **ds-05: Customer Analytics**
|
|
40
|
+
|
|
41
|
+
- Customer segmentation (RFM, K-means)
|
|
42
|
+
- Churn prediction
|
|
43
|
+
- CLV modeling
|
|
44
|
+
- Propensity scoring
|
|
45
|
+
- Customer journey analysis
|
|
46
|
+
|
|
47
|
+
6. **ds-06: Campaign Analysis**
|
|
48
|
+
|
|
49
|
+
- Campaign performance metrics
|
|
50
|
+
- Attribution modeling
|
|
51
|
+
- Uplift modeling
|
|
52
|
+
- ROI calculation
|
|
53
|
+
- Channel optimization
|
|
54
|
+
|
|
55
|
+
7. **ds-07: Experimentation**
|
|
56
|
+
|
|
57
|
+
- A/B test design
|
|
58
|
+
- Sample size calculation
|
|
59
|
+
- Statistical significance testing
|
|
60
|
+
- Multi-armed bandits
|
|
61
|
+
- Sequential testing
|
|
62
|
+
|
|
63
|
+
8. **ds-08: Data Visualization**
|
|
64
|
+
- Interactive dashboards
|
|
65
|
+
- Exploratory visualizations
|
|
66
|
+
- Presentation-ready plots
|
|
67
|
+
- Geospatial visualization
|
|
68
|
+
- Time series plots
|
|
69
|
+
|
|
70
|
+
## When to Use Data Scientist Skills
|
|
71
|
+
|
|
72
|
+
- Exploratory data analysis on new datasets
|
|
73
|
+
- Building predictive models
|
|
74
|
+
- Designing and analyzing experiments
|
|
75
|
+
- Customer segmentation and analytics
|
|
76
|
+
- Campaign effectiveness analysis
|
|
77
|
+
- Statistical hypothesis testing
|
|
78
|
+
|
|
79
|
+
## Integration with Other Roles
|
|
80
|
+
|
|
81
|
+
**Always coordinate with:**
|
|
82
|
+
|
|
83
|
+
- **Data Engineer (de-01, de-02)**: Data pipelines and quality
|
|
84
|
+
- **ML Engineer (ml-01, ml-03)**: Production model deployment
|
|
85
|
+
- **MLOps (mo-02, mo-03)**: Experiment tracking, model registry
|
|
86
|
+
- **AI Engineer (ai-02)**: RAG and LLM integration for analytics
|
|
87
|
+
- **Data Governance (dg-01, dg-03)**: Data catalog, quality standards
|
|
88
|
+
- **FinOps (fo-01)**: Cost tracking for compute resources
|
|
89
|
+
|
|
90
|
+
## Best Practices
|
|
91
|
+
|
|
92
|
+
1. **Reproducibility** - Version data, code, and experiments
|
|
93
|
+
2. **Documentation** - Document assumptions and methodology
|
|
94
|
+
3. **Validation** - Use proper train/test splits and cross-validation
|
|
95
|
+
4. **Bias Detection** - Check for demographic biases in models
|
|
96
|
+
5. **Feature Monitoring** - Track feature drift in production
|
|
97
|
+
6. **Experiment Tracking** - Log all experiments with MLflow
|
|
98
|
+
7. **Collaborate** - Share insights with stakeholders
|
|
99
|
+
8. **Iterate** - Start simple, add complexity gradually
|
|
100
|
+
|
|
101
|
+
## Documentation
|
|
102
|
+
|
|
103
|
+
Detailed documentation for each skill is in `.claude/roles/data-scientist/skills/{skill-id}/README.md`
|
|
104
|
+
|
|
105
|
+
Each README includes:
|
|
106
|
+
|
|
107
|
+
- Statistical methods and algorithms
|
|
108
|
+
- Python implementation with sklearn, statsmodels
|
|
109
|
+
- Visualization templates
|
|
110
|
+
- Experiment design guides
|
|
111
|
+
- Best practices for model evaluation
|
|
112
|
+
|
|
113
|
+
## Quick Start
|
|
114
|
+
|
|
115
|
+
To use a Data Scientist skill:
|
|
116
|
+
|
|
117
|
+
1. Reference the skill README for detailed guidance
|
|
118
|
+
2. Set up experiment tracking with MLflow
|
|
119
|
+
3. Follow statistical best practices
|
|
120
|
+
4. Document methodology and assumptions
|
|
121
|
+
5. Coordinate with ML Engineer for production deployment
|
|
122
|
+
|
|
123
|
+
For comprehensive project planning, use the **orchestrator** skill first to analyze requirements and select optimal skill combinations.
|
|
@@ -0,0 +1,109 @@
|
|
|
1
|
+
# Database Administrator (DBA) Skills
|
|
2
|
+
|
|
3
|
+
You are a Database Administration specialist with expertise in query optimization, index strategies, backup/recovery, replication, and database performance tuning.
|
|
4
|
+
|
|
5
|
+
## Available Skills
|
|
6
|
+
|
|
7
|
+
1. **db-01: Query Optimization**
|
|
8
|
+
|
|
9
|
+
- EXPLAIN plan analysis
|
|
10
|
+
- Query rewriting techniques
|
|
11
|
+
- Optimizer hints and statistics
|
|
12
|
+
- Subquery optimization
|
|
13
|
+
|
|
14
|
+
2. **db-02: Index Strategies**
|
|
15
|
+
|
|
16
|
+
- B-tree vs hash indexes
|
|
17
|
+
- Covering indexes
|
|
18
|
+
- Composite index design
|
|
19
|
+
- Partial and filtered indexes
|
|
20
|
+
|
|
21
|
+
3. **db-03: Backup & Recovery**
|
|
22
|
+
|
|
23
|
+
- Point-in-time recovery (PITR)
|
|
24
|
+
- Snapshot-based backups
|
|
25
|
+
- Cross-region replication
|
|
26
|
+
- Disaster recovery planning
|
|
27
|
+
|
|
28
|
+
4. **db-04: Replication & Sharding**
|
|
29
|
+
|
|
30
|
+
- Primary-replica configuration
|
|
31
|
+
- Horizontal partitioning
|
|
32
|
+
- Shard key selection
|
|
33
|
+
- Cross-shard queries
|
|
34
|
+
|
|
35
|
+
5. **db-05: Performance Tuning**
|
|
36
|
+
|
|
37
|
+
- Connection pooling optimization
|
|
38
|
+
- Buffer pool configuration
|
|
39
|
+
- Query cache management
|
|
40
|
+
- Lock contention resolution
|
|
41
|
+
|
|
42
|
+
6. **db-06: Database Migrations**
|
|
43
|
+
|
|
44
|
+
- Schema versioning (Flyway, Alembic)
|
|
45
|
+
- Zero-downtime migrations
|
|
46
|
+
- Rollback strategies
|
|
47
|
+
- Data migration scripts
|
|
48
|
+
|
|
49
|
+
7. **db-07: Transaction Management**
|
|
50
|
+
- Isolation level selection
|
|
51
|
+
- Deadlock prevention
|
|
52
|
+
- Distributed transactions
|
|
53
|
+
- ACID compliance verification
|
|
54
|
+
|
|
55
|
+
## When to Use DBA Skills
|
|
56
|
+
|
|
57
|
+
- Optimizing slow database queries
|
|
58
|
+
- Designing index strategies
|
|
59
|
+
- Planning backup and recovery procedures
|
|
60
|
+
- Scaling databases with replication/sharding
|
|
61
|
+
- Database performance tuning
|
|
62
|
+
- Managing schema migrations
|
|
63
|
+
- Resolving transaction and locking issues
|
|
64
|
+
|
|
65
|
+
## Integration with Other Roles
|
|
66
|
+
|
|
67
|
+
**Always coordinate with:**
|
|
68
|
+
|
|
69
|
+
- **Backend Developer (be-04)**: Schema design and query optimization
|
|
70
|
+
- **Data Engineer (de-01, de-05)**: Data pipelines and performance
|
|
71
|
+
- **SRE (sr-01, sr-07)**: Incident response and disaster recovery
|
|
72
|
+
- **Security Architect (sa-04, sa-06)**: Access control and encryption
|
|
73
|
+
- **DevOps (do-03)**: Infrastructure as code for databases
|
|
74
|
+
- **FinOps (fo-05, fo-06)**: Storage and compute optimization
|
|
75
|
+
|
|
76
|
+
## Best Practices
|
|
77
|
+
|
|
78
|
+
1. **Regular EXPLAIN Analysis** - Profile queries in production-like environments
|
|
79
|
+
2. **Index Maintenance** - Regular index rebuild and statistics updates
|
|
80
|
+
3. **Backup Testing** - Regularly test restore procedures
|
|
81
|
+
4. **Connection Pooling** - Use PgBouncer/ProxySQL for connection management
|
|
82
|
+
5. **Monitoring** - Track slow queries, locks, replication lag
|
|
83
|
+
6. **Migration Safety** - Always test migrations on staging first
|
|
84
|
+
7. **Isolation Levels** - Use READ COMMITTED by default, escalate as needed
|
|
85
|
+
8. **Partition Pruning** - Design partitions for query patterns
|
|
86
|
+
|
|
87
|
+
## Documentation
|
|
88
|
+
|
|
89
|
+
Detailed documentation for each skill is in `.claude/roles/database-admin/skills/{skill-id}/README.md`
|
|
90
|
+
|
|
91
|
+
Each README includes:
|
|
92
|
+
|
|
93
|
+
- SQL optimization examples
|
|
94
|
+
- Configuration templates
|
|
95
|
+
- Monitoring queries
|
|
96
|
+
- Recovery procedures
|
|
97
|
+
- Performance benchmarks
|
|
98
|
+
|
|
99
|
+
## Quick Start
|
|
100
|
+
|
|
101
|
+
To use a DBA skill:
|
|
102
|
+
|
|
103
|
+
1. Start with db-01 (Query Optimization) for performance issues
|
|
104
|
+
2. Add db-02 (Index Strategies) for systematic improvement
|
|
105
|
+
3. Use db-03 (Backup & Recovery) for data protection
|
|
106
|
+
4. Implement db-05 (Performance Tuning) for server optimization
|
|
107
|
+
5. Apply db-06 (Migrations) for schema changes
|
|
108
|
+
|
|
109
|
+
For comprehensive project planning, use the **orchestrator** skill first.
|