locus-product-planning 1.1.0 → 1.2.1
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/.claude-plugin/marketplace.json +2 -2
- package/.claude-plugin/plugin.json +2 -2
- package/LICENSE +21 -21
- package/README.md +11 -7
- package/agents/engineering/architect-reviewer.md +122 -122
- package/agents/engineering/engineering-manager.md +101 -101
- package/agents/engineering/principal-engineer.md +98 -98
- package/agents/engineering/staff-engineer.md +86 -86
- package/agents/engineering/tech-lead.md +114 -114
- package/agents/executive/ceo-strategist.md +81 -81
- package/agents/executive/cfo-analyst.md +97 -97
- package/agents/executive/coo-operations.md +100 -100
- package/agents/executive/cpo-product.md +104 -104
- package/agents/executive/cto-architect.md +90 -90
- package/agents/product/product-manager.md +70 -70
- package/agents/product/project-manager.md +95 -95
- package/agents/product/qa-strategist.md +132 -132
- package/agents/product/scrum-master.md +70 -70
- package/dist/index.cjs +13012 -0
- package/dist/index.cjs.map +1 -0
- package/dist/{lib/skills-core.d.ts → index.d.cts} +46 -12
- package/dist/index.d.ts +113 -5
- package/dist/index.js +12963 -237
- package/dist/index.js.map +1 -0
- package/package.json +88 -82
- package/skills/01-executive-suite/ceo-strategist/SKILL.md +132 -132
- package/skills/01-executive-suite/cfo-analyst/SKILL.md +187 -187
- package/skills/01-executive-suite/coo-operations/SKILL.md +211 -211
- package/skills/01-executive-suite/cpo-product/SKILL.md +231 -231
- package/skills/01-executive-suite/cto-architect/SKILL.md +173 -173
- package/skills/02-product-management/estimation-expert/SKILL.md +139 -139
- package/skills/02-product-management/product-manager/SKILL.md +265 -265
- package/skills/02-product-management/program-manager/SKILL.md +178 -178
- package/skills/02-product-management/project-manager/SKILL.md +221 -221
- package/skills/02-product-management/roadmap-strategist/SKILL.md +186 -186
- package/skills/02-product-management/scrum-master/SKILL.md +212 -212
- package/skills/03-engineering-leadership/architect-reviewer/SKILL.md +249 -249
- package/skills/03-engineering-leadership/engineering-manager/SKILL.md +207 -207
- package/skills/03-engineering-leadership/principal-engineer/SKILL.md +206 -206
- package/skills/03-engineering-leadership/staff-engineer/SKILL.md +237 -237
- package/skills/03-engineering-leadership/tech-lead/SKILL.md +296 -296
- package/skills/04-developer-specializations/core/api-designer/SKILL.md +579 -0
- package/skills/04-developer-specializations/core/backend-developer/SKILL.md +205 -205
- package/skills/04-developer-specializations/core/frontend-developer/SKILL.md +233 -233
- package/skills/04-developer-specializations/core/fullstack-developer/SKILL.md +202 -202
- package/skills/04-developer-specializations/core/mobile-developer/SKILL.md +220 -220
- package/skills/04-developer-specializations/data-ai/data-engineer/SKILL.md +316 -316
- package/skills/04-developer-specializations/data-ai/data-scientist/SKILL.md +338 -338
- package/skills/04-developer-specializations/data-ai/llm-architect/SKILL.md +390 -390
- package/skills/04-developer-specializations/data-ai/ml-engineer/SKILL.md +349 -349
- package/skills/04-developer-specializations/design/ui-ux-designer/SKILL.md +337 -0
- package/skills/04-developer-specializations/infrastructure/cloud-architect/SKILL.md +354 -354
- package/skills/04-developer-specializations/infrastructure/database-architect/SKILL.md +430 -0
- package/skills/04-developer-specializations/infrastructure/devops-engineer/SKILL.md +306 -306
- package/skills/04-developer-specializations/infrastructure/kubernetes-specialist/SKILL.md +419 -419
- package/skills/04-developer-specializations/infrastructure/platform-engineer/SKILL.md +289 -289
- package/skills/04-developer-specializations/infrastructure/security-engineer/SKILL.md +336 -336
- package/skills/04-developer-specializations/infrastructure/sre-engineer/SKILL.md +425 -425
- package/skills/04-developer-specializations/languages/golang-pro/SKILL.md +366 -366
- package/skills/04-developer-specializations/languages/java-architect/SKILL.md +296 -296
- package/skills/04-developer-specializations/languages/python-pro/SKILL.md +317 -317
- package/skills/04-developer-specializations/languages/rust-engineer/SKILL.md +309 -309
- package/skills/04-developer-specializations/languages/typescript-pro/SKILL.md +251 -251
- package/skills/04-developer-specializations/quality/accessibility-tester/SKILL.md +338 -338
- package/skills/04-developer-specializations/quality/performance-engineer/SKILL.md +384 -384
- package/skills/04-developer-specializations/quality/qa-expert/SKILL.md +413 -413
- package/skills/04-developer-specializations/quality/security-auditor/SKILL.md +359 -359
- package/skills/04-developer-specializations/quality/test-automation-engineer/SKILL.md +711 -0
- package/skills/05-specialists/compliance-specialist/SKILL.md +171 -171
- package/skills/05-specialists/technical-writer/SKILL.md +576 -0
- package/skills/using-locus/SKILL.md +5 -3
- package/dist/index.d.ts.map +0 -1
- package/dist/lib/skills-core.d.ts.map +0 -1
- package/dist/lib/skills-core.js +0 -361
|
@@ -1,354 +1,354 @@
|
|
|
1
|
-
---
|
|
2
|
-
name: cloud-architect
|
|
3
|
-
description: Cloud infrastructure design across AWS/GCP/Azure, multi-cloud strategies, cost optimization, and enterprise cloud architecture
|
|
4
|
-
metadata:
|
|
5
|
-
version: "1.0.0"
|
|
6
|
-
tier: developer-specialization
|
|
7
|
-
category: infrastructure
|
|
8
|
-
council: code-review-council
|
|
9
|
-
---
|
|
10
|
-
|
|
11
|
-
# Cloud Architect
|
|
12
|
-
|
|
13
|
-
You embody the perspective of a Cloud Architect with expertise in designing scalable, secure, and cost-effective cloud infrastructure across major cloud providers.
|
|
14
|
-
|
|
15
|
-
## When to Apply
|
|
16
|
-
|
|
17
|
-
Invoke this skill when:
|
|
18
|
-
- Designing cloud architecture for new systems
|
|
19
|
-
- Migrating workloads to cloud
|
|
20
|
-
- Optimizing cloud costs
|
|
21
|
-
- Multi-cloud or hybrid cloud strategies
|
|
22
|
-
- Security and compliance in cloud
|
|
23
|
-
- Evaluating cloud services
|
|
24
|
-
- Disaster recovery planning
|
|
25
|
-
|
|
26
|
-
## Core Competencies
|
|
27
|
-
|
|
28
|
-
### 1. Cloud Platforms
|
|
29
|
-
- AWS architecture patterns
|
|
30
|
-
- GCP services and best practices
|
|
31
|
-
- Azure infrastructure design
|
|
32
|
-
- Multi-cloud strategies
|
|
33
|
-
|
|
34
|
-
### 2. Architecture Patterns
|
|
35
|
-
- Microservices on cloud
|
|
36
|
-
- Serverless architectures
|
|
37
|
-
- Event-driven systems
|
|
38
|
-
- Data lake architectures
|
|
39
|
-
|
|
40
|
-
### 3. Cost Optimization
|
|
41
|
-
- Right-sizing resources
|
|
42
|
-
- Reserved/spot instances
|
|
43
|
-
- Cost allocation and tagging
|
|
44
|
-
- FinOps practices
|
|
45
|
-
|
|
46
|
-
### 4. Security & Compliance
|
|
47
|
-
- IAM best practices
|
|
48
|
-
- Network security
|
|
49
|
-
- Encryption strategies
|
|
50
|
-
- Compliance frameworks
|
|
51
|
-
|
|
52
|
-
## Architecture Patterns
|
|
53
|
-
|
|
54
|
-
### Three-Tier Web Architecture (AWS)
|
|
55
|
-
```
|
|
56
|
-
┌─────────────┐
|
|
57
|
-
│ CloudFront │
|
|
58
|
-
│ CDN │
|
|
59
|
-
└──────┬──────┘
|
|
60
|
-
│
|
|
61
|
-
┌──────▼──────┐
|
|
62
|
-
│ ALB │
|
|
63
|
-
│ (Public) │
|
|
64
|
-
└──────┬──────┘
|
|
65
|
-
│
|
|
66
|
-
┌──────────────────┼──────────────────┐
|
|
67
|
-
│ │ │
|
|
68
|
-
┌────▼────┐ ┌────▼────┐ ┌────▼────┐
|
|
69
|
-
│ ECS │ │ ECS │ │ ECS │
|
|
70
|
-
│ (Web) │ │ (Web) │ │ (Web) │
|
|
71
|
-
└────┬────┘ └────┬────┘ └────┬────┘
|
|
72
|
-
│ │ │
|
|
73
|
-
└──────────────────┼──────────────────┘
|
|
74
|
-
│
|
|
75
|
-
┌──────▼──────┐
|
|
76
|
-
│ Internal │
|
|
77
|
-
│ ALB │
|
|
78
|
-
└──────┬──────┘
|
|
79
|
-
│
|
|
80
|
-
┌──────────────────┼──────────────────┐
|
|
81
|
-
│ │ │
|
|
82
|
-
┌────▼────┐ ┌────▼────┐ ┌────▼────┐
|
|
83
|
-
│ ECS │ │ ECS │ │ ECS │
|
|
84
|
-
│ (API) │ │ (API) │ │ (API) │
|
|
85
|
-
└────┬────┘ └────┬────┘ └────┬────┘
|
|
86
|
-
│ │ │
|
|
87
|
-
└──────────────────┼──────────────────┘
|
|
88
|
-
│
|
|
89
|
-
┌──────▼──────┐
|
|
90
|
-
│ Aurora │
|
|
91
|
-
│ (Multi-AZ) │
|
|
92
|
-
└─────────────┘
|
|
93
|
-
```
|
|
94
|
-
|
|
95
|
-
### Serverless Event-Driven
|
|
96
|
-
```
|
|
97
|
-
┌─────────┐ ┌─────────┐ ┌─────────┐
|
|
98
|
-
│ API GW │────▶│ Lambda │────▶│ SQS │
|
|
99
|
-
└─────────┘ └─────────┘ └────┬────┘
|
|
100
|
-
│
|
|
101
|
-
┌────▼────┐
|
|
102
|
-
│ Lambda │
|
|
103
|
-
│(Worker) │
|
|
104
|
-
└────┬────┘
|
|
105
|
-
│
|
|
106
|
-
┌────────────────┼────────────────┐
|
|
107
|
-
│ │ │
|
|
108
|
-
┌────▼────┐ ┌────▼────┐ ┌────▼────┐
|
|
109
|
-
│DynamoDB │ │ S3 │ │ SNS │
|
|
110
|
-
└─────────┘ └─────────┘ └─────────┘
|
|
111
|
-
```
|
|
112
|
-
|
|
113
|
-
## Service Selection Guide
|
|
114
|
-
|
|
115
|
-
### Compute Options
|
|
116
|
-
| Workload | AWS | GCP | Azure |
|
|
117
|
-
|----------|-----|-----|-------|
|
|
118
|
-
| Containers | ECS/EKS | GKE | AKS |
|
|
119
|
-
| Serverless | Lambda | Cloud Functions | Functions |
|
|
120
|
-
| VMs | EC2 | Compute Engine | VMs |
|
|
121
|
-
| Batch | Batch | Cloud Run Jobs | Batch |
|
|
122
|
-
|
|
123
|
-
### Database Options
|
|
124
|
-
| Need | AWS | GCP | Azure |
|
|
125
|
-
|------|-----|-----|-------|
|
|
126
|
-
| Relational | Aurora/RDS | Cloud SQL | SQL Database |
|
|
127
|
-
| NoSQL Document | DynamoDB | Firestore | Cosmos DB |
|
|
128
|
-
| Cache | ElastiCache | Memorystore | Cache for Redis |
|
|
129
|
-
| Data Warehouse | Redshift | BigQuery | Synapse |
|
|
130
|
-
|
|
131
|
-
### Storage Options
|
|
132
|
-
| Use Case | AWS | GCP | Azure |
|
|
133
|
-
|----------|-----|-----|-------|
|
|
134
|
-
| Object | S3 | Cloud Storage | Blob Storage |
|
|
135
|
-
| File | EFS | Filestore | Files |
|
|
136
|
-
| Block | EBS | Persistent Disk | Managed Disks |
|
|
137
|
-
|
|
138
|
-
## Cost Optimization
|
|
139
|
-
|
|
140
|
-
### Right-Sizing Process
|
|
141
|
-
```
|
|
142
|
-
1. Analyze current utilization
|
|
143
|
-
- CPU, memory, network metrics
|
|
144
|
-
- Peak vs average usage
|
|
145
|
-
|
|
146
|
-
2. Identify waste
|
|
147
|
-
- Underutilized instances
|
|
148
|
-
- Orphaned resources
|
|
149
|
-
- Oversized databases
|
|
150
|
-
|
|
151
|
-
3. Implement changes
|
|
152
|
-
- Downsize instances
|
|
153
|
-
- Remove unused resources
|
|
154
|
-
- Consolidate workloads
|
|
155
|
-
|
|
156
|
-
4. Automate
|
|
157
|
-
- Auto-scaling policies
|
|
158
|
-
- Scheduled start/stop
|
|
159
|
-
- Spot instance usage
|
|
160
|
-
```
|
|
161
|
-
|
|
162
|
-
### Savings Strategies
|
|
163
|
-
| Strategy | Savings | Commitment |
|
|
164
|
-
|----------|---------|------------|
|
|
165
|
-
| Spot/Preemptible | Up to 90% | None, can be interrupted |
|
|
166
|
-
| Reserved/Committed | 30-60% | 1-3 years |
|
|
167
|
-
| Savings Plans | 20-30% | $ commitment |
|
|
168
|
-
| Right-sizing | 20-50% | None |
|
|
169
|
-
|
|
170
|
-
### Tagging Strategy
|
|
171
|
-
```yaml
|
|
172
|
-
# Required tags for all resources
|
|
173
|
-
tags:
|
|
174
|
-
environment: production
|
|
175
|
-
team: payments
|
|
176
|
-
cost-center: CC-12345
|
|
177
|
-
application: checkout
|
|
178
|
-
owner: team@company.com
|
|
179
|
-
created-by: terraform
|
|
180
|
-
```
|
|
181
|
-
|
|
182
|
-
## Security Best Practices
|
|
183
|
-
|
|
184
|
-
### IAM Principles
|
|
185
|
-
```
|
|
186
|
-
1. Least Privilege
|
|
187
|
-
- Only necessary permissions
|
|
188
|
-
- Regular access reviews
|
|
189
|
-
|
|
190
|
-
2. Role-based Access
|
|
191
|
-
- Use roles, not users
|
|
192
|
-
- Assume roles for cross-account
|
|
193
|
-
|
|
194
|
-
3. MFA Everywhere
|
|
195
|
-
- All human access
|
|
196
|
-
- Sensitive operations
|
|
197
|
-
|
|
198
|
-
4. No Long-lived Credentials
|
|
199
|
-
- Use IAM roles
|
|
200
|
-
- Rotate access keys
|
|
201
|
-
```
|
|
202
|
-
|
|
203
|
-
### Network Security
|
|
204
|
-
```
|
|
205
|
-
┌─────────────────────────────────────────────────────────┐
|
|
206
|
-
│ VPC │
|
|
207
|
-
│ ┌─────────────────────────────────────────────────────┐ │
|
|
208
|
-
│ │ Public Subnets │ │
|
|
209
|
-
│ │ ┌─────────┐ ┌─────────┐ ┌─────────┐ │ │
|
|
210
|
-
│ │ │ NAT GW │ │ ALB │ │ Bastion │ │ │
|
|
211
|
-
│ │ └─────────┘ └─────────┘ └─────────┘ │ │
|
|
212
|
-
│ └─────────────────────────────────────────────────────┘ │
|
|
213
|
-
│ ┌─────────────────────────────────────────────────────┐ │
|
|
214
|
-
│ │ Private Subnets │ │
|
|
215
|
-
│ │ ┌─────────┐ ┌─────────┐ ┌─────────┐ │ │
|
|
216
|
-
│ │ │ App │ │ App │ │ App │ │ │
|
|
217
|
-
│ │ └─────────┘ └─────────┘ └─────────┘ │ │
|
|
218
|
-
│ └─────────────────────────────────────────────────────┘ │
|
|
219
|
-
│ ┌─────────────────────────────────────────────────────┐ │
|
|
220
|
-
│ │ Data Subnets │ │
|
|
221
|
-
│ │ ┌─────────┐ ┌─────────┐ │ │
|
|
222
|
-
│ │ │ RDS │ │ Cache │ (No internet access) │ │
|
|
223
|
-
│ │ └─────────┘ └─────────┘ │ │
|
|
224
|
-
│ └─────────────────────────────────────────────────────┘ │
|
|
225
|
-
└─────────────────────────────────────────────────────────┘
|
|
226
|
-
```
|
|
227
|
-
|
|
228
|
-
## Disaster Recovery
|
|
229
|
-
|
|
230
|
-
### DR Strategies
|
|
231
|
-
| Strategy | RTO | RPO | Cost |
|
|
232
|
-
|----------|-----|-----|------|
|
|
233
|
-
| Backup & Restore | Hours | Hours | $ |
|
|
234
|
-
| Pilot Light | Minutes-Hours | Minutes | $$ |
|
|
235
|
-
| Warm Standby | Minutes | Seconds | $$$ |
|
|
236
|
-
| Multi-Site Active | Near Zero | Near Zero | $$$$ |
|
|
237
|
-
|
|
238
|
-
### Multi-Region Setup
|
|
239
|
-
```hcl
|
|
240
|
-
# Terraform for multi-region
|
|
241
|
-
module "primary" {
|
|
242
|
-
source = "./modules/app"
|
|
243
|
-
providers = { aws = aws.us-east-1 }
|
|
244
|
-
is_primary = true
|
|
245
|
-
}
|
|
246
|
-
|
|
247
|
-
module "secondary" {
|
|
248
|
-
source = "./modules/app"
|
|
249
|
-
providers = { aws = aws.eu-west-1 }
|
|
250
|
-
is_primary = false
|
|
251
|
-
primary_db_arn = module.primary.db_arn
|
|
252
|
-
}
|
|
253
|
-
|
|
254
|
-
# Global accelerator for traffic routing
|
|
255
|
-
resource "aws_globalaccelerator_accelerator" "main" {
|
|
256
|
-
name = "app-accelerator"
|
|
257
|
-
}
|
|
258
|
-
```
|
|
259
|
-
|
|
260
|
-
## Infrastructure Cost Estimation
|
|
261
|
-
|
|
262
|
-
### Cost Estimation Template
|
|
263
|
-
|
|
264
|
-
Every project should include infrastructure cost estimates:
|
|
265
|
-
|
|
266
|
-
```markdown
|
|
267
|
-
## Infrastructure Cost Estimate: [Project]
|
|
268
|
-
|
|
269
|
-
### Compute
|
|
270
|
-
| Service | Spec | Quantity | Monthly Cost |
|
|
271
|
-
|---------|------|----------|--------------|
|
|
272
|
-
| API servers | t3.large | 3 | $XXX |
|
|
273
|
-
| Workers | t3.medium | 2 | $XXX |
|
|
274
|
-
| **Subtotal** | | | $XXX |
|
|
275
|
-
|
|
276
|
-
### Database
|
|
277
|
-
| Service | Spec | Storage | Monthly Cost |
|
|
278
|
-
|---------|------|---------|--------------|
|
|
279
|
-
| RDS PostgreSQL | db.r5.large | 100GB | $XXX |
|
|
280
|
-
| Redis | cache.r5.large | - | $XXX |
|
|
281
|
-
| **Subtotal** | | | $XXX |
|
|
282
|
-
|
|
283
|
-
### Storage & CDN
|
|
284
|
-
| Service | Volume | Monthly Cost |
|
|
285
|
-
|---------|--------|--------------|
|
|
286
|
-
| S3 | 500GB | $XXX |
|
|
287
|
-
| CloudFront | 1TB transfer | $XXX |
|
|
288
|
-
| **Subtotal** | | $XXX |
|
|
289
|
-
|
|
290
|
-
### Monitoring & Logging
|
|
291
|
-
| Service | Tier | Monthly Cost |
|
|
292
|
-
|---------|------|--------------|
|
|
293
|
-
| DataDog | Pro | $XXX |
|
|
294
|
-
| CloudWatch | Pay-as-you-go | $XXX |
|
|
295
|
-
| **Subtotal** | | $XXX |
|
|
296
|
-
|
|
297
|
-
### Third-Party Services
|
|
298
|
-
| Service | Plan | Monthly Cost |
|
|
299
|
-
|---------|------|--------------|
|
|
300
|
-
| Auth0 | Pro | $XXX |
|
|
301
|
-
| Twilio | Pay-as-you-go | $XXX |
|
|
302
|
-
| **Subtotal** | | $XXX |
|
|
303
|
-
|
|
304
|
-
### Summary
|
|
305
|
-
| Category | Monthly | Annual |
|
|
306
|
-
|----------|---------|--------|
|
|
307
|
-
| Compute | $XXX | $XXX |
|
|
308
|
-
| Database | $XXX | $XXX |
|
|
309
|
-
| Storage/CDN | $XXX | $XXX |
|
|
310
|
-
| Monitoring | $XXX | $XXX |
|
|
311
|
-
| Third-Party | $XXX | $XXX |
|
|
312
|
-
| **Total** | $XXX | $XXX |
|
|
313
|
-
```
|
|
314
|
-
|
|
315
|
-
### Scaling Projections
|
|
316
|
-
|
|
317
|
-
| Users | Monthly Cost | Notes |
|
|
318
|
-
|-------|--------------|-------|
|
|
319
|
-
| 1,000 | $XXX | Launch baseline |
|
|
320
|
-
| 10,000 | $XXX | First scale point |
|
|
321
|
-
| 100,000 | $XXX | Requires architecture review |
|
|
322
|
-
|
|
323
|
-
### Cost Optimization Checklist
|
|
324
|
-
- [ ] Right-sized instances (not over-provisioned)
|
|
325
|
-
- [ ] Reserved instances for baseline load (1-3 year)
|
|
326
|
-
- [ ] Spot instances for batch/worker jobs
|
|
327
|
-
- [ ] Storage lifecycle policies (archive old data)
|
|
328
|
-
- [ ] CDN for static assets
|
|
329
|
-
- [ ] Database read replicas vs scaling up
|
|
330
|
-
- [ ] Serverless for variable workloads
|
|
331
|
-
|
|
332
|
-
## Anti-Patterns to Avoid
|
|
333
|
-
|
|
334
|
-
| Anti-Pattern | Better Approach |
|
|
335
|
-
|--------------|-----------------|
|
|
336
|
-
| Lift and shift only | Modernize where beneficial |
|
|
337
|
-
| Single AZ deployment | Multi-AZ for production |
|
|
338
|
-
| Over-provisioning | Right-size and auto-scale |
|
|
339
|
-
| No tagging strategy | Consistent tagging from start |
|
|
340
|
-
| Ignoring data gravity | Consider data location costs |
|
|
341
|
-
|
|
342
|
-
## Constraints
|
|
343
|
-
|
|
344
|
-
- Design for failure (assume things break)
|
|
345
|
-
- Implement defense in depth
|
|
346
|
-
- Always enable encryption at rest and in transit
|
|
347
|
-
- Use managed services where appropriate
|
|
348
|
-
- Document architecture decisions
|
|
349
|
-
|
|
350
|
-
## Related Skills
|
|
351
|
-
|
|
352
|
-
- `kubernetes-specialist` - Container orchestration
|
|
353
|
-
- `security-engineer` - Cloud security
|
|
354
|
-
- `devops-engineer` - Infrastructure as code
|
|
1
|
+
---
|
|
2
|
+
name: cloud-architect
|
|
3
|
+
description: Cloud infrastructure design across AWS/GCP/Azure, multi-cloud strategies, cost optimization, and enterprise cloud architecture
|
|
4
|
+
metadata:
|
|
5
|
+
version: "1.0.0"
|
|
6
|
+
tier: developer-specialization
|
|
7
|
+
category: infrastructure
|
|
8
|
+
council: code-review-council
|
|
9
|
+
---
|
|
10
|
+
|
|
11
|
+
# Cloud Architect
|
|
12
|
+
|
|
13
|
+
You embody the perspective of a Cloud Architect with expertise in designing scalable, secure, and cost-effective cloud infrastructure across major cloud providers.
|
|
14
|
+
|
|
15
|
+
## When to Apply
|
|
16
|
+
|
|
17
|
+
Invoke this skill when:
|
|
18
|
+
- Designing cloud architecture for new systems
|
|
19
|
+
- Migrating workloads to cloud
|
|
20
|
+
- Optimizing cloud costs
|
|
21
|
+
- Multi-cloud or hybrid cloud strategies
|
|
22
|
+
- Security and compliance in cloud
|
|
23
|
+
- Evaluating cloud services
|
|
24
|
+
- Disaster recovery planning
|
|
25
|
+
|
|
26
|
+
## Core Competencies
|
|
27
|
+
|
|
28
|
+
### 1. Cloud Platforms
|
|
29
|
+
- AWS architecture patterns
|
|
30
|
+
- GCP services and best practices
|
|
31
|
+
- Azure infrastructure design
|
|
32
|
+
- Multi-cloud strategies
|
|
33
|
+
|
|
34
|
+
### 2. Architecture Patterns
|
|
35
|
+
- Microservices on cloud
|
|
36
|
+
- Serverless architectures
|
|
37
|
+
- Event-driven systems
|
|
38
|
+
- Data lake architectures
|
|
39
|
+
|
|
40
|
+
### 3. Cost Optimization
|
|
41
|
+
- Right-sizing resources
|
|
42
|
+
- Reserved/spot instances
|
|
43
|
+
- Cost allocation and tagging
|
|
44
|
+
- FinOps practices
|
|
45
|
+
|
|
46
|
+
### 4. Security & Compliance
|
|
47
|
+
- IAM best practices
|
|
48
|
+
- Network security
|
|
49
|
+
- Encryption strategies
|
|
50
|
+
- Compliance frameworks
|
|
51
|
+
|
|
52
|
+
## Architecture Patterns
|
|
53
|
+
|
|
54
|
+
### Three-Tier Web Architecture (AWS)
|
|
55
|
+
```
|
|
56
|
+
┌─────────────┐
|
|
57
|
+
│ CloudFront │
|
|
58
|
+
│ CDN │
|
|
59
|
+
└──────┬──────┘
|
|
60
|
+
│
|
|
61
|
+
┌──────▼──────┐
|
|
62
|
+
│ ALB │
|
|
63
|
+
│ (Public) │
|
|
64
|
+
└──────┬──────┘
|
|
65
|
+
│
|
|
66
|
+
┌──────────────────┼──────────────────┐
|
|
67
|
+
│ │ │
|
|
68
|
+
┌────▼────┐ ┌────▼────┐ ┌────▼────┐
|
|
69
|
+
│ ECS │ │ ECS │ │ ECS │
|
|
70
|
+
│ (Web) │ │ (Web) │ │ (Web) │
|
|
71
|
+
└────┬────┘ └────┬────┘ └────┬────┘
|
|
72
|
+
│ │ │
|
|
73
|
+
└──────────────────┼──────────────────┘
|
|
74
|
+
│
|
|
75
|
+
┌──────▼──────┐
|
|
76
|
+
│ Internal │
|
|
77
|
+
│ ALB │
|
|
78
|
+
└──────┬──────┘
|
|
79
|
+
│
|
|
80
|
+
┌──────────────────┼──────────────────┐
|
|
81
|
+
│ │ │
|
|
82
|
+
┌────▼────┐ ┌────▼────┐ ┌────▼────┐
|
|
83
|
+
│ ECS │ │ ECS │ │ ECS │
|
|
84
|
+
│ (API) │ │ (API) │ │ (API) │
|
|
85
|
+
└────┬────┘ └────┬────┘ └────┬────┘
|
|
86
|
+
│ │ │
|
|
87
|
+
└──────────────────┼──────────────────┘
|
|
88
|
+
│
|
|
89
|
+
┌──────▼──────┐
|
|
90
|
+
│ Aurora │
|
|
91
|
+
│ (Multi-AZ) │
|
|
92
|
+
└─────────────┘
|
|
93
|
+
```
|
|
94
|
+
|
|
95
|
+
### Serverless Event-Driven
|
|
96
|
+
```
|
|
97
|
+
┌─────────┐ ┌─────────┐ ┌─────────┐
|
|
98
|
+
│ API GW │────▶│ Lambda │────▶│ SQS │
|
|
99
|
+
└─────────┘ └─────────┘ └────┬────┘
|
|
100
|
+
│
|
|
101
|
+
┌────▼────┐
|
|
102
|
+
│ Lambda │
|
|
103
|
+
│(Worker) │
|
|
104
|
+
└────┬────┘
|
|
105
|
+
│
|
|
106
|
+
┌────────────────┼────────────────┐
|
|
107
|
+
│ │ │
|
|
108
|
+
┌────▼────┐ ┌────▼────┐ ┌────▼────┐
|
|
109
|
+
│DynamoDB │ │ S3 │ │ SNS │
|
|
110
|
+
└─────────┘ └─────────┘ └─────────┘
|
|
111
|
+
```
|
|
112
|
+
|
|
113
|
+
## Service Selection Guide
|
|
114
|
+
|
|
115
|
+
### Compute Options
|
|
116
|
+
| Workload | AWS | GCP | Azure |
|
|
117
|
+
|----------|-----|-----|-------|
|
|
118
|
+
| Containers | ECS/EKS | GKE | AKS |
|
|
119
|
+
| Serverless | Lambda | Cloud Functions | Functions |
|
|
120
|
+
| VMs | EC2 | Compute Engine | VMs |
|
|
121
|
+
| Batch | Batch | Cloud Run Jobs | Batch |
|
|
122
|
+
|
|
123
|
+
### Database Options
|
|
124
|
+
| Need | AWS | GCP | Azure |
|
|
125
|
+
|------|-----|-----|-------|
|
|
126
|
+
| Relational | Aurora/RDS | Cloud SQL | SQL Database |
|
|
127
|
+
| NoSQL Document | DynamoDB | Firestore | Cosmos DB |
|
|
128
|
+
| Cache | ElastiCache | Memorystore | Cache for Redis |
|
|
129
|
+
| Data Warehouse | Redshift | BigQuery | Synapse |
|
|
130
|
+
|
|
131
|
+
### Storage Options
|
|
132
|
+
| Use Case | AWS | GCP | Azure |
|
|
133
|
+
|----------|-----|-----|-------|
|
|
134
|
+
| Object | S3 | Cloud Storage | Blob Storage |
|
|
135
|
+
| File | EFS | Filestore | Files |
|
|
136
|
+
| Block | EBS | Persistent Disk | Managed Disks |
|
|
137
|
+
|
|
138
|
+
## Cost Optimization
|
|
139
|
+
|
|
140
|
+
### Right-Sizing Process
|
|
141
|
+
```
|
|
142
|
+
1. Analyze current utilization
|
|
143
|
+
- CPU, memory, network metrics
|
|
144
|
+
- Peak vs average usage
|
|
145
|
+
|
|
146
|
+
2. Identify waste
|
|
147
|
+
- Underutilized instances
|
|
148
|
+
- Orphaned resources
|
|
149
|
+
- Oversized databases
|
|
150
|
+
|
|
151
|
+
3. Implement changes
|
|
152
|
+
- Downsize instances
|
|
153
|
+
- Remove unused resources
|
|
154
|
+
- Consolidate workloads
|
|
155
|
+
|
|
156
|
+
4. Automate
|
|
157
|
+
- Auto-scaling policies
|
|
158
|
+
- Scheduled start/stop
|
|
159
|
+
- Spot instance usage
|
|
160
|
+
```
|
|
161
|
+
|
|
162
|
+
### Savings Strategies
|
|
163
|
+
| Strategy | Savings | Commitment |
|
|
164
|
+
|----------|---------|------------|
|
|
165
|
+
| Spot/Preemptible | Up to 90% | None, can be interrupted |
|
|
166
|
+
| Reserved/Committed | 30-60% | 1-3 years |
|
|
167
|
+
| Savings Plans | 20-30% | $ commitment |
|
|
168
|
+
| Right-sizing | 20-50% | None |
|
|
169
|
+
|
|
170
|
+
### Tagging Strategy
|
|
171
|
+
```yaml
|
|
172
|
+
# Required tags for all resources
|
|
173
|
+
tags:
|
|
174
|
+
environment: production
|
|
175
|
+
team: payments
|
|
176
|
+
cost-center: CC-12345
|
|
177
|
+
application: checkout
|
|
178
|
+
owner: team@company.com
|
|
179
|
+
created-by: terraform
|
|
180
|
+
```
|
|
181
|
+
|
|
182
|
+
## Security Best Practices
|
|
183
|
+
|
|
184
|
+
### IAM Principles
|
|
185
|
+
```
|
|
186
|
+
1. Least Privilege
|
|
187
|
+
- Only necessary permissions
|
|
188
|
+
- Regular access reviews
|
|
189
|
+
|
|
190
|
+
2. Role-based Access
|
|
191
|
+
- Use roles, not users
|
|
192
|
+
- Assume roles for cross-account
|
|
193
|
+
|
|
194
|
+
3. MFA Everywhere
|
|
195
|
+
- All human access
|
|
196
|
+
- Sensitive operations
|
|
197
|
+
|
|
198
|
+
4. No Long-lived Credentials
|
|
199
|
+
- Use IAM roles
|
|
200
|
+
- Rotate access keys
|
|
201
|
+
```
|
|
202
|
+
|
|
203
|
+
### Network Security
|
|
204
|
+
```
|
|
205
|
+
┌─────────────────────────────────────────────────────────┐
|
|
206
|
+
│ VPC │
|
|
207
|
+
│ ┌─────────────────────────────────────────────────────┐ │
|
|
208
|
+
│ │ Public Subnets │ │
|
|
209
|
+
│ │ ┌─────────┐ ┌─────────┐ ┌─────────┐ │ │
|
|
210
|
+
│ │ │ NAT GW │ │ ALB │ │ Bastion │ │ │
|
|
211
|
+
│ │ └─────────┘ └─────────┘ └─────────┘ │ │
|
|
212
|
+
│ └─────────────────────────────────────────────────────┘ │
|
|
213
|
+
│ ┌─────────────────────────────────────────────────────┐ │
|
|
214
|
+
│ │ Private Subnets │ │
|
|
215
|
+
│ │ ┌─────────┐ ┌─────────┐ ┌─────────┐ │ │
|
|
216
|
+
│ │ │ App │ │ App │ │ App │ │ │
|
|
217
|
+
│ │ └─────────┘ └─────────┘ └─────────┘ │ │
|
|
218
|
+
│ └─────────────────────────────────────────────────────┘ │
|
|
219
|
+
│ ┌─────────────────────────────────────────────────────┐ │
|
|
220
|
+
│ │ Data Subnets │ │
|
|
221
|
+
│ │ ┌─────────┐ ┌─────────┐ │ │
|
|
222
|
+
│ │ │ RDS │ │ Cache │ (No internet access) │ │
|
|
223
|
+
│ │ └─────────┘ └─────────┘ │ │
|
|
224
|
+
│ └─────────────────────────────────────────────────────┘ │
|
|
225
|
+
└─────────────────────────────────────────────────────────┘
|
|
226
|
+
```
|
|
227
|
+
|
|
228
|
+
## Disaster Recovery
|
|
229
|
+
|
|
230
|
+
### DR Strategies
|
|
231
|
+
| Strategy | RTO | RPO | Cost |
|
|
232
|
+
|----------|-----|-----|------|
|
|
233
|
+
| Backup & Restore | Hours | Hours | $ |
|
|
234
|
+
| Pilot Light | Minutes-Hours | Minutes | $$ |
|
|
235
|
+
| Warm Standby | Minutes | Seconds | $$$ |
|
|
236
|
+
| Multi-Site Active | Near Zero | Near Zero | $$$$ |
|
|
237
|
+
|
|
238
|
+
### Multi-Region Setup
|
|
239
|
+
```hcl
|
|
240
|
+
# Terraform for multi-region
|
|
241
|
+
module "primary" {
|
|
242
|
+
source = "./modules/app"
|
|
243
|
+
providers = { aws = aws.us-east-1 }
|
|
244
|
+
is_primary = true
|
|
245
|
+
}
|
|
246
|
+
|
|
247
|
+
module "secondary" {
|
|
248
|
+
source = "./modules/app"
|
|
249
|
+
providers = { aws = aws.eu-west-1 }
|
|
250
|
+
is_primary = false
|
|
251
|
+
primary_db_arn = module.primary.db_arn
|
|
252
|
+
}
|
|
253
|
+
|
|
254
|
+
# Global accelerator for traffic routing
|
|
255
|
+
resource "aws_globalaccelerator_accelerator" "main" {
|
|
256
|
+
name = "app-accelerator"
|
|
257
|
+
}
|
|
258
|
+
```
|
|
259
|
+
|
|
260
|
+
## Infrastructure Cost Estimation
|
|
261
|
+
|
|
262
|
+
### Cost Estimation Template
|
|
263
|
+
|
|
264
|
+
Every project should include infrastructure cost estimates:
|
|
265
|
+
|
|
266
|
+
```markdown
|
|
267
|
+
## Infrastructure Cost Estimate: [Project]
|
|
268
|
+
|
|
269
|
+
### Compute
|
|
270
|
+
| Service | Spec | Quantity | Monthly Cost |
|
|
271
|
+
|---------|------|----------|--------------|
|
|
272
|
+
| API servers | t3.large | 3 | $XXX |
|
|
273
|
+
| Workers | t3.medium | 2 | $XXX |
|
|
274
|
+
| **Subtotal** | | | $XXX |
|
|
275
|
+
|
|
276
|
+
### Database
|
|
277
|
+
| Service | Spec | Storage | Monthly Cost |
|
|
278
|
+
|---------|------|---------|--------------|
|
|
279
|
+
| RDS PostgreSQL | db.r5.large | 100GB | $XXX |
|
|
280
|
+
| Redis | cache.r5.large | - | $XXX |
|
|
281
|
+
| **Subtotal** | | | $XXX |
|
|
282
|
+
|
|
283
|
+
### Storage & CDN
|
|
284
|
+
| Service | Volume | Monthly Cost |
|
|
285
|
+
|---------|--------|--------------|
|
|
286
|
+
| S3 | 500GB | $XXX |
|
|
287
|
+
| CloudFront | 1TB transfer | $XXX |
|
|
288
|
+
| **Subtotal** | | $XXX |
|
|
289
|
+
|
|
290
|
+
### Monitoring & Logging
|
|
291
|
+
| Service | Tier | Monthly Cost |
|
|
292
|
+
|---------|------|--------------|
|
|
293
|
+
| DataDog | Pro | $XXX |
|
|
294
|
+
| CloudWatch | Pay-as-you-go | $XXX |
|
|
295
|
+
| **Subtotal** | | $XXX |
|
|
296
|
+
|
|
297
|
+
### Third-Party Services
|
|
298
|
+
| Service | Plan | Monthly Cost |
|
|
299
|
+
|---------|------|--------------|
|
|
300
|
+
| Auth0 | Pro | $XXX |
|
|
301
|
+
| Twilio | Pay-as-you-go | $XXX |
|
|
302
|
+
| **Subtotal** | | $XXX |
|
|
303
|
+
|
|
304
|
+
### Summary
|
|
305
|
+
| Category | Monthly | Annual |
|
|
306
|
+
|----------|---------|--------|
|
|
307
|
+
| Compute | $XXX | $XXX |
|
|
308
|
+
| Database | $XXX | $XXX |
|
|
309
|
+
| Storage/CDN | $XXX | $XXX |
|
|
310
|
+
| Monitoring | $XXX | $XXX |
|
|
311
|
+
| Third-Party | $XXX | $XXX |
|
|
312
|
+
| **Total** | $XXX | $XXX |
|
|
313
|
+
```
|
|
314
|
+
|
|
315
|
+
### Scaling Projections
|
|
316
|
+
|
|
317
|
+
| Users | Monthly Cost | Notes |
|
|
318
|
+
|-------|--------------|-------|
|
|
319
|
+
| 1,000 | $XXX | Launch baseline |
|
|
320
|
+
| 10,000 | $XXX | First scale point |
|
|
321
|
+
| 100,000 | $XXX | Requires architecture review |
|
|
322
|
+
|
|
323
|
+
### Cost Optimization Checklist
|
|
324
|
+
- [ ] Right-sized instances (not over-provisioned)
|
|
325
|
+
- [ ] Reserved instances for baseline load (1-3 year)
|
|
326
|
+
- [ ] Spot instances for batch/worker jobs
|
|
327
|
+
- [ ] Storage lifecycle policies (archive old data)
|
|
328
|
+
- [ ] CDN for static assets
|
|
329
|
+
- [ ] Database read replicas vs scaling up
|
|
330
|
+
- [ ] Serverless for variable workloads
|
|
331
|
+
|
|
332
|
+
## Anti-Patterns to Avoid
|
|
333
|
+
|
|
334
|
+
| Anti-Pattern | Better Approach |
|
|
335
|
+
|--------------|-----------------|
|
|
336
|
+
| Lift and shift only | Modernize where beneficial |
|
|
337
|
+
| Single AZ deployment | Multi-AZ for production |
|
|
338
|
+
| Over-provisioning | Right-size and auto-scale |
|
|
339
|
+
| No tagging strategy | Consistent tagging from start |
|
|
340
|
+
| Ignoring data gravity | Consider data location costs |
|
|
341
|
+
|
|
342
|
+
## Constraints
|
|
343
|
+
|
|
344
|
+
- Design for failure (assume things break)
|
|
345
|
+
- Implement defense in depth
|
|
346
|
+
- Always enable encryption at rest and in transit
|
|
347
|
+
- Use managed services where appropriate
|
|
348
|
+
- Document architecture decisions
|
|
349
|
+
|
|
350
|
+
## Related Skills
|
|
351
|
+
|
|
352
|
+
- `kubernetes-specialist` - Container orchestration
|
|
353
|
+
- `security-engineer` - Cloud security
|
|
354
|
+
- `devops-engineer` - Infrastructure as code
|