npm - opencode-skills-collection - Versions diffs - 3.0.45 → 3.0.47 - Mend

opencode-skills-collection 3.0.45 → 3.0.47

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (71) hide show

package/bundled-skills/monopoly/patterns/SKILL.md ADDED Viewed

@@ -0,0 +1,331 @@
+---
+name: patterns
+description: Reference document for monopoly patterns.
+risk: safe
+reports-to: monopoly
+---
+# MONOPOLY — Design Patterns Deep Dive
+## Table of Contents
+1. CQRS
+2. Event Sourcing
+3. Saga Pattern
+4. Circuit Breaker
+5. Bulkhead
+6. Strangler Fig
+7. Sidecar / Service Mesh
+8. Outbox Pattern
+9. Consistent Hashing
+10. Backpressure
+11. Leader Election
+12. Two-Phase Commit
+---
+## 1. CQRS (Command Query Responsibility Segregation)
+**What it is:** Separate the read model (Query) from the write model (Command) into distinct services, databases, or code paths.
+**When to use:**
+- Read load is 10×+ write load (most web apps)
+- Read queries are complex aggregations over write data
+- Need to optimize read and write paths independently
+- Domain model is complex (DDD contexts)
+**Implementation:**
+```
+Write Path:  Client → Command API → Write DB (normalized, PostgreSQL)
+Read Path:   Client → Query API  → Read DB (denormalized, Redis / Elasticsearch)
+Sync:        Write DB → CDC (Debezium) → Message Queue → Read DB updater
+```
+**Trade-offs:**
+- ✅ Independent scaling of read and write
+- ✅ Optimized schemas for each operation type
+- ❌ Eventual consistency between write and read models
+- ❌ Increased complexity; two models to maintain
+**Real-world users:** Amazon (order service), LinkedIn (feed)
+---
+## 2. Event Sourcing
+**What it is:** Store state as a sequence of immutable events rather than current state. Rebuild current state by replaying events.
+**When to use:**
+- Full audit trail is a regulatory requirement (fintech, healthcare)
+- Need to replay history for debugging or analytics
+- Complex domain with many state transitions
+- Need to derive multiple read projections from same data
+**Implementation:**
+```
+Event Store: append-only log (Kafka, EventStoreDB)
+Snapshots:   periodic snapshots to speed up state rebuild
+Projections: consumers build read models from events
+```
+**Trade-offs:**
+- ✅ Complete audit history; perfect for compliance
+- ✅ Replay and time-travel debugging
+- ❌ Querying current state requires projection maintenance
+- ❌ Event schema evolution is hard
+- ❌ High storage overhead over time
+---
+## 3. Saga Pattern
+**What it is:** Manage distributed transactions across microservices via a sequence of local transactions, each publishing an event. If a step fails, compensating transactions undo previous steps.
+**Two variants:**
+- **Choreography:** Services react to events autonomously (decentralized)
+- **Orchestration:** A central Saga Orchestrator coordinates steps (centralized)
+**When to use:**
+- Multi-service workflows where ACID across services is impossible
+- Long-running business transactions (order → payment → inventory → shipping)
+- Need rollback across service boundaries
+**Choreography Example:**
+```
+OrderService creates order →
+  [event: OrderCreated] →
+    PaymentService charges card →
+      [event: PaymentProcessed] →
+        InventoryService reserves stock →
+          [event: StockReserved] →
+            ShippingService books courier
+```
+**Compensating Transactions (on failure):**
+```
+ShippingService fails →
+  [event: ShippingFailed] →
+    InventoryService releases stock →
+      PaymentService refunds card →
+        OrderService marks order failed
+```
+**Trade-offs:**
+- ✅ No distributed locking; high availability
+- ✅ Scales well across services
+- ❌ Hard to debug; distributed trace required
+- ❌ Compensating transactions are complex to implement correctly
+---
+## 4. Circuit Breaker
+**What it is:** A proxy that monitors calls to a service. If failure rate exceeds threshold, the circuit "opens" and calls fail fast instead of waiting for timeout.
+**States:**
+```
+CLOSED  → calls pass through; monitor failure rate
+OPEN    → calls fail immediately; no calls to downstream
+HALF-OPEN → let a probe call through; if success, close; if fail, stay open
+```
+**When to use:**
+- Calling any external service (payment gateway, SMS, email)
+- Microservices calling each other
+- Preventing timeout cascade when downstream is slow
+**Implementation tools:** Hystrix (deprecated), Resilience4j, Polly (.NET), Envoy proxy
+**Thresholds (starting point):**
+- Open after 50% failure rate over 10 requests
+- Stay open for 30 seconds
+- Half-open: allow 1 probe request
+**Trade-offs:**
+- ✅ Prevents cascade failures
+- ✅ Gives downstream time to recover
+- ❌ Adds latency overhead for monitoring
+- ❌ Requires fallback behavior when circuit is open
+---
+## 5. Bulkhead
+**What it is:** Isolate components so a failure in one doesn't consume resources of others. Named after the watertight compartments in ship hulls.
+**Types:**
+- **Thread Pool Bulkhead:** Separate thread pools per service call
+- **Semaphore Bulkhead:** Limit concurrent calls per service
+- **Process Bulkhead:** Separate processes/containers per service type
+**When to use:**
+- Multiple tenants sharing infrastructure (SaaS)
+- One slow service consuming all connection pool slots
+- Protecting critical services from being starved by non-critical ones
+**Example:**
+```
+Without bulkhead:
+  [Recommendation Service hangs] → fills shared thread pool → [Payment Service starves]
+With bulkhead:
+  [Recommendation Service hangs] → fills its own thread pool (10 threads) → [Payment Service unaffected, has its own 50 threads]
+```
+---
+## 6. Strangler Fig Pattern
+**What it is:** Incrementally replace a legacy monolith by routing new functionality to new microservices, while keeping the monolith alive for unchanged features.
+**Migration steps:**
+```
+Phase 1: Deploy proxy in front of monolith (no user impact)
+Phase 2: Route one feature to new microservice
+Phase 3: Verify; deprecate that feature in monolith
+Phase 4: Repeat for each feature
+Phase 5: Monolith is empty; decommission
+```
+**When to use:**
+- Migrating legacy monolith to microservices
+- Can't do a big-bang rewrite (too risky)
+- Need to ship new features during migration
+**Trade-offs:**
+- ✅ Zero downtime migration
+- ✅ Incremental risk
+- ❌ Dual maintenance burden during migration (monolith + new services)
+- ❌ Proxy adds latency; must be managed carefully
+---
+## 7. Outbox Pattern
+**What it is:** Solve the dual-write problem (write to DB AND publish to queue atomically) by writing the event to an "outbox" table in the same DB transaction, then having a separate process relay it to the queue.
+**Problem it solves:**
+```
+❌ WRONG (dual-write race):
+  BEGIN;
+  UPDATE orders SET status='paid';
+  COMMIT;
+  // Crash here → event never published, DB and queue are inconsistent
+  publish(PaymentProcessed);
+```
+```
+✅ CORRECT (outbox):
+  BEGIN;
+  UPDATE orders SET status='paid';
+  INSERT INTO outbox (event_type, payload) VALUES ('PaymentProcessed', {...});
+  COMMIT;
+  // Relay process reads outbox and publishes to Kafka
+  // At-least-once delivery guaranteed; make consumers idempotent
+```
+**Relay options:** Debezium (CDC), polling relay, transaction log tailing
+---
+## 8. Consistent Hashing
+**What it is:** A hashing scheme where adding or removing nodes requires only K/N keys to be remapped (K = keys, N = nodes), instead of remapping all keys.
+**When to use:**
+- Distributing cache keys across Redis cluster nodes
+- Routing requests to servers in a distributed system
+- Partitioning data across database nodes
+**Virtual nodes:** Assign multiple positions per physical node on the hash ring to ensure even distribution even with few nodes.
+---
+## 9. Backpressure
+**What it is:** A mechanism for consumers to signal producers to slow down when they can't keep up, preventing memory exhaustion and cascade failures.
+**Strategies:**
+- **Drop:** Discard overflow messages (acceptable for metrics, logs)
+- **Buffer:** Queue up to a limit, then block or drop
+- **Block:** Producer waits until consumer catches up (simplest, may cause timeout)
+- **Rate Limit:** Throttle producers at ingestion point
+**When to use:**
+- Message queue consumers are slower than producers
+- Real-time data pipeline ingestion spikes
+- API rate limiting for upstream clients
+---
+## 10. Leader Election
+**What it is:** In a distributed system, elect a single node to perform a privileged task (e.g., writing to DB, sending scheduled jobs, coordinating work).
+**Algorithms:**
+- **Raft:** Used by etcd, CockroachDB, Consul. Practical and well-understood.
+- **ZooKeeper (ZAB):** Used by Kafka, HBase. Mature but operationally heavy.
+- **Bully Algorithm:** Simple; highest ID wins. Not fault-tolerant.
+**When to use:**
+- Scheduled jobs that should only run once (cron replacement)
+- Primary/replica database failover coordination
+- Distributed lock management
+**Tools:** etcd, ZooKeeper, Consul, Redis (Redlock — use with caution)
+---
+## 11. Two-Phase Commit (2PC)
+**What it is:** A distributed algorithm that ensures all participants in a transaction either all commit or all abort.
+**Phases:**
+```
+Phase 1 (Prepare): Coordinator asks all participants "can you commit?"
+  All say YES → proceed to Phase 2
+  Any says NO → abort
+Phase 2 (Commit): Coordinator tells all participants to commit
+```
+**When to use (sparingly):**
+- Strong consistency is an absolute requirement across services
+- Data loss is catastrophic (financial settlements)
+**Why to avoid:**
+- Coordinator is a SPOF
+- Blocks on participant failure
+- Very low throughput under contention
+- Prefer Saga Pattern in most microservice architectures
+---
+## 12. Read-Through / Write-Through / Write-Behind Cache
+**Read-Through:**
+```
+Client → Cache (miss) → Cache fetches from DB → Returns to client
+```
+Cache is always populated on miss. Simple for clients. Risk: cold start.
+**Write-Through:**
+```
+Client → Cache → Cache writes to DB synchronously → Confirms
+```
+Strong consistency. Higher write latency. Good for read-heavy with consistency need.
+**Write-Behind (Write-Back):**
+```
+Client → Cache → Confirms immediately → Async flush to DB
+```
+Very low write latency. Risk of data loss if cache fails before flush. Good for high-throughput counters, analytics.
+**Cache-Aside (Lazy Loading):**
+```
+Client → Cache (miss) → Client fetches from DB → Client writes to Cache
+```
+Most common. Application owns cache logic. Risk: thundering herd on cold start.
+## Limitations
+- This is a reference document and may not cover all edge cases. Always verify architectures before production.

package/bundled-skills/monopoly/scale-benchmarks/SKILL.md ADDED Viewed

@@ -0,0 +1,174 @@
+---
+name: scale-benchmarks
+description: Reference document for monopoly scale-benchmarks.
+risk: safe
+reports-to: monopoly
+---
+# MONOPOLY — Scale Benchmarks & Estimation Formulas
+## Quick Estimation Formulas
+### User → RPS Conversion
+```
+Requests per second (avg) = DAU × avg_requests_per_user_per_day / 86400
+Requests per second (peak) = avg_RPS × peak_multiplier
+Peak multipliers by app type:
+  Social media:      5–10×
+  E-commerce:        3–5× (higher during sales)
+  News / media:      10–20× (breaking news spike)
+  B2B SaaS:          2–3× (business hours spike)
+  Gaming:            5–15× (event-driven)
+```
+### Storage Estimation
+```
+Storage per day    = requests_per_day × avg_payload_size
+Storage per year   = storage_per_day × 365
+With replication   = storage_per_year × replication_factor (3× typical)
+With CDN/cache     = reduce by cache_hit_ratio (80% hit = 20% origin load)
+Common payload sizes:
+  Tweet / short text:    500B
+  Social post with text: 2KB
+  Profile data:          5KB
+  Image (compressed):    200KB–2MB
+  Video (per minute):    50MB (720p), 150MB (1080p)
+  API JSON response:     1–20KB
+```
+### Bandwidth Estimation
+```
+Inbound bandwidth  = avg_request_size × RPS
+Outbound bandwidth = avg_response_size × RPS
+Convert: 1 Gbps = 125 MB/s
+         10 Gbps = 1.25 GB/s
+```
+---
+## Known Scale Limits of Common Technologies
+### Databases
+| Technology | Single Node Writes | Reads (with replicas) | Recommended Shard/Cluster Trigger |
+|------------|-------------------|----------------------|----------------------------------|
+| PostgreSQL | ~5K–20K writes/s | ~50K–200K reads/s | >5TB data or >20K writes/s |
+| MySQL | ~10K–25K writes/s | ~60K–250K reads/s | >5TB or >25K writes/s |
+| MongoDB | ~20K–50K writes/s | ~50K–100K reads/s | >100GB or >50K writes/s |
+| Cassandra | ~200K–1M writes/s | ~200K–500K reads/s | Almost never needs explicit sharding |
+| DynamoDB | Unlimited (managed) | Unlimited (managed) | Use provisioned capacity mode |
+| Redis | ~500K–1M ops/s | Same | >50GB data or cluster needed |
+| Elasticsearch | ~10K–50K docs/s | ~1K–10K queries/s | >100M documents per index |
+### Queues / Streams
+| Technology | Max Throughput | Max Consumers | Retention |
+|------------|----------------|---------------|-----------|
+| Kafka | 1M+ msgs/s per cluster | Unlimited consumer groups | Configurable (days–forever) |
+| RabbitMQ | ~50K–100K msgs/s | Limited by connections | Until consumed |
+| SQS Standard | Unlimited (AWS-managed) | Unlimited | 14 days |
+| SQS FIFO | 3K msgs/s per queue | Per group | 14 days |
+| Redis Pub/Sub | ~1M msgs/s | Limited by subscribers | None (fire-and-forget) |
+### Caching
+| Technology | Max Memory (single) | Max Throughput | Latency |
+|------------|--------------------|--------------|----|
+| Redis | ~1TB RAM | ~1M ops/s | <1ms |
+| Memcached | ~64GB RAM | ~1M ops/s | <1ms |
+| In-process (Caffeine/Guava) | JVM heap | Unlimited (local) | <0.1ms |
+---
+## Capacity Planning by User Scale
+### 1K DAU
+```
+Avg RPS:       ~1–5 RPS
+Peak RPS:      ~10–50 RPS
+DB size/year:  ~10–50GB
+Infra needed:  Single server, managed DB (RDS t3.medium), basic CDN
+Monthly cost:  $50–200
+```
+### 10K DAU
+```
+Avg RPS:       ~10–50 RPS
+Peak RPS:      ~100–500 RPS
+DB size/year:  ~100–500GB
+Infra needed:  2–4 app servers, RDS r5.large, Redis t3.medium, CDN
+Monthly cost:  $300–800
+```
+### 100K DAU
+```
+Avg RPS:       ~100–500 RPS
+Peak RPS:      ~1K–5K RPS
+DB size/year:  ~1–5TB
+Infra needed:  ASG (5–10 app servers), RDS r5.xlarge + 2 replicas, Redis cluster, CDN, ALB
+Monthly cost:  $2K–8K
+```
+### 1M DAU
+```
+Avg RPS:       ~1K–5K RPS
+Peak RPS:      ~10K–50K RPS
+DB size/year:  ~10–50TB
+Infra needed:  ASG (20–50 servers), DB sharding or Aurora, Redis cluster, Kafka, CDN, WAF
+Monthly cost:  $20K–80K
+```
+### 10M DAU
+```
+Avg RPS:       ~10K–50K RPS
+Peak RPS:      ~100K–500K RPS
+DB size/year:  ~100–500TB
+Infra needed:  Multi-region, microservices, distributed DB (Cassandra/CockroachDB), full CDN, dedicated SRE
+Monthly cost:  $200K–2M+
+```
+---
+## Common SLO Targets
+| Tier | Availability | Monthly Downtime Allowed |
+|------|-------------|--------------------------|
+| 99% | Basic | 7.2 hours/month |
+| 99.9% (three nines) | Standard production | 43.8 minutes/month |
+| 99.95% | Important services | 21.9 minutes/month |
+| 99.99% (four nines) | Critical services | 4.38 minutes/month |
+| 99.999% (five nines) | Telecom / payments | 26 seconds/month |
+**Achieving four nines requires:** Multi-AZ deployment, automated failover, zero-downtime deploys, chaos engineering, 24/7 on-call.
+---
+## Latency Budget Guidelines
+```
+User perceived latency targets:
+  < 100ms  → Feels instant
+  100–300ms → Acceptable for most interactions
+  300ms–1s → Noticeable; optimize if possible
+  > 1s     → Frustrating; unacceptable for critical paths
+Network latency by distance (approximate):
+  Same datacenter:    0.5ms
+  Same region (AZ):   1–2ms
+  Cross-region US:    30–60ms
+  US to Europe:       80–120ms
+  US to Asia:         150–250ms
+Database query targets:
+  Simple key-value:   < 1ms (cache)
+  Simple DB query:    < 5ms
+  Complex query:      < 50ms
+  Reporting query:    < 500ms (async if > 1s)
+```
+## Limitations
+- This is a reference document and may not cover all edge cases. Always verify architectures before production.

package/bundled-skills/monopoly/security-checklist/SKILL.md ADDED Viewed

@@ -0,0 +1,69 @@
+---
+name: security-checklist
+description: Reference document for monopoly security-checklist.
+risk: safe
+reports-to: monopoly
+---
+# MONOPOLY — Security Hardening Checklist
+## Network Security
+- [ ] All services inside private VPC; only LB/API GW exposed publicly
+- [ ] Security groups follow least-privilege (deny all, allow specific ports/CIDRs)
+- [ ] NACLs as secondary defense layer
+- [ ] WAF enabled with OWASP top 10 ruleset
+- [ ] DDoS protection (Cloudflare / AWS Shield Standard minimum)
+- [ ] VPN or Private Link for inter-service communication in multi-region
+## Authentication & Authorization
+- [ ] JWT tokens with short expiry (15 min access, 7 day refresh)
+- [ ] OAuth 2.0 / OIDC for third-party auth
+- [ ] MFA enforced for admin accounts
+- [ ] RBAC or ABAC for authorization
+- [ ] No secrets in JWT payload (use opaque references)
+- [ ] Token revocation strategy (Redis blocklist or short TTL)
+## API Security
+- [ ] Rate limiting at API gateway (per user, per IP, per endpoint)
+- [ ] Input validation and sanitization on all endpoints
+- [ ] SQL injection prevention (parameterized queries, ORM)
+- [ ] XSS prevention (output encoding, CSP headers)
+- [ ] CSRF protection (SameSite cookies, CSRF tokens)
+- [ ] CORS policy locked down (not wildcard `*`)
+- [ ] HTTP security headers (HSTS, X-Frame-Options, X-Content-Type-Options)
+## Data Security
+- [ ] Encryption in transit (TLS 1.2+ everywhere, TLS 1.3 preferred)
+- [ ] Encryption at rest (AES-256 for DBs, S3 SSE)
+- [ ] PII data identified, minimized, and encrypted at field level where needed
+- [ ] Database backups encrypted
+- [ ] No sensitive data in logs (PII, passwords, tokens, card numbers)
+## Secrets Management
+- [ ] No secrets in code or environment variables in plain text
+- [ ] Secrets manager in use (HashiCorp Vault, AWS Secrets Manager, GCP Secret Manager)
+- [ ] Secrets rotation automated
+- [ ] IAM roles for service-to-service auth (not static credentials)
+## Supply Chain & Dependencies
+- [ ] Dependency scanning (Snyk, Dependabot, npm audit)
+- [ ] Container image scanning (Trivy, ECR scanning)
+- [ ] Pin dependency versions in production
+- [ ] SBOM (Software Bill of Materials) generated for compliance
+## Incident Response
+- [ ] Audit logs for all admin actions and data access
+- [ ] Alerting on anomalous access patterns
+- [ ] Incident response runbook documented
+- [ ] Data breach notification process defined (GDPR 72-hour rule)
+- [ ] Regular penetration testing scheduled
+## Compliance (as applicable)
+- [ ] GDPR: data residency, right to deletion, consent tracking
+- [ ] PCI-DSS: if handling card data — never store raw PANs
+- [ ] HIPAA: if health data — encryption, audit logs, BAA with vendors
+- [ ] SOC 2 Type II: access control, availability, confidentiality evidence
+## Limitations
+- This is a reference document and may not cover all edge cases. Always verify architectures before production.