bobo-ai-cli 3.0.3 → 3.0.5
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/LICENSE +21 -21
- package/README.md +259 -259
- package/bundled-skills/CORE_SKILLS.txt +18 -18
- package/bundled-skills/backend-expert/SKILL.md +97 -97
- package/bundled-skills/code-review/SKILL.md +280 -280
- package/bundled-skills/code-review-expert/SKILL.md +85 -85
- package/bundled-skills/context-budget-analyzer/SKILL.md +76 -76
- package/bundled-skills/context-compressor/SKILL.md +75 -75
- package/bundled-skills/context-optimization-suite/SKILL.md +162 -162
- package/bundled-skills/frontend-expert/SKILL.md +93 -93
- package/bundled-skills/github/SKILL.md +12 -12
- package/bundled-skills/high-agency/SKILL.md +473 -473
- package/bundled-skills/high-agency/references/builder-patterns.md +126 -126
- package/bundled-skills/high-agency/references/recovery-playbook.md +298 -298
- package/bundled-skills/memory-manager/SKILL.md +214 -214
- package/bundled-skills/memory-manager/references/advanced-config.md +65 -65
- package/bundled-skills/orchestrator/SKILL.md +681 -681
- package/bundled-skills/planning-with-files/SKILL.md +193 -193
- package/bundled-skills/skill-creator/SKILL.md +220 -220
- package/bundled-skills/testing-expert/SKILL.md +99 -99
- package/bundled-skills/verify/SKILL.md +15 -15
- package/dist/agent.d.ts +5 -0
- package/dist/agent.js +11 -1
- package/dist/agent.js.map +1 -1
- package/dist/agents/catalog.d.ts +47 -0
- package/dist/agents/catalog.js +63 -5
- package/dist/agents/catalog.js.map +1 -1
- package/dist/agents/router.d.ts +12 -1
- package/dist/agents/router.js +43 -3
- package/dist/agents/router.js.map +1 -1
- package/dist/agents/spawn.js +36 -18
- package/dist/agents/spawn.js.map +1 -1
- package/dist/autonomous.js +5 -5
- package/dist/cli.js +23 -21
- package/dist/cli.js.map +1 -1
- package/dist/compactor.js +39 -39
- package/dist/dream.js +29 -29
- package/dist/image-input.d.ts +44 -0
- package/dist/image-input.js +161 -0
- package/dist/image-input.js.map +1 -0
- package/dist/memory.js +13 -13
- package/dist/project.js +15 -15
- package/dist/repl.js +88 -0
- package/dist/repl.js.map +1 -1
- package/dist/skills.js +54 -54
- package/dist/sub-agents.js +65 -65
- package/dist/tools/browser.js +21 -21
- package/dist/tools/claude-code.js +10 -10
- package/dist/web.js +7 -7
- package/dist/wiki-commands.d.ts +2 -0
- package/dist/wiki-commands.js +249 -0
- package/dist/wiki-commands.js.map +1 -0
- package/dist/wiki.d.ts +90 -0
- package/dist/wiki.js +614 -0
- package/dist/wiki.js.map +1 -0
- package/knowledge/advanced-patterns.md +70 -70
- package/knowledge/agent-directives.md +74 -74
- package/knowledge/api-integration-patterns.md +102 -0
- package/knowledge/code-review-protocol.md +69 -0
- package/knowledge/dream.md +36 -36
- package/knowledge/engineering.md +52 -46
- package/knowledge/error-catalog.md +38 -33
- package/knowledge/event-driven-architecture.md +43 -0
- package/knowledge/external-alignment.md +47 -0
- package/knowledge/high-agency.md +73 -0
- package/knowledge/image-generation.md +48 -0
- package/knowledge/index.json +194 -169
- package/knowledge/llm-wiki-pattern.md +71 -0
- package/knowledge/long-task-management.md +79 -0
- package/knowledge/memory/cache-optimization-and-skill-integration.md +102 -102
- package/knowledge/memory/engineering-patterns.md +134 -134
- package/knowledge/memory/feedback_root_structure.md +15 -15
- package/knowledge/memory/project-contexts.md +69 -69
- package/knowledge/memory/tools-and-services.md +85 -85
- package/knowledge/memory-management.md +72 -0
- package/knowledge/rules/advisor-strategy.md +204 -0
- package/knowledge/rules/agents.md +62 -62
- package/knowledge/rules/blocking-rules.md +323 -323
- package/knowledge/rules/cache-management.md +379 -379
- package/knowledge/rules/capability-evolution.md +132 -132
- package/knowledge/rules/coding.md +126 -126
- package/knowledge/rules/engineering-workflows.md +225 -225
- package/knowledge/rules/evomap-content-guidelines.md +354 -354
- package/knowledge/rules/evomap-guide.md +224 -224
- package/knowledge/rules/external-alignment.md +22 -0
- package/knowledge/rules/git.md +31 -31
- package/knowledge/rules/hooks.md +106 -106
- package/knowledge/rules/performance.md +101 -101
- package/knowledge/rules/remotion-auto-production.md +1120 -1120
- package/knowledge/rules/security.md +46 -46
- package/knowledge/rules/testing.md +32 -32
- package/knowledge/rules/work-mode.md +208 -208
- package/knowledge/rules.md +62 -62
- package/knowledge/self-evolution.md +78 -0
- package/knowledge/self-rationalization-guard.md +52 -0
- package/knowledge/skills/Skill_Seekers.md +1722 -1722
- package/knowledge/skills/ab-test-setup.md +557 -557
- package/knowledge/skills/agent-sdk-dev.md +238 -238
- package/knowledge/skills/agent-tools.md +136 -136
- package/knowledge/skills/analytics-tracking.md +597 -597
- package/knowledge/skills/artifacts-builder.md +89 -89
- package/knowledge/skills/asana.md +12 -12
- package/knowledge/skills/backend-expert.md +97 -97
- package/knowledge/skills/brand-voice.md +481 -481
- package/knowledge/skills/browser-use.md +419 -419
- package/knowledge/skills/cache-optimization-skill.md +179 -179
- package/knowledge/skills/canvas-design.md +147 -147
- package/knowledge/skills/citation-validator.md +203 -203
- package/knowledge/skills/clangd-lsp.md +52 -52
- package/knowledge/skills/code-review-expert.md +85 -85
- package/knowledge/skills/code-review.md +280 -280
- package/knowledge/skills/code-simplifier.md +12 -12
- package/knowledge/skills/commit-commands.md +258 -258
- package/knowledge/skills/competitor-alternatives.md +795 -795
- package/knowledge/skills/content-atomizer.md +910 -910
- package/knowledge/skills/content-research-writer.md +605 -605
- package/knowledge/skills/context-optimization-suite.md +162 -162
- package/knowledge/skills/context7.md +12 -12
- package/knowledge/skills/copy-editing.md +494 -494
- package/knowledge/skills/copywriting.md +510 -510
- package/knowledge/skills/csharp-lsp.md +40 -40
- package/knowledge/skills/decision-making-framework.md +154 -154
- package/knowledge/skills/developer-growth-analysis.md +335 -335
- package/knowledge/skills/direct-response-copy.md +2336 -2336
- package/knowledge/skills/docker-expert.md +229 -229
- package/knowledge/skills/document-skills.md +12 -12
- package/knowledge/skills/documentation-expert.md +126 -126
- package/knowledge/skills/email-sequence.md +1061 -1061
- package/knowledge/skills/email-sequences.md +910 -910
- package/knowledge/skills/example-plugin.md +72 -72
- package/knowledge/skills/explanatory-output-style.md +82 -82
- package/knowledge/skills/feature-dev.md +458 -458
- package/knowledge/skills/file-organizer.md +466 -466
- package/knowledge/skills/firebase.disabled.md +12 -12
- package/knowledge/skills/form-cro.md +488 -488
- package/knowledge/skills/free-tool-strategy.md +636 -636
- package/knowledge/skills/frontend-design-offical.md +55 -55
- package/knowledge/skills/frontend-design.md +41 -41
- package/knowledge/skills/frontend-expert.md +93 -93
- package/knowledge/skills/github.md +12 -12
- package/knowledge/skills/gitlab.md +12 -12
- package/knowledge/skills/gopls-lsp.md +32 -32
- package/knowledge/skills/got-controller.md +218 -218
- package/knowledge/skills/greptile.md +72 -72
- package/knowledge/skills/hookify.md +376 -376
- package/knowledge/skills/image-editor.md +189 -189
- package/knowledge/skills/image-enhancer.md +109 -109
- package/knowledge/skills/jdtls-lsp.md +49 -49
- package/knowledge/skills/json-canvas.md +654 -654
- package/knowledge/skills/keyword-research.md +559 -559
- package/knowledge/skills/kotlin-lsp.md +28 -28
- package/knowledge/skills/laravel-boost.md +12 -12
- package/knowledge/skills/launch-strategy.md +394 -394
- package/knowledge/skills/lead-magnet.md +393 -393
- package/knowledge/skills/learning-output-style.md +106 -106
- package/knowledge/skills/linear.md +12 -12
- package/knowledge/skills/lua-lsp.md +47 -47
- package/knowledge/skills/marketing-ideas.md +720 -720
- package/knowledge/skills/marketing-psychology.md +534 -534
- package/knowledge/skills/mcp-builder.md +369 -369
- package/knowledge/skills/meeting-insights-analyzer.md +347 -347
- package/knowledge/skills/memory-evolution-system.md +172 -172
- package/knowledge/skills/multi-lens-thinking.md +407 -407
- package/knowledge/skills/nano-banana-pro.md +116 -116
- package/knowledge/skills/newsletter.md +736 -736
- package/knowledge/skills/notebooklm.md +296 -296
- package/knowledge/skills/obsidian-bases.md +634 -634
- package/knowledge/skills/obsidian-markdown.md +651 -651
- package/knowledge/skills/onboarding-cro.md +494 -494
- package/knowledge/skills/orchestrator.md +681 -681
- package/knowledge/skills/page-cro.md +379 -379
- package/knowledge/skills/paid-ads.md +624 -624
- package/knowledge/skills/paywall-upgrade-cro.md +651 -651
- package/knowledge/skills/php-lsp.md +36 -36
- package/knowledge/skills/planning-with-files.md +193 -193
- package/knowledge/skills/playwright.md +12 -12
- package/knowledge/skills/plugin-dev.md +434 -434
- package/knowledge/skills/popup-cro.md +520 -520
- package/knowledge/skills/positioning-angles.md +330 -330
- package/knowledge/skills/pr-review-toolkit.md +359 -359
- package/knowledge/skills/pricing-strategy.md +777 -777
- package/knowledge/skills/programmatic-seo.md +714 -714
- package/knowledge/skills/pyright-lsp.md +43 -43
- package/knowledge/skills/quality-assurance-framework.md +168 -168
- package/knowledge/skills/question-refiner.md +160 -160
- package/knowledge/skills/ralph-loop.md +205 -205
- package/knowledge/skills/refactoring-expert.md +103 -103
- package/knowledge/skills/referral-program.md +668 -668
- package/knowledge/skills/research-executor.md +164 -164
- package/knowledge/skills/review-with-security.md +12 -12
- package/knowledge/skills/rust-analyzer-lsp.md +50 -50
- package/knowledge/skills/schema-markup.md +647 -647
- package/knowledge/skills/security-audit-expert.md +124 -124
- package/knowledge/skills/security-expert.md +140 -140
- package/knowledge/skills/security-guidance.md +12 -12
- package/knowledge/skills/seedance-prompt.md +139 -139
- package/knowledge/skills/self-evolution.md +1160 -1160
- package/knowledge/skills/seo-audit.md +432 -432
- package/knowledge/skills/seo-content.md +787 -787
- package/knowledge/skills/serena.md +12 -12
- package/knowledge/skills/signup-flow-cro.md +409 -409
- package/knowledge/skills/skill-creator.md +220 -220
- package/knowledge/skills/skill-manager.md +226 -226
- package/knowledge/skills/skill-share.md +98 -98
- package/knowledge/skills/slack.md +12 -12
- package/knowledge/skills/social-content.md +878 -878
- package/knowledge/skills/spec-flow-skill.md +124 -124
- package/knowledge/skills/stripe.md +12 -12
- package/knowledge/skills/supabase.md +12 -12
- package/knowledge/skills/swift-lsp.md +40 -40
- package/knowledge/skills/synthesizer.md +236 -236
- package/knowledge/skills/template-skill.md +16 -16
- package/knowledge/skills/testing-expert.md +99 -99
- package/knowledge/skills/theme-factory.md +72 -72
- package/knowledge/skills/tiktok-research.md +208 -208
- package/knowledge/skills/typescript-lsp.md +36 -36
- package/knowledge/skills/ui-ux-pro-max.md +247 -247
- package/knowledge/skills/verify.md +15 -15
- package/knowledge/skills/visual-prompt-engineer.md +102 -102
- package/knowledge/skills/webapp-testing.md +111 -111
- package/knowledge/skills/wide-research.md +191 -191
- package/knowledge/system.md +93 -93
- package/knowledge/task-router.md +46 -37
- package/knowledge/verification.md +38 -38
- package/knowledge/worker-prompt-craft.md +66 -0
- package/knowledge/workflows/3d-viz.md +47 -47
- package/knowledge/workflows/data-pipeline.md +47 -47
- package/knowledge/workflows/db-migration.md +51 -51
- package/knowledge/workflows/feature-dev.md +41 -41
- package/knowledge/workflows/tdd-flow.md +52 -52
- package/knowledge/workflows/ui-verify.md +51 -51
- package/package.json +74 -74
- package/dist/claude-bridge.d.ts +0 -18
- package/dist/claude-bridge.js +0 -91
- package/dist/claude-bridge.js.map +0 -1
- package/dist/tools/claude-bridge-tool.d.ts +0 -4
- package/dist/tools/claude-bridge-tool.js +0 -44
- package/dist/tools/claude-bridge-tool.js.map +0 -1
|
@@ -1,379 +1,379 @@
|
|
|
1
|
-
---
|
|
2
|
-
id: "cache-management"
|
|
3
|
-
title: "Cache Management & Token Optimization"
|
|
4
|
-
category: "domain"
|
|
5
|
-
tags: ["🔥 token 燃烧根本原因", "✅ 解决方案矩阵", "🎯 优先级规则", "📊 监控和验证", "每个会话开始时运行", "输出示例", "周期性审计", "🔧 集成到工作流", "1. 检查 cc-cache-fix 是否安装", "2. 运行缓存预热"]
|
|
6
|
-
source: "E:/Bobo's Coding cache/.claude/rules/domain/cache-management.md"
|
|
7
|
-
---
|
|
8
|
-
|
|
9
|
-
# Cache Management & Token Optimization
|
|
10
|
-
|
|
11
|
-
> **版本**: v1.0 (2026-04-01)
|
|
12
|
-
> **来源**: cc-cache-fix 集成 + Claude Code 缓存分析
|
|
13
|
-
> **目标**: 减少 60-70% token 燃烧,提升缓存命中率
|
|
14
|
-
|
|
15
|
-
---
|
|
16
|
-
|
|
17
|
-
## 🔥 Token 燃烧根本原因
|
|
18
|
-
|
|
19
|
-
### 问题1: Prompt Cache 利用率极低
|
|
20
|
-
- **症状**: `cacheCreationInputTokens` = 0, `cacheReadInputTokens` << total input
|
|
21
|
-
- **原因**: Delta 附件丢失、Hash 不稳定、TTL 过短
|
|
22
|
-
- **影响**: 每次对话重新计算,浪费 70% tokens
|
|
23
|
-
|
|
24
|
-
### 问题2: 会话恢复缓存丢失
|
|
25
|
-
- **症状**: 恢复会话后,缓存完全失效
|
|
26
|
-
- **原因**: `deferred_tools_delta` 和 `mcp_instructions_delta` 未持久化
|
|
27
|
-
- **影响**: 长期项目中每次恢复都是冷启动
|
|
28
|
-
|
|
29
|
-
### 问题3: Hash 不稳定
|
|
30
|
-
- **症状**: 相同内容不同 turn 的缓存键不同
|
|
31
|
-
- **原因**: 注入的元数据(时间戳、ID)影响 hash
|
|
32
|
-
- **影响**: 缓存失效率 40-50%
|
|
33
|
-
|
|
34
|
-
### 问题4: TTL 过短
|
|
35
|
-
- **症状**: 5分钟后缓存过期,需要重新计算
|
|
36
|
-
- **原因**: Claude Code 默认 TTL 设置
|
|
37
|
-
- **影响**: 中等长度会话(>5分钟)缓存无效
|
|
38
|
-
|
|
39
|
-
---
|
|
40
|
-
|
|
41
|
-
## ✅ 解决方案矩阵
|
|
42
|
-
|
|
43
|
-
### 方案A: 使用 cc-cache-fix(快速修复)
|
|
44
|
-
|
|
45
|
-
**安装**:
|
|
46
|
-
```bash
|
|
47
|
-
git clone https://github.com/Rangizingo/cc-cache-fix.git
|
|
48
|
-
cd cc-cache-fix
|
|
49
|
-
./install.sh
|
|
50
|
-
```
|
|
51
|
-
|
|
52
|
-
**使用**:
|
|
53
|
-
```bash
|
|
54
|
-
claude-patched # 替代 claude 命令
|
|
55
|
-
```
|
|
56
|
-
|
|
57
|
-
**效果**:
|
|
58
|
-
- ✅ 补丁1: 会话恢复缓存保留 → +30% 缓存命中
|
|
59
|
-
- ✅ 补丁2: Hash 稳定性 → +20% 缓存命中
|
|
60
|
-
- ✅ 补丁3: TTL 1小时 → +15% 缓存命中
|
|
61
|
-
- **总计**: ~60% token 节省(相对基准)
|
|
62
|
-
|
|
63
|
-
**验证**:
|
|
64
|
-
```bash
|
|
65
|
-
python test_cache.py # 检查缓存健康度
|
|
66
|
-
python usage_audit.py # 审计读取效率
|
|
67
|
-
```
|
|
68
|
-
|
|
69
|
-
---
|
|
70
|
-
|
|
71
|
-
### 方案B: Context 压缩(根本优化)
|
|
72
|
-
|
|
73
|
-
**触发条件**: 任何会话开始时
|
|
74
|
-
|
|
75
|
-
**执行步骤**:
|
|
76
|
-
|
|
77
|
-
1. **加载必需文档**(按需加载,不全量)
|
|
78
|
-
```
|
|
79
|
-
Layer 0: CLAUDE.md (5KB) + 核心规则 (15KB)
|
|
80
|
-
Layer 1: Task Router (3KB)
|
|
81
|
-
Layer 2: 相关能力文档 (15-30KB)
|
|
82
|
-
Layer 3: 具体案例 (按需)
|
|
83
|
-
```
|
|
84
|
-
|
|
85
|
-
2. **压缩 CLAUDE.md**
|
|
86
|
-
- 删除重复内容
|
|
87
|
-
- 提取关键规则到 `rules/domain/`
|
|
88
|
-
- 保留索引,不保留详细内容
|
|
89
|
-
|
|
90
|
-
3. **分离项目级规则**
|
|
91
|
-
- 全局规则: `~/.claude/rules/`
|
|
92
|
-
- 项目规则: `.claude/rules/`
|
|
93
|
-
- 不混合加载
|
|
94
|
-
|
|
95
|
-
4. **使用 Context Manager**
|
|
96
|
-
- 自动识别任务类型
|
|
97
|
-
- 按需加载相关文档
|
|
98
|
-
- 保持 context 清洁
|
|
99
|
-
|
|
100
|
-
**效果**: 减少 40-50% 初始 context 大小
|
|
101
|
-
|
|
102
|
-
---
|
|
103
|
-
|
|
104
|
-
### 方案C: 会话分割(长任务优化)
|
|
105
|
-
|
|
106
|
-
**触发条件**: 任务预计 >1小时
|
|
107
|
-
|
|
108
|
-
**执行步骤**:
|
|
109
|
-
|
|
110
|
-
1. **分割策略**
|
|
111
|
-
```
|
|
112
|
-
长任务 (>1小时)
|
|
113
|
-
↓
|
|
114
|
-
分成 3-5 个短会话 (15-20分钟)
|
|
115
|
-
↓
|
|
116
|
-
每个会话独立缓存
|
|
117
|
-
↓
|
|
118
|
-
总 token 节省 30-40%
|
|
119
|
-
```
|
|
120
|
-
|
|
121
|
-
2. **会话间数据传递**
|
|
122
|
-
- 使用 `task_plan.md` 保存进度
|
|
123
|
-
- 使用 `notes.md` 保存发现
|
|
124
|
-
- 下一个会话加载这两个文件
|
|
125
|
-
|
|
126
|
-
3. **缓存预热**
|
|
127
|
-
- 会话开始时加载前一个会话的关键上下文
|
|
128
|
-
- 避免重复计算
|
|
129
|
-
|
|
130
|
-
**效果**: 长任务 token 节省 30-40%
|
|
131
|
-
|
|
132
|
-
---
|
|
133
|
-
|
|
134
|
-
### 方案D: 缓存预热(会话开始优化)
|
|
135
|
-
|
|
136
|
-
**执行时机**: 每个会话开始
|
|
137
|
-
|
|
138
|
-
**步骤**:
|
|
139
|
-
|
|
140
|
-
1. **加载常用上下文**
|
|
141
|
-
```typescript
|
|
142
|
-
// 会话开始时自动加载
|
|
143
|
-
const warmupContext = [
|
|
144
|
-
'CLAUDE.md', // 核心规则
|
|
145
|
-
'rules/core/', // 核心规则
|
|
146
|
-
'memory/MEMORY.md', // 持久化记忆
|
|
147
|
-
'task_plan.md', // 当前任务计划
|
|
148
|
-
];
|
|
149
|
-
```
|
|
150
|
-
|
|
151
|
-
2. **预计算常见操作**
|
|
152
|
-
- 加载项目结构
|
|
153
|
-
- 初始化工具链
|
|
154
|
-
- 预加载常用代码片段
|
|
155
|
-
|
|
156
|
-
3. **缓存验证**
|
|
157
|
-
```bash
|
|
158
|
-
# 会话开始时运行
|
|
159
|
-
python usage_audit.py --check-warmup
|
|
160
|
-
```
|
|
161
|
-
|
|
162
|
-
**效果**: 会话启动时间 -50%, 首次操作 token -30%
|
|
163
|
-
|
|
164
|
-
---
|
|
165
|
-
|
|
166
|
-
## 🎯 优先级规则
|
|
167
|
-
|
|
168
|
-
### 立即应用(所有会话)
|
|
169
|
-
|
|
170
|
-
```
|
|
171
|
-
1. 使用 claude-patched(cc-cache-fix)
|
|
172
|
-
↓
|
|
173
|
-
2. 按需加载 context(不全量)
|
|
174
|
-
↓
|
|
175
|
-
3. 会话开始时缓存预热
|
|
176
|
-
↓
|
|
177
|
-
4. 长任务自动分割
|
|
178
|
-
```
|
|
179
|
-
|
|
180
|
-
### 按任务类型应用
|
|
181
|
-
|
|
182
|
-
| 任务类型 | 推荐方案 | 预期节省 |
|
|
183
|
-
|---------|---------|---------|
|
|
184
|
-
| 简单任务 (<15分钟) | A + B | 40-50% |
|
|
185
|
-
| 中等任务 (15-60分钟) | A + B + D | 50-60% |
|
|
186
|
-
| 长任务 (>1小时) | A + B + C + D | 60-70% |
|
|
187
|
-
| 复杂多文件 | A + B + D + 分割 | 60-70% |
|
|
188
|
-
|
|
189
|
-
---
|
|
190
|
-
|
|
191
|
-
## 📊 监控和验证
|
|
192
|
-
|
|
193
|
-
### 缓存健康检查
|
|
194
|
-
|
|
195
|
-
```bash
|
|
196
|
-
# 每个会话开始时运行
|
|
197
|
-
python cc-cache-fix/test_cache.py
|
|
198
|
-
|
|
199
|
-
# 输出示例
|
|
200
|
-
✅ Attachment persistence: PASS
|
|
201
|
-
✅ Hash stability: PASS
|
|
202
|
-
✅ TTL extension: PASS (1h)
|
|
203
|
-
📊 Cache hit rate: 78%
|
|
204
|
-
💾 Token saved: 2,340 / 3,200 (73%)
|
|
205
|
-
```
|
|
206
|
-
|
|
207
|
-
### 使用审计
|
|
208
|
-
|
|
209
|
-
```bash
|
|
210
|
-
# 周期性审计
|
|
211
|
-
python cc-cache-fix/usage_audit.py
|
|
212
|
-
|
|
213
|
-
# 输出示例
|
|
214
|
-
Session: 2026-04-01 10:00
|
|
215
|
-
├─ Total input tokens: 3,200
|
|
216
|
-
├─ Cache read tokens: 2,340 (73%)
|
|
217
|
-
├─ Cache creation tokens: 860 (27%)
|
|
218
|
-
├─ Efficiency: 73% ✅
|
|
219
|
-
└─ Recommendation: Maintain current strategy
|
|
220
|
-
```
|
|
221
|
-
|
|
222
|
-
### 告警阈值
|
|
223
|
-
|
|
224
|
-
| 指标 | 正常 | 警告 | 严重 |
|
|
225
|
-
|------|------|------|------|
|
|
226
|
-
| Cache hit rate | >70% | 50-70% | <50% |
|
|
227
|
-
| Token efficiency | >65% | 45-65% | <45% |
|
|
228
|
-
| TTL utilization | >80% | 60-80% | <60% |
|
|
229
|
-
|
|
230
|
-
---
|
|
231
|
-
|
|
232
|
-
## 🔧 集成到工作流
|
|
233
|
-
|
|
234
|
-
### 会话开始 Hook
|
|
235
|
-
|
|
236
|
-
```bash
|
|
237
|
-
# ~/.claude/hooks/session-start-cache-warmup.sh
|
|
238
|
-
|
|
239
|
-
#!/bin/bash
|
|
240
|
-
|
|
241
|
-
# 1. 检查 cc-cache-fix 是否安装
|
|
242
|
-
if ! command -v claude-patched &> /dev/null; then
|
|
243
|
-
echo "⚠️ cc-cache-fix not installed. Install with:"
|
|
244
|
-
echo " git clone https://github.com/Rangizingo/cc-cache-fix.git && cd cc-cache-fix && ./install.sh"
|
|
245
|
-
fi
|
|
246
|
-
|
|
247
|
-
# 2. 运行缓存预热
|
|
248
|
-
python cc-cache-fix/test_cache.py --warmup
|
|
249
|
-
|
|
250
|
-
# 3. 加载持久化记忆
|
|
251
|
-
if [ -f "~/.claude/projects/e--bobo-s-coding-cache/memory/MEMORY.md" ]; then
|
|
252
|
-
echo "✅ Memory loaded"
|
|
253
|
-
fi
|
|
254
|
-
|
|
255
|
-
# 4. 检查 context 大小
|
|
256
|
-
CONTEXT_SIZE=$(wc -c < "CLAUDE.md")
|
|
257
|
-
if [ $CONTEXT_SIZE -gt 50000 ]; then
|
|
258
|
-
echo "⚠️ CLAUDE.md too large ($CONTEXT_SIZE bytes). Consider splitting."
|
|
259
|
-
fi
|
|
260
|
-
```
|
|
261
|
-
|
|
262
|
-
### 任务执行 Hook
|
|
263
|
-
|
|
264
|
-
```bash
|
|
265
|
-
# ~/.claude/hooks/task-execution-cache-monitor.sh
|
|
266
|
-
|
|
267
|
-
#!/bin/bash
|
|
268
|
-
|
|
269
|
-
# 监控任务执行期间的缓存效率
|
|
270
|
-
python cc-cache-fix/usage_audit.py --monitor --interval 5m
|
|
271
|
-
```
|
|
272
|
-
|
|
273
|
-
---
|
|
274
|
-
|
|
275
|
-
## 📝 最佳实践清单
|
|
276
|
-
|
|
277
|
-
### 每个会话
|
|
278
|
-
|
|
279
|
-
- [ ] 使用 `claude-patched` 而非 `claude`
|
|
280
|
-
- [ ] 会话开始时运行 `test_cache.py --warmup`
|
|
281
|
-
- [ ] 加载 `memory/MEMORY.md`
|
|
282
|
-
- [ ] 检查 CLAUDE.md 大小 (<50KB)
|
|
283
|
-
|
|
284
|
-
### 每个任务
|
|
285
|
-
|
|
286
|
-
- [ ] 预估任务时长
|
|
287
|
-
- [ ] 如果 >1小时,分割成多个会话
|
|
288
|
-
- [ ] 使用 `task_plan.md` 保存进度
|
|
289
|
-
- [ ] 使用 `notes.md` 保存发现
|
|
290
|
-
|
|
291
|
-
### 每周
|
|
292
|
-
|
|
293
|
-
- [ ] 运行 `usage_audit.py` 审计
|
|
294
|
-
- [ ] 检查缓存命中率趋势
|
|
295
|
-
- [ ] 优化低效会话
|
|
296
|
-
- [ ] 更新 CLAUDE.md(删除过期内容)
|
|
297
|
-
|
|
298
|
-
### 每月
|
|
299
|
-
|
|
300
|
-
- [ ] 审视 context 结构
|
|
301
|
-
- [ ] 合并重复规则
|
|
302
|
-
- [ ] 更新技能索引
|
|
303
|
-
- [ ] 评估 token 节省效果
|
|
304
|
-
|
|
305
|
-
---
|
|
306
|
-
|
|
307
|
-
## 🚀 快速开始
|
|
308
|
-
|
|
309
|
-
### 1分钟快速设置
|
|
310
|
-
|
|
311
|
-
```bash
|
|
312
|
-
# 1. 安装 cc-cache-fix
|
|
313
|
-
git clone https://github.com/Rangizingo/cc-cache-fix.git
|
|
314
|
-
cd cc-cache-fix
|
|
315
|
-
./install.sh
|
|
316
|
-
|
|
317
|
-
# 2. 验证安装
|
|
318
|
-
claude-patched --version
|
|
319
|
-
|
|
320
|
-
# 3. 测试缓存
|
|
321
|
-
python test_cache.py
|
|
322
|
-
|
|
323
|
-
# 4. 从现在开始使用 claude-patched
|
|
324
|
-
alias claude=claude-patched
|
|
325
|
-
```
|
|
326
|
-
|
|
327
|
-
### 验证效果
|
|
328
|
-
|
|
329
|
-
```bash
|
|
330
|
-
# 运行两个相同的会话,对比 token 使用
|
|
331
|
-
|
|
332
|
-
# 会话1(冷启动)
|
|
333
|
-
claude-patched
|
|
334
|
-
# 输入相同的任务
|
|
335
|
-
# 记录 input tokens
|
|
336
|
-
|
|
337
|
-
# 会话2(热启动)
|
|
338
|
-
claude-patched
|
|
339
|
-
# 输入相同的任务
|
|
340
|
-
# 记录 input tokens
|
|
341
|
-
|
|
342
|
-
# 对比:会话2 应该节省 40-60% tokens
|
|
343
|
-
```
|
|
344
|
-
|
|
345
|
-
---
|
|
346
|
-
|
|
347
|
-
## ⚠️ 已知限制
|
|
348
|
-
|
|
349
|
-
### cc-cache-fix 的限制
|
|
350
|
-
|
|
351
|
-
- ✅ 解决 Delta 附件丢失
|
|
352
|
-
- ✅ 解决 Hash 不稳定
|
|
353
|
-
- ✅ 解决 TTL 过短
|
|
354
|
-
- ❌ 不能解决 Prompt Cache 设计缺陷
|
|
355
|
-
- ❌ 不能解决 MCP 指令重复加载
|
|
356
|
-
- ❌ 不能解决跨会话缓存共享
|
|
357
|
-
|
|
358
|
-
### 需要手动优化的
|
|
359
|
-
|
|
360
|
-
- Context 压缩(按需加载)
|
|
361
|
-
- 会话分割(长任务)
|
|
362
|
-
- 缓存预热(会话开始)
|
|
363
|
-
- 规则去重(定期维护)
|
|
364
|
-
|
|
365
|
-
---
|
|
366
|
-
|
|
367
|
-
## 📚 相关文档
|
|
368
|
-
|
|
369
|
-
- `performance.md` - 模型选择和性能优化
|
|
370
|
-
- `engineering-workflows.md` - 工程化工作流
|
|
371
|
-
- `context-budget-analyzer` - Context 预算分析工具
|
|
372
|
-
- `context-compressor` - Context 压缩工具
|
|
373
|
-
|
|
374
|
-
---
|
|
375
|
-
|
|
376
|
-
**版本**: v1.0
|
|
377
|
-
**创建**: 2026-04-01
|
|
378
|
-
**状态**: Active
|
|
379
|
-
**下一步**: 集成到 performance.md 和 engineering-workflows.md
|
|
1
|
+
---
|
|
2
|
+
id: "cache-management"
|
|
3
|
+
title: "Cache Management & Token Optimization"
|
|
4
|
+
category: "domain"
|
|
5
|
+
tags: ["🔥 token 燃烧根本原因", "✅ 解决方案矩阵", "🎯 优先级规则", "📊 监控和验证", "每个会话开始时运行", "输出示例", "周期性审计", "🔧 集成到工作流", "1. 检查 cc-cache-fix 是否安装", "2. 运行缓存预热"]
|
|
6
|
+
source: "E:/Bobo's Coding cache/.claude/rules/domain/cache-management.md"
|
|
7
|
+
---
|
|
8
|
+
|
|
9
|
+
# Cache Management & Token Optimization
|
|
10
|
+
|
|
11
|
+
> **版本**: v1.0 (2026-04-01)
|
|
12
|
+
> **来源**: cc-cache-fix 集成 + Claude Code 缓存分析
|
|
13
|
+
> **目标**: 减少 60-70% token 燃烧,提升缓存命中率
|
|
14
|
+
|
|
15
|
+
---
|
|
16
|
+
|
|
17
|
+
## 🔥 Token 燃烧根本原因
|
|
18
|
+
|
|
19
|
+
### 问题1: Prompt Cache 利用率极低
|
|
20
|
+
- **症状**: `cacheCreationInputTokens` = 0, `cacheReadInputTokens` << total input
|
|
21
|
+
- **原因**: Delta 附件丢失、Hash 不稳定、TTL 过短
|
|
22
|
+
- **影响**: 每次对话重新计算,浪费 70% tokens
|
|
23
|
+
|
|
24
|
+
### 问题2: 会话恢复缓存丢失
|
|
25
|
+
- **症状**: 恢复会话后,缓存完全失效
|
|
26
|
+
- **原因**: `deferred_tools_delta` 和 `mcp_instructions_delta` 未持久化
|
|
27
|
+
- **影响**: 长期项目中每次恢复都是冷启动
|
|
28
|
+
|
|
29
|
+
### 问题3: Hash 不稳定
|
|
30
|
+
- **症状**: 相同内容不同 turn 的缓存键不同
|
|
31
|
+
- **原因**: 注入的元数据(时间戳、ID)影响 hash
|
|
32
|
+
- **影响**: 缓存失效率 40-50%
|
|
33
|
+
|
|
34
|
+
### 问题4: TTL 过短
|
|
35
|
+
- **症状**: 5分钟后缓存过期,需要重新计算
|
|
36
|
+
- **原因**: Claude Code 默认 TTL 设置
|
|
37
|
+
- **影响**: 中等长度会话(>5分钟)缓存无效
|
|
38
|
+
|
|
39
|
+
---
|
|
40
|
+
|
|
41
|
+
## ✅ 解决方案矩阵
|
|
42
|
+
|
|
43
|
+
### 方案A: 使用 cc-cache-fix(快速修复)
|
|
44
|
+
|
|
45
|
+
**安装**:
|
|
46
|
+
```bash
|
|
47
|
+
git clone https://github.com/Rangizingo/cc-cache-fix.git
|
|
48
|
+
cd cc-cache-fix
|
|
49
|
+
./install.sh
|
|
50
|
+
```
|
|
51
|
+
|
|
52
|
+
**使用**:
|
|
53
|
+
```bash
|
|
54
|
+
claude-patched # 替代 claude 命令
|
|
55
|
+
```
|
|
56
|
+
|
|
57
|
+
**效果**:
|
|
58
|
+
- ✅ 补丁1: 会话恢复缓存保留 → +30% 缓存命中
|
|
59
|
+
- ✅ 补丁2: Hash 稳定性 → +20% 缓存命中
|
|
60
|
+
- ✅ 补丁3: TTL 1小时 → +15% 缓存命中
|
|
61
|
+
- **总计**: ~60% token 节省(相对基准)
|
|
62
|
+
|
|
63
|
+
**验证**:
|
|
64
|
+
```bash
|
|
65
|
+
python test_cache.py # 检查缓存健康度
|
|
66
|
+
python usage_audit.py # 审计读取效率
|
|
67
|
+
```
|
|
68
|
+
|
|
69
|
+
---
|
|
70
|
+
|
|
71
|
+
### 方案B: Context 压缩(根本优化)
|
|
72
|
+
|
|
73
|
+
**触发条件**: 任何会话开始时
|
|
74
|
+
|
|
75
|
+
**执行步骤**:
|
|
76
|
+
|
|
77
|
+
1. **加载必需文档**(按需加载,不全量)
|
|
78
|
+
```
|
|
79
|
+
Layer 0: CLAUDE.md (5KB) + 核心规则 (15KB)
|
|
80
|
+
Layer 1: Task Router (3KB)
|
|
81
|
+
Layer 2: 相关能力文档 (15-30KB)
|
|
82
|
+
Layer 3: 具体案例 (按需)
|
|
83
|
+
```
|
|
84
|
+
|
|
85
|
+
2. **压缩 CLAUDE.md**
|
|
86
|
+
- 删除重复内容
|
|
87
|
+
- 提取关键规则到 `rules/domain/`
|
|
88
|
+
- 保留索引,不保留详细内容
|
|
89
|
+
|
|
90
|
+
3. **分离项目级规则**
|
|
91
|
+
- 全局规则: `~/.claude/rules/`
|
|
92
|
+
- 项目规则: `.claude/rules/`
|
|
93
|
+
- 不混合加载
|
|
94
|
+
|
|
95
|
+
4. **使用 Context Manager**
|
|
96
|
+
- 自动识别任务类型
|
|
97
|
+
- 按需加载相关文档
|
|
98
|
+
- 保持 context 清洁
|
|
99
|
+
|
|
100
|
+
**效果**: 减少 40-50% 初始 context 大小
|
|
101
|
+
|
|
102
|
+
---
|
|
103
|
+
|
|
104
|
+
### 方案C: 会话分割(长任务优化)
|
|
105
|
+
|
|
106
|
+
**触发条件**: 任务预计 >1小时
|
|
107
|
+
|
|
108
|
+
**执行步骤**:
|
|
109
|
+
|
|
110
|
+
1. **分割策略**
|
|
111
|
+
```
|
|
112
|
+
长任务 (>1小时)
|
|
113
|
+
↓
|
|
114
|
+
分成 3-5 个短会话 (15-20分钟)
|
|
115
|
+
↓
|
|
116
|
+
每个会话独立缓存
|
|
117
|
+
↓
|
|
118
|
+
总 token 节省 30-40%
|
|
119
|
+
```
|
|
120
|
+
|
|
121
|
+
2. **会话间数据传递**
|
|
122
|
+
- 使用 `task_plan.md` 保存进度
|
|
123
|
+
- 使用 `notes.md` 保存发现
|
|
124
|
+
- 下一个会话加载这两个文件
|
|
125
|
+
|
|
126
|
+
3. **缓存预热**
|
|
127
|
+
- 会话开始时加载前一个会话的关键上下文
|
|
128
|
+
- 避免重复计算
|
|
129
|
+
|
|
130
|
+
**效果**: 长任务 token 节省 30-40%
|
|
131
|
+
|
|
132
|
+
---
|
|
133
|
+
|
|
134
|
+
### 方案D: 缓存预热(会话开始优化)
|
|
135
|
+
|
|
136
|
+
**执行时机**: 每个会话开始
|
|
137
|
+
|
|
138
|
+
**步骤**:
|
|
139
|
+
|
|
140
|
+
1. **加载常用上下文**
|
|
141
|
+
```typescript
|
|
142
|
+
// 会话开始时自动加载
|
|
143
|
+
const warmupContext = [
|
|
144
|
+
'CLAUDE.md', // 核心规则
|
|
145
|
+
'rules/core/', // 核心规则
|
|
146
|
+
'memory/MEMORY.md', // 持久化记忆
|
|
147
|
+
'task_plan.md', // 当前任务计划
|
|
148
|
+
];
|
|
149
|
+
```
|
|
150
|
+
|
|
151
|
+
2. **预计算常见操作**
|
|
152
|
+
- 加载项目结构
|
|
153
|
+
- 初始化工具链
|
|
154
|
+
- 预加载常用代码片段
|
|
155
|
+
|
|
156
|
+
3. **缓存验证**
|
|
157
|
+
```bash
|
|
158
|
+
# 会话开始时运行
|
|
159
|
+
python usage_audit.py --check-warmup
|
|
160
|
+
```
|
|
161
|
+
|
|
162
|
+
**效果**: 会话启动时间 -50%, 首次操作 token -30%
|
|
163
|
+
|
|
164
|
+
---
|
|
165
|
+
|
|
166
|
+
## 🎯 优先级规则
|
|
167
|
+
|
|
168
|
+
### 立即应用(所有会话)
|
|
169
|
+
|
|
170
|
+
```
|
|
171
|
+
1. 使用 claude-patched(cc-cache-fix)
|
|
172
|
+
↓
|
|
173
|
+
2. 按需加载 context(不全量)
|
|
174
|
+
↓
|
|
175
|
+
3. 会话开始时缓存预热
|
|
176
|
+
↓
|
|
177
|
+
4. 长任务自动分割
|
|
178
|
+
```
|
|
179
|
+
|
|
180
|
+
### 按任务类型应用
|
|
181
|
+
|
|
182
|
+
| 任务类型 | 推荐方案 | 预期节省 |
|
|
183
|
+
|---------|---------|---------|
|
|
184
|
+
| 简单任务 (<15分钟) | A + B | 40-50% |
|
|
185
|
+
| 中等任务 (15-60分钟) | A + B + D | 50-60% |
|
|
186
|
+
| 长任务 (>1小时) | A + B + C + D | 60-70% |
|
|
187
|
+
| 复杂多文件 | A + B + D + 分割 | 60-70% |
|
|
188
|
+
|
|
189
|
+
---
|
|
190
|
+
|
|
191
|
+
## 📊 监控和验证
|
|
192
|
+
|
|
193
|
+
### 缓存健康检查
|
|
194
|
+
|
|
195
|
+
```bash
|
|
196
|
+
# 每个会话开始时运行
|
|
197
|
+
python cc-cache-fix/test_cache.py
|
|
198
|
+
|
|
199
|
+
# 输出示例
|
|
200
|
+
✅ Attachment persistence: PASS
|
|
201
|
+
✅ Hash stability: PASS
|
|
202
|
+
✅ TTL extension: PASS (1h)
|
|
203
|
+
📊 Cache hit rate: 78%
|
|
204
|
+
💾 Token saved: 2,340 / 3,200 (73%)
|
|
205
|
+
```
|
|
206
|
+
|
|
207
|
+
### 使用审计
|
|
208
|
+
|
|
209
|
+
```bash
|
|
210
|
+
# 周期性审计
|
|
211
|
+
python cc-cache-fix/usage_audit.py
|
|
212
|
+
|
|
213
|
+
# 输出示例
|
|
214
|
+
Session: 2026-04-01 10:00
|
|
215
|
+
├─ Total input tokens: 3,200
|
|
216
|
+
├─ Cache read tokens: 2,340 (73%)
|
|
217
|
+
├─ Cache creation tokens: 860 (27%)
|
|
218
|
+
├─ Efficiency: 73% ✅
|
|
219
|
+
└─ Recommendation: Maintain current strategy
|
|
220
|
+
```
|
|
221
|
+
|
|
222
|
+
### 告警阈值
|
|
223
|
+
|
|
224
|
+
| 指标 | 正常 | 警告 | 严重 |
|
|
225
|
+
|------|------|------|------|
|
|
226
|
+
| Cache hit rate | >70% | 50-70% | <50% |
|
|
227
|
+
| Token efficiency | >65% | 45-65% | <45% |
|
|
228
|
+
| TTL utilization | >80% | 60-80% | <60% |
|
|
229
|
+
|
|
230
|
+
---
|
|
231
|
+
|
|
232
|
+
## 🔧 集成到工作流
|
|
233
|
+
|
|
234
|
+
### 会话开始 Hook
|
|
235
|
+
|
|
236
|
+
```bash
|
|
237
|
+
# ~/.claude/hooks/session-start-cache-warmup.sh
|
|
238
|
+
|
|
239
|
+
#!/bin/bash
|
|
240
|
+
|
|
241
|
+
# 1. 检查 cc-cache-fix 是否安装
|
|
242
|
+
if ! command -v claude-patched &> /dev/null; then
|
|
243
|
+
echo "⚠️ cc-cache-fix not installed. Install with:"
|
|
244
|
+
echo " git clone https://github.com/Rangizingo/cc-cache-fix.git && cd cc-cache-fix && ./install.sh"
|
|
245
|
+
fi
|
|
246
|
+
|
|
247
|
+
# 2. 运行缓存预热
|
|
248
|
+
python cc-cache-fix/test_cache.py --warmup
|
|
249
|
+
|
|
250
|
+
# 3. 加载持久化记忆
|
|
251
|
+
if [ -f "~/.claude/projects/e--bobo-s-coding-cache/memory/MEMORY.md" ]; then
|
|
252
|
+
echo "✅ Memory loaded"
|
|
253
|
+
fi
|
|
254
|
+
|
|
255
|
+
# 4. 检查 context 大小
|
|
256
|
+
CONTEXT_SIZE=$(wc -c < "CLAUDE.md")
|
|
257
|
+
if [ $CONTEXT_SIZE -gt 50000 ]; then
|
|
258
|
+
echo "⚠️ CLAUDE.md too large ($CONTEXT_SIZE bytes). Consider splitting."
|
|
259
|
+
fi
|
|
260
|
+
```
|
|
261
|
+
|
|
262
|
+
### 任务执行 Hook
|
|
263
|
+
|
|
264
|
+
```bash
|
|
265
|
+
# ~/.claude/hooks/task-execution-cache-monitor.sh
|
|
266
|
+
|
|
267
|
+
#!/bin/bash
|
|
268
|
+
|
|
269
|
+
# 监控任务执行期间的缓存效率
|
|
270
|
+
python cc-cache-fix/usage_audit.py --monitor --interval 5m
|
|
271
|
+
```
|
|
272
|
+
|
|
273
|
+
---
|
|
274
|
+
|
|
275
|
+
## 📝 最佳实践清单
|
|
276
|
+
|
|
277
|
+
### 每个会话
|
|
278
|
+
|
|
279
|
+
- [ ] 使用 `claude-patched` 而非 `claude`
|
|
280
|
+
- [ ] 会话开始时运行 `test_cache.py --warmup`
|
|
281
|
+
- [ ] 加载 `memory/MEMORY.md`
|
|
282
|
+
- [ ] 检查 CLAUDE.md 大小 (<50KB)
|
|
283
|
+
|
|
284
|
+
### 每个任务
|
|
285
|
+
|
|
286
|
+
- [ ] 预估任务时长
|
|
287
|
+
- [ ] 如果 >1小时,分割成多个会话
|
|
288
|
+
- [ ] 使用 `task_plan.md` 保存进度
|
|
289
|
+
- [ ] 使用 `notes.md` 保存发现
|
|
290
|
+
|
|
291
|
+
### 每周
|
|
292
|
+
|
|
293
|
+
- [ ] 运行 `usage_audit.py` 审计
|
|
294
|
+
- [ ] 检查缓存命中率趋势
|
|
295
|
+
- [ ] 优化低效会话
|
|
296
|
+
- [ ] 更新 CLAUDE.md(删除过期内容)
|
|
297
|
+
|
|
298
|
+
### 每月
|
|
299
|
+
|
|
300
|
+
- [ ] 审视 context 结构
|
|
301
|
+
- [ ] 合并重复规则
|
|
302
|
+
- [ ] 更新技能索引
|
|
303
|
+
- [ ] 评估 token 节省效果
|
|
304
|
+
|
|
305
|
+
---
|
|
306
|
+
|
|
307
|
+
## 🚀 快速开始
|
|
308
|
+
|
|
309
|
+
### 1分钟快速设置
|
|
310
|
+
|
|
311
|
+
```bash
|
|
312
|
+
# 1. 安装 cc-cache-fix
|
|
313
|
+
git clone https://github.com/Rangizingo/cc-cache-fix.git
|
|
314
|
+
cd cc-cache-fix
|
|
315
|
+
./install.sh
|
|
316
|
+
|
|
317
|
+
# 2. 验证安装
|
|
318
|
+
claude-patched --version
|
|
319
|
+
|
|
320
|
+
# 3. 测试缓存
|
|
321
|
+
python test_cache.py
|
|
322
|
+
|
|
323
|
+
# 4. 从现在开始使用 claude-patched
|
|
324
|
+
alias claude=claude-patched
|
|
325
|
+
```
|
|
326
|
+
|
|
327
|
+
### 验证效果
|
|
328
|
+
|
|
329
|
+
```bash
|
|
330
|
+
# 运行两个相同的会话,对比 token 使用
|
|
331
|
+
|
|
332
|
+
# 会话1(冷启动)
|
|
333
|
+
claude-patched
|
|
334
|
+
# 输入相同的任务
|
|
335
|
+
# 记录 input tokens
|
|
336
|
+
|
|
337
|
+
# 会话2(热启动)
|
|
338
|
+
claude-patched
|
|
339
|
+
# 输入相同的任务
|
|
340
|
+
# 记录 input tokens
|
|
341
|
+
|
|
342
|
+
# 对比:会话2 应该节省 40-60% tokens
|
|
343
|
+
```
|
|
344
|
+
|
|
345
|
+
---
|
|
346
|
+
|
|
347
|
+
## ⚠️ 已知限制
|
|
348
|
+
|
|
349
|
+
### cc-cache-fix 的限制
|
|
350
|
+
|
|
351
|
+
- ✅ 解决 Delta 附件丢失
|
|
352
|
+
- ✅ 解决 Hash 不稳定
|
|
353
|
+
- ✅ 解决 TTL 过短
|
|
354
|
+
- ❌ 不能解决 Prompt Cache 设计缺陷
|
|
355
|
+
- ❌ 不能解决 MCP 指令重复加载
|
|
356
|
+
- ❌ 不能解决跨会话缓存共享
|
|
357
|
+
|
|
358
|
+
### 需要手动优化的
|
|
359
|
+
|
|
360
|
+
- Context 压缩(按需加载)
|
|
361
|
+
- 会话分割(长任务)
|
|
362
|
+
- 缓存预热(会话开始)
|
|
363
|
+
- 规则去重(定期维护)
|
|
364
|
+
|
|
365
|
+
---
|
|
366
|
+
|
|
367
|
+
## 📚 相关文档
|
|
368
|
+
|
|
369
|
+
- `performance.md` - 模型选择和性能优化
|
|
370
|
+
- `engineering-workflows.md` - 工程化工作流
|
|
371
|
+
- `context-budget-analyzer` - Context 预算分析工具
|
|
372
|
+
- `context-compressor` - Context 压缩工具
|
|
373
|
+
|
|
374
|
+
---
|
|
375
|
+
|
|
376
|
+
**版本**: v1.0
|
|
377
|
+
**创建**: 2026-04-01
|
|
378
|
+
**状态**: Active
|
|
379
|
+
**下一步**: 集成到 performance.md 和 engineering-workflows.md
|