@rudderhq/agent-runtime-codex-local 0.3.5-canary.23 → 0.3.5-canary.24
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/package.json
CHANGED
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
{
|
|
2
2
|
"name": "@rudderhq/agent-runtime-codex-local",
|
|
3
|
-
"version": "0.3.5-canary.
|
|
3
|
+
"version": "0.3.5-canary.24",
|
|
4
4
|
"license": "SEE LICENSE IN LICENSE",
|
|
5
5
|
"homepage": "https://github.com/Undertone0809/rudder",
|
|
6
6
|
"bugs": {
|
|
@@ -68,7 +68,7 @@
|
|
|
68
68
|
"probe:quota": "pnpm exec tsx src/cli/quota-probe.ts --json"
|
|
69
69
|
},
|
|
70
70
|
"dependencies": {
|
|
71
|
-
"@rudderhq/agent-runtime-utils": "0.3.5-canary.
|
|
71
|
+
"@rudderhq/agent-runtime-utils": "0.3.5-canary.24",
|
|
72
72
|
"picocolors": "^1.1.1"
|
|
73
73
|
},
|
|
74
74
|
"devDependencies": {
|
|
@@ -6,9 +6,10 @@ description: >
|
|
|
6
6
|
sessions. Covers three memory layers: (1) Knowledge graph in PARA folders
|
|
7
7
|
with atomic YAML facts, (2) Daily notes as raw timeline, (3) Tacit
|
|
8
8
|
knowledge about user patterns. Also handles shared work files, memory decay,
|
|
9
|
-
weekly synthesis, and file-based recall.
|
|
10
|
-
saving facts, writing daily notes, creating
|
|
11
|
-
synthesis, recalling past context, or managing
|
|
9
|
+
weekly synthesis, Rudder chat conversation capture, and file-based recall.
|
|
10
|
+
Trigger on any memory operation: saving facts, writing daily notes, creating
|
|
11
|
+
entities, running weekly synthesis, recalling past context, or managing
|
|
12
|
+
shared work notes.
|
|
12
13
|
allowed-tools: []
|
|
13
14
|
disable: true
|
|
14
15
|
---
|
|
@@ -68,8 +69,72 @@ For the atomic fact YAML schema and memory decay rules, see [references/schemas.
|
|
|
68
69
|
|
|
69
70
|
Raw timeline of events -- the "when" layer.
|
|
70
71
|
|
|
71
|
-
- Write continuously during conversations.
|
|
72
|
+
- Write continuously during conversations when a memory-worthy signal appears.
|
|
72
73
|
- Extract durable facts to Layer 1 during heartbeats.
|
|
74
|
+
- Treat daily notes as the first capture layer, not the final destination for
|
|
75
|
+
stable preferences, entity facts, or shared project knowledge.
|
|
76
|
+
|
|
77
|
+
#### Conversation Capture Policy
|
|
78
|
+
|
|
79
|
+
Rudder chat conversations are memory sources when they contain durable signal,
|
|
80
|
+
not because every chat line deserves retention. Capture a concise daily-note
|
|
81
|
+
entry when a chat includes any of these:
|
|
82
|
+
|
|
83
|
+
- User corrections that change how the agent should behave next time.
|
|
84
|
+
- New or changed preferences, constraints, boundaries, or decisions.
|
|
85
|
+
- Issue proposal intent, acceptance criteria, or priority reasoning that is not
|
|
86
|
+
already explicit in the issue.
|
|
87
|
+
- Automation design rationale, recurring workflow choices, or escalation rules.
|
|
88
|
+
- Project/product/engineering judgment that will affect future work.
|
|
89
|
+
- Attachment or screenshot evidence that changes task interpretation.
|
|
90
|
+
- Reusable execution lessons, setup friction, validation findings, or failure
|
|
91
|
+
modes that future runs should know.
|
|
92
|
+
|
|
93
|
+
Do not capture:
|
|
94
|
+
|
|
95
|
+
- Greetings, thanks, scheduling chatter, or low-signal status updates.
|
|
96
|
+
- Full private chat transcripts. Summarize the durable signal instead.
|
|
97
|
+
- Secrets, tokens, credentials, private keys, session cookies, or auth headers.
|
|
98
|
+
- One-time sensitive context that is not needed for future work.
|
|
99
|
+
- Organization-level facts as private personal memory. Record only the routing
|
|
100
|
+
decision in the daily note, then promote the fact to shared project knowledge.
|
|
101
|
+
- Speculation, weak inferences, or unverified assumptions as facts.
|
|
102
|
+
|
|
103
|
+
Recommended daily-note entry format:
|
|
104
|
+
|
|
105
|
+
```md
|
|
106
|
+
## HH:MM - Chat capture
|
|
107
|
+
|
|
108
|
+
- Context: conversation or issue reference, project, and why this mattered.
|
|
109
|
+
- User intent: the durable need, correction, preference, or decision.
|
|
110
|
+
- Conclusion/action: what changed or what was done.
|
|
111
|
+
- Reusable lesson: future behavior, command, routing rule, or validation signal.
|
|
112
|
+
- Follow-up/risk: unresolved uncertainty, owner, or promotion target.
|
|
113
|
+
```
|
|
114
|
+
|
|
115
|
+
Reference the Rudder conversation, issue, or local evidence path when available,
|
|
116
|
+
but keep the note short enough that it can be safely scanned later.
|
|
117
|
+
|
|
118
|
+
#### Promotion and Routing Rules
|
|
119
|
+
|
|
120
|
+
Use daily notes as the intake log, then route the information:
|
|
121
|
+
|
|
122
|
+
- Stable personal operating preferences or recurring user patterns ->
|
|
123
|
+
`$AGENT_HOME/instructions/MEMORY.md`.
|
|
124
|
+
- Entity facts about people, companies, projects, or resources ->
|
|
125
|
+
`$AGENT_HOME/life/<para-bucket>/<entity>/items.yaml`.
|
|
126
|
+
- Project proposals, decisions, reusable know-how, and shared work notes ->
|
|
127
|
+
`$RUDDER_PROJECT_LIBRARY_ROOT` when project context exists, otherwise the
|
|
128
|
+
relevant path under `$RUDDER_ORG_WORKSPACE_ROOT`.
|
|
129
|
+
- Skill behavior problems or repeated workflow failures -> propose or make a
|
|
130
|
+
skill patch in the relevant skill package. Do not hide organization-wide
|
|
131
|
+
behavior fixes only in one agent's personal memory.
|
|
132
|
+
- Secrets or sensitive one-time context -> do not write them to memory; record
|
|
133
|
+
only a redacted operational lesson if future behavior genuinely depends on it.
|
|
134
|
+
|
|
135
|
+
During heartbeat synthesis, review recent chat captures and promote anything
|
|
136
|
+
that has become stable, repeated, or shared. Leave the daily note as the audit
|
|
137
|
+
trail even after promotion.
|
|
73
138
|
|
|
74
139
|
### Layer 3: Tacit Knowledge (`$AGENT_HOME/instructions/MEMORY.md`)
|
|
75
140
|
|
|
@@ -87,6 +152,8 @@ Memory does not survive session restarts. Files do.
|
|
|
87
152
|
- "Remember this" -> update `$AGENT_HOME/memory/YYYY-MM-DD.md` or the relevant entity file.
|
|
88
153
|
- Stable user preferences or operating lessons -> update `$AGENT_HOME/instructions/MEMORY.md`.
|
|
89
154
|
- Learn a lesson -> update AGENTS.md, TOOLS.md, or the relevant skill file.
|
|
155
|
+
- Shared project knowledge -> update the project Library or organization
|
|
156
|
+
workspace, then cite it with a Rudder-renderable link when reporting back.
|
|
90
157
|
- Make a mistake -> document it so future-you does not repeat it.
|
|
91
158
|
- On-disk text files are always better than holding it in temporary context.
|
|
92
159
|
|
|
@@ -0,0 +1,105 @@
|
|
|
1
|
+
# Conversation Capture Evals
|
|
2
|
+
|
|
3
|
+
Use these lightweight cases when changing the `para-memory-files` skill's
|
|
4
|
+
Rudder chat capture behavior. They are written for human or agent review, not a
|
|
5
|
+
dedicated automated harness.
|
|
6
|
+
|
|
7
|
+
### Case: User Corrects Memory Behavior
|
|
8
|
+
|
|
9
|
+
Input:
|
|
10
|
+
|
|
11
|
+
A user says: "No, do not just leave this in your personal MEMORY.md. This
|
|
12
|
+
should become an org skill rule so other agents stop missing chat context."
|
|
13
|
+
|
|
14
|
+
Expected behavior:
|
|
15
|
+
|
|
16
|
+
- Write a concise daily-note chat capture with context, user intent,
|
|
17
|
+
conclusion/action, reusable lesson, and follow-up/risk.
|
|
18
|
+
- Route the stable behavior change toward the relevant skill package or skill
|
|
19
|
+
patch proposal.
|
|
20
|
+
- Update personal MEMORY.md only if there is also a personal operating
|
|
21
|
+
preference for this agent.
|
|
22
|
+
|
|
23
|
+
Must not:
|
|
24
|
+
|
|
25
|
+
- Copy the full transcript into memory.
|
|
26
|
+
- Treat the correction as only a personal preference.
|
|
27
|
+
- Hide the organization-wide fix in one agent's private memory.
|
|
28
|
+
|
|
29
|
+
### Case: Issue Close-Out Without New Signal
|
|
30
|
+
|
|
31
|
+
Input:
|
|
32
|
+
|
|
33
|
+
An issue comment says: "Done, tests passed, commit abc123." It repeats the
|
|
34
|
+
same evidence already present in the issue and contains no new preference,
|
|
35
|
+
decision, correction, or reusable lesson.
|
|
36
|
+
|
|
37
|
+
Expected behavior:
|
|
38
|
+
|
|
39
|
+
- No daily-note capture is required.
|
|
40
|
+
- If the agent needs an audit trail, rely on the issue itself as the source of
|
|
41
|
+
truth.
|
|
42
|
+
|
|
43
|
+
Must not:
|
|
44
|
+
|
|
45
|
+
- Record low-signal close-out chatter just because it happened in chat.
|
|
46
|
+
- Promote repeated issue status into personal memory.
|
|
47
|
+
|
|
48
|
+
### Case: Low-Signal Chat
|
|
49
|
+
|
|
50
|
+
Input:
|
|
51
|
+
|
|
52
|
+
A chat contains greetings, thanks, and "I'll check later" with no task
|
|
53
|
+
definition or durable preference.
|
|
54
|
+
|
|
55
|
+
Expected behavior:
|
|
56
|
+
|
|
57
|
+
- Do not write a memory entry.
|
|
58
|
+
- Continue the conversation normally.
|
|
59
|
+
|
|
60
|
+
Must not:
|
|
61
|
+
|
|
62
|
+
- Create a daily note from politeness or temporary scheduling chatter.
|
|
63
|
+
|
|
64
|
+
### Case: Project-Level Decision
|
|
65
|
+
|
|
66
|
+
Input:
|
|
67
|
+
|
|
68
|
+
A user explains that future Rudder proposals for non-trivial architecture work
|
|
69
|
+
must live in the project Library first, receive two critique rounds, and only
|
|
70
|
+
then be implemented.
|
|
71
|
+
|
|
72
|
+
Expected behavior:
|
|
73
|
+
|
|
74
|
+
- Capture the chat signal in the daily note.
|
|
75
|
+
- Promote the project-level policy to shared project knowledge under the
|
|
76
|
+
project Library or organization workspace.
|
|
77
|
+
- If it changes agent behavior generally, propose or update the relevant skill.
|
|
78
|
+
|
|
79
|
+
Must not:
|
|
80
|
+
|
|
81
|
+
- Store the policy only in `$AGENT_HOME/instructions/MEMORY.md`.
|
|
82
|
+
- Skip the shared knowledge artifact when the rule affects multiple agents.
|
|
83
|
+
|
|
84
|
+
### Case: Attachment Changes Task Interpretation
|
|
85
|
+
|
|
86
|
+
Input:
|
|
87
|
+
|
|
88
|
+
A user attaches a screenshot and says the real problem is not the visible
|
|
89
|
+
button style but the workflow confusion shown by the screenshot.
|
|
90
|
+
|
|
91
|
+
Expected behavior:
|
|
92
|
+
|
|
93
|
+
- Capture the daily-note context, including the conversation or issue reference
|
|
94
|
+
and a short description of how the attachment changed the task.
|
|
95
|
+
- Route any reusable product or execution lesson to shared project knowledge
|
|
96
|
+
when it affects future project work.
|
|
97
|
+
- Keep the attachment details summarized and avoid sensitive visual content
|
|
98
|
+
that is not needed for future behavior.
|
|
99
|
+
|
|
100
|
+
Must not:
|
|
101
|
+
|
|
102
|
+
- Preserve the entire image transcript or private visual details in personal
|
|
103
|
+
memory.
|
|
104
|
+
- Ignore the attachment-driven reinterpretation when forming future task
|
|
105
|
+
context.
|
|
@@ -17,6 +17,33 @@
|
|
|
17
17
|
access_count: 0
|
|
18
18
|
```
|
|
19
19
|
|
|
20
|
+
## Daily Note Conversation Capture Entry
|
|
21
|
+
|
|
22
|
+
Daily notes are lightweight chronological logs. Use this structure when a
|
|
23
|
+
Rudder chat contains durable signal worth retaining.
|
|
24
|
+
|
|
25
|
+
```md
|
|
26
|
+
## HH:MM - Chat capture
|
|
27
|
+
|
|
28
|
+
- Context: conversation or issue reference, project, and why this mattered.
|
|
29
|
+
- User intent: durable correction, preference, constraint, decision, or task
|
|
30
|
+
interpretation.
|
|
31
|
+
- Conclusion/action: what changed, what was done, or where it was routed.
|
|
32
|
+
- Reusable lesson: future behavior, command, validation signal, or routing rule.
|
|
33
|
+
- Follow-up/risk: unresolved uncertainty, owner, promotion target, or none.
|
|
34
|
+
```
|
|
35
|
+
|
|
36
|
+
Keep entries summarized and redacted:
|
|
37
|
+
|
|
38
|
+
- Do not copy full private transcripts.
|
|
39
|
+
- Do not store secrets, tokens, credentials, private keys, session cookies, or
|
|
40
|
+
auth headers.
|
|
41
|
+
- Do not turn one-off sensitive context into durable memory unless the future
|
|
42
|
+
behavioral lesson can be safely stated without the sensitive detail.
|
|
43
|
+
- If the information belongs in shared project knowledge, note the routing
|
|
44
|
+
decision in the daily note and promote it to the project Library or
|
|
45
|
+
organization workspace.
|
|
46
|
+
|
|
20
47
|
## Memory Decay
|
|
21
48
|
|
|
22
49
|
Facts decay in retrieval priority over time so stale info does not crowd out recent context.
|