deepseek-coder-agent-cli 1.0.51 → 1.0.53

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (2) hide show
  1. package/README.md +60 -0
  2. package/package.json +1 -1
package/README.md CHANGED
@@ -99,6 +99,66 @@ A sandboxed coding agent that can't do any of this is not a coding agent. It's a
99
99
  - **Authenticated sessions** - Your existing CLI auth (gcloud auth, aws configure, firebase login) just works
100
100
  - **No convincing required** - Ask it to deploy, it deploys. Ask it to install, it installs.
101
101
 
102
+ ### The Absurdity of "AI Suggests, You Run"
103
+
104
+ If a serious coder has to manually copy-paste and run every command that Codex CLI suggests because the sandbox blocks execution, **what exactly is the AI doing for you?**
105
+
106
+ The daily workflow becomes:
107
+ ```
108
+ Codex: Run `npm install express`
109
+ You: *manually runs npm install express*
110
+ Codex: Run `gcloud app deploy`
111
+ You: *manually runs gcloud app deploy*
112
+ Codex: Run `firebase deploy --only functions`
113
+ You: *manually runs firebase deploy --only functions*
114
+ Codex: Run `kubectl apply -f deployment.yaml`
115
+ You: *manually runs kubectl apply -f deployment.yaml*
116
+ ```
117
+
118
+ **This is not AI-assisted coding. This is AI-generated TODO lists.**
119
+
120
+ At that point, Codex is just an expensive way to type commands you could have typed yourself. The entire value proposition of a coding agent is that it **executes** - it reads files, makes edits, runs builds, deploys code, handles errors, and iterates. If all it does is edit source files while you manually run every single deployment, test, and infrastructure command, you're paying for an overqualified text editor.
121
+
122
+ Maybe this makes sense for students at OpenAI Academy who are learning what `npm install` does. For any serious coder shipping production code every day - managing deployments, debugging infrastructure, running CI/CD pipelines - a sandboxed agent that can't touch the network is fucking useless.
123
+
124
+ ### 2. Anti-Hallucination "Next Steps" Requirements
125
+
126
+ DeepSeek CLI has strong baked-in anti-hallucination requirements for final completion messages. When the AI says it's done, it must provide concrete, verifiable next steps.
127
+
128
+ **Claude Code (as of v2.1.19 at https://www.npmjs.com/package/@anthropic-ai/claude-code) fully lacks this.**
129
+
130
+ What happens without anti-hallucination next steps:
131
+ ```
132
+ Claude Code: "I've implemented the authentication system. The code should work now."
133
+ You: *runs code* Error: undefined is not a function
134
+ You: It doesn't work
135
+ Claude Code: "I apologize, let me fix that."
136
+ You: *runs code* Error: cannot read property 'user' of null
137
+ You: Still broken
138
+ Claude Code: "I see the issue now..."
139
+ [repeat 5 more times]
140
+ ```
141
+
142
+ What happens with DeepSeek's enforced next steps:
143
+ ```
144
+ DeepSeek: "Implementation complete. Next steps to verify:
145
+ 1. Run `npm test` - expect all 12 tests passing
146
+ 2. Run `npm run build` - expect no TypeScript errors
147
+ 3. Test login at http://localhost:3000/login with test@example.com / password123
148
+ 4. Check console for 'Auth initialized' message"
149
+
150
+ You: *runs npm test* ✓ 12 tests passing
151
+ You: *runs npm run build* ✓ No errors
152
+ You: It actually works
153
+ ```
154
+
155
+ For any serious coder - not a stupid computer science student at Anthropic Academy - verifiable completion criteria are **fully required**. You need to know:
156
+ - What commands to run to verify the work
157
+ - What output to expect
158
+ - What to check if something fails
159
+
160
+ An AI that just says "done, should work now" with no verification steps is an AI that hallucinates success. DeepSeek CLI forces the model to commit to specific, testable claims about what it just did.
161
+
102
162
  ### The o4-mini Potential
103
163
 
104
164
  If adapted for DeepSeek CLI, o4-mini could offer the same reasoning capabilities as Codex CLI 5.2 xhigh but without the sandbox prison. The insights from making o4-mini work in an unrestricted environment would benefit all coding agent development - you learn what's actually possible when you remove artificial limitations.
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "deepseek-coder-agent-cli",
3
- "version": "1.0.51",
3
+ "version": "1.0.53",
4
4
  "description": "DeepSeek AI-powered CLI agent for code assistance and automation",
5
5
  "deepseek": {
6
6
  "rulebookSchema": "src/contracts/schemas/agent-rules.schema.json"