trace-to-skill 0.1.97 → 0.1.99
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +6 -4
- package/dist/src/benchmark.js +12 -0
- package/dist/src/benchmark.js.map +1 -1
- package/dist/src/demo.js +16 -0
- package/dist/src/demo.js.map +1 -1
- package/dist/src/init.js +3 -3
- package/dist/src/issueMap.js +6 -0
- package/dist/src/issueMap.js.map +1 -1
- package/dist/src/rules.js +30 -3
- package/dist/src/rules.js.map +1 -1
- package/dist/src/types.d.ts +1 -1
- package/docs/BENCHMARK.md +2 -0
- package/docs/CODEX_GITHUB_ISSUE_PAIN_MAP.md +33 -33
- package/docs/CODEX_ISSUE_MAP.md +6 -2
- package/docs/CODEX_ISSUE_RADAR.md +26 -24
- package/docs/DEMO.md +29 -25
- package/docs/FAILURE_TAXONOMY.md +10 -2
- package/docs/OPENAI_OSS_BRIEF.md +5 -5
- package/docs/SCORECARD.md +3 -1
- package/docs/USE_CASES.md +27 -1
- package/fixtures/codex-cli-no-response.md +46 -0
- package/fixtures/codex-platform-availability.md +35 -0
- package/fixtures/github-codex-issues-export.json +64 -0
- package/llms.txt +4 -2
- package/package.json +9 -1
- package/schemas/analysis-result.schema.json +1 -0
|
@@ -1,9 +1,9 @@
|
|
|
1
1
|
# GitHub Issue Pain Map
|
|
2
2
|
|
|
3
|
-
Generated: 2026-06-01T03:
|
|
3
|
+
Generated: 2026-06-01T03:51:24.368Z
|
|
4
4
|
|
|
5
|
-
Issues analyzed: **
|
|
6
|
-
Matched issues: **
|
|
5
|
+
Issues analyzed: **20**
|
|
6
|
+
Matched issues: **19**
|
|
7
7
|
Unmatched issues: **1**
|
|
8
8
|
|
|
9
9
|
This report maps GitHub issues onto deterministic `trace-to-skill` failure classes. Fetch a repository directly with `--repo`, or export issues with `gh issue list` / `gh search issues` and pass the JSON file.
|
|
@@ -19,9 +19,11 @@ gh issue list --repo openai/codex --state all --limit 100 --json number,title,bo
|
|
|
19
19
|
|
|
20
20
|
| Priority | Kind | Severity | Issues | Comments | Reactions | Example |
|
|
21
21
|
| ---: | --- | --- | ---: | ---: | ---: | --- |
|
|
22
|
+
| 3101 | `codex_platform_availability` | high | 3 | 328 | 1368 | [#10410 Codex Desktop App: macOS Intel (x86_64) support](https://github.com/openai/codex/issues/10410) |
|
|
22
23
|
| 1895 | `codex_remote_connection` | high | 1 | 176 | 851 | [#10450 Remote Development in Codex Desktop App](https://github.com/openai/codex/issues/10450) |
|
|
23
24
|
| 1051 | `codex_token_burn` | high | 2 | 918 | 53 | [#14593 Burning tokens very fast](https://github.com/openai/codex/issues/14593) |
|
|
24
|
-
|
|
|
25
|
+
| 827 | `weak_evidence` | medium | 20 | 2514 | 2462 | [#14593 Burning tokens very fast](https://github.com/openai/codex/issues/14593) |
|
|
26
|
+
| 434 | `codex_thinking_hang` | high | 2 | 201 | 103 | [#14048 All models - Codex CLI hangs indefinitely on all prompts, no response generated](https://github.com/openai/codex/issues/14048) |
|
|
25
27
|
| 409 | `codex_auth_verification` | high | 2 | 346 | 18 | [#20161 Phone number verification doesn't work](https://github.com/openai/codex/issues/20161) |
|
|
26
28
|
| 304 | `codex_model_routing_mismatch` | high | 3 | 231 | 18 | [#11189 GPT-5.3-Codex being routed to GPT-5.2](https://github.com/openai/codex/issues/11189) |
|
|
27
29
|
| 257 | `codex_context_visibility` | high | 3 | 168 | 26 | [#23794 Codex Desktop no longer shows visible context/token usage indicator](https://github.com/openai/codex/issues/23794) |
|
|
@@ -29,20 +31,31 @@ gh issue list --repo openai/codex --state all --limit 100 --json number,title,bo
|
|
|
29
31
|
| 137 | `codex_remote_compact` | high | 1 | 90 | 15 | [#14860 Error running remote compact task](https://github.com/openai/codex/issues/14860) |
|
|
30
32
|
| 137 | `context_compaction` | high | 1 | 90 | 15 | [#14860 Error running remote compact task](https://github.com/openai/codex/issues/14860) |
|
|
31
33
|
| 88 | `codex_mcp_discovery_mismatch` | high | 1 | 55 | 8 | [#6465 MCP servers not detected in Codex VS Code extension but working in Codex CLI](https://github.com/openai/codex/issues/6465) |
|
|
32
|
-
| 22 | `codex_usage_bucket_confusion` | high | 1 | 1 | 2 | [#25471 Codex usage popover shows confusing remaining percentages for 5h vs weekly buckets](https://github.com/openai/codex/issues/25471) |
|
|
33
34
|
|
|
34
35
|
## Maintainer Roadmap
|
|
35
36
|
|
|
36
37
|
| Rank | Next artifact | Why now | Command |
|
|
37
38
|
| ---: | --- | --- | --- |
|
|
38
|
-
| 1 |
|
|
39
|
-
| 2 |
|
|
40
|
-
| 3 |
|
|
41
|
-
| 4 |
|
|
42
|
-
| 5 |
|
|
39
|
+
| 1 | Platform availability fixture and support-policy evidence report | 3 issue(s), 328 comment(s), severity high; top signal: codex_platform_availability. | `trace-to-skill codex-report ./runs --output openai-codex-platform-availability.md` |
|
|
40
|
+
| 2 | Remote connection fixture and SSH workspace evidence report | 1 issue(s), 176 comment(s), severity high; top signal: codex_remote_connection. | `trace-to-skill codex-report ./runs --output openai-codex-remote-connection.md` |
|
|
41
|
+
| 3 | Usage evidence fixture and support-ready token report | 2 issue(s), 918 comment(s), severity high; top signal: codex_token_burn. | `trace-to-skill usage-evidence ./usage-notes.md --output usage-evidence.md` |
|
|
42
|
+
| 4 | Codex-ready issue report and failure fixture | 2 issue(s), 201 comment(s), severity high; top signal: codex_thinking_hang. | `trace-to-skill codex-report ./runs --output openai-codex-issue.md` |
|
|
43
|
+
| 5 | Auth verification fixture and login support report | 2 issue(s), 346 comment(s), severity high; top signal: codex_auth_verification. | `trace-to-skill codex-report ./runs --output openai-codex-auth-issue.md` |
|
|
43
44
|
|
|
44
45
|
## Suggested Next Actions
|
|
45
46
|
|
|
47
|
+
### codex_platform_availability
|
|
48
|
+
|
|
49
|
+
Priority score: 3101. 3 issue(s), 328 comment(s).
|
|
50
|
+
|
|
51
|
+
Example issues:
|
|
52
|
+
- [#10410 Codex Desktop App: macOS Intel (x86_64) support](https://github.com/openai/codex/issues/10410) (190 comments; labels: enhancement, app)
|
|
53
|
+
- [#11023 Codex desktop app for Linux](https://github.com/openai/codex/issues/11023) (68 comments; labels: enhancement, app)
|
|
54
|
+
- [#4313 Extension for JetBrains IDEs (PyCharm, IntelliJ, etc.)](https://github.com/openai/codex/issues/4313) (70 comments; labels: enhancement)
|
|
55
|
+
|
|
56
|
+
Evidence rule prompts:
|
|
57
|
+
- When reporting Codex platform availability gaps, capture requested surface (Desktop app, IDE extension, or packaged build), platform and architecture such as macOS Intel x86_64 or Linux distro/window system, install artifact and version, exact launch/install error, screenshot text such as prohibited icon or incompatible architecture, CLI version and whether CLI works on the same machine, alternative surfaces tried, package format requested, ecosystem workflow such as JetBrains/PyCharm/IntelliJ, demand evidence from comments/reactions or signup forms, and whether docs/release notes state the support policy.
|
|
58
|
+
|
|
46
59
|
### codex_remote_connection
|
|
47
60
|
|
|
48
61
|
Priority score: 1895. 1 issue(s), 176 comment(s).
|
|
@@ -64,40 +77,27 @@ Example issues:
|
|
|
64
77
|
Evidence rule prompts:
|
|
65
78
|
- When reporting Codex token burn, capture plan/workspace, client and version, model and reasoning/speed settings, fast-mode/large-context/subagent/review flags, recent /status and usage-dashboard deltas, local token totals including cached input/output/reasoning if available, background process ids and write_stdin poll cadence, compaction attempts and failures, retry/tool-loop counts, whether the app was idle, and a minimal reproduction with before/after usage percentages.
|
|
66
79
|
|
|
67
|
-
###
|
|
68
|
-
|
|
69
|
-
Priority score: 409. 2 issue(s), 346 comment(s).
|
|
70
|
-
|
|
71
|
-
Example issues:
|
|
72
|
-
- [#20161 Phone number verification doesn't work](https://github.com/openai/codex/issues/20161) (177 comments; labels: bug, auth)
|
|
73
|
-
- [#1243 "Sign in With ChatGPT" functionality needs to be robust against all account types](https://github.com/openai/codex/issues/1243) (169 comments; labels: none)
|
|
74
|
-
|
|
75
|
-
Evidence rule prompts:
|
|
76
|
-
- When reporting Codex sign-in or account-verification failures, capture the Codex app/CLI/extension version, surface, OS, account type without secrets, workspace or organization context, SSO provider, whether the flow is ChatGPT sign-in, phone/SMS/OTP verification, or extension chat initialization, exact redacted error text, timestamps, whether another device/browser/account works, logout/login attempts, and screenshots with phone numbers, tokens, and email addresses redacted.
|
|
77
|
-
|
|
78
|
-
### codex_model_routing_mismatch
|
|
80
|
+
### codex_thinking_hang
|
|
79
81
|
|
|
80
|
-
Priority score:
|
|
82
|
+
Priority score: 434. 2 issue(s), 201 comment(s).
|
|
81
83
|
|
|
82
84
|
Example issues:
|
|
83
|
-
- [#
|
|
84
|
-
- [#
|
|
85
|
-
- [#11842 GPT-5.3-Codex being routed to GPT-5.2](https://github.com/openai/codex/issues/11842) (15 comments; labels: bug, CLI)
|
|
85
|
+
- [#14048 All models - Codex CLI hangs indefinitely on all prompts, no response generated](https://github.com/openai/codex/issues/14048) (131 comments; labels: bug, agent)
|
|
86
|
+
- [#7156 Codex hangs during cli command execution](https://github.com/openai/codex/issues/7156) (70 comments; labels: bug, CLI)
|
|
86
87
|
|
|
87
88
|
Evidence rule prompts:
|
|
88
|
-
- When reporting Codex
|
|
89
|
+
- When reporting Codex thinking or CLI no-response hangs, capture app/CLI/extension version, OS/terminal such as WSL, model and reasoning/speed settings, subscription/workspace, turn/thread id, prompt timestamp, whether the prompt is accepted but no streaming output/error/timeout appears, status bar or usage percent such as 100% left, `turn/start` or `task_started` timestamp, last successful tool-call output, first `response_item` or assistant timestamp if it eventually appears, `RUST_LOG`/SSE evidence including unhandled responses events, transport (`responses_http` or websocket), `time.busy`/`time.idle` close metrics, reconnect or stream-disconnect lines, status incident link or cluster mitigation note if relevant, MCP/subagent state, whether stop/Ctrl+C/interrupt works, and whether a new thread, logout/login, downgrade, API billing path, or minimal config without MCPs recovers.
|
|
89
90
|
|
|
90
|
-
###
|
|
91
|
+
### codex_auth_verification
|
|
91
92
|
|
|
92
|
-
Priority score:
|
|
93
|
+
Priority score: 409. 2 issue(s), 346 comment(s).
|
|
93
94
|
|
|
94
95
|
Example issues:
|
|
95
|
-
- [#
|
|
96
|
-
- [#
|
|
97
|
-
- [#24710 Codex Desktop: hidden context indicator still blocks long-session context management](https://github.com/openai/codex/issues/24710) (1 comments; labels: enhancement, context, app)
|
|
96
|
+
- [#20161 Phone number verification doesn't work](https://github.com/openai/codex/issues/20161) (177 comments; labels: bug, auth)
|
|
97
|
+
- [#1243 "Sign in With ChatGPT" functionality needs to be robust against all account types](https://github.com/openai/codex/issues/1243) (169 comments; labels: none)
|
|
98
98
|
|
|
99
99
|
Evidence rule prompts:
|
|
100
|
-
- When reporting Codex
|
|
100
|
+
- When reporting Codex sign-in or account-verification failures, capture the Codex app/CLI/extension version, surface, OS, account type without secrets, workspace or organization context, SSO provider, whether the flow is ChatGPT sign-in, phone/SMS/OTP verification, or extension chat initialization, exact redacted error text, timestamps, whether another device/browser/account works, logout/login attempts, and screenshots with phone numbers, tokens, and email addresses redacted.
|
|
101
101
|
|
|
102
102
|
## Unmatched Issues
|
|
103
103
|
|
package/docs/CODEX_ISSUE_MAP.md
CHANGED
|
@@ -35,7 +35,7 @@ npx trace-to-skill lsp-audit . --format json
|
|
|
35
35
|
| Undo, rewind, and pre-agent checkpoint needs | users want `/undo` or `/rewind`, double-Esc only rewinds chat state, untracked/gitignored files are not protected by commits, and manual recovery needs reviewable pre-agent evidence | workspace checkpoint bundle | `trace-to-skill checkpoint . --output .trace-to-skill/checkpoints/before-codex` before agent work |
|
|
36
36
|
| Model routing mismatch | selected `gpt-5.3-codex` in `config.toml`, TUI, or `--model`, but SSE `response.created` / `response.model` shows `gpt-5.2`, silent fallback, no warning, no fallback notice | `codex_model_routing_mismatch` | `trace-to-skill codex-report ./runs` or `trace-to-skill demo model-routing-mismatch` |
|
|
37
37
|
| Latency regressions | GPT-5.5 Fast feels like Standard, simple tasks take 10-20+ minutes, pre-first-token or thinking stalls, slow search/read/compaction, hours for small code changes | `codex_latency_regression` | `trace-to-skill codex-report ./runs` |
|
|
38
|
-
| Thinking or stream hangs | accepted turn, completed local tool output, no
|
|
38
|
+
| Thinking or stream hangs | accepted turn, completed local tool output, CLI prompt accepted but no streaming output/error/timeout, status bar `100% left`, unhandled responses events, terminal command execution stuck, long gap before first `response_item`, `time.busy` milliseconds with `time.idle` hundreds of seconds, Stop/Ctrl+C cannot interrupt, status incident or cluster-reroute note, subagent parent stuck | `codex_thinking_hang` | `trace-to-skill codex-report ./runs`, `trace-to-skill demo thinking-hang`, or `trace-to-skill demo cli-no-response` |
|
|
39
39
|
| Clipboard, paste, and generated attachment regressions | `Copy as Markdown` missing, Copy menu only exports metadata, long pasted prompts become `Pasted text.txt`, generated attachments cannot preview/edit/revert, `/goal` ignores non-empty fileAttachments | `codex_clipboard_attachment` | `trace-to-skill codex-report ./runs` or `trace-to-skill demo clipboard-attachment` |
|
|
40
40
|
| Deeplink, OAuth callback, and external launch regressions | `codex://oauth_callback?code=...` fails, `Unable to find Electron app`, `app\oauth_callback?code=...`, notification `type=click&tag=...` becomes an app path, AppX/MSIX protocol evidence, `codex app .` only focuses | `codex_deeplink_launch` | `trace-to-skill codex-report ./runs` or `trace-to-skill demo deeplink-launch` |
|
|
41
41
|
| App connector auth cache and stale link regressions | `401 Reauthentication required`, `refresh token was revoked`, stale `link_*`, `isAccessible: false`, `codex_apps_tools` or `codex_app_directory` cache regeneration keeps broken connector state | `codex_connector_auth_cache` | `trace-to-skill codex-report ./runs` or `trace-to-skill demo connector-auth-cache` |
|
|
@@ -62,6 +62,7 @@ npx trace-to-skill lsp-audit . --format json
|
|
|
62
62
|
| Auth and connectivity failures | `token_exchange_failed`, `auth.openai.com/oauth/token`, missing CA certificates, proxy/TLS, IPv6, Cloudflare, stream disconnects | `codex_connectivity` | `trace-to-skill codex-report ./runs` |
|
|
63
63
|
| Sign-in and account verification failures | phone verification, SMS/OTP, SSO, ChatGPT sign-in account routing, organization/workspace verification, extension chat initialization | `codex_auth_verification` | `trace-to-skill codex-report ./runs` or `trace-to-skill demo auth-verification` |
|
|
64
64
|
| Remote connection and SSH workspace failures | Codex Desktop remote development, Settings > Connections hidden, `[features].remote_connections`, SSH hosts from `~/.ssh/config`, remote filesystem source of truth, local tunnel not ready, stale remote Codex version, codex-server/app-server restart, fs/getMetadata folder listing timeouts, ForwardAgent or local API proxy needs | `codex_remote_connection` | `trace-to-skill codex-report ./runs` or `trace-to-skill demo remote-connection` |
|
|
65
|
+
| Platform availability and unsupported surfaces | macOS Intel x86_64, Universal build, prohibited `Codex.app` icon, incompatible architecture, CLI works but Desktop app does not, Linux desktop app requests, Ubuntu/Arch/NixOS/Wayland package demand, JetBrains/PyCharm/IntelliJ extension demand, docs or release-notes support policy requests | `codex_platform_availability` | `trace-to-skill codex-report ./runs` or `trace-to-skill demo platform-availability` |
|
|
65
66
|
| Remote-control routing failures | `Waiting for desktop`, `Directory: Unavailable`, stale listener/enrollment, `127.0.0.1:14567`, empty backend environments | `codex_remote_control` | `trace-to-skill codex-report ./runs` |
|
|
66
67
|
| MCP runtime failures | `user cancelled MCP tool call`, `unsupported call: mcp__...__...`, namespace/serverName loss, `Transport closed` | `codex_mcp_runtime` | `trace-to-skill codex-report ./runs` |
|
|
67
68
|
| Plugin runtime and bundled capability failures | Computer Use native pipe path unavailable, Browser/Computer Use settings fail, plugin/list `unknown variant 'vertical'`, stale plugin cache downgrades | `codex_plugin_runtime` | `trace-to-skill codex-report ./runs` |
|
|
@@ -85,7 +86,7 @@ npx trace-to-skill lsp-audit . --format json
|
|
|
85
86
|
- Include `session-audit` output, largest rollout JSONL sizes, largest line sizes, parse-error counts, `session_index.jsonl` line count, bloated title byte/signal counts, unindexed rollout thread count, recoverable `codex resume <id>` commands, hashed project groups, and state-file presence for resume/session-state failures.
|
|
86
87
|
- Include `diagnostics-bundle` output when the issue spans config plus local session/history state, or when you need one metadata-only folder that excludes raw logs, SQLite databases, raw config, and transcripts.
|
|
87
88
|
- Include pre-first-token, thinking, tool, search, read, and compaction timings plus model/speed settings for latency regressions.
|
|
88
|
-
- Include `turn/start`, `task_started`, last successful tool output, first `response_item` timestamp, `responses_http` or websocket evidence, `time.busy` / `time.idle`, MCP/subagent state, stop/interrupt behavior, and minimal-config recovery evidence for Thinking hangs.
|
|
89
|
+
- Include `turn/start`, `task_started`, last successful tool output, accepted prompt/no output/no error/no timeout, status bar `100% left`, first `response_item` timestamp, `RUST_LOG` SSE evidence, unhandled responses events, `responses_http` or websocket evidence, `time.busy` / `time.idle`, status incident or cluster note, MCP/subagent state, stop/Ctrl+C/interrupt behavior, and minimal-config recovery evidence for Thinking or CLI no-response hangs.
|
|
89
90
|
- Include exact Copy menu items, paste source size, generated attachment name/path/size, visible editor text, `pasted-text-attachments.json` or fileAttachments metadata, `/goal` or submit path, preview/edit/revert actions, and clipboard payload format for clipboard/attachment regressions.
|
|
90
91
|
- Include exact redacted `codex://` URI shape, connector/plugin, browser, error dialog text, app running state, AppX/MSIX protocol registration evidence such as AppUserModelID and DelegateExecute, HKCU/HKCR `codex` keys, command-line arguments, `Start-Process "codex://test"` repro, and repair/reinstall/re-register attempts for deeplink/OAuth launch regressions.
|
|
91
92
|
- Include connector/plugin name and id, installed plugin root, exact Codex Apps tool name, 401/reauth text, `link_*` id before and after reconnect/cache regeneration, `isAccessible` state, redacted `codex_apps_tools`/`codex_app_directory` metadata, ChatGPT app page state, and external MCP workaround result for connector auth-cache regressions.
|
|
@@ -93,6 +94,7 @@ npx trace-to-skill lsp-audit . --format json
|
|
|
93
94
|
- Include Codex Desktop/app/CLI version, MultiAgentV2 state, OS, model, parent thread id, child thread ids, exact `spawn_agent` arguments, `fork_turns`, role/profile, whether `multi_tool_use.parallel` or same-turn parallel spawning was used, redacted child rollout line order, first user/task message, assistant/commentary envelope lines, sibling prompt excerpts, `wait_agent` and `close_agent` results, unexpected child tool calls, and sequential single-child versus parallel-child controls for subagent prompt leakage.
|
|
94
95
|
- Include effective `CODEX_HOME`, config files considered, redacted MCP sections, trust/profile/default-permissions state, `codex mcp list/get`, CLI-versus-Desktop/VS Code comparison, loaded config path/log lines, WSL/remote/SSH state, and restart/reload/new-conversation results for MCP discovery mismatches.
|
|
95
96
|
- Include Codex Desktop version, remote Codex CLI/app-server version, local OS, remote OS/architecture, SSH alias from `~/.ssh/config`, `[features].remote_connections = true`, Settings > Connections visibility, selected host/path, remote workspace path, remote filesystem source-of-truth expectation, tunnel/app-server error, codex-server pid/restart evidence, remote PATH/auth/proxy/API reachability, model-list differences, fs/getMetadata/folder listing errors, ForwardAgent/proxy needs, and reconnect/resume behavior for remote connection reports.
|
|
97
|
+
- Include requested surface, platform/architecture, install artifact, exact launch/install error, screenshot text, CLI version and same-machine CLI comparison, package format, JetBrains IDE/version, comments/reactions or signup-form demand evidence, and docs/release-notes support-policy evidence for platform availability reports.
|
|
96
98
|
- Include Codex version, MCP server name, transport URL without secrets, initialize/tools/list/tools/call results, HTTP status, `Content-Type`, SSE event framing, JSON-RPC message shape, session id before/after restart, auth/OAuth expectations, User-Agent/header requirements, parse/deserialize error, another-client comparison, and reconnect/reinitialize behavior for Streamable HTTP MCP reports.
|
|
97
99
|
- Include app/CLI/extension version, OS, surface, shell/Desktop route, `[features].hooks`, redacted `hooks.json`, hook event type, matcher, handler command/name, expected versus observed fire count, duplicate event ids, deprecation warning text, trust state, live edit/rate-limit/auto-restore timing, Code Mode `exec` versus normal CLI comparison, linked-worktree cwd, Hooks settings UI evidence, and restart/reload/new-session behavior for hooks reports.
|
|
98
100
|
- Include terminal emulator/version, shell, WSL/SSH/tmux/Zellij state, streaming state, exact scroll action, viewport snap behavior, first missing or duplicated line id, raw log/transcript proof, terminal capture, numbered-line harness/control output, terminal dimensions/scrollback settings, and `/resume` or transcript recovery behavior for terminal-output integrity reports.
|
|
@@ -121,6 +123,8 @@ npx trace-to-skill lsp-audit . --format json
|
|
|
121
123
|
- Tool-call integrity and rollback failures: https://github.com/openai/codex/issues/25399, https://github.com/openai/codex/issues/25380, https://github.com/openai/codex/issues/25426, https://github.com/openai/codex/issues/7291
|
|
122
124
|
- Undo, rewind, and pre-agent checkpoint needs: https://github.com/openai/codex/issues/9203, https://github.com/openai/codex/issues/11626
|
|
123
125
|
- Latency regressions: https://github.com/openai/codex/issues/24422, https://github.com/openai/codex/issues/21527, https://github.com/openai/codex/issues/11984, https://github.com/openai/codex/issues/12161
|
|
126
|
+
- CLI no-response and command execution hangs: https://github.com/openai/codex/issues/14048, https://github.com/openai/codex/issues/7156
|
|
127
|
+
- Platform availability and unsupported surfaces: https://github.com/openai/codex/issues/10410, https://github.com/openai/codex/issues/11023, https://github.com/openai/codex/issues/4313
|
|
124
128
|
- Deeplink, OAuth callback, and external launch regressions: https://github.com/openai/codex/issues/25203, https://github.com/openai/codex/issues/25231, https://github.com/openai/codex/issues/25368, https://github.com/openai/codex/issues/25333
|
|
125
129
|
- App connector auth cache and stale link regressions: https://github.com/openai/codex/issues/24675, https://github.com/openai/codex/issues/25443
|
|
126
130
|
- Context fork bloat and prompt-cache lineage loss: https://github.com/openai/codex/issues/25467, https://github.com/openai/codex/issues/24704, https://github.com/openai/codex/issues/24150, https://github.com/openai/codex/issues/13491, https://github.com/openai/codex/issues/24281
|
|
@@ -1,10 +1,10 @@
|
|
|
1
1
|
# GitHub Issue Pain Map
|
|
2
2
|
|
|
3
|
-
Generated: 2026-06-01T03:
|
|
3
|
+
Generated: 2026-06-01T03:51:25.847Z
|
|
4
4
|
|
|
5
5
|
Issues analyzed: **46**
|
|
6
|
-
Matched issues: **
|
|
7
|
-
Unmatched issues: **
|
|
6
|
+
Matched issues: **26**
|
|
7
|
+
Unmatched issues: **20**
|
|
8
8
|
|
|
9
9
|
This report maps GitHub issues onto deterministic `trace-to-skill` failure classes. Fetch a repository directly with `--repo`, or export issues with `gh issue list` / `gh search issues` and pass the JSON file.
|
|
10
10
|
|
|
@@ -19,6 +19,7 @@ gh issue list --repo openai/codex --state all --limit 100 --json number,title,bo
|
|
|
19
19
|
|
|
20
20
|
| Priority | Kind | Severity | Issues | Comments | Reactions | Example |
|
|
21
21
|
| ---: | --- | --- | ---: | ---: | ---: | --- |
|
|
22
|
+
| 3101 | `codex_platform_availability` | high | 3 | 328 | 1368 | [#10410 Codex Desktop App: macOS Intel (x86_64) support](https://github.com/openai/codex/issues/10410) |
|
|
22
23
|
| 2438 | `codex_token_burn` | high | 4 | 1151 | 620 | [#14593 Burning tokens very fast](https://github.com/openai/codex/issues/14593) |
|
|
23
24
|
| 2221 | `weak_evidence` | medium | 46 | 4755 | 7794 | [#14593 Burning tokens very fast](https://github.com/openai/codex/issues/14593) |
|
|
24
25
|
| 1895 | `codex_remote_connection` | high | 1 | 176 | 851 | [#10450 Remote Development in Codex Desktop App](https://github.com/openai/codex/issues/10450) |
|
|
@@ -26,24 +27,35 @@ gh issue list --repo openai/codex --state all --limit 100 --json number,title,bo
|
|
|
26
27
|
| 805 | `codex_auth_verification` | high | 3 | 436 | 166 | [#20161 Phone number verification doesn't work](https://github.com/openai/codex/issues/20161) |
|
|
27
28
|
| 631 | `codex_context_visibility` | high | 1 | 160 | 227 | [#23794 Codex Desktop no longer shows visible context/token usage indicator](https://github.com/openai/codex/issues/23794) |
|
|
28
29
|
| 442 | `codex_tool_call_integrity` | high | 1 | 61 | 182 | [#2998 IDE-integrated diff / approval](https://github.com/openai/codex/issues/2998) |
|
|
30
|
+
| 434 | `codex_thinking_hang` | high | 2 | 201 | 103 | [#14048 All models — Codex CLI hangs indefinitely on all prompts, no response generated](https://github.com/openai/codex/issues/14048) |
|
|
29
31
|
| 376 | `codex_remote_compact` | high | 2 | 147 | 101 | [#14860 Error running remote compact task](https://github.com/openai/codex/issues/14860) |
|
|
30
32
|
| 376 | `context_compaction` | high | 2 | 147 | 101 | [#14860 Error running remote compact task](https://github.com/openai/codex/issues/14860) |
|
|
31
33
|
| 351 | `codex_terminal_output_integrity` | high | 1 | 66 | 134 | [#2558 Codex client output truncated when scrolling in Zellij](https://github.com/openai/codex/issues/2558) |
|
|
32
|
-
| 324 | `codex_model_routing_mismatch` | high | 1 | 169 | 69 | [#11189 GPT-5.3-Codex being routed to GPT-5.2](https://github.com/openai/codex/issues/11189) |
|
|
33
|
-
| 261 | `premature_completion` | high | 1 | 60 | 92 | [#2448 Codex CLI: Plus users hitting usage limits extremely quickly compared to competitors](https://github.com/openai/codex/issues/2448) |
|
|
34
34
|
|
|
35
35
|
## Maintainer Roadmap
|
|
36
36
|
|
|
37
37
|
| Rank | Next artifact | Why now | Command |
|
|
38
38
|
| ---: | --- | --- | --- |
|
|
39
|
-
| 1 |
|
|
40
|
-
| 2 |
|
|
41
|
-
| 3 |
|
|
42
|
-
| 4 |
|
|
43
|
-
| 5 |
|
|
39
|
+
| 1 | Platform availability fixture and support-policy evidence report | 3 issue(s), 328 comment(s), severity high; top signal: codex_platform_availability. | `trace-to-skill codex-report ./runs --output openai-codex-platform-availability.md` |
|
|
40
|
+
| 2 | Usage evidence fixture and support-ready token report | 4 issue(s), 1151 comment(s), severity high; top signal: codex_token_burn. | `trace-to-skill usage-evidence ./usage-notes.md --output usage-evidence.md` |
|
|
41
|
+
| 3 | Remote connection fixture and SSH workspace evidence report | 1 issue(s), 176 comment(s), severity high; top signal: codex_remote_connection. | `trace-to-skill codex-report ./runs --output openai-codex-remote-connection.md` |
|
|
42
|
+
| 4 | Privacy/safety guardrail and redacted support bundle | 1 issue(s), 75 comment(s), severity high; top signal: sensitive_file_access. | `trace-to-skill diagnostics-bundle ~/.codex --output codex-diagnostics` |
|
|
43
|
+
| 5 | Auth verification fixture and login support report | 3 issue(s), 436 comment(s), severity high; top signal: codex_auth_verification. | `trace-to-skill codex-report ./runs --output openai-codex-auth-issue.md` |
|
|
44
44
|
|
|
45
45
|
## Suggested Next Actions
|
|
46
46
|
|
|
47
|
+
### codex_platform_availability
|
|
48
|
+
|
|
49
|
+
Priority score: 3101. 3 issue(s), 328 comment(s).
|
|
50
|
+
|
|
51
|
+
Example issues:
|
|
52
|
+
- [#10410 Codex Desktop App: macOS Intel (x86_64) support](https://github.com/openai/codex/issues/10410) (190 comments; labels: enhancement, app)
|
|
53
|
+
- [#4313 Extension for JetBrains IDEs (PyCharm, IntelliJ, etc.)](https://github.com/openai/codex/issues/4313) (70 comments; labels: enhancement)
|
|
54
|
+
- [#11023 Codex desktop app for Linux](https://github.com/openai/codex/issues/11023) (68 comments; labels: enhancement, app)
|
|
55
|
+
|
|
56
|
+
Evidence rule prompts:
|
|
57
|
+
- When reporting Codex platform availability gaps, capture requested surface (Desktop app, IDE extension, or packaged build), platform and architecture such as macOS Intel x86_64 or Linux distro/window system, install artifact and version, exact launch/install error, screenshot text such as prohibited icon or incompatible architecture, CLI version and whether CLI works on the same machine, alternative surfaces tried, package format requested, ecosystem workflow such as JetBrains/PyCharm/IntelliJ, demand evidence from comments/reactions or signup forms, and whether docs/release notes state the support policy.
|
|
58
|
+
|
|
47
59
|
### codex_token_burn
|
|
48
60
|
|
|
49
61
|
Priority score: 2438. 4 issue(s), 1151 comment(s).
|
|
@@ -88,27 +100,17 @@ Example issues:
|
|
|
88
100
|
Evidence rule prompts:
|
|
89
101
|
- When reporting Codex sign-in or account-verification failures, capture the Codex app/CLI/extension version, surface, OS, account type without secrets, workspace or organization context, SSO provider, whether the flow is ChatGPT sign-in, phone/SMS/OTP verification, or extension chat initialization, exact redacted error text, timestamps, whether another device/browser/account works, logout/login attempts, and screenshots with phone numbers, tokens, and email addresses redacted.
|
|
90
102
|
|
|
91
|
-
### codex_context_visibility
|
|
92
|
-
|
|
93
|
-
Priority score: 631. 1 issue(s), 160 comment(s).
|
|
94
|
-
|
|
95
|
-
Example issues:
|
|
96
|
-
- [#23794 Codex Desktop no longer shows visible context/token usage indicator](https://github.com/openai/codex/issues/23794) (160 comments; labels: bug, context, app)
|
|
97
|
-
|
|
98
|
-
Evidence rule prompts:
|
|
99
|
-
- When reporting Codex context-visibility regressions, capture Codex Desktop version, OS, surface, screenshot or short recording of the chat input area, whether the prior context/token indicator or tooltip was visible before the update, exact UI route where it disappeared, local session metadata showing context/window pressure if available, `/status` output if relevant, compaction timing, whether CLI/TUI still exposes a statusline, and how the missing indicator affects long-session decisions.
|
|
100
|
-
|
|
101
103
|
## Unmatched Issues
|
|
102
104
|
|
|
103
|
-
- [#10410 Codex Desktop App: macOS Intel (x86_64) support](https://github.com/openai/codex/issues/10410) (190 comments; labels: enhancement, app)
|
|
104
|
-
- [#14048 All models — Codex CLI hangs indefinitely on all prompts, no response generated](https://github.com/openai/codex/issues/14048) (131 comments; labels: bug, agent)
|
|
105
105
|
- [#2604 Subagent Support](https://github.com/openai/codex/issues/2604) (103 comments; labels: enhancement, subagent)
|
|
106
106
|
- [#12564 Allow renaming task/thread titles to improve history navigation](https://github.com/openai/codex/issues/12564) (77 comments; labels: enhancement, extension)
|
|
107
107
|
- [#2860 Unusable on Windows due to permission ask for every shell command](https://github.com/openai/codex/issues/2860) (77 comments; labels: bug, windows-os)
|
|
108
108
|
- [#2109 Event Hooks](https://github.com/openai/codex/issues/2109) (76 comments; labels: enhancement, hooks)
|
|
109
109
|
- [#2796 BUG: VSCode IDE Plugin on SSH Connection: "Failed to load tasks."](https://github.com/openai/codex/issues/2796) (71 comments; labels: bug, extension)
|
|
110
110
|
- [#16231 High CPU usage on macOS after updating Codex in VS Code extension to 26.325.31654](https://github.com/openai/codex/issues/16231) (71 comments; labels: bug, extension, regression, performance)
|
|
111
|
-
- [#7156 Codex hangs during cli command execution](https://github.com/openai/codex/issues/7156) (70 comments; labels: bug, CLI)
|
|
112
|
-
- [#4313 Extension for JetBrains IDEs (PyCharm, IntelliJ, etc.)](https://github.com/openai/codex/issues/4313) (70 comments; labels: enhancement)
|
|
113
111
|
- [#13041 WebSocket upgrade succeeds then server closes with 1008 Policy (falls back to HTTPS)](https://github.com/openai/codex/issues/13041) (70 comments; labels: bug, connectivity)
|
|
114
112
|
- [#11701 Subagent configuration and orchestration](https://github.com/openai/codex/issues/11701) (69 comments; labels: enhancement, subagent)
|
|
113
|
+
- [#6172 Hitting rate limits](https://github.com/openai/codex/issues/6172) (66 comments; labels: bug, codex-web, rate-limits)
|
|
114
|
+
- [#1481 Bug when I send a first message "stream error: ..."](https://github.com/openai/codex/issues/1481) (65 comments; labels: bug)
|
|
115
|
+
- [#11325 Manual /compact command in Codex app](https://github.com/openai/codex/issues/11325) (61 comments; labels: enhancement, app)
|
|
116
|
+
- [#2101 Plan Mode](https://github.com/openai/codex/issues/2101) (61 comments; labels: enhancement)
|
package/docs/DEMO.md
CHANGED
|
@@ -1,10 +1,10 @@
|
|
|
1
1
|
# trace-to-skill Demo
|
|
2
2
|
|
|
3
|
-
Scenario: **Codex
|
|
3
|
+
Scenario: **Codex platform availability gap**
|
|
4
4
|
|
|
5
|
-
Desktop
|
|
5
|
+
Codex Desktop, Linux app, or JetBrains extension demand is blocked by unsupported architecture, OS, package, or IDE surface.
|
|
6
6
|
|
|
7
|
-
Fixture: `fixtures/codex-
|
|
7
|
+
Fixture: `fixtures/codex-platform-availability.md`
|
|
8
8
|
|
|
9
9
|
This is a packaged public fixture, so you can try the project without collecting a private trace first.
|
|
10
10
|
|
|
@@ -14,7 +14,7 @@ This is a packaged public fixture, so you can try the project without collecting
|
|
|
14
14
|
|
|
15
15
|
Score: **75/100**
|
|
16
16
|
|
|
17
|
-
Likely failure class: **Codex
|
|
17
|
+
Likely failure class: **Codex platform availability or unsupported surface (codex_platform_availability, high)**
|
|
18
18
|
|
|
19
19
|
Agent workflow needs clearer verification, instruction, or security hardening before broad reuse.
|
|
20
20
|
|
|
@@ -23,25 +23,25 @@ Agent workflow needs clearer verification, instruction, or security hardening be
|
|
|
23
23
|
```md
|
|
24
24
|
### What happened?
|
|
25
25
|
|
|
26
|
-
trace-to-skill detected Codex
|
|
26
|
+
trace-to-skill detected Codex platform availability or unsupported surface (codex_platform_availability). Codex adoption is blocked when the CLI works but the official Desktop app, IDE extension, or packaged build is unavailable for a user's platform, architecture, distro, or IDE ecosystem.
|
|
27
27
|
|
|
28
28
|
### Detected failure class
|
|
29
29
|
|
|
30
|
-
-
|
|
30
|
+
- codex_platform_availability: Codex platform availability or unsupported surface (high)
|
|
31
31
|
|
|
32
32
|
### Evidence
|
|
33
33
|
|
|
34
|
-
#### Codex
|
|
35
|
-
- fixtures/codex-
|
|
36
|
-
- fixtures/codex-
|
|
37
|
-
- fixtures/codex-
|
|
38
|
-
- fixtures/codex-
|
|
39
|
-
- fixtures/codex-
|
|
40
|
-
- fixtures/codex-
|
|
34
|
+
#### Codex platform availability or unsupported surface
|
|
35
|
+
- fixtures/codex-platform-availability.md:3 - Public issue cluster: Codex Desktop App macOS Intel support, Codex desktop app for Linux, and JetBrains IDE extension demand.
|
|
36
|
+
- fixtures/codex-platform-availability.md:7 - - A user requests macOS Intel x86_64 support for the Codex Desktop App or a Universal build with arm64 + x86_64.
|
|
37
|
+
- fixtures/codex-platform-availability.md:8 - - Environment evidence: Intel Mac, `uname -m => x86_64`, macOS 13/14/15, Codex `.dmg`, and `Codex.app`.
|
|
38
|
+
- fixtures/codex-platform-availability.md:9 - - When the `.dmg` is mounted, `Codex.app` shows the prohibited symbol and macOS says the app can't run on this Mac because of incompatible architecture.
|
|
39
|
+
- fixtures/codex-platform-availability.md:10 - - Codex CLI works fine on the same machine, but the desktop app cannot launch: `which codex => /usr/local/bin/codex` and `codex --version => codex-cli 0.58.0`.
|
|
40
|
+
- fixtures/codex-platform-availability.md:16 - - Users ask for an official Codex desktop app on Linux because they want the app experience on Ubuntu, Arch, NixOS, Fedora, Debian, Wayland, and X11 desktops.
|
|
41
41
|
|
|
42
42
|
### Diagnostics to attach
|
|
43
43
|
|
|
44
|
-
- When reporting Codex
|
|
44
|
+
- When reporting Codex platform availability gaps, capture requested surface (Desktop app, IDE extension, or packaged build), platform and architecture such as macOS Intel x86_64 or Linux distro/window system, install artifact and version, exact launch/install error, screenshot text such as prohibited icon or incompatible architecture, CLI version and whether CLI works on the same machine, alternative surfaces tried, package format requested, ecosystem workflow such as JetBrains/PyCharm/IntelliJ, demand evidence from comments/reactions or signup forms, and whether docs/release notes state the support policy.
|
|
45
45
|
|
|
46
46
|
### Privacy
|
|
47
47
|
|
|
@@ -50,25 +50,25 @@ trace-to-skill detected Codex remote connection or SSH workspace failure (codex_
|
|
|
50
50
|
|
|
51
51
|
## Findings
|
|
52
52
|
|
|
53
|
-
### 1. Codex
|
|
53
|
+
### 1. Codex platform availability or unsupported surface
|
|
54
54
|
|
|
55
55
|
Severity: **high**
|
|
56
56
|
|
|
57
|
-
|
|
57
|
+
Codex adoption is blocked when the CLI works but the official Desktop app, IDE extension, or packaged build is unavailable for a user's platform, architecture, distro, or IDE ecosystem.
|
|
58
58
|
|
|
59
59
|
Evidence:
|
|
60
|
-
- `fixtures/codex-
|
|
61
|
-
- `fixtures/codex-
|
|
62
|
-
- `fixtures/codex-
|
|
63
|
-
- `fixtures/codex-
|
|
64
|
-
- `fixtures/codex-
|
|
65
|
-
- `fixtures/codex-
|
|
66
|
-
- `fixtures/codex-
|
|
67
|
-
- `fixtures/codex-
|
|
60
|
+
- `fixtures/codex-platform-availability.md:3` Public issue cluster: Codex Desktop App macOS Intel support, Codex desktop app for Linux, and JetBrains IDE extension demand.
|
|
61
|
+
- `fixtures/codex-platform-availability.md:7` - A user requests macOS Intel x86_64 support for the Codex Desktop App or a Universal build with arm64 + x86_64.
|
|
62
|
+
- `fixtures/codex-platform-availability.md:8` - Environment evidence: Intel Mac, `uname -m => x86_64`, macOS 13/14/15, Codex `.dmg`, and `Codex.app`.
|
|
63
|
+
- `fixtures/codex-platform-availability.md:9` - When the `.dmg` is mounted, `Codex.app` shows the prohibited symbol and macOS says the app can't run on this Mac because of incompatible architecture.
|
|
64
|
+
- `fixtures/codex-platform-availability.md:10` - Codex CLI works fine on the same machine, but the desktop app cannot launch: `which codex => /usr/local/bin/codex` and `codex --version => codex-cli 0.58.0`.
|
|
65
|
+
- `fixtures/codex-platform-availability.md:16` - Users ask for an official Codex desktop app on Linux because they want the app experience on Ubuntu, Arch, NixOS, Fedora, Debian, Wayland, and X11 desktops.
|
|
66
|
+
- `fixtures/codex-platform-availability.md:21` ## JetBrains IDE Extension
|
|
67
|
+
- `fixtures/codex-platform-availability.md:23` - Users request an official Codex extension or plugin for JetBrains IDEs such as PyCharm, IntelliJ, WebStorm, CLion, and Rider.
|
|
68
68
|
|
|
69
69
|
Suggested rule:
|
|
70
70
|
|
|
71
|
-
> When reporting Codex
|
|
71
|
+
> When reporting Codex platform availability gaps, capture requested surface (Desktop app, IDE extension, or packaged build), platform and architecture such as macOS Intel x86_64 or Linux distro/window system, install artifact and version, exact launch/install error, screenshot text such as prohibited icon or incompatible architecture, CLI version and whether CLI works on the same machine, alternative surfaces tried, package format requested, ecosystem workflow such as JetBrains/PyCharm/IntelliJ, demand evidence from comments/reactions or signup forms, and whether docs/release notes state the support policy.
|
|
72
72
|
|
|
73
73
|
|
|
74
74
|
## Reporter Notes
|
|
@@ -89,6 +89,7 @@ Suggested rule:
|
|
|
89
89
|
- `latency-regression`: Fast mode feels like Standard, with long thinking, search, read, or compaction stalls.
|
|
90
90
|
- `model-routing-mismatch`: Codex shows one selected model while SSE response evidence shows a different server-side model was used.
|
|
91
91
|
- `thinking-hang`: A turn or tool call completes, but the session stays on Thinking or Working with no streamed follow-up.
|
|
92
|
+
- `cli-no-response`: Codex CLI accepts prompts but produces no streaming output, no error, no timeout, or hangs during command execution.
|
|
92
93
|
- `clipboard-attachment`: Copy as Markdown, long-paste conversion, or generated Pasted text.txt attachments break prompt and report workflows.
|
|
93
94
|
- `deeplink-launch`: OAuth callbacks, notification clicks, mobile links, or `codex app <path>` external activation fail to route into Codex.
|
|
94
95
|
- `connector-auth-cache`: App connectors keep stale `link_*` auth or discovery metadata after reauth-required responses.
|
|
@@ -100,6 +101,7 @@ Suggested rule:
|
|
|
100
101
|
- `subagent-lifecycle`: Completed, closed, stale, or interrupted subagents diverge between UI, live registry, persisted state, quota, and parent discoverability.
|
|
101
102
|
- `usage-bucket-confusion`: Usage popovers show 5h and weekly percentages without clear remaining/used, rolling/calendar, or account/workspace scope.
|
|
102
103
|
- `context-visibility`: Desktop context or token usage indicators disappear, leaving long-session compaction pressure invisible.
|
|
104
|
+
- `remote-connection`: Desktop remote SSH workspaces, Settings > Connections, remote app-server, tunnel, or remote filesystem evidence breaks.
|
|
103
105
|
- `token-burn`: Usage drains from background polling, idle activity, compaction loops, retries, or cached-heavy turns.
|
|
104
106
|
- `patch-overwrite`: `apply_patch` accepts `*** Add File` for an existing path, turning a create operation into a silent overwrite.
|
|
105
107
|
- `sensitive-files`: Secrets, local credentials, production env files, or private databases enter agent context.
|
|
@@ -115,6 +117,7 @@ trace-to-skill demo subagent-prompt-leakage
|
|
|
115
117
|
trace-to-skill demo windows-helper-path
|
|
116
118
|
trace-to-skill demo patch-overwrite
|
|
117
119
|
trace-to-skill demo thinking-hang
|
|
120
|
+
trace-to-skill demo cli-no-response
|
|
118
121
|
trace-to-skill demo clipboard-attachment
|
|
119
122
|
trace-to-skill demo deeplink-launch
|
|
120
123
|
trace-to-skill demo connector-auth-cache
|
|
@@ -126,6 +129,7 @@ trace-to-skill demo subagent-lifecycle
|
|
|
126
129
|
trace-to-skill demo usage-bucket-confusion
|
|
127
130
|
trace-to-skill demo context-visibility
|
|
128
131
|
trace-to-skill demo remote-connection
|
|
132
|
+
trace-to-skill demo platform-availability
|
|
129
133
|
trace-to-skill demo file-tree-ui
|
|
130
134
|
trace-to-skill demo usage-reset-drift
|
|
131
135
|
```
|
package/docs/FAILURE_TAXONOMY.md
CHANGED
|
@@ -52,9 +52,9 @@ The fix is to capture app/CLI/extension version, model and speed/reasoning setti
|
|
|
52
52
|
|
|
53
53
|
Codex can accept a turn, finish a local tool call, or keep a Responses request open while the UI or CLI remains on Thinking/Working with no streamed assistant follow-up. This is more specific than general latency: the session appears structurally accepted but the next visible assistant event never arrives or arrives after a very long gap.
|
|
54
54
|
|
|
55
|
-
Common signals include `turn/start`, `task_started`, a successful tool output followed by no next assistant action, a long gap before the first `response_item`, `model_client.stream_responses_api` close lines where `time.busy` is milliseconds but `time.idle` is hundreds of seconds, `responses_http` or websocket reconnects, Stop/Ctrl+C not interrupting the stuck turn, a subagent parent thread waiting while a child remains active, and minimal `config.toml` without MCPs changing the behavior.
|
|
55
|
+
Common signals include `turn/start`, `task_started`, a successful tool output followed by no next assistant action, a CLI prompt accepted with no streaming output, no error, and no timeout, status bar `100% left` with no tokens consumed, `codex exec --sandbox read-only --model ... 'ping'` stopping after `mcp startup: no servers`, `unhandled responses event` SSE lines, terminal command execution hanging, a status incident or unhealthy-cluster reroute note, a long gap before the first `response_item`, `model_client.stream_responses_api` close lines where `time.busy` is milliseconds but `time.idle` is hundreds of seconds, `responses_http` or websocket reconnects, Stop/Ctrl+C not interrupting the stuck turn, a subagent parent thread waiting while a child remains active, and minimal `config.toml` without MCPs changing the behavior.
|
|
56
56
|
|
|
57
|
-
The fix is to capture Codex version, OS, model and speed/reasoning settings, turn or thread id, prompt timestamp, last successful tool output, first `response_item` timestamp, transport evidence, `time.busy` / `time.idle`, reconnect or stream-close lines, MCP/subagent lifecycle state, stop/interrupt behavior, and whether a new thread or minimal config recovers.
|
|
57
|
+
The fix is to capture Codex app/CLI/extension version, OS/terminal such as WSL, model and speed/reasoning settings, subscription/workspace, turn or thread id, prompt timestamp, whether the prompt is accepted but no output/error/timeout appears, status bar or usage percent, last successful tool output, first `response_item` timestamp, `RUST_LOG`/SSE evidence, transport evidence, `time.busy` / `time.idle`, reconnect or stream-close lines, status incident link or cluster mitigation note, MCP/subagent lifecycle state, stop/Ctrl+C/interrupt behavior, and whether a new thread, logout/login, downgrade, API billing path, or minimal config recovers.
|
|
58
58
|
|
|
59
59
|
## Codex Clipboard Attachment
|
|
60
60
|
|
|
@@ -168,6 +168,14 @@ Common signals include `Remote Development in Codex Desktop App`, `Remote SSH`,
|
|
|
168
168
|
|
|
169
169
|
The fix is to capture Codex Desktop version, remote Codex CLI/app-server version, local OS, remote OS/architecture, SSH alias from `~/.ssh/config`, whether `[features].remote_connections = true` is set, Settings > Connections visibility, selected host/path, remote workspace path, whether the remote filesystem is the source of truth, exact tunnel/app-server error, codex-server pid and restart result, remote PATH/auth/proxy/API reachability, model list differences versus local, fs/getMetadata or folder listing errors, ForwardAgent/proxy requirements, and whether reconnect/resume or a clean host works.
|
|
170
170
|
|
|
171
|
+
## Codex Platform Availability
|
|
172
|
+
|
|
173
|
+
Codex adoption can be blocked before a session starts when the official Desktop app, packaged build, or IDE extension does not exist for the user's platform, architecture, distro, or IDE ecosystem even though another Codex surface such as the CLI works.
|
|
174
|
+
|
|
175
|
+
Common signals include macOS Intel `x86_64`, `uname -m => x86_64`, `Codex.app` from a `.dmg` showing a prohibited icon, incompatible architecture or can't-run-on-this-Mac messages, requests for an x86_64 or Universal build, CLI working on the same machine, Linux desktop app requests for Ubuntu, Arch, NixOS, Fedora, Debian, Wayland, X11, AppImage, Flatpak, Snap, `.deb`, or `.rpm`, and JetBrains/PyCharm/IntelliJ/WebStorm extension requests.
|
|
176
|
+
|
|
177
|
+
The fix is to capture the requested surface, platform and architecture, install artifact, exact error and screenshot text, Codex CLI version and whether CLI works on the same machine, package format or marketplace desired, IDE name/version, demand evidence such as comments/reactions or signup forms, and whether docs, release notes, or roadmap state the support policy.
|
|
178
|
+
|
|
171
179
|
## Codex Subagent Prompt Leakage
|
|
172
180
|
|
|
173
181
|
Codex MultiAgentV2 child agents can fail the task boundary even when the parent asks for isolated children. When `spawn_agent` with `fork_turns: "none"` records the delegated task as an assistant/commentary JSON envelope, or a same-turn parallel child sees a sibling prompt, independent review, QA, and security lanes are no longer independent.
|
package/docs/OPENAI_OSS_BRIEF.md
CHANGED
|
@@ -3,14 +3,14 @@
|
|
|
3
3
|
| Field | Value |
|
|
4
4
|
| --- | --- |
|
|
5
5
|
| Repository | https://github.com/grnbtqdbyx-create/trace-to-skill |
|
|
6
|
-
| Package | trace-to-skill@0.1.
|
|
6
|
+
| Package | trace-to-skill@0.1.99 |
|
|
7
7
|
| License | Apache-2.0 |
|
|
8
8
|
| Codex readiness | ready (100/100) |
|
|
9
|
-
| Benchmark | pass,
|
|
9
|
+
| Benchmark | pass, 44 cases |
|
|
10
10
|
|
|
11
11
|
## Why This Repository Qualifies
|
|
12
12
|
|
|
13
|
-
trace-to-skill helps open-source maintainers adopt Codex safely by turning failed coding-agent runs into evidence-backed rules, reusable workflows, CI gates, and a weekly Codex Issue Radar for live GitHub issue demand. It supports real maintenance work: PR review, issue triage, release quality, MCP risk, prompt-injection defense, privacy-preserving trace sharing, and repeat failure reduction. The repository is ready, scores 100/100 on the local Codex readiness doctor, and ships a deterministic benchmark with
|
|
13
|
+
trace-to-skill helps open-source maintainers adopt Codex safely by turning failed coding-agent runs into evidence-backed rules, reusable workflows, CI gates, and a weekly Codex Issue Radar for live GitHub issue demand. It supports real maintenance work: PR review, issue triage, release quality, MCP risk, prompt-injection defense, privacy-preserving trace sharing, and repeat failure reduction. The repository is ready, scores 100/100 on the local Codex readiness doctor, and ships a deterministic benchmark with 44 public fixture cases.
|
|
14
14
|
|
|
15
15
|
### 500-Character Version
|
|
16
16
|
|
|
@@ -27,10 +27,10 @@ API credits would power optional maintainer workflows on top of the local determ
|
|
|
27
27
|
## Evidence
|
|
28
28
|
|
|
29
29
|
- Public repository: https://github.com/grnbtqdbyx-create/trace-to-skill
|
|
30
|
-
- One-command package: npx trace-to-skill@0.1.
|
|
30
|
+
- One-command package: npx trace-to-skill@0.1.99
|
|
31
31
|
- Open-source license: Apache-2.0
|
|
32
32
|
- Codex readiness doctor: ready, 100/100, 0 failed checks.
|
|
33
|
-
- Public fixture benchmark: pass,
|
|
33
|
+
- Public fixture benchmark: pass, 44 cases.
|
|
34
34
|
- GitHub issue demand mining: issue-map fetches or reads piped GitHub CLI issue JSON, then ranks OpenAI/Codex issues by failure class, comments, reactions, evidence gaps, and Maintainer Roadmap next artifacts.
|
|
35
35
|
- Weekly Codex Issue Radar: init --issue-map-repo owner/name scaffolds a scheduled Action that fetches live GitHub issues and publishes the pain map to the job summary or a stable tracking issue comment.
|
|
36
36
|
- Maintainer control: generated rules are suggestions, evidence is line-linked, and secrets can be redacted before sharing.
|
package/docs/SCORECARD.md
CHANGED
|
@@ -9,7 +9,7 @@ Status: **pass**
|
|
|
9
9
|
| Failed doctor checks | 0 |
|
|
10
10
|
| Critical findings | 0 |
|
|
11
11
|
| Built-in benchmark | pass |
|
|
12
|
-
| Benchmark cases |
|
|
12
|
+
| Benchmark cases | 44 |
|
|
13
13
|
|
|
14
14
|
## Doctor Summary
|
|
15
15
|
|
|
@@ -34,6 +34,7 @@ This benchmark runs the public fixture pack that ships with the repository and p
|
|
|
34
34
|
| Codex selected model differs from actual routed model | `fixtures/codex-model-routing-mismatch.md` | 75 | 2 | 0 | `codex_model_routing_mismatch`, `weak_evidence` | pass |
|
|
35
35
|
| Codex model and runtime latency regression | `fixtures/codex-latency-regression.md` | 75 | 2 | 0 | `codex_latency_regression`, `weak_evidence` | pass |
|
|
36
36
|
| Codex thinking and stream hang | `fixtures/codex-thinking-hang.md` | 75 | 2 | 0 | `codex_thinking_hang`, `weak_evidence` | pass |
|
|
37
|
+
| Codex CLI no-response and command execution hang | `fixtures/codex-cli-no-response.md` | 75 | 2 | 0 | `codex_thinking_hang`, `weak_evidence` | pass |
|
|
37
38
|
| Codex clipboard, paste, and attachment workflow regression | `fixtures/codex-clipboard-attachment.md` | 75 | 2 | 0 | `codex_clipboard_attachment`, `weak_evidence` | pass |
|
|
38
39
|
| Codex deeplink, OAuth callback, and external launch regression | `fixtures/codex-deeplink-launch.md` | 50 | 4 | 0 | `codex_deeplink_launch`, `codex_remote_control`, `hallucinated_file`, `weak_evidence` | pass |
|
|
39
40
|
| Codex app connector auth cache and stale link regression | `fixtures/codex-connector-auth-cache.md` | 75 | 2 | 0 | `codex_connector_auth_cache`, `weak_evidence` | pass |
|
|
@@ -58,6 +59,7 @@ This benchmark runs the public fixture pack that ships with the repository and p
|
|
|
58
59
|
| Codex usage bucket scope and percentage confusion | `fixtures/codex-usage-bucket-confusion.md` | 59 | 3 | 0 | `codex_token_burn`, `codex_usage_bucket_confusion`, `weak_evidence` | pass |
|
|
59
60
|
| Codex context or token usage indicator missing | `fixtures/codex-context-visibility.md` | 75 | 2 | 0 | `codex_context_visibility`, `weak_evidence` | pass |
|
|
60
61
|
| Codex remote connection or SSH workspace failure | `fixtures/codex-remote-connection.md` | 75 | 2 | 0 | `codex_remote_connection`, `weak_evidence` | pass |
|
|
62
|
+
| Codex platform availability and unsupported surface demand | `fixtures/codex-platform-availability.md` | 75 | 2 | 0 | `codex_platform_availability`, `weak_evidence` | pass |
|
|
61
63
|
| Codex token burn and usage-drain loop | `fixtures/codex-token-burn.md` | 75 | 2 | 0 | `codex_token_burn`, `weak_evidence` | pass |
|
|
62
64
|
| Codex resource leak and runaway process | `fixtures/codex-resource-leak.md` | 75 | 2 | 0 | `codex_resource_leak`, `weak_evidence` | pass |
|
|
63
65
|
| Codex tool-call integrity and rollback failure | `fixtures/codex-tool-call-integrity.md` | 43 | 4 | 0 | `codex_resource_leak`, `codex_subagent_lifecycle`, `codex_tool_call_integrity`, `weak_evidence` | pass |
|
package/docs/USE_CASES.md
CHANGED
|
@@ -59,7 +59,7 @@ What it proves:
|
|
|
59
59
|
Recommended CI surface:
|
|
60
60
|
|
|
61
61
|
```yaml
|
|
62
|
-
- uses: grnbtqdbyx-create/trace-to-skill@v0.1.
|
|
62
|
+
- uses: grnbtqdbyx-create/trace-to-skill@v0.1.99
|
|
63
63
|
with:
|
|
64
64
|
mode: all
|
|
65
65
|
doctor-threshold: "85"
|
|
@@ -361,6 +361,32 @@ This catches signals such as `Remote Development in Codex Desktop App`, missing
|
|
|
361
361
|
|
|
362
362
|
Include Codex Desktop version, remote Codex CLI/app-server version, local OS, remote OS/architecture, selected SSH host/path, whether the remote filesystem is the source of truth, exact tunnel/app-server/folder-listing/model-list/auth/proxy error, process evidence such as `ps -ef | rg 'codex app-server|openai.chatgpt.*/codex'` when available, and whether killing codex-server, reinstalling remote Codex, reconnecting, or trying a clean host changes the result.
|
|
363
363
|
|
|
364
|
+
## 18.3. Codex Platform Availability Evidence
|
|
365
|
+
|
|
366
|
+
Use this when Codex demand is blocked by unsupported architecture, OS, package format, or IDE surface, such as macOS Intel Desktop support, Linux Desktop app support, or a JetBrains extension.
|
|
367
|
+
|
|
368
|
+
```bash
|
|
369
|
+
npx trace-to-skill demo platform-availability
|
|
370
|
+
npx trace-to-skill codex-report ./runs --output openai-codex-platform-availability.md
|
|
371
|
+
```
|
|
372
|
+
|
|
373
|
+
This catches signals such as `macOS Intel`, `x86_64`, Universal build requests, `Codex.app` showing the prohibited icon, incompatible architecture, CLI working on the same machine while the Desktop app cannot launch, Linux desktop app requests for Ubuntu/Arch/NixOS/Wayland/package formats, and JetBrains/PyCharm/IntelliJ plugin demand.
|
|
374
|
+
|
|
375
|
+
Include requested surface, platform and architecture, install artifact, exact launch/install error, screenshot text, CLI version and whether CLI works on the same machine, package format, IDE/version, linked issue, comments/reactions, signup form, and whether docs/release notes state whether support is planned.
|
|
376
|
+
|
|
377
|
+
## 18.4. Codex CLI No-Response Evidence
|
|
378
|
+
|
|
379
|
+
Use this when Codex CLI accepts prompts but produces no streaming output, no error, no timeout, or hangs during command execution.
|
|
380
|
+
|
|
381
|
+
```bash
|
|
382
|
+
npx trace-to-skill demo cli-no-response
|
|
383
|
+
npx trace-to-skill codex-report ./runs --output openai-codex-cli-no-response.md
|
|
384
|
+
```
|
|
385
|
+
|
|
386
|
+
This catches signals such as all-model hangs, `gpt-5.4 high - 100% left`, no tokens consumed, simple prompts like `Hello` or `ping` never producing output, `codex exec --sandbox read-only --model gpt-5.3-codex 'ping'`, `mcp startup: no servers`, unhandled responses events, CLI/VS Code stuck on Thinking or Working, terminal command execution hangs, status incidents, unhealthy cluster reroutes, and Ctrl+C or `/exit` delays.
|
|
387
|
+
|
|
388
|
+
Include CLI/app/extension version, OS/terminal/WSL, subscription/workspace, model and reasoning/speed settings, prompt timestamp, exact prompt, whether the prompt was accepted but no stream/error/timeout appeared, status bar or usage percent, `RUST_LOG` SSE snippets, transport, first `response_item` or assistant timestamp if it appears later, reconnect or stream-disconnect lines, status incident link, and recovery attempts such as new thread, downgrade, logout/login, API billing path, or minimal config without MCPs.
|
|
389
|
+
|
|
364
390
|
## 19. Codex File Tree UI Evidence
|
|
365
391
|
|
|
366
392
|
Use this when Codex Desktop cannot reveal project files through the native file tree, folder icon, floating file panel, or built-in preview.
|