autonomous-coding-toolkit 1.0.1 → 1.0.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (2) hide show
  1. package/README.md +5 -4
  2. package/package.json +1 -1
package/README.md CHANGED
@@ -155,10 +155,11 @@ Core skill chain forked from [superpowers](https://github.com/obra/superpowers)
155
155
 
156
156
  The toolkit's design is grounded in peer-reviewed research. Key papers:
157
157
 
158
- - **SWE-bench Pro** (Xia et al., 2025) — 1,865 programming problems; removing specifications degraded agent success from 25.9% to 8.4%
159
- - **Chroma** (Hong et al., 2025) — Long-context coding benchmark; 11 of 12 models scored below 50% accuracy at 32K tokens
160
- - **Lost in the Middle** (Liu et al., Stanford TACL 2024) — Information placed mid-context suffers up to 20 percentage point accuracy loss
161
- - **OOPSLA 2025** — Property-based testing finds ~50x more mutations per test than traditional unit tests
158
+ - [**SWE-bench Pro**](https://arxiv.org/pdf/2509.16941) (Xia et al., 2025) — 1,865 programming problems; removing specifications degraded agent success from 25.9% to 8.4%
159
+ - [**Context Rot**](https://research.trychroma.com/context-rot) (Hong et al., Chroma 2025) — 11 of 12 models scored below 50% of short-context performance at 32K tokens
160
+ - [**Lost in the Middle**](https://arxiv.org/abs/2307.03172) (Liu et al., Stanford TACL 2024) — Information placed mid-context suffers up to 20 percentage point accuracy loss
161
+ - [**Agentic Property-Based Testing**](https://arxiv.org/html/2510.09907v1) (OOPSLA 2025) — Property-based testing finds ~50x more mutations per test than traditional unit tests
162
+ - [**Bugs in LLM-Generated Code**](https://arxiv.org/abs/2403.08937) (Tambon et al., 2024) — Empirical taxonomy of AI code generation failures
162
163
  - **Cooper Stage-Gate** — Projects with stable, upfront definitions are 3x more likely to succeed
163
164
 
164
165
  16 research reports synthesizing 25+ papers: [`docs/RESEARCH.md`](docs/RESEARCH.md)
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "autonomous-coding-toolkit",
3
- "version": "1.0.1",
3
+ "version": "1.0.2",
4
4
  "description": "Autonomous AI coding pipeline: quality gates, fresh-context execution, community lessons, and compounding learning",
5
5
  "license": "MIT",
6
6
  "author": "Justin McFarland <parthalon025@gmail.com>",