researchloop 0.1.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/CHANGELOG.md +28 -0
- package/LICENSE +21 -0
- package/README.md +146 -0
- package/bin/researchloop.js +900 -0
- package/docs/getting-started.md +283 -0
- package/package.json +37 -0
- package/templates/adapters/generic.md +18 -0
- package/templates/adapters/huggingface.md +15 -0
- package/templates/adapters/llm-research-kit.md +20 -0
- package/templates/adapters/pytorch.md +26 -0
- package/templates/base/AGENTS.md +47 -0
- package/templates/base/goal.md +22 -0
- package/templates/base/plan.md +22 -0
- package/templates/base/scratchpad/THREAD.md +13 -0
- package/templates/base/scratchpad/audits.md +14 -0
- package/templates/base/scratchpad/ideas/.gitkeep +1 -0
- package/templates/base/scratchpad/papers/.gitkeep +1 -0
- package/templates/base/scratchpad/picklist.md +15 -0
- package/templates/base/scratchpad/runs.jsonl +1 -0
- package/templates/base/scratchpad/sweeps/.gitkeep +1 -0
- package/templates/base/scratchpad/variants/.gitkeep +1 -0
- package/templates/dashboard/index.html +627 -0
- package/templates/prompts/claude-code.md +30 -0
- package/templates/prompts/codex.md +29 -0
- package/templates/prompts/focus/architecture.md +30 -0
- package/templates/prompts/focus/attention.md +27 -0
- package/templates/prompts/focus/hyperparameters.md +32 -0
- package/templates/prompts/generic.md +8 -0
- package/templates/prompts/hermes.md +26 -0
package/CHANGELOG.md
ADDED
|
@@ -0,0 +1,28 @@
|
|
|
1
|
+
# Changelog
|
|
2
|
+
|
|
3
|
+
## 0.1.0
|
|
4
|
+
|
|
5
|
+
First public ResearchLoop release.
|
|
6
|
+
|
|
7
|
+
Includes:
|
|
8
|
+
|
|
9
|
+
- `researchloop init`
|
|
10
|
+
- `researchloop goal`
|
|
11
|
+
- `researchloop inspect`
|
|
12
|
+
- `researchloop idea`
|
|
13
|
+
- `researchloop prompt`
|
|
14
|
+
- `researchloop doctor`
|
|
15
|
+
- `researchloop record`
|
|
16
|
+
- `researchloop compare`
|
|
17
|
+
- `researchloop dashboard`
|
|
18
|
+
- `researchloop report`
|
|
19
|
+
|
|
20
|
+
Also includes:
|
|
21
|
+
|
|
22
|
+
- local-first dashboard
|
|
23
|
+
- MacBook / MPS smoke path
|
|
24
|
+
- startup docs
|
|
25
|
+
- onboarding tests
|
|
26
|
+
- competitor research notes
|
|
27
|
+
- open source release plan
|
|
28
|
+
|
package/LICENSE
ADDED
|
@@ -0,0 +1,21 @@
|
|
|
1
|
+
MIT License
|
|
2
|
+
|
|
3
|
+
Copyright (c) 2026 Vuk Rosic
|
|
4
|
+
|
|
5
|
+
Permission is hereby granted, free of charge, to any person obtaining a copy
|
|
6
|
+
of this software and associated documentation files (the "Software"), to deal
|
|
7
|
+
in the Software without restriction, including without limitation the rights
|
|
8
|
+
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
|
|
9
|
+
copies of the Software, and to permit persons to whom the Software is
|
|
10
|
+
furnished to do so, subject to the following conditions:
|
|
11
|
+
|
|
12
|
+
The above copyright notice and this permission notice shall be included in all
|
|
13
|
+
copies or substantial portions of the Software.
|
|
14
|
+
|
|
15
|
+
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
|
|
16
|
+
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
|
|
17
|
+
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
|
|
18
|
+
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
|
|
19
|
+
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
|
|
20
|
+
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
|
|
21
|
+
SOFTWARE.
|
package/README.md
ADDED
|
@@ -0,0 +1,146 @@
|
|
|
1
|
+
# ResearchLoop
|
|
2
|
+
|
|
3
|
+
ResearchLoop is an open source npm package for autonomous AI research workflows.
|
|
4
|
+
|
|
5
|
+
If you want the full step-by-step usage and publish guide, read [docs/getting-started.md](./docs/getting-started.md).
|
|
6
|
+
|
|
7
|
+
It installs a durable research harness into a machine learning repo so agents like Codex, Claude Code, Hermes, Cursor, and similar tools can inspect the code, propose experiments, run small checks, log results, and continue the loop without losing context.
|
|
8
|
+
|
|
9
|
+
This repo is both the product and the startup home base.
|
|
10
|
+
|
|
11
|
+
## Install
|
|
12
|
+
|
|
13
|
+
```bash
|
|
14
|
+
npm install -g researchloop
|
|
15
|
+
```
|
|
16
|
+
|
|
17
|
+
Local development from this checkout:
|
|
18
|
+
|
|
19
|
+
```bash
|
|
20
|
+
cd /Users/vukrosic/my-life/researchloop
|
|
21
|
+
npm link
|
|
22
|
+
researchloop --help
|
|
23
|
+
```
|
|
24
|
+
|
|
25
|
+
## Quick Start
|
|
26
|
+
|
|
27
|
+
```bash
|
|
28
|
+
researchloop init --agent codex
|
|
29
|
+
researchloop goal "lower validation loss"
|
|
30
|
+
researchloop inspect
|
|
31
|
+
researchloop idea --write
|
|
32
|
+
researchloop prompt --agent codex
|
|
33
|
+
researchloop prompt --agent codex --focus hyperparameters
|
|
34
|
+
researchloop dashboard
|
|
35
|
+
researchloop doctor
|
|
36
|
+
researchloop record --id first-run --status complete --metric val_loss=2.31 --note "First logged experiment"
|
|
37
|
+
researchloop compare --metric val_loss --direction lower
|
|
38
|
+
researchloop report
|
|
39
|
+
```
|
|
40
|
+
|
|
41
|
+
Then paste the generated prompt into the coding agent.
|
|
42
|
+
|
|
43
|
+
## What It Creates
|
|
44
|
+
|
|
45
|
+
```text
|
|
46
|
+
.researchloop/
|
|
47
|
+
AGENTS.md
|
|
48
|
+
goal.md
|
|
49
|
+
plan.md
|
|
50
|
+
repo-profile.json
|
|
51
|
+
adapters/
|
|
52
|
+
scratchpad/
|
|
53
|
+
THREAD.md
|
|
54
|
+
runs.jsonl
|
|
55
|
+
ideas/
|
|
56
|
+
papers/
|
|
57
|
+
variants/
|
|
58
|
+
sweeps/
|
|
59
|
+
```
|
|
60
|
+
|
|
61
|
+
The package does not claim to magically train every model. It gives an agent the operating system for serious research: constraints, baseline-first behavior, experiment logs, idea files, and reproducible reports.
|
|
62
|
+
|
|
63
|
+
## Repo Layout
|
|
64
|
+
|
|
65
|
+
```text
|
|
66
|
+
bin/ CLI entrypoint
|
|
67
|
+
templates/ Harness, adapters, and agent prompts
|
|
68
|
+
docs/site/ Landing page
|
|
69
|
+
docs/research/ Local testing notes and research logs
|
|
70
|
+
docs/competitors/ Competitor and adjacent-project research
|
|
71
|
+
docs/testing/ Setup and onboarding test plans
|
|
72
|
+
docs/startup/ Users, customers, open source, and go-to-market
|
|
73
|
+
examples/ Copyable end-to-end usage examples
|
|
74
|
+
examples/fixtures/ Minimal repo fixtures used by setup tests
|
|
75
|
+
scripts/ Smoke tests for the npm package
|
|
76
|
+
```
|
|
77
|
+
|
|
78
|
+
## Current Evidence
|
|
79
|
+
|
|
80
|
+
Tested on this MacBook:
|
|
81
|
+
|
|
82
|
+
- `researchloop init`, `inspect`, `prompt`, `doctor`, and `report` pass in a clean temp repo.
|
|
83
|
+
- `researchloop inspect` correctly detects `llm-research-kit` as `generic`, `pytorch`, `huggingface`, and `llm-research-kit`.
|
|
84
|
+
- `researchloop doctor` confirms local torch 2.8.0, CUDA false, MPS true.
|
|
85
|
+
- A tiny synthetic LLM training run completed locally through `llm-research-kit` on MPS.
|
|
86
|
+
|
|
87
|
+
See `docs/research/experiments/macbook-e2e-2026-05-15.md`.
|
|
88
|
+
|
|
89
|
+
## Product Thesis
|
|
90
|
+
|
|
91
|
+
Autonomous AI research is bottlenecked less by model access than by research discipline. Most repos lack a stable loop for:
|
|
92
|
+
|
|
93
|
+
- clear goals
|
|
94
|
+
- baselines
|
|
95
|
+
- small experiments
|
|
96
|
+
- run logs
|
|
97
|
+
- comparison
|
|
98
|
+
- pruning
|
|
99
|
+
- continuation
|
|
100
|
+
|
|
101
|
+
ResearchLoop packages that loop as an open source npm tool.
|
|
102
|
+
|
|
103
|
+
## Users
|
|
104
|
+
|
|
105
|
+
Primary users:
|
|
106
|
+
|
|
107
|
+
- PhD students running ablations
|
|
108
|
+
- small AI labs
|
|
109
|
+
- independent AI researchers
|
|
110
|
+
- companies with model, prompt, or eval optimization work
|
|
111
|
+
|
|
112
|
+
The startup plan is in `docs/startup/`.
|
|
113
|
+
|
|
114
|
+
## Commands
|
|
115
|
+
|
|
116
|
+
- `researchloop init` creates `.researchloop/` and agent instruction files.
|
|
117
|
+
- `researchloop goal` saves a durable research objective in `.researchloop/goal.md`.
|
|
118
|
+
- `researchloop inspect` writes `.researchloop/repo-profile.json`.
|
|
119
|
+
- `researchloop idea` generates ranked experiment ideas and can write an idea note.
|
|
120
|
+
- `researchloop prompt` prints an agent-ready autonomous research prompt, with optional focus playbooks.
|
|
121
|
+
- `researchloop dashboard` starts a local localhost dashboard for experiment tracking.
|
|
122
|
+
- `researchloop doctor` checks basic local tooling.
|
|
123
|
+
- `researchloop record` appends a structured run result to `runs.jsonl`.
|
|
124
|
+
- `researchloop compare` ranks runs by a chosen metric.
|
|
125
|
+
- `researchloop report` summarizes the run ledger.
|
|
126
|
+
- `npm run test:setup` runs the blank-repo and minimal-fixture setup checks.
|
|
127
|
+
- `npm run test:compare` checks comparison output for a few recorded runs.
|
|
128
|
+
- `npm run test:goal` checks goal saving and prompt handoff.
|
|
129
|
+
- `npm run test:idea` checks idea generation for a blank repo and an llm-research-kit-shaped repo.
|
|
130
|
+
- `npm run test:dashboard` checks the local dashboard server and API.
|
|
131
|
+
- `npm run test:prompts` checks prompt templates for placeholder drift.
|
|
132
|
+
- `npm run test:focus-prompts` checks the hyperparameter, architecture, and attention playbooks.
|
|
133
|
+
- `npm run test:site` checks the public landing page copy.
|
|
134
|
+
|
|
135
|
+
## Open Source
|
|
136
|
+
|
|
137
|
+
ResearchLoop should stay open source at the core. The npm package, prompts, adapters, and run ledger format should be inspectable and forkable.
|
|
138
|
+
|
|
139
|
+
Possible paid layers later:
|
|
140
|
+
|
|
141
|
+
- hosted dashboard
|
|
142
|
+
- team run history
|
|
143
|
+
- managed GPU runners
|
|
144
|
+
- private lab templates
|
|
145
|
+
- compliance/export support
|
|
146
|
+
- priority support for labs and companies
|