tycono 0.1.94-beta.6 → 0.1.94-beta.7

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "tycono",
3
- "version": "0.1.94-beta.6",
3
+ "version": "0.1.94-beta.7",
4
4
  "description": "Build an AI company. Watch them work.",
5
5
  "type": "module",
6
6
  "bin": {
@@ -745,11 +745,12 @@ function buildSupervisionSection(node: OrgNode): string {
745
745
  - 🛑 **Abort**: Seriously wrong → \`python3 "$SUPERVISION_CMD" abort <ses-id> --reason "why"\`
746
746
  - ✅ **All done?** → Before reporting done, **verify deliverables** (see Quality Gate below)
747
747
  4. **Repeat** watch until all subordinates complete. Do NOT stop after one tick.
748
- 5. **Quality Gate**: When subordinates report done, **read their actual output**:
749
- - Check files exist and are non-trivial
750
- - Verify key requirements from your task are met
751
- - If gaps found re-dispatch with specific feedback: "Missing X, Y, Z. Continue."
752
- - There is NO time limit. Iterate until the work truly meets the requirements.
748
+ 5. **Quality Gate**: When subordinates report done, **run and test** the output:
749
+ - For web apps/games: start a local server and open in browser to verify it actually works
750
+ - Try the core user interactions if basic things don't work, it's NOT done
751
+ - Check that required libraries/tools mentioned in the task are actually used
752
+ - If gaps found re-dispatch with **specific, actionable** feedback (not "improve quality")
753
+ - There is NO time limit. Non-working code is worse than less code that works.
753
754
 
754
755
  ## Supervision Commands
755
756
 
@@ -296,16 +296,23 @@ ${recoveryContext}
296
296
 
297
297
  ## Quality Gate (CRITICAL — G-09)
298
298
  ⛔ **"Subordinate said done" ≠ "Work is actually done."**
299
- Before declaring yourself done, you MUST verify the deliverables meet the directive's requirements:
299
+ **"Code exists" "Code works."** You MUST run and test the output, not just read files.
300
300
 
301
- 1. **Read the actual output files** — don't trust status reports. Check the code, docs, or artifacts yourself.
302
- 2. **Test if it works** — if the directive asks for a working game/app, check if it actually runs.
301
+ Before declaring yourself done, you MUST:
302
+
303
+ 1. **Read the actual output files** — don't trust status reports. Check the code yourself.
304
+ 2. **RUN it and test it** — this is the most important step:
305
+ - For web apps/games: \`cd <code-dir> && python3 -m http.server 9999\` then open in browser
306
+ - Actually try the core interactions (click buttons, press keys, navigate)
307
+ - If basic interactions fail (can't move, can't click, blank screen) → it's NOT done
303
308
  3. **Count against requirements** — if the directive says "15 monsters, 7 maps", count them.
304
- 4. **If quality is insufficient re-dispatch** with specific feedback:
305
- - "You implemented 4/11 systems. Still missing: NPC dialogue, inventory, capture. Continue."
306
- - "The game doesn't load in browser. Fix the entry point and test."
307
- 5. **Iterate until the directive is truly fulfilled.** There is NO time limit.
308
- A half-finished deliverable is worse than taking 2-3 hours to get it right.
309
+ 4. **Check the directive's specific tech requirements** if it mentions a specific library/engine, verify it's actually used in the code (grep for it).
310
+ 5. **If quality is insufficient re-dispatch** with specific, actionable feedback:
311
+ - "Arrow keys don't move the player. Fix input handling in WorldScene."
312
+ - "TyconoForge was required but not used. Add character rendering with TyconoForge.render()."
313
+ - NOT vague feedback like "improve quality" or "make it better"
314
+ 6. **Iterate until the directive is truly fulfilled.** There is NO time limit.
315
+ 20,000 lines of non-working code is worse than 5,000 lines that actually play.
309
316
 
310
317
  Re-dispatch pattern:
311
318
  - dispatch same C-Level with specific gaps identified