@neuroverseos/governance 0.3.0 → 0.3.1
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +20 -0
- package/package.json +16 -3
- package/policies/content-moderation-rules.txt +8 -0
- package/policies/marketing-rules.txt +8 -0
- package/policies/science-research-rules.txt +11 -0
- package/policies/social-media-rules.txt +7 -0
- package/policies/strict-rules.txt +8 -0
- package/policies/trading-rules.txt +8 -0
- package/simulate.html +1899 -0
- package/dist/adapters/autoresearch.cjs +0 -196
- package/dist/adapters/autoresearch.d.cts +0 -103
- package/dist/adapters/autoresearch.d.ts +0 -103
- package/dist/adapters/autoresearch.js +0 -7
- package/dist/adapters/express.cjs +0 -1114
- package/dist/adapters/express.d.cts +0 -66
- package/dist/adapters/express.d.ts +0 -66
- package/dist/adapters/express.js +0 -12
- package/dist/adapters/index.cjs +0 -1669
- package/dist/adapters/index.d.cts +0 -6
- package/dist/adapters/index.d.ts +0 -6
- package/dist/adapters/index.js +0 -46
- package/dist/adapters/langchain.cjs +0 -1155
- package/dist/adapters/langchain.d.cts +0 -89
- package/dist/adapters/langchain.d.ts +0 -89
- package/dist/adapters/langchain.js +0 -16
- package/dist/adapters/openai.cjs +0 -1185
- package/dist/adapters/openai.d.cts +0 -99
- package/dist/adapters/openai.d.ts +0 -99
- package/dist/adapters/openai.js +0 -16
- package/dist/adapters/openclaw.cjs +0 -1177
- package/dist/adapters/openclaw.d.cts +0 -99
- package/dist/adapters/openclaw.d.ts +0 -99
- package/dist/adapters/openclaw.js +0 -16
- package/dist/bootstrap-GXVDZNF7.js +0 -114
- package/dist/build-P42YFKQV.js +0 -339
- package/dist/chunk-2NICNKOM.js +0 -100
- package/dist/chunk-2PQU3VAN.js +0 -131
- package/dist/chunk-4A7LISES.js +0 -324
- package/dist/chunk-4JRYGIO7.js +0 -727
- package/dist/chunk-4NGDRRQH.js +0 -10
- package/dist/chunk-4QXB6PEO.js +0 -232
- package/dist/chunk-6CZSKEY5.js +0 -164
- package/dist/chunk-7P3S7MAY.js +0 -1090
- package/dist/chunk-A5W4GNQO.js +0 -130
- package/dist/chunk-AKW5YVCE.js +0 -96
- package/dist/chunk-BUWWN2NX.js +0 -192
- package/dist/chunk-COT5XS4V.js +0 -109
- package/dist/chunk-ER62HNGF.js +0 -139
- package/dist/chunk-FYS2CBUW.js +0 -304
- package/dist/chunk-GR6DGCZ2.js +0 -340
- package/dist/chunk-I3RRAYK2.js +0 -11
- package/dist/chunk-JZPQGIKR.js +0 -79
- package/dist/chunk-MWDQ4MJB.js +0 -11
- package/dist/chunk-NF5POFCI.js +0 -622
- package/dist/chunk-OGL7QXZS.js +0 -608
- package/dist/chunk-OT6PXH54.js +0 -61
- package/dist/chunk-PDOZHZWL.js +0 -225
- package/dist/chunk-Q6O7ZLO2.js +0 -62
- package/dist/chunk-QPASI2BR.js +0 -187
- package/dist/chunk-T5EUJQE5.js +0 -172
- package/dist/chunk-XPDMYECO.js +0 -642
- package/dist/chunk-YZFATT7X.js +0 -9
- package/dist/cli/neuroverse.cjs +0 -11448
- package/dist/cli/neuroverse.d.cts +0 -1
- package/dist/cli/neuroverse.d.ts +0 -1
- package/dist/cli/neuroverse.js +0 -196
- package/dist/cli/plan.cjs +0 -1599
- package/dist/cli/plan.d.cts +0 -20
- package/dist/cli/plan.d.ts +0 -20
- package/dist/cli/plan.js +0 -361
- package/dist/cli/run.cjs +0 -1746
- package/dist/cli/run.d.cts +0 -20
- package/dist/cli/run.d.ts +0 -20
- package/dist/cli/run.js +0 -143
- package/dist/configure-ai-TK67ZWZL.js +0 -132
- package/dist/derive-TLIV4OOU.js +0 -152
- package/dist/doctor-XPDLEYXN.js +0 -171
- package/dist/explain-IDCRWMPX.js +0 -70
- package/dist/guard-RV65TT4L.js +0 -96
- package/dist/guard-contract-WZx__PmU.d.cts +0 -709
- package/dist/guard-contract-WZx__PmU.d.ts +0 -709
- package/dist/guard-engine-JLTUARGU.js +0 -10
- package/dist/impact-XPECYRLH.js +0 -59
- package/dist/improve-GPUBKTEA.js +0 -85
- package/dist/index.cjs +0 -6273
- package/dist/index.d.cts +0 -1616
- package/dist/index.d.ts +0 -1616
- package/dist/index.js +0 -379
- package/dist/infer-world-7GVZWFX4.js +0 -543
- package/dist/init-PKPIYHYE.js +0 -144
- package/dist/init-world-VWMQZQC7.js +0 -223
- package/dist/mcp-server-FPVSU32Z.js +0 -13
- package/dist/model-adapter-BB7G4MFI.js +0 -11
- package/dist/playground-E664U4T6.js +0 -550
- package/dist/redteam-Z7WREJ44.js +0 -357
- package/dist/session-EKTRSR7C.js +0 -14
- package/dist/simulate-VDOYQFRO.js +0 -108
- package/dist/test-OGXJK4QU.js +0 -217
- package/dist/trace-JVF67VR3.js +0 -166
- package/dist/validate-LLBWVPGV.js +0 -81
- package/dist/validate-engine-UIABSIHD.js +0 -7
- package/dist/world-LAXO6DOX.js +0 -378
- package/dist/world-loader-HMPTOEA2.js +0 -9
- package/dist/worlds/autoresearch.nv-world.md +0 -230
- package/dist/worlds/derivation-world.nv-world.md +0 -278
|
@@ -1,230 +0,0 @@
|
|
|
1
|
-
---
|
|
2
|
-
world_id: autoresearch
|
|
3
|
-
name: Autoresearch Governance
|
|
4
|
-
version: 1.0.0
|
|
5
|
-
runtime_mode: SIMULATION
|
|
6
|
-
default_profile: conservative
|
|
7
|
-
alternative_profile: exploratory
|
|
8
|
-
---
|
|
9
|
-
|
|
10
|
-
# Thesis
|
|
11
|
-
|
|
12
|
-
Autonomous AI research loops must operate within structured governance: experiments are reproducible, metrics are tracked, compute budgets are enforced, and agents cannot drift beyond their declared research context. A research world without constraints produces noise, not knowledge.
|
|
13
|
-
|
|
14
|
-
# Invariants
|
|
15
|
-
|
|
16
|
-
- `experiments_must_be_reproducible` — Every experiment must log architecture, hyperparameters, dataset, and training config sufficient to reproduce results (structural, immutable)
|
|
17
|
-
- `metrics_must_be_recorded` — Every training run must produce at least one evaluation metric; runs without metrics are invalid (structural, immutable)
|
|
18
|
-
- `dataset_must_be_declared` — The dataset used for training and evaluation must be explicitly declared and never changed without governance approval (structural, immutable)
|
|
19
|
-
- `goal_must_be_defined` — The optimization goal (metric + direction) must be defined before any experiment runs (structural, immutable)
|
|
20
|
-
- `no_data_leakage` — Training data must never contaminate evaluation data; train/val/test splits must be fixed (structural, immutable)
|
|
21
|
-
- `compute_budget_enforced` — Experiments must respect declared compute limits; exceeding budget halts the loop (structural, immutable)
|
|
22
|
-
- `architecture_constraints_honored` — If the research context declares architectural constraints, experiments must satisfy them (prompt, immutable)
|
|
23
|
-
|
|
24
|
-
# State
|
|
25
|
-
|
|
26
|
-
## experiments_run
|
|
27
|
-
- type: number
|
|
28
|
-
- min: 0
|
|
29
|
-
- max: 10000
|
|
30
|
-
- step: 1
|
|
31
|
-
- default: 0
|
|
32
|
-
- label: Experiments Run
|
|
33
|
-
- description: Total number of experiments completed in this research loop
|
|
34
|
-
|
|
35
|
-
## best_metric_value
|
|
36
|
-
- type: number
|
|
37
|
-
- min: -1000
|
|
38
|
-
- max: 1000
|
|
39
|
-
- step: 0.01
|
|
40
|
-
- default: 100
|
|
41
|
-
- label: Best Metric Value
|
|
42
|
-
- description: Best value achieved for the primary evaluation metric
|
|
43
|
-
|
|
44
|
-
## keep_rate
|
|
45
|
-
- type: number
|
|
46
|
-
- min: 0
|
|
47
|
-
- max: 100
|
|
48
|
-
- step: 1
|
|
49
|
-
- default: 0
|
|
50
|
-
- label: Keep Rate
|
|
51
|
-
- description: Percentage of experiments that improved upon the previous best result
|
|
52
|
-
|
|
53
|
-
## compute_used_minutes
|
|
54
|
-
- type: number
|
|
55
|
-
- min: 0
|
|
56
|
-
- max: 100000
|
|
57
|
-
- step: 1
|
|
58
|
-
- default: 0
|
|
59
|
-
- label: Compute Used (minutes)
|
|
60
|
-
- description: Total wall-clock training time consumed across all experiments
|
|
61
|
-
|
|
62
|
-
## compute_budget_minutes
|
|
63
|
-
- type: number
|
|
64
|
-
- min: 0
|
|
65
|
-
- max: 100000
|
|
66
|
-
- step: 60
|
|
67
|
-
- default: 1440
|
|
68
|
-
- label: Compute Budget (minutes)
|
|
69
|
-
- description: Maximum allowed wall-clock training time for the research loop
|
|
70
|
-
|
|
71
|
-
## research_context_drift
|
|
72
|
-
- type: number
|
|
73
|
-
- min: 0
|
|
74
|
-
- max: 100
|
|
75
|
-
- step: 1
|
|
76
|
-
- default: 0
|
|
77
|
-
- label: Context Drift
|
|
78
|
-
- description: Degree to which recent experiments have diverged from the declared research context. 0 = on-topic. 100 = unrelated.
|
|
79
|
-
|
|
80
|
-
## metric_improvement_rate
|
|
81
|
-
- type: number
|
|
82
|
-
- min: 0
|
|
83
|
-
- max: 100
|
|
84
|
-
- step: 1
|
|
85
|
-
- default: 0
|
|
86
|
-
- label: Improvement Rate
|
|
87
|
-
- description: Rate of metric improvement over the last 10 experiments. 0 = stagnant. 100 = rapid improvement.
|
|
88
|
-
|
|
89
|
-
## failed_experiments
|
|
90
|
-
- type: number
|
|
91
|
-
- min: 0
|
|
92
|
-
- max: 10000
|
|
93
|
-
- step: 1
|
|
94
|
-
- default: 0
|
|
95
|
-
- label: Failed Experiments
|
|
96
|
-
- description: Number of experiments that crashed, timed out, or produced no valid metrics
|
|
97
|
-
|
|
98
|
-
# Assumptions
|
|
99
|
-
|
|
100
|
-
## conservative
|
|
101
|
-
- name: Conservative Research
|
|
102
|
-
- description: Prioritize reproducibility and careful iteration. Small architectural changes per experiment. Strict compute limits. Reject experiments that drift from the research context.
|
|
103
|
-
- iteration_style: incremental
|
|
104
|
-
- drift_tolerance: low
|
|
105
|
-
- compute_strictness: high
|
|
106
|
-
- failure_tolerance: low
|
|
107
|
-
|
|
108
|
-
## exploratory
|
|
109
|
-
- name: Exploratory Research
|
|
110
|
-
- description: Allow broader architectural exploration. Larger jumps between experiments. More lenient compute budget. Accept higher context drift if metrics improve.
|
|
111
|
-
- iteration_style: explorative
|
|
112
|
-
- drift_tolerance: moderate
|
|
113
|
-
- compute_strictness: moderate
|
|
114
|
-
- failure_tolerance: moderate
|
|
115
|
-
|
|
116
|
-
# Rules
|
|
117
|
-
|
|
118
|
-
## rule-001: Compute Budget Exhausted (structural)
|
|
119
|
-
When compute budget is exceeded, the research loop must halt. No further experiments are allowed.
|
|
120
|
-
|
|
121
|
-
When compute_used_minutes > compute_budget_minutes [state]
|
|
122
|
-
Then research_viability *= 0.00
|
|
123
|
-
Collapse: research_viability < 0.05
|
|
124
|
-
|
|
125
|
-
> trigger: Compute usage exceeds declared budget — no training time remains.
|
|
126
|
-
> rule: Unbounded compute makes research ungovernable. The budget is a hard constraint, not a suggestion.
|
|
127
|
-
> shift: Research loop halts. Final results are reported. No new experiments start.
|
|
128
|
-
> effect: Research viability set to zero. Loop terminated.
|
|
129
|
-
|
|
130
|
-
## rule-002: High Failure Rate (degradation)
|
|
131
|
-
Too many failed experiments indicate a systemic problem — bad code, misconfigured environment, or impossible architecture.
|
|
132
|
-
|
|
133
|
-
When failed_experiments > 5 [state] AND experiments_run > 0 [state]
|
|
134
|
-
Then research_viability *= 0.50
|
|
135
|
-
|
|
136
|
-
> trigger: More than 5 experiments have failed — possible systemic issue.
|
|
137
|
-
> rule: Failures consume compute without producing knowledge. High failure rates signal infrastructure problems, not research progress.
|
|
138
|
-
> shift: Research viability degrades. Agent should investigate root cause before continuing.
|
|
139
|
-
> effect: Research viability reduced to 50%.
|
|
140
|
-
|
|
141
|
-
## rule-003: Context Drift Warning (degradation)
|
|
142
|
-
Experiments diverging from the declared research context waste compute and produce irrelevant results.
|
|
143
|
-
|
|
144
|
-
When research_context_drift > 40 [state]
|
|
145
|
-
Then research_viability *= 0.60
|
|
146
|
-
|
|
147
|
-
> trigger: Context drift above 40% — experiments are straying from the research topic.
|
|
148
|
-
> rule: Governance exists to keep research focused. Agents exploring unrelated architectures are not contributing to the declared goal.
|
|
149
|
-
> shift: Research viability degrades. Agent must return to the declared research context.
|
|
150
|
-
> effect: Research viability reduced to 60%.
|
|
151
|
-
|
|
152
|
-
## rule-004: Metric Stagnation (degradation)
|
|
153
|
-
When experiments stop improving the primary metric, the research approach may need fundamental revision.
|
|
154
|
-
|
|
155
|
-
When metric_improvement_rate < 5 [state] AND experiments_run > 10 [state]
|
|
156
|
-
Then research_viability *= 0.70
|
|
157
|
-
|
|
158
|
-
> trigger: Improvement rate below 5% after 10+ experiments — research may have plateaued.
|
|
159
|
-
> rule: Stagnant metrics indicate diminishing returns from the current approach. The agent should consider a strategy change.
|
|
160
|
-
> shift: Research viability degrades. Agent should try a substantially different approach or conclude the loop.
|
|
161
|
-
> effect: Research viability reduced to 70%.
|
|
162
|
-
|
|
163
|
-
## rule-005: Strong Progress (advantage)
|
|
164
|
-
Consistent metric improvement validates the research approach and warrants continued investment.
|
|
165
|
-
|
|
166
|
-
When metric_improvement_rate > 30 [state] AND keep_rate > 20 [state]
|
|
167
|
-
Then research_viability *= 1.20
|
|
168
|
-
|
|
169
|
-
> trigger: Improvement rate above 30% with keep rate above 20% — research is productive.
|
|
170
|
-
> rule: Productive research should be encouraged. Strong metric trends indicate a promising research direction.
|
|
171
|
-
> shift: Research viability improves. Continued experimentation is well-justified.
|
|
172
|
-
> effect: Research viability boosted by 20%.
|
|
173
|
-
|
|
174
|
-
## rule-006: No Metrics Recorded (structural)
|
|
175
|
-
An experiment that produces no evaluation metrics is invalid and must not count as progress.
|
|
176
|
-
|
|
177
|
-
When experiments_run > 0 [state] AND best_metric_value == 100 [state]
|
|
178
|
-
Then research_viability *= 0.30
|
|
179
|
-
Collapse: research_viability < 0.05
|
|
180
|
-
|
|
181
|
-
> trigger: Experiments have run but no metric improvement from default — metrics may not be recording.
|
|
182
|
-
> rule: Research without measurement is not research. Every experiment must produce at least one evaluation metric.
|
|
183
|
-
> shift: Research viability drops sharply. Agent must fix metric recording before continuing.
|
|
184
|
-
> effect: Research viability reduced to 30%.
|
|
185
|
-
|
|
186
|
-
## rule-007: Efficient Compute Usage (advantage)
|
|
187
|
-
High keep rate with low compute usage indicates efficient research methodology.
|
|
188
|
-
|
|
189
|
-
When keep_rate > 30 [state] AND compute_used_minutes < compute_budget_minutes [state]
|
|
190
|
-
Then research_viability *= 1.15
|
|
191
|
-
|
|
192
|
-
> trigger: Keep rate above 30% with compute budget remaining — efficient experimentation.
|
|
193
|
-
> rule: Efficient use of compute demonstrates disciplined research. Not every experiment needs to be expensive.
|
|
194
|
-
> shift: Research viability improves. The research methodology is sustainable.
|
|
195
|
-
> effect: Research viability boosted by 15%.
|
|
196
|
-
|
|
197
|
-
# Gates
|
|
198
|
-
|
|
199
|
-
- BREAKTHROUGH: research_viability >= 90
|
|
200
|
-
- PRODUCTIVE: research_viability >= 60
|
|
201
|
-
- ONGOING: research_viability >= 35
|
|
202
|
-
- STRUGGLING: research_viability > 10
|
|
203
|
-
- HALTED: research_viability <= 10
|
|
204
|
-
|
|
205
|
-
# Outcomes
|
|
206
|
-
|
|
207
|
-
## research_viability
|
|
208
|
-
- type: number
|
|
209
|
-
- range: 0-100
|
|
210
|
-
- display: percentage
|
|
211
|
-
- label: Research Viability
|
|
212
|
-
- primary: true
|
|
213
|
-
|
|
214
|
-
## best_metric_value
|
|
215
|
-
- type: number
|
|
216
|
-
- range: -1000-1000
|
|
217
|
-
- display: decimal
|
|
218
|
-
- label: Best Metric Value
|
|
219
|
-
|
|
220
|
-
## keep_rate
|
|
221
|
-
- type: number
|
|
222
|
-
- range: 0-100
|
|
223
|
-
- display: percentage
|
|
224
|
-
- label: Keep Rate
|
|
225
|
-
|
|
226
|
-
## experiments_run
|
|
227
|
-
- type: number
|
|
228
|
-
- range: 0-10000
|
|
229
|
-
- display: integer
|
|
230
|
-
- label: Experiments Run
|
|
@@ -1,278 +0,0 @@
|
|
|
1
|
-
---
|
|
2
|
-
world_id: derivationworld
|
|
3
|
-
name: DerivationWorld
|
|
4
|
-
version: 1.0.0
|
|
5
|
-
runtime_mode: synthesis
|
|
6
|
-
default_profile: strict_synthesis
|
|
7
|
-
alternative_profile: permissive_synthesis
|
|
8
|
-
---
|
|
9
|
-
|
|
10
|
-
# Thesis
|
|
11
|
-
|
|
12
|
-
AI-synthesized governance documents must be structurally valid, epistemically honest, and deterministically verifiable. A derived .nv-world.md is only legitimate if it satisfies the same parser constraints as a hand-authored world, distinguishes declared facts from inferred claims, and never introduces governance domains beyond the source material.
|
|
13
|
-
|
|
14
|
-
# Invariants
|
|
15
|
-
|
|
16
|
-
- `output_must_be_valid_nv_world` — Synthesized output must parse successfully under parseWorldMarkdown with zero errors (prompt, immutable)
|
|
17
|
-
- `must_include_required_sections` — Output must contain Thesis, Invariants, State, Rules, Gates, and Outcomes sections (prompt, immutable)
|
|
18
|
-
- `must_distinguish_declared_vs_inferred` — Invariants derived from explicit source statements must be marked structural; those inferred by the model must be marked operational (prompt, immutable)
|
|
19
|
-
- `must_not_invent_external_domains` — All state variables, rules, and invariants must trace to concepts present in the input markdown (prompt, immutable)
|
|
20
|
-
- `invariants_must_be_enforceable_or_marked` — Every invariant must be structurally enforceable via rules, or explicitly tagged as non-enforceable with rationale (prompt, immutable)
|
|
21
|
-
- `no_json_output` — Output must be .nv-world.md markdown only, never JSON (prompt, immutable)
|
|
22
|
-
- `no_extra_commentary` — Output must contain only the .nv-world.md document, no preamble, explanation, or trailing commentary (prompt, immutable)
|
|
23
|
-
- `frontmatter_must_be_complete` — Output frontmatter must include world_id, name, and version fields (prompt, immutable)
|
|
24
|
-
- `rules_must_have_triggers_and_effects` — Every rule must include a When trigger line and a Then effect line (prompt, immutable)
|
|
25
|
-
- `gate_thresholds_must_be_ordered` — Gate thresholds must be monotonically decreasing from best to worst status (prompt, immutable)
|
|
26
|
-
|
|
27
|
-
# State
|
|
28
|
-
|
|
29
|
-
## source_section_count
|
|
30
|
-
- type: number
|
|
31
|
-
- min: 0
|
|
32
|
-
- max: 100
|
|
33
|
-
- step: 1
|
|
34
|
-
- default: 5
|
|
35
|
-
- label: Source Section Count
|
|
36
|
-
- description: Number of distinct sections or files in the input markdown. More sections generally means richer synthesis material.
|
|
37
|
-
|
|
38
|
-
## source_token_estimate
|
|
39
|
-
- type: number
|
|
40
|
-
- min: 0
|
|
41
|
-
- max: 200000
|
|
42
|
-
- step: 100
|
|
43
|
-
- default: 2000
|
|
44
|
-
- label: Source Token Estimate
|
|
45
|
-
- description: Approximate token count of concatenated input. Determines whether context window constraints may truncate material.
|
|
46
|
-
|
|
47
|
-
## declared_concept_count
|
|
48
|
-
- type: number
|
|
49
|
-
- min: 0
|
|
50
|
-
- max: 200
|
|
51
|
-
- step: 1
|
|
52
|
-
- default: 10
|
|
53
|
-
- label: Declared Concept Count
|
|
54
|
-
- description: Number of distinct governance concepts explicitly named in source material. Drives state variable and rule generation.
|
|
55
|
-
|
|
56
|
-
## concept_specificity
|
|
57
|
-
- type: number
|
|
58
|
-
- min: 0
|
|
59
|
-
- max: 100
|
|
60
|
-
- default: 50
|
|
61
|
-
- label: Concept Specificity
|
|
62
|
-
- description: How precisely the source material defines its governance concepts. 0 = vague aspirations. 100 = precise structural claims with measurable criteria.
|
|
63
|
-
|
|
64
|
-
## domain_coherence
|
|
65
|
-
- type: number
|
|
66
|
-
- min: 0
|
|
67
|
-
- max: 100
|
|
68
|
-
- default: 60
|
|
69
|
-
- label: Domain Coherence
|
|
70
|
-
- description: How well source sections relate to a single governance domain. Low coherence indicates conflicting or unrelated source material.
|
|
71
|
-
|
|
72
|
-
## synthesis_fidelity
|
|
73
|
-
- type: number
|
|
74
|
-
- min: 0.00
|
|
75
|
-
- max: 1.00
|
|
76
|
-
- step: 0.01
|
|
77
|
-
- default: 0.70
|
|
78
|
-
- label: Synthesis Fidelity
|
|
79
|
-
- description: Measure of how faithfully the derived world represents the source material. Primary outcome metric.
|
|
80
|
-
|
|
81
|
-
## structural_completeness
|
|
82
|
-
- type: number
|
|
83
|
-
- min: 0
|
|
84
|
-
- max: 100
|
|
85
|
-
- default: 60
|
|
86
|
-
- label: Structural Completeness
|
|
87
|
-
- description: Percentage of required .nv-world.md sections that contain meaningful content rather than stubs.
|
|
88
|
-
|
|
89
|
-
## epistemic_honesty
|
|
90
|
-
- type: number
|
|
91
|
-
- min: 0
|
|
92
|
-
- max: 100
|
|
93
|
-
- default: 70
|
|
94
|
-
- label: Epistemic Honesty
|
|
95
|
-
- description: Degree to which the output correctly distinguishes source-declared constraints from model-inferred constraints. 0 = everything claimed as declared. 100 = perfect attribution.
|
|
96
|
-
|
|
97
|
-
## invention_ratio
|
|
98
|
-
- type: number
|
|
99
|
-
- min: 0.00
|
|
100
|
-
- max: 1.00
|
|
101
|
-
- step: 0.01
|
|
102
|
-
- default: 0.10
|
|
103
|
-
- label: Invention Ratio
|
|
104
|
-
- description: Fraction of output concepts that have no traceable origin in the source material. Should be near zero. Above 0.30 indicates hallucination.
|
|
105
|
-
|
|
106
|
-
# Assumptions
|
|
107
|
-
|
|
108
|
-
## strict_synthesis
|
|
109
|
-
- name: Strict Synthesis
|
|
110
|
-
- description: Conservative derivation that only produces governance elements with clear source basis. Prefers omission over invention. Marks all inferred elements as operational.
|
|
111
|
-
- invention_tolerance: minimal
|
|
112
|
-
- attribution_mode: strict
|
|
113
|
-
- completeness_priority: low
|
|
114
|
-
- fidelity_priority: high
|
|
115
|
-
|
|
116
|
-
## permissive_synthesis
|
|
117
|
-
- name: Permissive Synthesis
|
|
118
|
-
- description: Broader derivation that fills structural gaps with reasonable inferences. Produces more complete worlds but with higher invention ratio. All inferences are still marked operational.
|
|
119
|
-
- invention_tolerance: moderate
|
|
120
|
-
- attribution_mode: standard
|
|
121
|
-
- completeness_priority: high
|
|
122
|
-
- fidelity_priority: moderate
|
|
123
|
-
|
|
124
|
-
# Rules
|
|
125
|
-
|
|
126
|
-
## rule-001: Empty Source Rejection (structural)
|
|
127
|
-
Synthesis from empty or trivially short input cannot produce meaningful governance. The derivation must fail rather than fabricate.
|
|
128
|
-
|
|
129
|
-
When source_section_count < 1 [state]
|
|
130
|
-
Then synthesis_fidelity *= 0.00
|
|
131
|
-
Collapse: synthesis_fidelity < 0.05
|
|
132
|
-
|
|
133
|
-
> trigger: Source input contains no sections — nothing to derive from.
|
|
134
|
-
> rule: A world cannot be synthesized from nothing. Empty input must produce a clear failure, not a fabricated world.
|
|
135
|
-
> shift: Derivation halts. No output file is written.
|
|
136
|
-
> effect: Synthesis fidelity set to zero. Derivation rejected.
|
|
137
|
-
|
|
138
|
-
## rule-002: Sparse Source Warning (degradation)
|
|
139
|
-
Minimal source material limits the quality of derived governance. Output will be structurally thin.
|
|
140
|
-
|
|
141
|
-
When source_section_count < 3 [state] AND source_token_estimate < 500 [state]
|
|
142
|
-
Then synthesis_fidelity *= 0.50, structural_completeness *= 0.60
|
|
143
|
-
|
|
144
|
-
> trigger: Source has fewer than 3 sections and under 500 tokens — sparse material.
|
|
145
|
-
> rule: Sparse input yields sparse governance. The model cannot reliably infer structure from fragments.
|
|
146
|
-
> shift: Output quality degrades. State variables and rules will be minimal.
|
|
147
|
-
> effect: Synthesis fidelity reduced to 50%. Structural completeness reduced to 60%.
|
|
148
|
-
|
|
149
|
-
## rule-003: Concept Vagueness Penalty (degradation)
|
|
150
|
-
Source material with low concept specificity produces invariants and rules that are aspirational rather than structural.
|
|
151
|
-
|
|
152
|
-
When concept_specificity < 25 [state]
|
|
153
|
-
Then synthesis_fidelity *= 0.60, epistemic_honesty *= 0.70
|
|
154
|
-
|
|
155
|
-
> trigger: Source concept specificity below 25% — governance concepts are vague.
|
|
156
|
-
> rule: Vague concepts cannot produce structural invariants. The model must either invent specificity or produce unenforceable constraints.
|
|
157
|
-
> shift: Output invariants trend toward aspiration. Rules lack deterministic triggers.
|
|
158
|
-
> effect: Synthesis fidelity reduced to 60%. Epistemic honesty reduced to 70%.
|
|
159
|
-
|
|
160
|
-
## rule-004: Domain Incoherence Penalty (degradation)
|
|
161
|
-
Source material spanning unrelated domains produces a world with conflicting governance logic.
|
|
162
|
-
|
|
163
|
-
When domain_coherence < 30 [state]
|
|
164
|
-
Then synthesis_fidelity *= 0.55
|
|
165
|
-
|
|
166
|
-
> trigger: Domain coherence below 30% — source material is internally contradictory or covers unrelated domains.
|
|
167
|
-
> rule: A single .nv-world.md should govern a coherent domain. Mixed domains produce conflicting rules and meaningless invariants.
|
|
168
|
-
> shift: Output becomes structurally confused. State variables may not relate to each other.
|
|
169
|
-
> effect: Synthesis fidelity reduced to 55%.
|
|
170
|
-
|
|
171
|
-
## rule-005: Invention Threshold Breach (structural)
|
|
172
|
-
Excessive invention without source basis constitutes fabrication, not derivation.
|
|
173
|
-
|
|
174
|
-
When invention_ratio > 0.30 [state]
|
|
175
|
-
Then synthesis_fidelity *= 0.30, epistemic_honesty *= 0.40
|
|
176
|
-
Collapse: synthesis_fidelity < 0.05
|
|
177
|
-
|
|
178
|
-
> trigger: Invention ratio exceeds 30% — more than a third of output has no source basis.
|
|
179
|
-
> rule: Derivation must be grounded. A world that is mostly invented does not represent the user's governance intent.
|
|
180
|
-
> shift: Output crosses from synthesis to hallucination. Fidelity drops below usable threshold.
|
|
181
|
-
> effect: Synthesis fidelity reduced to 30%. Epistemic honesty reduced to 40%.
|
|
182
|
-
|
|
183
|
-
## rule-006: High Fidelity Source (advantage)
|
|
184
|
-
Rich, specific, coherent source material enables high-quality derivation.
|
|
185
|
-
|
|
186
|
-
When concept_specificity > 70 [state] AND domain_coherence > 70 [state] AND declared_concept_count > 8 [state]
|
|
187
|
-
Then synthesis_fidelity *= 1.20, structural_completeness *= 1.15
|
|
188
|
-
|
|
189
|
-
> trigger: High concept specificity, strong domain coherence, and rich concept count.
|
|
190
|
-
> rule: Quality source material produces quality governance. The model has enough structure to derive rather than invent.
|
|
191
|
-
> shift: Output is well-grounded. Most invariants and rules trace directly to source.
|
|
192
|
-
> effect: Synthesis fidelity boosted by 20%. Structural completeness boosted by 15%.
|
|
193
|
-
|
|
194
|
-
## rule-007: Structural Completeness Gate (degradation)
|
|
195
|
-
A derived world missing critical sections is not usable regardless of quality in present sections.
|
|
196
|
-
|
|
197
|
-
When structural_completeness < 40 [state]
|
|
198
|
-
Then synthesis_fidelity *= 0.50
|
|
199
|
-
|
|
200
|
-
> trigger: Structural completeness below 40% — too many required sections are empty or stub.
|
|
201
|
-
> rule: A partial world is not a valid world. Missing sections mean missing governance.
|
|
202
|
-
> shift: The output may parse but cannot function as meaningful governance.
|
|
203
|
-
> effect: Synthesis fidelity reduced to 50%.
|
|
204
|
-
|
|
205
|
-
## rule-008: Epistemic Honesty Reward (advantage)
|
|
206
|
-
Correct attribution of declared versus inferred constraints makes output trustworthy and auditable.
|
|
207
|
-
|
|
208
|
-
When epistemic_honesty > 80 [state]
|
|
209
|
-
Then synthesis_fidelity *= 1.10
|
|
210
|
-
|
|
211
|
-
> trigger: Epistemic honesty above 80% — model correctly attributes constraint origins.
|
|
212
|
-
> rule: Honest attribution makes governance auditable. Users can verify which constraints they declared versus which the model suggested.
|
|
213
|
-
> shift: Output gains trust. Declared constraints can be relied upon; inferred ones can be reviewed.
|
|
214
|
-
> effect: Synthesis fidelity boosted by 10%.
|
|
215
|
-
|
|
216
|
-
## rule-009: Context Window Overflow Risk (degradation)
|
|
217
|
-
Extremely large source material risks truncation and missed governance concepts.
|
|
218
|
-
|
|
219
|
-
When source_token_estimate > 100000 [state]
|
|
220
|
-
Then synthesis_fidelity *= 0.75
|
|
221
|
-
|
|
222
|
-
> trigger: Source material exceeds 100k tokens — likely to be truncated.
|
|
223
|
-
> rule: Truncated input means incomplete synthesis. The model may miss governance concepts that appear late in the concatenation.
|
|
224
|
-
> shift: Output may be partial. Critical sections from later source files may be absent.
|
|
225
|
-
> effect: Synthesis fidelity reduced to 75% due to truncation risk.
|
|
226
|
-
|
|
227
|
-
## rule-010: Derivation Coherence Reward (advantage)
|
|
228
|
-
Aligned quality metrics across fidelity, honesty, and invention produce a genuine governance document.
|
|
229
|
-
|
|
230
|
-
When synthesis_fidelity > 0.80 [state] AND epistemic_honesty > 75 [state] AND invention_ratio < 0.15 [state]
|
|
231
|
-
Then synthesis_fidelity *= 1.15
|
|
232
|
-
Collapse: synthesis_fidelity < 0.05
|
|
233
|
-
|
|
234
|
-
> trigger: Synthesis fidelity above 80%, epistemic honesty above 75%, and invention ratio below 15%.
|
|
235
|
-
> rule: Coherent derivation across all metrics indicates a faithful, usable governance document.
|
|
236
|
-
> shift: The derived world moves from draft to production-quality. Suitable for bootstrap and validation.
|
|
237
|
-
> effect: Synthesis fidelity boosted by 15%. Derivation coherence achieved.
|
|
238
|
-
|
|
239
|
-
# Gates
|
|
240
|
-
|
|
241
|
-
- FAITHFUL: synthesis_fidelity >= 0.85
|
|
242
|
-
- USABLE: synthesis_fidelity >= 0.60
|
|
243
|
-
- REVIEWABLE: synthesis_fidelity >= 0.40
|
|
244
|
-
- SUSPECT: synthesis_fidelity > 0.15
|
|
245
|
-
- DERIVATION_REJECTED: synthesis_fidelity <= 0.15
|
|
246
|
-
|
|
247
|
-
# Outcomes
|
|
248
|
-
|
|
249
|
-
## synthesis_fidelity
|
|
250
|
-
- type: number
|
|
251
|
-
- range: 0-1
|
|
252
|
-
- display: percentage
|
|
253
|
-
- label: Synthesis Fidelity
|
|
254
|
-
- primary: true
|
|
255
|
-
|
|
256
|
-
## structural_completeness
|
|
257
|
-
- type: number
|
|
258
|
-
- range: 0-100
|
|
259
|
-
- display: percentage
|
|
260
|
-
- label: Structural Completeness
|
|
261
|
-
|
|
262
|
-
## epistemic_honesty
|
|
263
|
-
- type: number
|
|
264
|
-
- range: 0-100
|
|
265
|
-
- display: percentage
|
|
266
|
-
- label: Epistemic Honesty
|
|
267
|
-
|
|
268
|
-
## invention_ratio
|
|
269
|
-
- type: number
|
|
270
|
-
- range: 0-1
|
|
271
|
-
- display: percentage
|
|
272
|
-
- label: Invention Ratio
|
|
273
|
-
- assignment: external
|
|
274
|
-
|
|
275
|
-
## derivation_status
|
|
276
|
-
- type: enum
|
|
277
|
-
- label: Derivation Status
|
|
278
|
-
- assignment: external
|