@probelabs/visor 0.1.156-ee → 0.1.157

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (57) hide show
  1. package/defaults/assistant.yaml +76 -8
  2. package/defaults/code-talk.yaml +20 -6
  3. package/dist/defaults/assistant.yaml +76 -8
  4. package/dist/defaults/code-talk.yaml +20 -6
  5. package/dist/index.js +25 -1842
  6. package/dist/output/traces/run-2026-03-04T20-18-38-877Z.ndjson +138 -0
  7. package/dist/output/traces/run-2026-03-04T20-19-23-273Z.ndjson +2197 -0
  8. package/dist/sdk/check-provider-registry-46N5DY3V.mjs +29 -0
  9. package/dist/sdk/{check-provider-registry-OBUYAPPC.mjs → check-provider-registry-GJ4EZAIO.mjs} +2 -2
  10. package/dist/sdk/chunk-74V66BDH.mjs +443 -0
  11. package/dist/sdk/chunk-74V66BDH.mjs.map +1 -0
  12. package/dist/sdk/chunk-ADYVRVDK.mjs +43774 -0
  13. package/dist/sdk/{chunk-2XSKH755.mjs.map → chunk-ADYVRVDK.mjs.map} +1 -1
  14. package/dist/sdk/chunk-BAHN2OFA.mjs +1502 -0
  15. package/dist/sdk/chunk-BAHN2OFA.mjs.map +1 -0
  16. package/dist/sdk/{chunk-2XSKH755.mjs → chunk-D7STLGAD.mjs} +9 -9
  17. package/dist/sdk/chunk-D7STLGAD.mjs.map +1 -0
  18. package/dist/sdk/chunk-RR6AHRAI.mjs +739 -0
  19. package/dist/sdk/chunk-RR6AHRAI.mjs.map +1 -0
  20. package/dist/sdk/failure-condition-evaluator-BQXDXL44.mjs +17 -0
  21. package/dist/sdk/github-frontend-DLMDMCE3.mjs +1368 -0
  22. package/dist/sdk/github-frontend-DLMDMCE3.mjs.map +1 -0
  23. package/dist/sdk/{host-MIHKJ63G.mjs → host-OO5PLU4L.mjs} +2 -2
  24. package/dist/sdk/routing-IK63T3E4.mjs +25 -0
  25. package/dist/sdk/schedule-tool-NEH23XPP.mjs +35 -0
  26. package/dist/sdk/schedule-tool-NEH23XPP.mjs.map +1 -0
  27. package/dist/sdk/{schedule-tool-M2H7O5WL.mjs → schedule-tool-SLR7ZHBZ.mjs} +2 -2
  28. package/dist/sdk/schedule-tool-SLR7ZHBZ.mjs.map +1 -0
  29. package/dist/sdk/schedule-tool-handler-TPNX4EAS.mjs +39 -0
  30. package/dist/sdk/schedule-tool-handler-TPNX4EAS.mjs.map +1 -0
  31. package/dist/sdk/{schedule-tool-handler-J3CJWB6A.mjs → schedule-tool-handler-XVHYK4KT.mjs} +2 -2
  32. package/dist/sdk/schedule-tool-handler-XVHYK4KT.mjs.map +1 -0
  33. package/dist/sdk/sdk.js +259 -1633
  34. package/dist/sdk/sdk.js.map +1 -1
  35. package/dist/sdk/sdk.mjs +4 -4
  36. package/dist/sdk/trace-helpers-L5ZNK3CK.mjs +25 -0
  37. package/dist/sdk/trace-helpers-L5ZNK3CK.mjs.map +1 -0
  38. package/dist/sdk/{workflow-check-provider-7RLM3T2I.mjs → workflow-check-provider-NYBQAYVH.mjs} +2 -2
  39. package/dist/sdk/workflow-check-provider-NYBQAYVH.mjs.map +1 -0
  40. package/dist/sdk/workflow-check-provider-TWDKDSUK.mjs +29 -0
  41. package/dist/sdk/workflow-check-provider-TWDKDSUK.mjs.map +1 -0
  42. package/dist/traces/run-2026-03-04T20-18-38-877Z.ndjson +138 -0
  43. package/dist/traces/run-2026-03-04T20-19-23-273Z.ndjson +2197 -0
  44. package/package.json +1 -1
  45. package/dist/sdk/knex-store-CRORFJE6.mjs +0 -527
  46. package/dist/sdk/knex-store-CRORFJE6.mjs.map +0 -1
  47. package/dist/sdk/loader-NJCF7DUS.mjs +0 -89
  48. package/dist/sdk/loader-NJCF7DUS.mjs.map +0 -1
  49. package/dist/sdk/opa-policy-engine-S2S2ULEI.mjs +0 -655
  50. package/dist/sdk/opa-policy-engine-S2S2ULEI.mjs.map +0 -1
  51. package/dist/sdk/validator-XTZJZZJH.mjs +0 -134
  52. package/dist/sdk/validator-XTZJZZJH.mjs.map +0 -1
  53. /package/dist/sdk/{check-provider-registry-OBUYAPPC.mjs.map → check-provider-registry-46N5DY3V.mjs.map} +0 -0
  54. /package/dist/sdk/{schedule-tool-M2H7O5WL.mjs.map → check-provider-registry-GJ4EZAIO.mjs.map} +0 -0
  55. /package/dist/sdk/{schedule-tool-handler-J3CJWB6A.mjs.map → failure-condition-evaluator-BQXDXL44.mjs.map} +0 -0
  56. /package/dist/sdk/{host-MIHKJ63G.mjs.map → host-OO5PLU4L.mjs.map} +0 -0
  57. /package/dist/sdk/{workflow-check-provider-7RLM3T2I.mjs.map → routing-IK63T3E4.mjs.map} +0 -0
@@ -26,6 +26,21 @@
26
26
  # - id: chat
27
27
  # description: general Q&A
28
28
  # skills:
29
+ # - id: code-explorer
30
+ # description: needs codebase exploration
31
+ # knowledge: |
32
+ # ## Code Explorer
33
+ # Use the code-talk tool to explore code repositories.
34
+ # The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
35
+ # - If a call returns confidence "high", trust the answer — do NOT re-call
36
+ # with a rephrased version of the same question
37
+ # - Only call again for a genuinely DIFFERENT aspect of the codebase
38
+ # - If confidence is "medium" or "low", check confidence_reason for what to
39
+ # refine before re-exploring
40
+ # tools:
41
+ # code-talk:
42
+ # workflow: code-talk
43
+ # inputs: {}
29
44
  # - id: jira
30
45
  # description: needs Jira access
31
46
  # knowledge: "Use jira_get_issue to fetch tickets."
@@ -938,8 +953,14 @@ tests:
938
953
  knowledge:
939
954
  - intent: code_help
940
955
  content: |
941
- ## Code Exploration
942
- Use the code-explorer tool to search the codebase.
956
+ ## Code Explorer
957
+ Use the code-talk tool to explore code repositories.
958
+ The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
959
+ - If a call returns confidence "high", trust the answer — do NOT re-call
960
+ with a rephrased version of the same question
961
+ - Only call again for a genuinely DIFFERENT aspect of the codebase
962
+ - If confidence is "medium" or "low", check confidence_reason for what to
963
+ refine before re-exploring
943
964
  mocks:
944
965
  route-intent:
945
966
  intent: code_help
@@ -1102,7 +1123,13 @@ tests:
1102
1123
  description: needs codebase exploration
1103
1124
  knowledge: |
1104
1125
  ## Code Explorer
1105
- Use to search codebase.
1126
+ Use the code-talk tool to explore code repositories.
1127
+ The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
1128
+ - If a call returns confidence "high", trust the answer — do NOT re-call
1129
+ with a rephrased version of the same question
1130
+ - Only call again for a genuinely DIFFERENT aspect of the codebase
1131
+ - If confidence is "medium" or "low", check confidence_reason for what to
1132
+ refine before re-exploring
1106
1133
  tools:
1107
1134
  code-talk:
1108
1135
  workflow: code-talk
@@ -1192,6 +1219,15 @@ tests:
1192
1219
  skills:
1193
1220
  - id: code-explorer
1194
1221
  description: needs codebase exploration
1222
+ knowledge: |
1223
+ ## Code Explorer
1224
+ Use the code-talk tool to explore code repositories.
1225
+ The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
1226
+ - If a call returns confidence "high", trust the answer — do NOT re-call
1227
+ with a rephrased version of the same question
1228
+ - Only call again for a genuinely DIFFERENT aspect of the codebase
1229
+ - If confidence is "medium" or "low", check confidence_reason for what to
1230
+ refine before re-exploring
1195
1231
  tools:
1196
1232
  code-explorer:
1197
1233
  workflow: code-talk
@@ -1600,6 +1636,15 @@ tests:
1600
1636
  skills:
1601
1637
  - id: code-explorer
1602
1638
  description: needs code exploration
1639
+ knowledge: |
1640
+ ## Code Explorer
1641
+ Use the code-talk tool to explore code repositories.
1642
+ The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
1643
+ - If a call returns confidence "high", trust the answer — do NOT re-call
1644
+ with a rephrased version of the same question
1645
+ - Only call again for a genuinely DIFFERENT aspect of the codebase
1646
+ - If confidence is "medium" or "low", check confidence_reason for what to
1647
+ refine before re-exploring
1603
1648
  tools:
1604
1649
  code-explorer:
1605
1650
  workflow: code-talk
@@ -1781,8 +1826,14 @@ tests:
1781
1826
  - id: code-explorer
1782
1827
  description: needs code exploration
1783
1828
  knowledge: |
1784
- ## Code Exploration
1785
- Use code-explorer tool to search code.
1829
+ ## Code Explorer
1830
+ Use the code-talk tool to explore code repositories.
1831
+ The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
1832
+ - If a call returns confidence "high", trust the answer — do NOT re-call
1833
+ with a rephrased version of the same question
1834
+ - Only call again for a genuinely DIFFERENT aspect of the codebase
1835
+ - If confidence is "medium" or "low", check confidence_reason for what to
1836
+ refine before re-exploring
1786
1837
  mocks:
1787
1838
  route-intent:
1788
1839
  intent: chat
@@ -1804,7 +1855,7 @@ tests:
1804
1855
  - "<id>jira</id>"
1805
1856
  - "## Jira Tools"
1806
1857
  - "<id>code-explorer</id>"
1807
- - "## Code Exploration"
1858
+ - "## Code Explorer"
1808
1859
  outputs:
1809
1860
  - step: build-config
1810
1861
  path: knowledge_content
@@ -1817,7 +1868,7 @@ tests:
1817
1868
  matches: "<id>code-explorer</id>"
1818
1869
  - step: build-config
1819
1870
  path: knowledge_content
1820
- matches: "Code Exploration"
1871
+ matches: "Code Explorer"
1821
1872
 
1822
1873
  # =========================================================================
1823
1874
  # MCP Server Configuration Assertions
@@ -2070,7 +2121,15 @@ tests:
2070
2121
  skills:
2071
2122
  - id: code-explorer
2072
2123
  description: code exploration
2073
- knowledge: "Use code-explorer for questions"
2124
+ knowledge: |
2125
+ ## Code Explorer
2126
+ Use the code-talk tool to explore code repositories.
2127
+ The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
2128
+ - If a call returns confidence "high", trust the answer — do NOT re-call
2129
+ with a rephrased version of the same question
2130
+ - Only call again for a genuinely DIFFERENT aspect of the codebase
2131
+ - If confidence is "medium" or "low", check confidence_reason for what to
2132
+ refine before re-exploring
2074
2133
  tools:
2075
2134
  code-explorer:
2076
2135
  workflow: code-talk
@@ -2117,6 +2176,15 @@ tests:
2117
2176
  skills:
2118
2177
  - id: code-explorer
2119
2178
  description: code exploration
2179
+ knowledge: |
2180
+ ## Code Explorer
2181
+ Use the code-talk tool to explore code repositories.
2182
+ The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
2183
+ - If a call returns confidence "high", trust the answer — do NOT re-call
2184
+ with a rephrased version of the same question
2185
+ - Only call again for a genuinely DIFFERENT aspect of the codebase
2186
+ - If confidence is "medium" or "low", check confidence_reason for what to
2187
+ refine before re-exploring
2120
2188
  tools:
2121
2189
  code-explorer:
2122
2190
  workflow: code-talk
@@ -538,13 +538,13 @@ steps:
538
538
  6. Have you ruled out other possibilities with evidence, not assumptions?
539
539
 
540
540
  Cross-project verification:
541
- 4. Are there dependencies or interactions between projects you examined?
542
- 5. Did you verify how data/config flows between components?
543
- 6. Did you check both the happy path AND error handling paths?
541
+ 7. Are there dependencies or interactions between projects you examined?
542
+ 8. Did you verify how data/config flows between components?
543
+ 9. Did you check both the happy path AND error handling paths?
544
544
 
545
545
  Reference accuracy:
546
- 7. Are all code references accurate (file paths, function names, line numbers)?
547
- 8. Do the docs match what the code actually does?
546
+ 10. Are all code references accurate (file paths, function names, line numbers)?
547
+ 11. Do the docs match what the code actually does?
548
548
 
549
549
  CRITICAL: When you identify a configuration variable, you MUST then perform
550
550
  a second search to find the code that consumes that variable. You cannot draw
@@ -554,11 +554,25 @@ steps:
554
554
  If you found any ambiguity, gaps in your investigation, or unexplored
555
555
  hypotheses - use delegate tool to investigate further before concluding.
556
556
 
557
+ Confidence calibration — be HONEST, not optimistic:
558
+ - "high" ONLY when you found definitive code evidence that FULLY answers the
559
+ question, you verified the complete call chain, AND you ruled out alternative
560
+ explanations with evidence (not assumptions)
561
+ - "medium" when you found relevant code but could not verify all aspects, when
562
+ there are alternative explanations you did not fully rule out, or when you
563
+ traced only part of the execution path
564
+ - "low" when your answer is based on naming conventions, comments, partial
565
+ evidence, or inference without direct code confirmation
566
+ - Do NOT default to "high" — inflated confidence causes the caller to skip
567
+ needed follow-up investigation, wasting time on redundant re-exploration
568
+ - If confidence is not "high", the confidence_reason MUST clearly state what
569
+ evidence is missing or which alternative theories remain unverified
570
+
557
571
  When you finish, ensure your answer includes:
558
572
  - All relevant details grounded in code
559
573
  - If multiple theories were considered, explain which one is correct and WHY
560
574
  (with evidence ruling out alternatives)
561
- - A confidence score ("high", "medium", or "low")
575
+ - A confidence score ("high", "medium", or "low") calibrated per the rules above
562
576
  - If confidence is "medium" or "low", include a clear confidence_reason
563
577
  explaining what evidence is missing or ambiguous
564
578
  - At the END of your answer.text, append a "## References" section with a
@@ -26,6 +26,21 @@
26
26
  # - id: chat
27
27
  # description: general Q&A
28
28
  # skills:
29
+ # - id: code-explorer
30
+ # description: needs codebase exploration
31
+ # knowledge: |
32
+ # ## Code Explorer
33
+ # Use the code-talk tool to explore code repositories.
34
+ # The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
35
+ # - If a call returns confidence "high", trust the answer — do NOT re-call
36
+ # with a rephrased version of the same question
37
+ # - Only call again for a genuinely DIFFERENT aspect of the codebase
38
+ # - If confidence is "medium" or "low", check confidence_reason for what to
39
+ # refine before re-exploring
40
+ # tools:
41
+ # code-talk:
42
+ # workflow: code-talk
43
+ # inputs: {}
29
44
  # - id: jira
30
45
  # description: needs Jira access
31
46
  # knowledge: "Use jira_get_issue to fetch tickets."
@@ -938,8 +953,14 @@ tests:
938
953
  knowledge:
939
954
  - intent: code_help
940
955
  content: |
941
- ## Code Exploration
942
- Use the code-explorer tool to search the codebase.
956
+ ## Code Explorer
957
+ Use the code-talk tool to explore code repositories.
958
+ The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
959
+ - If a call returns confidence "high", trust the answer — do NOT re-call
960
+ with a rephrased version of the same question
961
+ - Only call again for a genuinely DIFFERENT aspect of the codebase
962
+ - If confidence is "medium" or "low", check confidence_reason for what to
963
+ refine before re-exploring
943
964
  mocks:
944
965
  route-intent:
945
966
  intent: code_help
@@ -1102,7 +1123,13 @@ tests:
1102
1123
  description: needs codebase exploration
1103
1124
  knowledge: |
1104
1125
  ## Code Explorer
1105
- Use to search codebase.
1126
+ Use the code-talk tool to explore code repositories.
1127
+ The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
1128
+ - If a call returns confidence "high", trust the answer — do NOT re-call
1129
+ with a rephrased version of the same question
1130
+ - Only call again for a genuinely DIFFERENT aspect of the codebase
1131
+ - If confidence is "medium" or "low", check confidence_reason for what to
1132
+ refine before re-exploring
1106
1133
  tools:
1107
1134
  code-talk:
1108
1135
  workflow: code-talk
@@ -1192,6 +1219,15 @@ tests:
1192
1219
  skills:
1193
1220
  - id: code-explorer
1194
1221
  description: needs codebase exploration
1222
+ knowledge: |
1223
+ ## Code Explorer
1224
+ Use the code-talk tool to explore code repositories.
1225
+ The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
1226
+ - If a call returns confidence "high", trust the answer — do NOT re-call
1227
+ with a rephrased version of the same question
1228
+ - Only call again for a genuinely DIFFERENT aspect of the codebase
1229
+ - If confidence is "medium" or "low", check confidence_reason for what to
1230
+ refine before re-exploring
1195
1231
  tools:
1196
1232
  code-explorer:
1197
1233
  workflow: code-talk
@@ -1600,6 +1636,15 @@ tests:
1600
1636
  skills:
1601
1637
  - id: code-explorer
1602
1638
  description: needs code exploration
1639
+ knowledge: |
1640
+ ## Code Explorer
1641
+ Use the code-talk tool to explore code repositories.
1642
+ The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
1643
+ - If a call returns confidence "high", trust the answer — do NOT re-call
1644
+ with a rephrased version of the same question
1645
+ - Only call again for a genuinely DIFFERENT aspect of the codebase
1646
+ - If confidence is "medium" or "low", check confidence_reason for what to
1647
+ refine before re-exploring
1603
1648
  tools:
1604
1649
  code-explorer:
1605
1650
  workflow: code-talk
@@ -1781,8 +1826,14 @@ tests:
1781
1826
  - id: code-explorer
1782
1827
  description: needs code exploration
1783
1828
  knowledge: |
1784
- ## Code Exploration
1785
- Use code-explorer tool to search code.
1829
+ ## Code Explorer
1830
+ Use the code-talk tool to explore code repositories.
1831
+ The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
1832
+ - If a call returns confidence "high", trust the answer — do NOT re-call
1833
+ with a rephrased version of the same question
1834
+ - Only call again for a genuinely DIFFERENT aspect of the codebase
1835
+ - If confidence is "medium" or "low", check confidence_reason for what to
1836
+ refine before re-exploring
1786
1837
  mocks:
1787
1838
  route-intent:
1788
1839
  intent: chat
@@ -1804,7 +1855,7 @@ tests:
1804
1855
  - "<id>jira</id>"
1805
1856
  - "## Jira Tools"
1806
1857
  - "<id>code-explorer</id>"
1807
- - "## Code Exploration"
1858
+ - "## Code Explorer"
1808
1859
  outputs:
1809
1860
  - step: build-config
1810
1861
  path: knowledge_content
@@ -1817,7 +1868,7 @@ tests:
1817
1868
  matches: "<id>code-explorer</id>"
1818
1869
  - step: build-config
1819
1870
  path: knowledge_content
1820
- matches: "Code Exploration"
1871
+ matches: "Code Explorer"
1821
1872
 
1822
1873
  # =========================================================================
1823
1874
  # MCP Server Configuration Assertions
@@ -2070,7 +2121,15 @@ tests:
2070
2121
  skills:
2071
2122
  - id: code-explorer
2072
2123
  description: code exploration
2073
- knowledge: "Use code-explorer for questions"
2124
+ knowledge: |
2125
+ ## Code Explorer
2126
+ Use the code-talk tool to explore code repositories.
2127
+ The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
2128
+ - If a call returns confidence "high", trust the answer — do NOT re-call
2129
+ with a rephrased version of the same question
2130
+ - Only call again for a genuinely DIFFERENT aspect of the codebase
2131
+ - If confidence is "medium" or "low", check confidence_reason for what to
2132
+ refine before re-exploring
2074
2133
  tools:
2075
2134
  code-explorer:
2076
2135
  workflow: code-talk
@@ -2117,6 +2176,15 @@ tests:
2117
2176
  skills:
2118
2177
  - id: code-explorer
2119
2178
  description: code exploration
2179
+ knowledge: |
2180
+ ## Code Explorer
2181
+ Use the code-talk tool to explore code repositories.
2182
+ The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
2183
+ - If a call returns confidence "high", trust the answer — do NOT re-call
2184
+ with a rephrased version of the same question
2185
+ - Only call again for a genuinely DIFFERENT aspect of the codebase
2186
+ - If confidence is "medium" or "low", check confidence_reason for what to
2187
+ refine before re-exploring
2120
2188
  tools:
2121
2189
  code-explorer:
2122
2190
  workflow: code-talk
@@ -538,13 +538,13 @@ steps:
538
538
  6. Have you ruled out other possibilities with evidence, not assumptions?
539
539
 
540
540
  Cross-project verification:
541
- 4. Are there dependencies or interactions between projects you examined?
542
- 5. Did you verify how data/config flows between components?
543
- 6. Did you check both the happy path AND error handling paths?
541
+ 7. Are there dependencies or interactions between projects you examined?
542
+ 8. Did you verify how data/config flows between components?
543
+ 9. Did you check both the happy path AND error handling paths?
544
544
 
545
545
  Reference accuracy:
546
- 7. Are all code references accurate (file paths, function names, line numbers)?
547
- 8. Do the docs match what the code actually does?
546
+ 10. Are all code references accurate (file paths, function names, line numbers)?
547
+ 11. Do the docs match what the code actually does?
548
548
 
549
549
  CRITICAL: When you identify a configuration variable, you MUST then perform
550
550
  a second search to find the code that consumes that variable. You cannot draw
@@ -554,11 +554,25 @@ steps:
554
554
  If you found any ambiguity, gaps in your investigation, or unexplored
555
555
  hypotheses - use delegate tool to investigate further before concluding.
556
556
 
557
+ Confidence calibration — be HONEST, not optimistic:
558
+ - "high" ONLY when you found definitive code evidence that FULLY answers the
559
+ question, you verified the complete call chain, AND you ruled out alternative
560
+ explanations with evidence (not assumptions)
561
+ - "medium" when you found relevant code but could not verify all aspects, when
562
+ there are alternative explanations you did not fully rule out, or when you
563
+ traced only part of the execution path
564
+ - "low" when your answer is based on naming conventions, comments, partial
565
+ evidence, or inference without direct code confirmation
566
+ - Do NOT default to "high" — inflated confidence causes the caller to skip
567
+ needed follow-up investigation, wasting time on redundant re-exploration
568
+ - If confidence is not "high", the confidence_reason MUST clearly state what
569
+ evidence is missing or which alternative theories remain unverified
570
+
557
571
  When you finish, ensure your answer includes:
558
572
  - All relevant details grounded in code
559
573
  - If multiple theories were considered, explain which one is correct and WHY
560
574
  (with evidence ruling out alternatives)
561
- - A confidence score ("high", "medium", or "low")
575
+ - A confidence score ("high", "medium", or "low") calibrated per the rules above
562
576
  - If confidence is "medium" or "low", include a clear confidence_reason
563
577
  explaining what evidence is missing or ambiguous
564
578
  - At the END of your answer.text, append a "## References" section with a