@probelabs/visor 0.1.156 → 0.1.157-ee
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/defaults/assistant.yaml +76 -8
- package/defaults/code-talk.yaml +20 -6
- package/dist/defaults/assistant.yaml +76 -8
- package/dist/defaults/code-talk.yaml +20 -6
- package/dist/index.js +1842 -25
- package/dist/sdk/{check-provider-registry-GJ4EZAIO.mjs → check-provider-registry-OBUYAPPC.mjs} +2 -2
- package/dist/sdk/{chunk-D7STLGAD.mjs → chunk-2XSKH755.mjs} +9 -9
- package/dist/sdk/{chunk-2UN6C3VO.mjs.map → chunk-2XSKH755.mjs.map} +1 -1
- package/dist/sdk/{host-3CBVPE5U.mjs → host-MIHKJ63G.mjs} +2 -2
- package/dist/sdk/knex-store-CRORFJE6.mjs +527 -0
- package/dist/sdk/knex-store-CRORFJE6.mjs.map +1 -0
- package/dist/sdk/loader-NJCF7DUS.mjs +89 -0
- package/dist/sdk/loader-NJCF7DUS.mjs.map +1 -0
- package/dist/sdk/opa-policy-engine-S2S2ULEI.mjs +655 -0
- package/dist/sdk/opa-policy-engine-S2S2ULEI.mjs.map +1 -0
- package/dist/sdk/{schedule-tool-SLR7ZHBZ.mjs → schedule-tool-M2H7O5WL.mjs} +2 -2
- package/dist/sdk/{schedule-tool-handler-XVHYK4KT.mjs → schedule-tool-handler-J3CJWB6A.mjs} +2 -2
- package/dist/sdk/sdk.js +1633 -259
- package/dist/sdk/sdk.js.map +1 -1
- package/dist/sdk/sdk.mjs +4 -4
- package/dist/sdk/validator-XTZJZZJH.mjs +134 -0
- package/dist/sdk/validator-XTZJZZJH.mjs.map +1 -0
- package/dist/sdk/{workflow-check-provider-NYBQAYVH.mjs → workflow-check-provider-7RLM3T2I.mjs} +2 -2
- package/package.json +1 -1
- package/dist/output/traces/run-2026-03-04T18-51-17-205Z.ndjson +0 -138
- package/dist/output/traces/run-2026-03-04T18-52-01-841Z.ndjson +0 -2197
- package/dist/sdk/check-provider-registry-NXGF7OAN.mjs +0 -29
- package/dist/sdk/chunk-2UN6C3VO.mjs +0 -43774
- package/dist/sdk/chunk-D7STLGAD.mjs.map +0 -1
- package/dist/sdk/chunk-G5JCPBXD.mjs +0 -739
- package/dist/sdk/chunk-G5JCPBXD.mjs.map +0 -1
- package/dist/sdk/chunk-YNZCDEI5.mjs +0 -443
- package/dist/sdk/chunk-YNZCDEI5.mjs.map +0 -1
- package/dist/sdk/chunk-ZSXL4I4C.mjs +0 -1502
- package/dist/sdk/chunk-ZSXL4I4C.mjs.map +0 -1
- package/dist/sdk/failure-condition-evaluator-QGLA6IJ6.mjs +0 -17
- package/dist/sdk/github-frontend-G6NTMPEG.mjs +0 -1368
- package/dist/sdk/github-frontend-G6NTMPEG.mjs.map +0 -1
- package/dist/sdk/routing-GYSENS76.mjs +0 -25
- package/dist/sdk/schedule-tool-JLSHSHQA.mjs +0 -35
- package/dist/sdk/schedule-tool-JLSHSHQA.mjs.map +0 -1
- package/dist/sdk/schedule-tool-SLR7ZHBZ.mjs.map +0 -1
- package/dist/sdk/schedule-tool-handler-5A37VO62.mjs +0 -39
- package/dist/sdk/schedule-tool-handler-5A37VO62.mjs.map +0 -1
- package/dist/sdk/schedule-tool-handler-XVHYK4KT.mjs.map +0 -1
- package/dist/sdk/trace-helpers-HU65M6UT.mjs +0 -25
- package/dist/sdk/trace-helpers-HU65M6UT.mjs.map +0 -1
- package/dist/sdk/workflow-check-provider-M77ZHQDH.mjs +0 -29
- package/dist/sdk/workflow-check-provider-M77ZHQDH.mjs.map +0 -1
- package/dist/sdk/workflow-check-provider-NYBQAYVH.mjs.map +0 -1
- package/dist/traces/run-2026-03-04T18-51-17-205Z.ndjson +0 -138
- package/dist/traces/run-2026-03-04T18-52-01-841Z.ndjson +0 -2197
- /package/dist/sdk/{check-provider-registry-GJ4EZAIO.mjs.map → check-provider-registry-OBUYAPPC.mjs.map} +0 -0
- /package/dist/sdk/{host-3CBVPE5U.mjs.map → host-MIHKJ63G.mjs.map} +0 -0
- /package/dist/sdk/{check-provider-registry-NXGF7OAN.mjs.map → schedule-tool-M2H7O5WL.mjs.map} +0 -0
- /package/dist/sdk/{failure-condition-evaluator-QGLA6IJ6.mjs.map → schedule-tool-handler-J3CJWB6A.mjs.map} +0 -0
- /package/dist/sdk/{routing-GYSENS76.mjs.map → workflow-check-provider-7RLM3T2I.mjs.map} +0 -0
package/defaults/assistant.yaml
CHANGED
|
@@ -26,6 +26,21 @@
|
|
|
26
26
|
# - id: chat
|
|
27
27
|
# description: general Q&A
|
|
28
28
|
# skills:
|
|
29
|
+
# - id: code-explorer
|
|
30
|
+
# description: needs codebase exploration
|
|
31
|
+
# knowledge: |
|
|
32
|
+
# ## Code Explorer
|
|
33
|
+
# Use the code-talk tool to explore code repositories.
|
|
34
|
+
# The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
|
|
35
|
+
# - If a call returns confidence "high", trust the answer — do NOT re-call
|
|
36
|
+
# with a rephrased version of the same question
|
|
37
|
+
# - Only call again for a genuinely DIFFERENT aspect of the codebase
|
|
38
|
+
# - If confidence is "medium" or "low", check confidence_reason for what to
|
|
39
|
+
# refine before re-exploring
|
|
40
|
+
# tools:
|
|
41
|
+
# code-talk:
|
|
42
|
+
# workflow: code-talk
|
|
43
|
+
# inputs: {}
|
|
29
44
|
# - id: jira
|
|
30
45
|
# description: needs Jira access
|
|
31
46
|
# knowledge: "Use jira_get_issue to fetch tickets."
|
|
@@ -938,8 +953,14 @@ tests:
|
|
|
938
953
|
knowledge:
|
|
939
954
|
- intent: code_help
|
|
940
955
|
content: |
|
|
941
|
-
## Code
|
|
942
|
-
Use the code-
|
|
956
|
+
## Code Explorer
|
|
957
|
+
Use the code-talk tool to explore code repositories.
|
|
958
|
+
The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
|
|
959
|
+
- If a call returns confidence "high", trust the answer — do NOT re-call
|
|
960
|
+
with a rephrased version of the same question
|
|
961
|
+
- Only call again for a genuinely DIFFERENT aspect of the codebase
|
|
962
|
+
- If confidence is "medium" or "low", check confidence_reason for what to
|
|
963
|
+
refine before re-exploring
|
|
943
964
|
mocks:
|
|
944
965
|
route-intent:
|
|
945
966
|
intent: code_help
|
|
@@ -1102,7 +1123,13 @@ tests:
|
|
|
1102
1123
|
description: needs codebase exploration
|
|
1103
1124
|
knowledge: |
|
|
1104
1125
|
## Code Explorer
|
|
1105
|
-
Use to
|
|
1126
|
+
Use the code-talk tool to explore code repositories.
|
|
1127
|
+
The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
|
|
1128
|
+
- If a call returns confidence "high", trust the answer — do NOT re-call
|
|
1129
|
+
with a rephrased version of the same question
|
|
1130
|
+
- Only call again for a genuinely DIFFERENT aspect of the codebase
|
|
1131
|
+
- If confidence is "medium" or "low", check confidence_reason for what to
|
|
1132
|
+
refine before re-exploring
|
|
1106
1133
|
tools:
|
|
1107
1134
|
code-talk:
|
|
1108
1135
|
workflow: code-talk
|
|
@@ -1192,6 +1219,15 @@ tests:
|
|
|
1192
1219
|
skills:
|
|
1193
1220
|
- id: code-explorer
|
|
1194
1221
|
description: needs codebase exploration
|
|
1222
|
+
knowledge: |
|
|
1223
|
+
## Code Explorer
|
|
1224
|
+
Use the code-talk tool to explore code repositories.
|
|
1225
|
+
The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
|
|
1226
|
+
- If a call returns confidence "high", trust the answer — do NOT re-call
|
|
1227
|
+
with a rephrased version of the same question
|
|
1228
|
+
- Only call again for a genuinely DIFFERENT aspect of the codebase
|
|
1229
|
+
- If confidence is "medium" or "low", check confidence_reason for what to
|
|
1230
|
+
refine before re-exploring
|
|
1195
1231
|
tools:
|
|
1196
1232
|
code-explorer:
|
|
1197
1233
|
workflow: code-talk
|
|
@@ -1600,6 +1636,15 @@ tests:
|
|
|
1600
1636
|
skills:
|
|
1601
1637
|
- id: code-explorer
|
|
1602
1638
|
description: needs code exploration
|
|
1639
|
+
knowledge: |
|
|
1640
|
+
## Code Explorer
|
|
1641
|
+
Use the code-talk tool to explore code repositories.
|
|
1642
|
+
The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
|
|
1643
|
+
- If a call returns confidence "high", trust the answer — do NOT re-call
|
|
1644
|
+
with a rephrased version of the same question
|
|
1645
|
+
- Only call again for a genuinely DIFFERENT aspect of the codebase
|
|
1646
|
+
- If confidence is "medium" or "low", check confidence_reason for what to
|
|
1647
|
+
refine before re-exploring
|
|
1603
1648
|
tools:
|
|
1604
1649
|
code-explorer:
|
|
1605
1650
|
workflow: code-talk
|
|
@@ -1781,8 +1826,14 @@ tests:
|
|
|
1781
1826
|
- id: code-explorer
|
|
1782
1827
|
description: needs code exploration
|
|
1783
1828
|
knowledge: |
|
|
1784
|
-
## Code
|
|
1785
|
-
Use code-
|
|
1829
|
+
## Code Explorer
|
|
1830
|
+
Use the code-talk tool to explore code repositories.
|
|
1831
|
+
The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
|
|
1832
|
+
- If a call returns confidence "high", trust the answer — do NOT re-call
|
|
1833
|
+
with a rephrased version of the same question
|
|
1834
|
+
- Only call again for a genuinely DIFFERENT aspect of the codebase
|
|
1835
|
+
- If confidence is "medium" or "low", check confidence_reason for what to
|
|
1836
|
+
refine before re-exploring
|
|
1786
1837
|
mocks:
|
|
1787
1838
|
route-intent:
|
|
1788
1839
|
intent: chat
|
|
@@ -1804,7 +1855,7 @@ tests:
|
|
|
1804
1855
|
- "<id>jira</id>"
|
|
1805
1856
|
- "## Jira Tools"
|
|
1806
1857
|
- "<id>code-explorer</id>"
|
|
1807
|
-
- "## Code
|
|
1858
|
+
- "## Code Explorer"
|
|
1808
1859
|
outputs:
|
|
1809
1860
|
- step: build-config
|
|
1810
1861
|
path: knowledge_content
|
|
@@ -1817,7 +1868,7 @@ tests:
|
|
|
1817
1868
|
matches: "<id>code-explorer</id>"
|
|
1818
1869
|
- step: build-config
|
|
1819
1870
|
path: knowledge_content
|
|
1820
|
-
matches: "Code
|
|
1871
|
+
matches: "Code Explorer"
|
|
1821
1872
|
|
|
1822
1873
|
# =========================================================================
|
|
1823
1874
|
# MCP Server Configuration Assertions
|
|
@@ -2070,7 +2121,15 @@ tests:
|
|
|
2070
2121
|
skills:
|
|
2071
2122
|
- id: code-explorer
|
|
2072
2123
|
description: code exploration
|
|
2073
|
-
knowledge:
|
|
2124
|
+
knowledge: |
|
|
2125
|
+
## Code Explorer
|
|
2126
|
+
Use the code-talk tool to explore code repositories.
|
|
2127
|
+
The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
|
|
2128
|
+
- If a call returns confidence "high", trust the answer — do NOT re-call
|
|
2129
|
+
with a rephrased version of the same question
|
|
2130
|
+
- Only call again for a genuinely DIFFERENT aspect of the codebase
|
|
2131
|
+
- If confidence is "medium" or "low", check confidence_reason for what to
|
|
2132
|
+
refine before re-exploring
|
|
2074
2133
|
tools:
|
|
2075
2134
|
code-explorer:
|
|
2076
2135
|
workflow: code-talk
|
|
@@ -2117,6 +2176,15 @@ tests:
|
|
|
2117
2176
|
skills:
|
|
2118
2177
|
- id: code-explorer
|
|
2119
2178
|
description: code exploration
|
|
2179
|
+
knowledge: |
|
|
2180
|
+
## Code Explorer
|
|
2181
|
+
Use the code-talk tool to explore code repositories.
|
|
2182
|
+
The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
|
|
2183
|
+
- If a call returns confidence "high", trust the answer — do NOT re-call
|
|
2184
|
+
with a rephrased version of the same question
|
|
2185
|
+
- Only call again for a genuinely DIFFERENT aspect of the codebase
|
|
2186
|
+
- If confidence is "medium" or "low", check confidence_reason for what to
|
|
2187
|
+
refine before re-exploring
|
|
2120
2188
|
tools:
|
|
2121
2189
|
code-explorer:
|
|
2122
2190
|
workflow: code-talk
|
package/defaults/code-talk.yaml
CHANGED
|
@@ -538,13 +538,13 @@ steps:
|
|
|
538
538
|
6. Have you ruled out other possibilities with evidence, not assumptions?
|
|
539
539
|
|
|
540
540
|
Cross-project verification:
|
|
541
|
-
|
|
542
|
-
|
|
543
|
-
|
|
541
|
+
7. Are there dependencies or interactions between projects you examined?
|
|
542
|
+
8. Did you verify how data/config flows between components?
|
|
543
|
+
9. Did you check both the happy path AND error handling paths?
|
|
544
544
|
|
|
545
545
|
Reference accuracy:
|
|
546
|
-
|
|
547
|
-
|
|
546
|
+
10. Are all code references accurate (file paths, function names, line numbers)?
|
|
547
|
+
11. Do the docs match what the code actually does?
|
|
548
548
|
|
|
549
549
|
CRITICAL: When you identify a configuration variable, you MUST then perform
|
|
550
550
|
a second search to find the code that consumes that variable. You cannot draw
|
|
@@ -554,11 +554,25 @@ steps:
|
|
|
554
554
|
If you found any ambiguity, gaps in your investigation, or unexplored
|
|
555
555
|
hypotheses - use delegate tool to investigate further before concluding.
|
|
556
556
|
|
|
557
|
+
Confidence calibration — be HONEST, not optimistic:
|
|
558
|
+
- "high" ONLY when you found definitive code evidence that FULLY answers the
|
|
559
|
+
question, you verified the complete call chain, AND you ruled out alternative
|
|
560
|
+
explanations with evidence (not assumptions)
|
|
561
|
+
- "medium" when you found relevant code but could not verify all aspects, when
|
|
562
|
+
there are alternative explanations you did not fully rule out, or when you
|
|
563
|
+
traced only part of the execution path
|
|
564
|
+
- "low" when your answer is based on naming conventions, comments, partial
|
|
565
|
+
evidence, or inference without direct code confirmation
|
|
566
|
+
- Do NOT default to "high" — inflated confidence causes the caller to skip
|
|
567
|
+
needed follow-up investigation, wasting time on redundant re-exploration
|
|
568
|
+
- If confidence is not "high", the confidence_reason MUST clearly state what
|
|
569
|
+
evidence is missing or which alternative theories remain unverified
|
|
570
|
+
|
|
557
571
|
When you finish, ensure your answer includes:
|
|
558
572
|
- All relevant details grounded in code
|
|
559
573
|
- If multiple theories were considered, explain which one is correct and WHY
|
|
560
574
|
(with evidence ruling out alternatives)
|
|
561
|
-
- A confidence score ("high", "medium", or "low")
|
|
575
|
+
- A confidence score ("high", "medium", or "low") calibrated per the rules above
|
|
562
576
|
- If confidence is "medium" or "low", include a clear confidence_reason
|
|
563
577
|
explaining what evidence is missing or ambiguous
|
|
564
578
|
- At the END of your answer.text, append a "## References" section with a
|
|
@@ -26,6 +26,21 @@
|
|
|
26
26
|
# - id: chat
|
|
27
27
|
# description: general Q&A
|
|
28
28
|
# skills:
|
|
29
|
+
# - id: code-explorer
|
|
30
|
+
# description: needs codebase exploration
|
|
31
|
+
# knowledge: |
|
|
32
|
+
# ## Code Explorer
|
|
33
|
+
# Use the code-talk tool to explore code repositories.
|
|
34
|
+
# The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
|
|
35
|
+
# - If a call returns confidence "high", trust the answer — do NOT re-call
|
|
36
|
+
# with a rephrased version of the same question
|
|
37
|
+
# - Only call again for a genuinely DIFFERENT aspect of the codebase
|
|
38
|
+
# - If confidence is "medium" or "low", check confidence_reason for what to
|
|
39
|
+
# refine before re-exploring
|
|
40
|
+
# tools:
|
|
41
|
+
# code-talk:
|
|
42
|
+
# workflow: code-talk
|
|
43
|
+
# inputs: {}
|
|
29
44
|
# - id: jira
|
|
30
45
|
# description: needs Jira access
|
|
31
46
|
# knowledge: "Use jira_get_issue to fetch tickets."
|
|
@@ -938,8 +953,14 @@ tests:
|
|
|
938
953
|
knowledge:
|
|
939
954
|
- intent: code_help
|
|
940
955
|
content: |
|
|
941
|
-
## Code
|
|
942
|
-
Use the code-
|
|
956
|
+
## Code Explorer
|
|
957
|
+
Use the code-talk tool to explore code repositories.
|
|
958
|
+
The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
|
|
959
|
+
- If a call returns confidence "high", trust the answer — do NOT re-call
|
|
960
|
+
with a rephrased version of the same question
|
|
961
|
+
- Only call again for a genuinely DIFFERENT aspect of the codebase
|
|
962
|
+
- If confidence is "medium" or "low", check confidence_reason for what to
|
|
963
|
+
refine before re-exploring
|
|
943
964
|
mocks:
|
|
944
965
|
route-intent:
|
|
945
966
|
intent: code_help
|
|
@@ -1102,7 +1123,13 @@ tests:
|
|
|
1102
1123
|
description: needs codebase exploration
|
|
1103
1124
|
knowledge: |
|
|
1104
1125
|
## Code Explorer
|
|
1105
|
-
Use to
|
|
1126
|
+
Use the code-talk tool to explore code repositories.
|
|
1127
|
+
The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
|
|
1128
|
+
- If a call returns confidence "high", trust the answer — do NOT re-call
|
|
1129
|
+
with a rephrased version of the same question
|
|
1130
|
+
- Only call again for a genuinely DIFFERENT aspect of the codebase
|
|
1131
|
+
- If confidence is "medium" or "low", check confidence_reason for what to
|
|
1132
|
+
refine before re-exploring
|
|
1106
1133
|
tools:
|
|
1107
1134
|
code-talk:
|
|
1108
1135
|
workflow: code-talk
|
|
@@ -1192,6 +1219,15 @@ tests:
|
|
|
1192
1219
|
skills:
|
|
1193
1220
|
- id: code-explorer
|
|
1194
1221
|
description: needs codebase exploration
|
|
1222
|
+
knowledge: |
|
|
1223
|
+
## Code Explorer
|
|
1224
|
+
Use the code-talk tool to explore code repositories.
|
|
1225
|
+
The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
|
|
1226
|
+
- If a call returns confidence "high", trust the answer — do NOT re-call
|
|
1227
|
+
with a rephrased version of the same question
|
|
1228
|
+
- Only call again for a genuinely DIFFERENT aspect of the codebase
|
|
1229
|
+
- If confidence is "medium" or "low", check confidence_reason for what to
|
|
1230
|
+
refine before re-exploring
|
|
1195
1231
|
tools:
|
|
1196
1232
|
code-explorer:
|
|
1197
1233
|
workflow: code-talk
|
|
@@ -1600,6 +1636,15 @@ tests:
|
|
|
1600
1636
|
skills:
|
|
1601
1637
|
- id: code-explorer
|
|
1602
1638
|
description: needs code exploration
|
|
1639
|
+
knowledge: |
|
|
1640
|
+
## Code Explorer
|
|
1641
|
+
Use the code-talk tool to explore code repositories.
|
|
1642
|
+
The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
|
|
1643
|
+
- If a call returns confidence "high", trust the answer — do NOT re-call
|
|
1644
|
+
with a rephrased version of the same question
|
|
1645
|
+
- Only call again for a genuinely DIFFERENT aspect of the codebase
|
|
1646
|
+
- If confidence is "medium" or "low", check confidence_reason for what to
|
|
1647
|
+
refine before re-exploring
|
|
1603
1648
|
tools:
|
|
1604
1649
|
code-explorer:
|
|
1605
1650
|
workflow: code-talk
|
|
@@ -1781,8 +1826,14 @@ tests:
|
|
|
1781
1826
|
- id: code-explorer
|
|
1782
1827
|
description: needs code exploration
|
|
1783
1828
|
knowledge: |
|
|
1784
|
-
## Code
|
|
1785
|
-
Use code-
|
|
1829
|
+
## Code Explorer
|
|
1830
|
+
Use the code-talk tool to explore code repositories.
|
|
1831
|
+
The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
|
|
1832
|
+
- If a call returns confidence "high", trust the answer — do NOT re-call
|
|
1833
|
+
with a rephrased version of the same question
|
|
1834
|
+
- Only call again for a genuinely DIFFERENT aspect of the codebase
|
|
1835
|
+
- If confidence is "medium" or "low", check confidence_reason for what to
|
|
1836
|
+
refine before re-exploring
|
|
1786
1837
|
mocks:
|
|
1787
1838
|
route-intent:
|
|
1788
1839
|
intent: chat
|
|
@@ -1804,7 +1855,7 @@ tests:
|
|
|
1804
1855
|
- "<id>jira</id>"
|
|
1805
1856
|
- "## Jira Tools"
|
|
1806
1857
|
- "<id>code-explorer</id>"
|
|
1807
|
-
- "## Code
|
|
1858
|
+
- "## Code Explorer"
|
|
1808
1859
|
outputs:
|
|
1809
1860
|
- step: build-config
|
|
1810
1861
|
path: knowledge_content
|
|
@@ -1817,7 +1868,7 @@ tests:
|
|
|
1817
1868
|
matches: "<id>code-explorer</id>"
|
|
1818
1869
|
- step: build-config
|
|
1819
1870
|
path: knowledge_content
|
|
1820
|
-
matches: "Code
|
|
1871
|
+
matches: "Code Explorer"
|
|
1821
1872
|
|
|
1822
1873
|
# =========================================================================
|
|
1823
1874
|
# MCP Server Configuration Assertions
|
|
@@ -2070,7 +2121,15 @@ tests:
|
|
|
2070
2121
|
skills:
|
|
2071
2122
|
- id: code-explorer
|
|
2072
2123
|
description: code exploration
|
|
2073
|
-
knowledge:
|
|
2124
|
+
knowledge: |
|
|
2125
|
+
## Code Explorer
|
|
2126
|
+
Use the code-talk tool to explore code repositories.
|
|
2127
|
+
The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
|
|
2128
|
+
- If a call returns confidence "high", trust the answer — do NOT re-call
|
|
2129
|
+
with a rephrased version of the same question
|
|
2130
|
+
- Only call again for a genuinely DIFFERENT aspect of the codebase
|
|
2131
|
+
- If confidence is "medium" or "low", check confidence_reason for what to
|
|
2132
|
+
refine before re-exploring
|
|
2074
2133
|
tools:
|
|
2075
2134
|
code-explorer:
|
|
2076
2135
|
workflow: code-talk
|
|
@@ -2117,6 +2176,15 @@ tests:
|
|
|
2117
2176
|
skills:
|
|
2118
2177
|
- id: code-explorer
|
|
2119
2178
|
description: code exploration
|
|
2179
|
+
knowledge: |
|
|
2180
|
+
## Code Explorer
|
|
2181
|
+
Use the code-talk tool to explore code repositories.
|
|
2182
|
+
The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
|
|
2183
|
+
- If a call returns confidence "high", trust the answer — do NOT re-call
|
|
2184
|
+
with a rephrased version of the same question
|
|
2185
|
+
- Only call again for a genuinely DIFFERENT aspect of the codebase
|
|
2186
|
+
- If confidence is "medium" or "low", check confidence_reason for what to
|
|
2187
|
+
refine before re-exploring
|
|
2120
2188
|
tools:
|
|
2121
2189
|
code-explorer:
|
|
2122
2190
|
workflow: code-talk
|
|
@@ -538,13 +538,13 @@ steps:
|
|
|
538
538
|
6. Have you ruled out other possibilities with evidence, not assumptions?
|
|
539
539
|
|
|
540
540
|
Cross-project verification:
|
|
541
|
-
|
|
542
|
-
|
|
543
|
-
|
|
541
|
+
7. Are there dependencies or interactions between projects you examined?
|
|
542
|
+
8. Did you verify how data/config flows between components?
|
|
543
|
+
9. Did you check both the happy path AND error handling paths?
|
|
544
544
|
|
|
545
545
|
Reference accuracy:
|
|
546
|
-
|
|
547
|
-
|
|
546
|
+
10. Are all code references accurate (file paths, function names, line numbers)?
|
|
547
|
+
11. Do the docs match what the code actually does?
|
|
548
548
|
|
|
549
549
|
CRITICAL: When you identify a configuration variable, you MUST then perform
|
|
550
550
|
a second search to find the code that consumes that variable. You cannot draw
|
|
@@ -554,11 +554,25 @@ steps:
|
|
|
554
554
|
If you found any ambiguity, gaps in your investigation, or unexplored
|
|
555
555
|
hypotheses - use delegate tool to investigate further before concluding.
|
|
556
556
|
|
|
557
|
+
Confidence calibration — be HONEST, not optimistic:
|
|
558
|
+
- "high" ONLY when you found definitive code evidence that FULLY answers the
|
|
559
|
+
question, you verified the complete call chain, AND you ruled out alternative
|
|
560
|
+
explanations with evidence (not assumptions)
|
|
561
|
+
- "medium" when you found relevant code but could not verify all aspects, when
|
|
562
|
+
there are alternative explanations you did not fully rule out, or when you
|
|
563
|
+
traced only part of the execution path
|
|
564
|
+
- "low" when your answer is based on naming conventions, comments, partial
|
|
565
|
+
evidence, or inference without direct code confirmation
|
|
566
|
+
- Do NOT default to "high" — inflated confidence causes the caller to skip
|
|
567
|
+
needed follow-up investigation, wasting time on redundant re-exploration
|
|
568
|
+
- If confidence is not "high", the confidence_reason MUST clearly state what
|
|
569
|
+
evidence is missing or which alternative theories remain unverified
|
|
570
|
+
|
|
557
571
|
When you finish, ensure your answer includes:
|
|
558
572
|
- All relevant details grounded in code
|
|
559
573
|
- If multiple theories were considered, explain which one is correct and WHY
|
|
560
574
|
(with evidence ruling out alternatives)
|
|
561
|
-
- A confidence score ("high", "medium", or "low")
|
|
575
|
+
- A confidence score ("high", "medium", or "low") calibrated per the rules above
|
|
562
576
|
- If confidence is "medium" or "low", include a clear confidence_reason
|
|
563
577
|
explaining what evidence is missing or ambiguous
|
|
564
578
|
- At the END of your answer.text, append a "## References" section with a
|