npm - @probelabs/visor - Versions diffs - 0.1.156 → 0.1.157-ee - Mend

@probelabs/visor 0.1.156 → 0.1.157-ee

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (57) hide show

package/defaults/assistant.yaml CHANGED Viewed

@@ -26,6 +26,21 @@
 #           - id: chat
 #             description: general Q&A
 #         skills:
+#           - id: code-explorer
+#             description: needs codebase exploration
+#             knowledge: |
+#               ## Code Explorer
+#               Use the code-talk tool to explore code repositories.
+#               The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
+#               - If a call returns confidence "high", trust the answer — do NOT re-call
+#                 with a rephrased version of the same question
+#               - Only call again for a genuinely DIFFERENT aspect of the codebase
+#               - If confidence is "medium" or "low", check confidence_reason for what to
+#                 refine before re-exploring
+#             tools:
+#               code-talk:
+#                 workflow: code-talk
+#                 inputs: {}
 #           - id: jira
 #             description: needs Jira access
 #             knowledge: "Use jira_get_issue to fetch tickets."
@@ -938,8 +953,14 @@ tests:
         knowledge:
           - intent: code_help
             content: |
-              ## Code Exploration
-              Use the code-explorer tool to search the codebase.
+              ## Code Explorer
+              Use the code-talk tool to explore code repositories.
+              The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
+              - If a call returns confidence "high", trust the answer — do NOT re-call
+                with a rephrased version of the same question
+              - Only call again for a genuinely DIFFERENT aspect of the codebase
+              - If confidence is "medium" or "low", check confidence_reason for what to
+                refine before re-exploring
       mocks:
         route-intent:
           intent: code_help
@@ -1102,7 +1123,13 @@ tests:
             description: needs codebase exploration
             knowledge: |
               ## Code Explorer
-              Use to search codebase.
+              Use the code-talk tool to explore code repositories.
+              The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
+              - If a call returns confidence "high", trust the answer — do NOT re-call
+                with a rephrased version of the same question
+              - Only call again for a genuinely DIFFERENT aspect of the codebase
+              - If confidence is "medium" or "low", check confidence_reason for what to
+                refine before re-exploring
             tools:
               code-talk:
                 workflow: code-talk
@@ -1192,6 +1219,15 @@ tests:
         skills:
           - id: code-explorer
             description: needs codebase exploration
+            knowledge: |
+              ## Code Explorer
+              Use the code-talk tool to explore code repositories.
+              The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
+              - If a call returns confidence "high", trust the answer — do NOT re-call
+                with a rephrased version of the same question
+              - Only call again for a genuinely DIFFERENT aspect of the codebase
+              - If confidence is "medium" or "low", check confidence_reason for what to
+                refine before re-exploring
             tools:
               code-explorer:
                 workflow: code-talk
@@ -1600,6 +1636,15 @@ tests:
         skills:
           - id: code-explorer
             description: needs code exploration
+            knowledge: |
+              ## Code Explorer
+              Use the code-talk tool to explore code repositories.
+              The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
+              - If a call returns confidence "high", trust the answer — do NOT re-call
+                with a rephrased version of the same question
+              - Only call again for a genuinely DIFFERENT aspect of the codebase
+              - If confidence is "medium" or "low", check confidence_reason for what to
+                refine before re-exploring
             tools:
               code-explorer:
                 workflow: code-talk
@@ -1781,8 +1826,14 @@ tests:
           - id: code-explorer
             description: needs code exploration
             knowledge: |
-              ## Code Exploration
-              Use code-explorer tool to search code.
+              ## Code Explorer
+              Use the code-talk tool to explore code repositories.
+              The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
+              - If a call returns confidence "high", trust the answer — do NOT re-call
+                with a rephrased version of the same question
+              - Only call again for a genuinely DIFFERENT aspect of the codebase
+              - If confidence is "medium" or "low", check confidence_reason for what to
+                refine before re-exploring
       mocks:
         route-intent:
           intent: chat
@@ -1804,7 +1855,7 @@ tests:
               - "<id>jira</id>"
               - "## Jira Tools"
               - "<id>code-explorer</id>"
-              - "## Code Exploration"
+              - "## Code Explorer"
         outputs:
           - step: build-config
             path: knowledge_content
@@ -1817,7 +1868,7 @@ tests:
             matches: "<id>code-explorer</id>"
           - step: build-config
             path: knowledge_content
-            matches: "Code Exploration"
+            matches: "Code Explorer"
     # =========================================================================
     # MCP Server Configuration Assertions
@@ -2070,7 +2121,15 @@ tests:
         skills:
           - id: code-explorer
             description: code exploration
-            knowledge: "Use code-explorer for questions"
+            knowledge: |
+              ## Code Explorer
+              Use the code-talk tool to explore code repositories.
+              The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
+              - If a call returns confidence "high", trust the answer — do NOT re-call
+                with a rephrased version of the same question
+              - Only call again for a genuinely DIFFERENT aspect of the codebase
+              - If confidence is "medium" or "low", check confidence_reason for what to
+                refine before re-exploring
             tools:
               code-explorer:
                 workflow: code-talk
@@ -2117,6 +2176,15 @@ tests:
         skills:
           - id: code-explorer
             description: code exploration
+            knowledge: |
+              ## Code Explorer
+              Use the code-talk tool to explore code repositories.
+              The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
+              - If a call returns confidence "high", trust the answer — do NOT re-call
+                with a rephrased version of the same question
+              - Only call again for a genuinely DIFFERENT aspect of the codebase
+              - If confidence is "medium" or "low", check confidence_reason for what to
+                refine before re-exploring
             tools:
               code-explorer:
                 workflow: code-talk

package/defaults/code-talk.yaml CHANGED Viewed

@@ -538,13 +538,13 @@ steps:
         6. Have you ruled out other possibilities with evidence, not assumptions?
         Cross-project verification:
-        4. Are there dependencies or interactions between projects you examined?
-        5. Did you verify how data/config flows between components?
-        6. Did you check both the happy path AND error handling paths?
+        7. Are there dependencies or interactions between projects you examined?
+        8. Did you verify how data/config flows between components?
+        9. Did you check both the happy path AND error handling paths?
         Reference accuracy:
-        7. Are all code references accurate (file paths, function names, line numbers)?
-        8. Do the docs match what the code actually does?
+        10. Are all code references accurate (file paths, function names, line numbers)?
+        11. Do the docs match what the code actually does?
         CRITICAL: When you identify a configuration variable, you MUST then perform
         a second search to find the code that consumes that variable. You cannot draw
@@ -554,11 +554,25 @@ steps:
         If you found any ambiguity, gaps in your investigation, or unexplored
         hypotheses - use delegate tool to investigate further before concluding.
+        Confidence calibration — be HONEST, not optimistic:
+        - "high" ONLY when you found definitive code evidence that FULLY answers the
+          question, you verified the complete call chain, AND you ruled out alternative
+          explanations with evidence (not assumptions)
+        - "medium" when you found relevant code but could not verify all aspects, when
+          there are alternative explanations you did not fully rule out, or when you
+          traced only part of the execution path
+        - "low" when your answer is based on naming conventions, comments, partial
+          evidence, or inference without direct code confirmation
+        - Do NOT default to "high" — inflated confidence causes the caller to skip
+          needed follow-up investigation, wasting time on redundant re-exploration
+        - If confidence is not "high", the confidence_reason MUST clearly state what
+          evidence is missing or which alternative theories remain unverified
         When you finish, ensure your answer includes:
         - All relevant details grounded in code
         - If multiple theories were considered, explain which one is correct and WHY
           (with evidence ruling out alternatives)
-        - A confidence score ("high", "medium", or "low")
+        - A confidence score ("high", "medium", or "low") calibrated per the rules above
         - If confidence is "medium" or "low", include a clear confidence_reason
           explaining what evidence is missing or ambiguous
         - At the END of your answer.text, append a "## References" section with a

package/dist/defaults/assistant.yaml CHANGED Viewed

@@ -26,6 +26,21 @@
 #           - id: chat
 #             description: general Q&A
 #         skills:
+#           - id: code-explorer
+#             description: needs codebase exploration
+#             knowledge: |
+#               ## Code Explorer
+#               Use the code-talk tool to explore code repositories.
+#               The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
+#               - If a call returns confidence "high", trust the answer — do NOT re-call
+#                 with a rephrased version of the same question
+#               - Only call again for a genuinely DIFFERENT aspect of the codebase
+#               - If confidence is "medium" or "low", check confidence_reason for what to
+#                 refine before re-exploring
+#             tools:
+#               code-talk:
+#                 workflow: code-talk
+#                 inputs: {}
 #           - id: jira
 #             description: needs Jira access
 #             knowledge: "Use jira_get_issue to fetch tickets."
@@ -938,8 +953,14 @@ tests:
         knowledge:
           - intent: code_help
             content: |
-              ## Code Exploration
-              Use the code-explorer tool to search the codebase.
+              ## Code Explorer
+              Use the code-talk tool to explore code repositories.
+              The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
+              - If a call returns confidence "high", trust the answer — do NOT re-call
+                with a rephrased version of the same question
+              - Only call again for a genuinely DIFFERENT aspect of the codebase
+              - If confidence is "medium" or "low", check confidence_reason for what to
+                refine before re-exploring
       mocks:
         route-intent:
           intent: code_help
@@ -1102,7 +1123,13 @@ tests:
             description: needs codebase exploration
             knowledge: |
               ## Code Explorer
-              Use to search codebase.
+              Use the code-talk tool to explore code repositories.
+              The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
+              - If a call returns confidence "high", trust the answer — do NOT re-call
+                with a rephrased version of the same question
+              - Only call again for a genuinely DIFFERENT aspect of the codebase
+              - If confidence is "medium" or "low", check confidence_reason for what to
+                refine before re-exploring
             tools:
               code-talk:
                 workflow: code-talk
@@ -1192,6 +1219,15 @@ tests:
         skills:
           - id: code-explorer
             description: needs codebase exploration
+            knowledge: |
+              ## Code Explorer
+              Use the code-talk tool to explore code repositories.
+              The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
+              - If a call returns confidence "high", trust the answer — do NOT re-call
+                with a rephrased version of the same question
+              - Only call again for a genuinely DIFFERENT aspect of the codebase
+              - If confidence is "medium" or "low", check confidence_reason for what to
+                refine before re-exploring
             tools:
               code-explorer:
                 workflow: code-talk
@@ -1600,6 +1636,15 @@ tests:
         skills:
           - id: code-explorer
             description: needs code exploration
+            knowledge: |
+              ## Code Explorer
+              Use the code-talk tool to explore code repositories.
+              The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
+              - If a call returns confidence "high", trust the answer — do NOT re-call
+                with a rephrased version of the same question
+              - Only call again for a genuinely DIFFERENT aspect of the codebase
+              - If confidence is "medium" or "low", check confidence_reason for what to
+                refine before re-exploring
             tools:
               code-explorer:
                 workflow: code-talk
@@ -1781,8 +1826,14 @@ tests:
           - id: code-explorer
             description: needs code exploration
             knowledge: |
-              ## Code Exploration
-              Use code-explorer tool to search code.
+              ## Code Explorer
+              Use the code-talk tool to explore code repositories.
+              The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
+              - If a call returns confidence "high", trust the answer — do NOT re-call
+                with a rephrased version of the same question
+              - Only call again for a genuinely DIFFERENT aspect of the codebase
+              - If confidence is "medium" or "low", check confidence_reason for what to
+                refine before re-exploring
       mocks:
         route-intent:
           intent: chat
@@ -1804,7 +1855,7 @@ tests:
               - "<id>jira</id>"
               - "## Jira Tools"
               - "<id>code-explorer</id>"
-              - "## Code Exploration"
+              - "## Code Explorer"
         outputs:
           - step: build-config
             path: knowledge_content
@@ -1817,7 +1868,7 @@ tests:
             matches: "<id>code-explorer</id>"
           - step: build-config
             path: knowledge_content
-            matches: "Code Exploration"
+            matches: "Code Explorer"
     # =========================================================================
     # MCP Server Configuration Assertions
@@ -2070,7 +2121,15 @@ tests:
         skills:
           - id: code-explorer
             description: code exploration
-            knowledge: "Use code-explorer for questions"
+            knowledge: |
+              ## Code Explorer
+              Use the code-talk tool to explore code repositories.
+              The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
+              - If a call returns confidence "high", trust the answer — do NOT re-call
+                with a rephrased version of the same question
+              - Only call again for a genuinely DIFFERENT aspect of the codebase
+              - If confidence is "medium" or "low", check confidence_reason for what to
+                refine before re-exploring
             tools:
               code-explorer:
                 workflow: code-talk
@@ -2117,6 +2176,15 @@ tests:
         skills:
           - id: code-explorer
             description: code exploration
+            knowledge: |
+              ## Code Explorer
+              Use the code-talk tool to explore code repositories.
+              The tool returns `confidence` ("high"/"medium"/"low") and `confidence_reason`.
+              - If a call returns confidence "high", trust the answer — do NOT re-call
+                with a rephrased version of the same question
+              - Only call again for a genuinely DIFFERENT aspect of the codebase
+              - If confidence is "medium" or "low", check confidence_reason for what to
+                refine before re-exploring
             tools:
               code-explorer:
                 workflow: code-talk

package/dist/defaults/code-talk.yaml CHANGED Viewed

@@ -538,13 +538,13 @@ steps:
         6. Have you ruled out other possibilities with evidence, not assumptions?
         Cross-project verification:
-        4. Are there dependencies or interactions between projects you examined?
-        5. Did you verify how data/config flows between components?
-        6. Did you check both the happy path AND error handling paths?
+        7. Are there dependencies or interactions between projects you examined?
+        8. Did you verify how data/config flows between components?
+        9. Did you check both the happy path AND error handling paths?
         Reference accuracy:
-        7. Are all code references accurate (file paths, function names, line numbers)?
-        8. Do the docs match what the code actually does?
+        10. Are all code references accurate (file paths, function names, line numbers)?
+        11. Do the docs match what the code actually does?
         CRITICAL: When you identify a configuration variable, you MUST then perform
         a second search to find the code that consumes that variable. You cannot draw
@@ -554,11 +554,25 @@ steps:
         If you found any ambiguity, gaps in your investigation, or unexplored
         hypotheses - use delegate tool to investigate further before concluding.
+        Confidence calibration — be HONEST, not optimistic:
+        - "high" ONLY when you found definitive code evidence that FULLY answers the
+          question, you verified the complete call chain, AND you ruled out alternative
+          explanations with evidence (not assumptions)
+        - "medium" when you found relevant code but could not verify all aspects, when
+          there are alternative explanations you did not fully rule out, or when you
+          traced only part of the execution path
+        - "low" when your answer is based on naming conventions, comments, partial
+          evidence, or inference without direct code confirmation
+        - Do NOT default to "high" — inflated confidence causes the caller to skip
+          needed follow-up investigation, wasting time on redundant re-exploration
+        - If confidence is not "high", the confidence_reason MUST clearly state what
+          evidence is missing or which alternative theories remain unverified
         When you finish, ensure your answer includes:
         - All relevant details grounded in code
         - If multiple theories were considered, explain which one is correct and WHY
           (with evidence ruling out alternatives)
-        - A confidence score ("high", "medium", or "low")
+        - A confidence score ("high", "medium", or "low") calibrated per the rules above
         - If confidence is "medium" or "low", include a clear confidence_reason
           explaining what evidence is missing or ambiguous
         - At the END of your answer.text, append a "## References" section with a