RubyGems - cov-loupe - Versions diffs - 3.0.0 → 4.0.0.pre - Mend

cov-loupe 3.0.0 → 4.0.0.pre

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (281) hide show

checksums.yaml +4 -4
data/AGENTS.md +230 -0
data/CLAUDE.md +5 -0
data/CODE_OF_CONDUCT.md +62 -0
data/CONTRIBUTING.md +102 -0
data/GEMINI.md +5 -0
data/README.md +154 -51
data/RELEASE_NOTES.md +452 -0
data/dev/images/cov-loupe-icon-lores.png +0 -0
data/dev/images/cov-loupe-icon-square.png +0 -0
data/dev/images/cov-loupe-icon.png +0 -0
data/dev/images/cov-loupe-logo.png +0 -0
data/dev/prompts/README.md +74 -0
data/dev/prompts/archive/architectural-review-and-actions-prompt.md +53 -0
data/dev/prompts/archive/investigate-and-report-issues-prompt.md +33 -0
data/dev/prompts/archive/produce-action-items-prompt.md +25 -0
data/dev/prompts/guidelines/ai-code-evaluator-guidelines.md +337 -0
data/dev/prompts/improve/refactor-test-suite.md +18 -0
data/dev/prompts/improve/simplify-code-logic.md +133 -0
data/dev/prompts/improve/update-documentation.md +21 -0
data/dev/prompts/review/comprehensive-codebase-review.md +176 -0
data/dev/prompts/review/identify-action-items.md +143 -0
data/dev/prompts/review/verify-code-changes.md +54 -0
data/dev/prompts/validate/create-screencast-outline.md +234 -0
data/dev/prompts/validate/test-documentation-examples.md +180 -0
data/docs/QUICKSTART.md +63 -0
data/docs/assets/images/cov-loupe-logo-lores.png +0 -0
data/docs/assets/images/cov-loupe-logo.png +0 -0
data/docs/assets/images/favicon.png +0 -0
data/docs/assets/stylesheets/branding.css +16 -0
data/docs/assets/stylesheets/extra.css +15 -0
data/docs/code_of_conduct.md +1 -0
data/docs/contributing.md +1 -0
data/docs/dev/ARCHITECTURE.md +56 -11
data/docs/dev/DEVELOPMENT.md +116 -12
data/docs/dev/FUTURE_ENHANCEMENTS.md +14 -0
data/docs/dev/README.md +3 -2
data/docs/dev/RELEASING.md +2 -0
data/docs/dev/arch-decisions/README.md +10 -7
data/docs/dev/arch-decisions/application-architecture.md +259 -0
data/docs/dev/arch-decisions/coverage-data-quality.md +193 -0
data/docs/dev/arch-decisions/output-character-mode.md +217 -0
data/docs/dev/arch-decisions/path-resolution.md +90 -0
data/docs/dev/arch-decisions/{004-x-arch-decision.md → policy-validation.md} +32 -28
data/docs/dev/arch-decisions/{005-x-arch-decision.md → simplecov-integration.md} +47 -44
data/docs/dev/presentations/cov-loupe-presentation.md +15 -13
data/docs/examples/mcp-inputs.md +3 -0
data/docs/examples/prompts.md +3 -0
data/docs/examples/success_predicates.md +3 -0
data/docs/fixtures/demo_project/.resultset.json +170 -0
data/docs/fixtures/demo_project/README.md +6 -0
data/docs/fixtures/demo_project/app/controllers/admin/audit_logs_controller.rb +19 -0
data/docs/fixtures/demo_project/app/controllers/orders_controller.rb +26 -0
data/docs/fixtures/demo_project/app/models/order.rb +20 -0
data/docs/fixtures/demo_project/app/models/user.rb +19 -0
data/docs/fixtures/demo_project/lib/api/client.rb +22 -0
data/docs/fixtures/demo_project/lib/ops/jobs/cleanup_job.rb +16 -0
data/docs/fixtures/demo_project/lib/ops/jobs/report_job.rb +17 -0
data/docs/fixtures/demo_project/lib/payments/processor.rb +15 -0
data/docs/fixtures/demo_project/lib/payments/refund_service.rb +15 -0
data/docs/fixtures/demo_project/lib/payments/reporting/exporter.rb +16 -0
data/docs/index.md +1 -0
data/docs/license.md +3 -0
data/docs/release_notes.md +3 -0
data/docs/user/ADVANCED_USAGE.md +208 -115
data/docs/user/CLI_FALLBACK_FOR_LLMS.md +2 -0
data/docs/user/CLI_USAGE.md +276 -101
data/docs/user/ERROR_HANDLING.md +4 -4
data/docs/user/EXAMPLES.md +121 -128
data/docs/user/INSTALLATION.md +9 -28
data/docs/user/LIBRARY_API.md +227 -122
data/docs/user/MCP_INTEGRATION.md +114 -203
data/docs/user/README.md +5 -1
data/docs/user/TROUBLESHOOTING.md +49 -27
data/docs/user/installing-a-prelease-version-of-covloupe.md +43 -0
data/docs/user/{V2-BREAKING-CHANGES.md → migrations/MIGRATING_TO_V2.md} +62 -72
data/docs/user/migrations/MIGRATING_TO_V3.md +72 -0
data/docs/user/migrations/MIGRATING_TO_V4.md +591 -0
data/docs/user/migrations/README.md +22 -0
data/docs/user/prompts/README.md +9 -0
data/docs/user/prompts/non-web-coverage-analysis-prompt.md +103 -0
data/docs/user/prompts/rails-coverage-analysis-prompt.md +94 -0
data/docs/user/prompts/use-cli-not-mcp-prompt.md +53 -0
data/examples/cli_demo.sh +77 -0
data/examples/filter_and_table_demo-output.md +114 -0
data/examples/filter_and_table_demo.rb +174 -0
data/examples/fixtures/demo_project/coverage/.resultset.json +10 -0
data/examples/mcp-inputs/README.md +66 -0
data/examples/mcp-inputs/coverage_detailed.json +1 -0
data/examples/mcp-inputs/coverage_raw.json +1 -0
data/examples/mcp-inputs/coverage_summary.json +1 -0
data/examples/mcp-inputs/list.json +1 -0
data/examples/mcp-inputs/uncovered_lines.json +1 -0
data/examples/prompts/README.md +27 -0
data/examples/prompts/custom_resultset.txt +2 -0
data/examples/prompts/detailed_with_source.txt +2 -0
data/examples/prompts/list_lowest.txt +2 -0
data/examples/prompts/summary.txt +2 -0
data/examples/prompts/uncovered.txt +2 -0
data/examples/success_predicates/README.md +198 -0
data/examples/success_predicates/all_files_above_threshold_predicate.rb +21 -0
data/examples/success_predicates/directory_specific_thresholds_predicate.rb +30 -0
data/examples/success_predicates/project_coverage_minimum_predicate.rb +6 -0
data/lib/cov_loupe/base_tool.rb +229 -20
data/lib/cov_loupe/cli.rb +132 -23
data/lib/cov_loupe/commands/base_command.rb +25 -6
data/lib/cov_loupe/commands/command_factory.rb +0 -1
data/lib/cov_loupe/commands/detailed_command.rb +10 -5
data/lib/cov_loupe/commands/list_command.rb +2 -1
data/lib/cov_loupe/commands/raw_command.rb +7 -5
data/lib/cov_loupe/commands/summary_command.rb +12 -7
data/lib/cov_loupe/commands/totals_command.rb +74 -10
data/lib/cov_loupe/commands/uncovered_command.rb +7 -5
data/lib/cov_loupe/commands/validate_command.rb +11 -3
data/lib/cov_loupe/commands/version_command.rb +6 -4
data/lib/cov_loupe/{app_config.rb → config/app_config.rb} +13 -5
data/lib/cov_loupe/config/app_context.rb +43 -0
data/lib/cov_loupe/config/boolean_type.rb +91 -0
data/lib/cov_loupe/config/logger.rb +92 -0
data/lib/cov_loupe/{option_normalizers.rb → config/option_normalizers.rb} +55 -24
data/lib/cov_loupe/{option_parser_builder.rb → config/option_parser_builder.rb} +46 -24
data/lib/cov_loupe/coverage/coverage_calculator.rb +53 -0
data/lib/cov_loupe/coverage/coverage_reporter.rb +63 -0
data/lib/cov_loupe/coverage/coverage_table_formatter.rb +133 -0
data/lib/cov_loupe/{error_handler.rb → errors/error_handler.rb} +21 -33
data/lib/cov_loupe/{errors.rb → errors/errors.rb} +48 -71
data/lib/cov_loupe/formatters/formatters.rb +75 -0
data/lib/cov_loupe/formatters/source_formatter.rb +18 -7
data/lib/cov_loupe/formatters/table_formatter.rb +80 -0
data/lib/cov_loupe/loaders/all.rb +15 -0
data/lib/cov_loupe/loaders/all_cli.rb +10 -0
data/lib/cov_loupe/loaders/all_mcp.rb +23 -0
data/lib/cov_loupe/loaders/resultset_loader.rb +147 -0
data/lib/cov_loupe/mcp_server.rb +3 -2
data/lib/cov_loupe/model/model.rb +520 -0
data/lib/cov_loupe/model/model_data.rb +13 -0
data/lib/cov_loupe/model/model_data_cache.rb +116 -0
data/lib/cov_loupe/option_parsers/env_options_parser.rb +17 -6
data/lib/cov_loupe/option_parsers/error_helper.rb +16 -10
data/lib/cov_loupe/output_chars.rb +192 -0
data/lib/cov_loupe/paths/glob_utils.rb +100 -0
data/lib/cov_loupe/{path_relativizer.rb → paths/path_relativizer.rb} +5 -13
data/lib/cov_loupe/paths/path_utils.rb +265 -0
data/lib/cov_loupe/paths/volume_case_sensitivity.rb +173 -0
data/lib/cov_loupe/presenters/base_coverage_presenter.rb +9 -13
data/lib/cov_loupe/presenters/coverage_payload_presenter.rb +21 -0
data/lib/cov_loupe/presenters/payload_caching.rb +23 -0
data/lib/cov_loupe/presenters/project_coverage_presenter.rb +73 -21
data/lib/cov_loupe/presenters/project_totals_presenter.rb +16 -10
data/lib/cov_loupe/repositories/coverage_repository.rb +149 -0
data/lib/cov_loupe/resolvers/coverage_line_resolver.rb +90 -76
data/lib/cov_loupe/resolvers/{resolver_factory.rb → resolver_helpers.rb} +6 -5
data/lib/cov_loupe/resolvers/resultset_path_resolver.rb +40 -12
data/lib/cov_loupe/scripts/command_execution.rb +113 -0
data/lib/cov_loupe/scripts/latest_ci_status.rb +97 -0
data/lib/cov_loupe/scripts/pre_release_check.rb +164 -0
data/lib/cov_loupe/scripts/setup_doc_server.rb +23 -0
data/lib/cov_loupe/scripts/start_doc_server.rb +24 -0
data/lib/cov_loupe/staleness/stale_status.rb +23 -0
data/lib/cov_loupe/staleness/staleness_checker.rb +328 -0
data/lib/cov_loupe/staleness/staleness_message_formatter.rb +91 -0
data/lib/cov_loupe/tools/coverage_detailed_tool.rb +14 -15
data/lib/cov_loupe/tools/coverage_raw_tool.rb +14 -14
data/lib/cov_loupe/tools/coverage_summary_tool.rb +16 -16
data/lib/cov_loupe/tools/coverage_table_tool.rb +139 -21
data/lib/cov_loupe/tools/coverage_totals_tool.rb +31 -13
data/lib/cov_loupe/tools/help_tool.rb +16 -20
data/lib/cov_loupe/tools/list_tool.rb +65 -0
data/lib/cov_loupe/tools/uncovered_lines_tool.rb +14 -14
data/lib/cov_loupe/tools/validate_tool.rb +18 -24
data/lib/cov_loupe/tools/version_tool.rb +8 -3
data/lib/cov_loupe/version.rb +1 -1
data/lib/cov_loupe.rb +83 -55
metadata +184 -154
data/docs/dev/BRANCH_ONLY_COVERAGE.md +0 -158
data/docs/dev/arch-decisions/001-x-arch-decision.md +0 -95
data/docs/dev/arch-decisions/002-x-arch-decision.md +0 -159
data/docs/dev/arch-decisions/003-x-arch-decision.md +0 -165
data/lib/cov_loupe/app_context.rb +0 -26
data/lib/cov_loupe/constants.rb +0 -22
data/lib/cov_loupe/coverage_reporter.rb +0 -31
data/lib/cov_loupe/formatters.rb +0 -51
data/lib/cov_loupe/mode_detector.rb +0 -56
data/lib/cov_loupe/model.rb +0 -339
data/lib/cov_loupe/presenters/coverage_detailed_presenter.rb +0 -14
data/lib/cov_loupe/presenters/coverage_raw_presenter.rb +0 -14
data/lib/cov_loupe/presenters/coverage_summary_presenter.rb +0 -14
data/lib/cov_loupe/presenters/coverage_uncovered_presenter.rb +0 -14
data/lib/cov_loupe/resultset_loader.rb +0 -131
data/lib/cov_loupe/staleness_checker.rb +0 -247
data/lib/cov_loupe/table_formatter.rb +0 -64
data/lib/cov_loupe/tools/all_files_coverage_tool.rb +0 -51
data/lib/cov_loupe/util.rb +0 -88
data/spec/MCP_INTEGRATION_TESTS_README.md +0 -111
data/spec/TIMESTAMPS.md +0 -48
data/spec/all_files_coverage_tool_spec.rb +0 -53
data/spec/app_config_spec.rb +0 -142
data/spec/base_tool_spec.rb +0 -62
data/spec/cli/show_default_report_spec.rb +0 -33
data/spec/cli_enumerated_options_spec.rb +0 -90
data/spec/cli_error_spec.rb +0 -184
data/spec/cli_format_spec.rb +0 -123
data/spec/cli_json_options_spec.rb +0 -50
data/spec/cli_source_spec.rb +0 -44
data/spec/cli_spec.rb +0 -192
data/spec/cli_table_spec.rb +0 -28
data/spec/cli_usage_spec.rb +0 -42
data/spec/commands/base_command_spec.rb +0 -107
data/spec/commands/command_factory_spec.rb +0 -76
data/spec/commands/detailed_command_spec.rb +0 -34
data/spec/commands/list_command_spec.rb +0 -28
data/spec/commands/raw_command_spec.rb +0 -69
data/spec/commands/summary_command_spec.rb +0 -34
data/spec/commands/totals_command_spec.rb +0 -34
data/spec/commands/uncovered_command_spec.rb +0 -55
data/spec/commands/validate_command_spec.rb +0 -213
data/spec/commands/version_command_spec.rb +0 -38
data/spec/constants_spec.rb +0 -61
data/spec/cov_loupe/formatters/source_formatter_spec.rb +0 -267
data/spec/cov_loupe/formatters_spec.rb +0 -76
data/spec/cov_loupe/presenters/base_coverage_presenter_spec.rb +0 -79
data/spec/cov_loupe_model_spec.rb +0 -454
data/spec/cov_loupe_module_spec.rb +0 -37
data/spec/cov_loupe_opts_spec.rb +0 -185
data/spec/coverage_reporter_spec.rb +0 -102
data/spec/coverage_table_tool_spec.rb +0 -59
data/spec/coverage_totals_tool_spec.rb +0 -37
data/spec/error_handler_spec.rb +0 -197
data/spec/error_mode_spec.rb +0 -139
data/spec/errors_edge_cases_spec.rb +0 -312
data/spec/errors_stale_spec.rb +0 -83
data/spec/file_based_mcp_tools_spec.rb +0 -99
data/spec/help_tool_spec.rb +0 -26
data/spec/integration_spec.rb +0 -789
data/spec/logging_fallback_spec.rb +0 -128
data/spec/mcp_logging_spec.rb +0 -44
data/spec/mcp_server_integration_spec.rb +0 -23
data/spec/mcp_server_spec.rb +0 -106
data/spec/mode_detector_spec.rb +0 -153
data/spec/model_error_handling_spec.rb +0 -269
data/spec/model_staleness_spec.rb +0 -79
data/spec/option_normalizers_spec.rb +0 -203
data/spec/option_parsers/env_options_parser_spec.rb +0 -221
data/spec/option_parsers/error_helper_spec.rb +0 -222
data/spec/path_relativizer_spec.rb +0 -98
data/spec/presenters/coverage_detailed_presenter_spec.rb +0 -19
data/spec/presenters/coverage_raw_presenter_spec.rb +0 -15
data/spec/presenters/coverage_summary_presenter_spec.rb +0 -15
data/spec/presenters/coverage_uncovered_presenter_spec.rb +0 -16
data/spec/presenters/project_coverage_presenter_spec.rb +0 -87
data/spec/presenters/project_totals_presenter_spec.rb +0 -144
data/spec/resolvers/coverage_line_resolver_spec.rb +0 -282
data/spec/resolvers/resolver_factory_spec.rb +0 -61
data/spec/resolvers/resultset_path_resolver_spec.rb +0 -60
data/spec/resultset_loader_spec.rb +0 -167
data/spec/shared_examples/README.md +0 -115
data/spec/shared_examples/coverage_presenter_examples.rb +0 -66
data/spec/shared_examples/file_based_mcp_tools.rb +0 -179
data/spec/shared_examples/formatted_command_examples.rb +0 -64
data/spec/shared_examples/mcp_tool_text_json_response.rb +0 -16
data/spec/spec_helper.rb +0 -127
data/spec/staleness_checker_spec.rb +0 -374
data/spec/staleness_more_spec.rb +0 -42
data/spec/support/cli_helpers.rb +0 -22
data/spec/support/control_flow_helpers.rb +0 -20
data/spec/support/fake_mcp.rb +0 -40
data/spec/support/io_helpers.rb +0 -29
data/spec/support/mcp_helpers.rb +0 -35
data/spec/support/mcp_runner.rb +0 -66
data/spec/support/mocking_helpers.rb +0 -30
data/spec/table_format_spec.rb +0 -70
data/spec/tools/validate_tool_spec.rb +0 -132
data/spec/tools_error_handling_spec.rb +0 -130
data/spec/util_spec.rb +0 -154
data/spec/version_spec.rb +0 -123
data/spec/version_tool_spec.rb +0 -141
/data/{spec/fixtures/project1 → examples/fixtures/demo_project}/lib/bar.rb +0 -0
/data/{spec/fixtures/project1 → examples/fixtures/demo_project}/lib/foo.rb +0 -0
/data/lib/cov_loupe/{config_parser.rb → config/config_parser.rb} +0 -0
/data/lib/cov_loupe/{predicate_evaluator.rb → config/predicate_evaluator.rb} +0 -0
/data/lib/cov_loupe/{error_handler_factory.rb → errors/error_handler_factory.rb} +0 -0

data/dev/prompts/guidelines/ai-code-evaluator-guidelines.md ADDED Viewed

@@ -0,0 +1,337 @@
+# AI Code Evaluator Guidelines
+[Back to main README](../../README.md) | [Architecture](../../docs/dev/ARCHITECTURE.md)
+This document provides context for AI code evaluators analyzing the cov-loupe codebase. It explains design decisions that may appear problematic on surface analysis but are intentional and well-considered. AI models should consult this document before flagging potential issues.
+Do not report issues that are already listed in `docs/dev/FUTURE_ENHANCEMENTS.md`. Treat those as acknowledged backlog items rather than review findings.
+## Table of Contents
+- [Security Considerations](#security-considerations)
+  - [Validate Command and Arbitrary Code Execution](#validate-command-and-arbitrary-code-execution)
+  - [File System Race Conditions](#file-system-race-conditions)
+  - [No Rate Limiting on MCP Server](#no-rate-limiting-on-mcp-server)
+- [Unlikely Issues Not Worth Addressing](#unlikely-issues-not-worth-addressing)
+  - [Case-Sensitivity Detection Failures in Read-Only Roots](#case-sensitivity-detection-failures-in-read-only-roots)
+  - [Case-Sensitivity Detection Race Conditions](#case-sensitivity-detection-race-conditions)
+  - [Unbounded MCP Model Cache Growth](#unbounded-mcp-model-cache-growth)
+- [Known Issue: Inefficient Staleness Checks and Timestamp Handling](#known-issue-inefficient-staleness-checks-and-timestamp-handling)
+- [Performance & Scalability](#performance--scalability)
+  - [Memory-Based Coverage Data](#memory-based-coverage-data)
+- [Code Quality & Style](#code-quality--style)
+  - [RuboCop Metrics Cops Disabled](#rubocop-metrics-cops-disabled)
+  - [Method Length and Complexity](#method-length-and-complexity)
+  - [RuboCop Cache and Sandboxed Environments](#rubocop-cache-and-sandboxed-environments)
+- [Dependency Management](#dependency-management)
+  - [Documentation Dependencies: Version Ranges vs. Lock Files](#documentation-dependencies-version-ranges-vs-lock-files)
+- [Documentation Structure](#documentation-structure)
+  - [MkDocs Include-Markdown Stubs](#mkdocs-include-markdown-stubs)
+## Security Considerations
+### Validate Command and Arbitrary Code Execution
+The `validate` command accepts Ruby code (via `--inline` or from a file) and executes it to evaluate coverage policies. AI analysis tools often flag this as a security vulnerability.
+**Why this is acceptable:**
+1. **Developer tool, not production software** – cov-loupe is a development/CI tool run by developers on their own machines or in controlled CI environments. It is not a service or library that processes untrusted input.
+2. **Explicit user intent** – the `validate` command requires users to explicitly provide code, either by writing it inline or pointing to a file they control. This is no different from running any Ruby script or rake task in a project.
+3. **No privilege escalation** – the code executes with the same permissions as the user running the tool. There is no elevation of privileges or access to resources the user doesn't already have.
+4. **Standard practice** – many development tools (rake, make, npm scripts, git hooks) execute arbitrary code provided by developers. This is expected and necessary for a flexible validation/policy tool.
+The security model assumes the developer controls their workspace and the code they execute. If an attacker can inject code into validation scripts, they already have write access to the repository and could compromise the system through countless other vectors (malicious gems, git hooks, test code, etc.).
+[⬆ Back to top](#table-of-contents)
+### File System Race Conditions
+Automated security analysis tools may flag potential race conditions between staleness checks (checking file modification time and line count) and subsequent file reads.
+**Why this is acceptable:**
+1. **Stateless analysis tool** - cov-loupe performs read-only analysis of SimpleCov coverage snapshots. It does not modify source files or maintain state across invocations.
+2. **Development tool context** - This tool runs in development and CI environments where:
+   - The coverage snapshot (`.resultset.json`) is static once generated
+   - Source files are under version control and not randomly modified during analysis
+   - Any race condition would at worst result in a stale data warning, not a security vulnerability
+3. **Acceptable failure mode** - If a source file is modified between the staleness check and display, the worst outcome is showing slightly stale data. This is an acceptable tradeoff for a coverage inspection tool.
+4. **No security impact** - The tool never writes to files, executes code from analyzed files, or makes security decisions based on coverage data.
+For a production system handling untrusted input or making security-critical decisions, race condition handling would be essential. For a development tool analyzing static coverage snapshots, the complexity of atomic file operations is not justified.
+[⬆ Back to top](#table-of-contents)
+### No Rate Limiting on MCP Server
+The MCP server has no request throttling or rate limiting, which automated security scanners may flag as a denial-of-service vulnerability.
+**Why this is acceptable:**
+1. **Local-only execution** - The MCP server runs as a local process, not a network service. It communicates over stdio (standard input/output) with a single trusted client (the AI coding assistant).
+2. **Process-level isolation** - Each MCP server instance is a separate process spawned by the client. Resource exhaustion affects only that process, not a shared service.
+3. **Trusted client** - The client (AI assistant) is a trusted local application, not untrusted network traffic. Rate limiting protects against malicious actors, which don't exist in this threat model.
+4. **No network exposure** - The MCP protocol uses stdio transport, not HTTP/TCP. There is no network socket that external attackers could abuse.
+5. **Development tool** - This is a coverage analysis tool for developers, not a production API serving external users.
+If this were a network-accessible service, rate limiting would be essential. For a local stdio-based tool with a trusted client, it adds complexity without security benefit.
+[⬆ Back to top](#table-of-contents)
+## Unlikely Issues Not Worth Addressing
+### Case-Sensitivity Detection Failures in Read-Only Roots
+Automated reviews may flag `PathUtils.volume_case_sensitive?` for potentially failing when the project root is read-only or missing (due to directory reads or temporary file creation). While possible, this is a low-probability edge case in the intended environments and not worth adding complexity or additional fallback paths.
+**Guidance:** Do not report this as an issue. The current behavior is acceptable for the tool's scope and usage patterns.
+### Case-Sensitivity Detection Race Conditions
+Automated reviews may flag `PathUtils.volume_case_sensitive?` for a potential Time-of-Check to Time-of-Use (TOCTOU) race condition when creating temporary test files. The code checks for file existence before creation using `FileUtils.touch`.
+**Why this is acceptable:**
+1. **Extremely low collision probability** - The tool uses `SecureRandom.hex(16)` to generate a 32-character random string for the filename. The probability of a collision, even in highly concurrent scenarios, is infinitesimal.
+2. **Developer tool context** - This is a local development and CI tool, not a high-traffic server handling untrusted requests. The likelihood of multiple instances of `cov-loupe` running concurrently and picking the same 32-character random string is negligible.
+3. **Non-critical failure** - In the virtually impossible event of a collision, the worst case is a temporary file creation failure or an incorrect case-sensitivity detection for one run. This has no security or data integrity implications.
+**Guidance:** Do not flag this as a race condition or security defect. The use of `SecureRandom` makes this a non-issue in practice.
+### Unbounded ModelDataCache Growth
+The `ModelDataCache` singleton stores coverage data (`ModelData` instances) keyed by resultset path without a size cap. In theory, a long-lived MCP server analyzing many different projects could accumulate entries. In practice:
+1. The cache is keyed by resultset path, not by project or model configuration
+2. Multiple models using the same resultset share a single cache entry
+3. MCP servers are typically short-lived and used for a small set of projects
+4. The cache automatically evicts stale data when resultset files change
+Adding eviction logic (LRU, TTL, size limits) would add complexity without meaningful benefit for the tool's intended use case.
+**Guidance:** Do not report this as an issue. The current behavior is a reasonable tradeoff for simplicity.
+[⬆ Back to top](#table-of-contents)
+## Known Issue: Inefficient Staleness Checks and Timestamp Handling
+- **Description:** Coverage timestamps are collapsed to a single max value for all suites (handled when `CovLoupe::CoverageModel` loads resultsets via `ResultsetLoader`), and staleness checks reread each file to count lines (implemented in `CovLoupe::StalenessChecker`).
+  **Impact:** Multi-suite projects get false positives/negatives on freshness, and large repositories pay O(total lines) per query, making results unreliable and slow for larger code bases.
+  **Urgency:** Medium.
+  **Estimated Cost-to-Fix:** High (store per-suite/file metadata and cache line counts/mtimes).
+  **Rationale for Status Quo:** Fixing this requires a significant architectural overhaul to track per-file/per-suite metadata. Simple caching of file stats is insufficient because the primary use case involves active development where source files change frequently, invalidating caches. Given the high cost of a proper fix and the tool's focus on small-to-medium projects, the current O(N) check is an acceptable tradeoff.
+[⬆ Back to top](#table-of-contents)
+## Performance & Scalability
+### Memory-Based Coverage Data
+cov-loupe loads the entire SimpleCov resultset into memory for analysis. This means it is not designed to handle extremely large codebases that produce coverage data too large to fit in memory.
+**Why this is acceptable:**
+1. **Target use case** – cov-loupe is designed for small-to-medium sized Ruby projects. The typical SimpleCov resultset for such projects is measured in megabytes, well within modern system capabilities.
+2. **SimpleCov's own limitations** – SimpleCov itself loads coverage data into memory. If SimpleCov can generate the resultset, cov-loupe can analyze it.
+3. **Performance trade-off** – in-memory processing enables fast queries, rich data transformations, and a responsive CLI. Streaming or database-backed approaches would add significant complexity for marginal benefit in the target use case.
+4. **Practical upper bound** – even large Ruby projects (Rails, GitLab) generate resultsets in the tens of megabytes. Modern machines have gigabytes of RAM. The constraint is theoretical rather than practical for the intended audience.
+If a project grows large enough that coverage analysis becomes a memory bottleneck, it likely has deeper problems (test suite organization, monolith vs services architecture) that should be addressed at that level rather than by adding complexity to a coverage inspection tool.
+[⬆ Back to top](#table-of-contents)
+### Model Cache Digest Reads
+The MCP model cache computes an MD5 digest of the resultset file on each fetch. This can look like unnecessary file I/O, but it is intentional.
+**Why this is acceptable:**
+1. **Correctness over metadata shortcuts** – mtime/size/inode can miss changes (e.g., clock skew, restored files, or filesystem quirks). Hashing is the only reliable way to guarantee the content is unchanged.
+2. **Cheaper than parsing** – reading and hashing the file is far faster than parsing JSON into memory, so the cache still provides meaningful savings.
+3. **Scoped to MCP usage** – this path is used for repeated MCP tool calls where avoiding full parsing is the main goal.
+**Guidance:** Do not flag this as a performance defect; it is a deliberate tradeoff for correctness.
+[⬆ Back to top](#table-of-contents)
+## Code Quality & Style
+### RuboCop Metrics Cops Disabled
+All RuboCop Metrics cops (AbcSize, BlockLength, ClassLength, CyclomaticComplexity, MethodLength, ModuleLength, ParameterLists, PerceivedComplexity, BlockNesting) are intentionally disabled in `.rubocop.yml`.
+**Why this is acceptable:**
+1. **Arbitrary thresholds don't account for domain complexity** – Some problems are inherently complex. SimpleCov coverage analysis involves edge cases (staleness checking, path resolution, multi-suite merging) that require comprehensive logic. Artificial method splitting can scatter cohesive logic and reduce clarity.
+2. **Comprehensive error handling adds necessary lines** – This project prioritizes reliability through extensive error handling with context-rich messages. Error handling code is inherently verbose but critical for user experience across three modes (CLI, library, MCP).
+3. **Quality maintained through other means** – The codebase achieves:
+   - 100% line coverage (1815/1815 lines)
+   - 94% branch coverage
+   - 0 RuboCop violations (all non-Metrics cops)
+   - Comprehensive code review
+   - Clear inline documentation for complex logic
+   - Voluntary file size restraint (most files < 200 lines)
+4. **Readability over arbitrary limits** – The project values clear, cohesive methods over arbitrary line limits. When a method's length accurately reflects its necessary complexity, splitting it just to meet a metric harms rather than helps. Key examples:
+   - `StalenessChecker#compute_file_staleness_details` (30 lines) handles complex edge cases with clear documentation
+   - `CoverageDataProjectStaleError#build_details` (22 lines) builds error messages through simple sequential operations
+**Evidence:** Manual review shows appropriate complexity for domain logic, with no god objects or unclear methods.
+[⬆ Back to top](#table-of-contents)
+### Method Length and Complexity
+AI code analysis tools may flag methods as "too long" based on line count alone. However, **line count is not always a proxy for complexity**. Many long methods in this codebase are intentionally structured for clarity and maintainability.
+**When long methods are acceptable:**
+1. **Sequential data structure building** – Methods that build arrays or hashes through sequential operations are highly readable despite line count:
+   - `CoverageDataProjectStaleError#build_details` (22 lines, lib/cov_loupe/errors.rb:163-184) – builds error message parts in an array through simple, similar operations
+   - This pattern is far clearer than extracting fragmented helper methods
+2. **Comprehensive conditional handling** – Methods with extensive if-elsif-else chains that handle different cases clearly:
+   - `ErrorHandler#convert_runtime_error` (22 lines, lib/cov_loupe/error_handler.rb:103-124) – converts different error types with appropriate context
+   - Each branch is simple; complexity lies in coverage, not algorithmic difficulty
+3. **Well-documented edge case handling** – Methods handling multiple edge cases with clear inline documentation:
+   - `StalenessChecker#compute_file_staleness_details` (30 lines, lib/cov_loupe/staleness_checker.rb:160-190) – handles file existence, timestamps, and line count mismatches with inline explanations
+   - Comments clarify intent; the logic itself remains straightforward
+4. **Coordinated multi-step operations** – Methods that orchestrate several well-defined steps in a logical flow:
+   - While `CoverageModel#list` was previously 48 lines, it has been refactored to 16 lines (lib/cov_loupe/model.rb:80-95) by extracting helper methods
+   - This demonstrates that extraction makes sense when it improves clarity, not to meet arbitrary line limits
+**When to refactor:**
+Long methods warrant refactoring when they exhibit:
+- Deeply nested conditionals (>3 levels)
+- Mixed concerns (I/O, business logic, and presentation in one method)
+- High cyclomatic complexity (many independent decision points)
+- Difficulty understanding purpose or flow even with comments
+**Guidance for AI analysis tools:**
+Before flagging a long method, assess:
+- **What is it doing?** One cohesive thing, or mixing concerns?
+- **Would splitting improve readability?** Or just scatter related logic?
+- **What causes the length?** Data structures, error handling, or algorithmic complexity?
+- **Is it well-documented?** Inline comments can make complex logic clear.
+Method length is a heuristic, not a rule. Judge methods by clarity, cohesion, and testability rather than line count alone.
+[⬆ Back to top](#table-of-contents)
+### RuboCop Cache and Sandboxed Environments
+RuboCop may crash in sandboxed environments (such as AI coding assistants with file system restrictions) when attempting to write cache files:
+```
+Read-only file system @ rb_sysopen
+  → /home/user/.cache/rubocop_cache/...
+  → Parallel.work_in_processes
+```
+**Why this happens:**
+RuboCop runs in parallel mode by default, forking worker processes via the `parallel` gem. Each worker attempts to cache analysis results to `~/.cache/rubocop_cache/`. When sandbox restrictions prevent writes outside the project directory, the cache write fails and crashes the analysis.
+**Why this is not a code quality issue:**
+Running RuboCop with `--cache false` completes successfully with **0 violations**:
+```
+164 files inspected, no offenses detected
+```
+The codebase has perfect RuboCop compliance. The crash is purely environmental.
+**Workaround:**
+Use `bundle exec rubocop --cache false` in sandboxed environments. This adds approximately 5 seconds to execution time (3s → 8s) but ensures successful analysis. Cache performance benefits are modest for this project size, making the tradeoff acceptable.
+**Why caching is not disabled by default:**
+The 3-second speedup is valuable for frequent local development. Developers in non-sandboxed environments (the common case) benefit from faster linting. The issue only affects specific sandboxed AI tools and CI environments, which can use the `--cache false` flag when needed.
+[⬆ Back to top](#table-of-contents)
+## Dependency Management
+### Documentation Dependencies: Version Ranges vs. Lock Files
+The project uses **both** `requirements.txt` (version ranges) and `requirements-lock.txt` (exact pins) for Python documentation build dependencies.
+**Why version ranges are used in `requirements.txt`:**
+1. **Optional dependencies** - These Python packages are only needed for building documentation. They are NOT part of the Ruby gem or required for using cov-loupe as an MCP server.
+2. **Library compatibility** - Contributors may be working on multiple projects with different documentation tooling. Flexible ranges allow them to use compatible versions already in their environment without conflicts.
+3. **Development flexibility** - Local documentation builds should work with any compatible version. Overly strict pinning would create unnecessary friction for contributors.
+**Why lock files are used in CI (`requirements-lock.txt`):**
+1. **Reproducible builds** - CI documentation builds must be deterministic. The same commit should always produce the same documentation output.
+2. **Prevent drift** - Without locked versions, a new minor/patch release could silently change docs rendering, break the build, or introduce bugs.
+3. **Standard practice** - This is the recommended pattern in Python: flexible ranges for development (`requirements.txt`/`requirements.in`), exact pins for deployment (`requirements-lock.txt`).
+**How it works:**
+- Contributors run `pip install -r requirements.txt` locally (flexible)
+- CI runs `pip install -r requirements-lock.txt` (reproducible)
+- Users of the gem/MCP server are unaffected (these are Python-only doc dependencies)
+This dual-file approach is intentional and follows Python packaging best practices for applications with optional documentation tooling.
+[⬆ Back to top](#table-of-contents)
+## Documentation Structure
+### MkDocs Include-Markdown Stubs
+The files `docs/contributing.md` and `docs/code_of_conduct.md` appear to be minimal 46-byte stubs when examined directly. AI code analysis tools often flag these as missing or incomplete documentation.
+**Why this is not an issue:**
+These files use MkDocs' `include-markdown` plugin to pull in comprehensive documentation from the repository root:
+- `docs/contributing.md` → `{% include-markdown "../CONTRIBUTING.md" %}`
+- `docs/code_of_conduct.md` → `{% include-markdown "../CODE_OF_CONDUCT.md" %}`
+The actual comprehensive documentation exists at:
+- `CONTRIBUTING.md` (103 lines) - Full contributing guide with PR workflow, development setup, testing requirements, and release process
+- `CODE_OF_CONDUCT.md` (61 lines) - Complete Contributor Covenant v2.1
+**Why this pattern is used:**
+1. **Single source of truth** - The actual content lives in standard locations (`CONTRIBUTING.md` and `CODE_OF_CONDUCT.md` at repository root) where GitHub, developers, and tools expect to find them.
+2. **Documentation site integration** - MkDocs automatically includes these files in the generated documentation website without duplication or manual synchronization.
+3. **Standard practice** - This is the recommended approach in the MkDocs documentation for including existing project files in the documentation site.
+AI tools analyzing file sizes directly will see 46-byte stubs, but the documentation is complete and properly structured.
+[⬆ Back to top](#table-of-contents)
+---
+*This document should be updated whenever design decisions are made that might appear problematic to automated analysis but are intentional and defensible.*

data/dev/prompts/improve/refactor-test-suite.md ADDED Viewed

@@ -0,0 +1,18 @@
+# Thoroughly Review Test Suite
+Carefully examine the test suite. Report and fix:
+- duplicate tests testing the same thing
+- tests that test the test setup rather than the actual production code
+- verbose tests, e.g.:
+  - multiple calls to `to include` that should be compressed into a single call with a comma separated string list
+  - duplicate test code that can be made more concise by the use of arrays of test data with an `.each`, etc. block
+- complex tests that could be clarified with comments, intermediate variables, extracted methods, etc.
+Ensure that any code changes comply with rubocop linting:
+- run `rubocop` to see if there are any errors. If cache writes fail (e.g., in a sandboxed environment), use `bundle exec rubocop --cache false`.
+- run `rubocop -A` (or `bundle exec rubocop -A --cache false` if needed) to fix anything rubocop is capable of fixing
+- fix the other errors yourself
+Run the test suite as necessary to verify that all tests pass.

data/dev/prompts/improve/simplify-code-logic.md ADDED Viewed

@@ -0,0 +1,133 @@
+# Simplify and Document Code Logic
+**Purpose:** Identify and improve complex, unclear, or surprising code logic through simplification or documentation.
+## When to Use This
+- Code has complex conditionals (>3 levels of nesting)
+- Logic is surprising or differs from standard conventions
+- Methods/functions are difficult to understand
+- Variable or method names are unclear
+- Edge cases lack explanation
+## What to Look For
+### Complexity Indicators
+- **Deep nesting:** Conditionals or loops nested more than 3 levels
+- **Long methods:** Methods that require excessive mental effort to understand
+- **Unclear variable names:** Names that don't clearly indicate purpose
+- **Magic numbers/strings:** Unexplained literal values
+- **Complex boolean expressions:** Compound conditions that are hard to parse
+### Surprising Behavior
+- Logic that differs from typical language/framework conventions
+- Non-obvious side effects
+- Implicit assumptions about state or input
+- Edge case handling that isn't self-evident
+### Missing Context
+- Unclear intent or purpose
+- Inadequate or missing explanatory comments
+- Undocumented edge cases
+- Assumptions that aren't stated
+## Actions to Take
+For each instance of complex or unclear logic:
+### 1. Assess Simplification Potential
+- Can the logic be rewritten more clearly?
+- Would extracting helper methods improve clarity?
+- Can complex conditions be simplified or inverted?
+- Would better variable names help?
+### 2. If Simplification is Possible
+- Refactor to simpler, more readable code
+- Extract helper methods with clear, descriptive names
+- Use early returns to reduce nesting
+- Break complex expressions into named intermediate variables
+- Replace magic values with named constants
+### 3. If Simplification is Not Possible
+- Add clarifying comments explaining the "why"
+- Document edge cases and assumptions
+- Add examples in comments if helpful
+- Explain why simpler approaches won't work
+### 4. Maintain Functionality
+- Add tests if coverage is missing
+- Run existing tests to verify no regressions
+- Follow Rubocop rules
+- Preserve documented design decisions
+## Constraints
+- **Follow guidelines:** Respect decisions documented in `dev/prompts/guidelines/ai-code-evaluator-guidelines.md`
+- **Maintain behavior:** Do not change functionality
+- **Add tests:** Ensure adequate test coverage for any refactored code
+- **Rubocop compliance:** Run `rubocop` (or `rubocop --cache false` in sandboxed environments) when feasible
+- **Preserve intent:** Maintain the original purpose and behavior
+## Examples
+### Before: Complex nested conditionals
+```ruby
+def process_order(order)
+  if order.valid?
+    if order.items.any?
+      if order.payment_method
+        if order.payment_method.authorized?
+          complete_order(order)
+        else
+          reject_order(order, "Payment not authorized")
+        end
+      else
+        reject_order(order, "No payment method")
+      end
+    else
+      reject_order(order, "No items")
+    end
+  else
+    reject_order(order, "Invalid order")
+  end
+end
+```
+### After: Using early returns
+```ruby
+def process_order(order)
+  return reject_order(order, "Invalid order") unless order.valid?
+  return reject_order(order, "No items") if order.items.empty?
+  return reject_order(order, "No payment method") unless order.payment_method
+  return reject_order(order, "Payment not authorized") unless order.payment_method.authorized?
+  complete_order(order)
+end
+```
+### Before: Unclear logic
+```ruby
+def calculate_price(item)
+  # What is 0.8? Why multiply by it?
+  item.base_price * 0.8 if item.category == 3
+end
+```
+### After: Documented with constants
+```ruby
+# Discount rate for clearance items (20% off)
+CLEARANCE_DISCOUNT = 0.8
+CLEARANCE_CATEGORY = 3
+def calculate_price(item)
+  return item.base_price unless item.category == CLEARANCE_CATEGORY
+  item.base_price * CLEARANCE_DISCOUNT
+end
+```
+## Output
+Make changes directly to the code files. No separate report is needed unless you want to summarize the improvements made.
+Do not run `git commit`. If the user asks for a commit, suggest a concise message that explains what was simplified or documented and why.

data/dev/prompts/improve/update-documentation.md ADDED Viewed

@@ -0,0 +1,21 @@
+# Review and Revise Documentation as Necessary
+Examine carefully the Markdown documentation files in:
+- *.md
+- docs/**/*.md
+- docs/user/**/*.md
+- docs/dev/arch-decisions/**/*.md
+Make any changes necessary to make them accurate, clear, and complete.
+Make sure that documents are linked in both directions, e.g. from the top level readme
+to the specialized document, and vice versa.
+Ensure that there is enough verbiage to explain a given point, but not too much as to
+unnecessarily reduce the signal to noise ratio.
+If a given point is addressed in multiple documents, look to see if that is sensible, and fix if necessary.
+Ensure that any code examples are correct and relevant.