RubyGems - data_porter - Versions diffs - 0.1.0 - Mend

data_porter 0.1.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (159) hide show

checksums.yaml +7 -0
data/.claude/commands/blog-status.md +10 -0
data/.claude/commands/blog.md +109 -0
data/.claude/commands/task-done.md +27 -0
data/.claude/commands/tm/add-dependency.md +58 -0
data/.claude/commands/tm/add-subtask.md +79 -0
data/.claude/commands/tm/add-task.md +81 -0
data/.claude/commands/tm/analyze-complexity.md +124 -0
data/.claude/commands/tm/analyze-project.md +100 -0
data/.claude/commands/tm/auto-implement-tasks.md +100 -0
data/.claude/commands/tm/command-pipeline.md +80 -0
data/.claude/commands/tm/complexity-report.md +120 -0
data/.claude/commands/tm/convert-task-to-subtask.md +74 -0
data/.claude/commands/tm/expand-all-tasks.md +52 -0
data/.claude/commands/tm/expand-task.md +52 -0
data/.claude/commands/tm/fix-dependencies.md +82 -0
data/.claude/commands/tm/help.md +101 -0
data/.claude/commands/tm/init-project-quick.md +49 -0
data/.claude/commands/tm/init-project.md +53 -0
data/.claude/commands/tm/install-taskmaster.md +118 -0
data/.claude/commands/tm/learn.md +106 -0
data/.claude/commands/tm/list-tasks-by-status.md +42 -0
data/.claude/commands/tm/list-tasks-with-subtasks.md +30 -0
data/.claude/commands/tm/list-tasks.md +46 -0
data/.claude/commands/tm/next-task.md +69 -0
data/.claude/commands/tm/parse-prd-with-research.md +51 -0
data/.claude/commands/tm/parse-prd.md +52 -0
data/.claude/commands/tm/project-status.md +67 -0
data/.claude/commands/tm/quick-install-taskmaster.md +23 -0
data/.claude/commands/tm/remove-all-subtasks.md +94 -0
data/.claude/commands/tm/remove-dependency.md +65 -0
data/.claude/commands/tm/remove-subtask.md +87 -0
data/.claude/commands/tm/remove-subtasks.md +89 -0
data/.claude/commands/tm/remove-task.md +110 -0
data/.claude/commands/tm/setup-models.md +52 -0
data/.claude/commands/tm/show-task.md +85 -0
data/.claude/commands/tm/smart-workflow.md +58 -0
data/.claude/commands/tm/sync-readme.md +120 -0
data/.claude/commands/tm/tm-main.md +147 -0
data/.claude/commands/tm/to-cancelled.md +58 -0
data/.claude/commands/tm/to-deferred.md +50 -0
data/.claude/commands/tm/to-done.md +47 -0
data/.claude/commands/tm/to-in-progress.md +39 -0
data/.claude/commands/tm/to-pending.md +35 -0
data/.claude/commands/tm/to-review.md +43 -0
data/.claude/commands/tm/update-single-task.md +122 -0
data/.claude/commands/tm/update-task.md +75 -0
data/.claude/commands/tm/update-tasks-from-id.md +111 -0
data/.claude/commands/tm/validate-dependencies.md +72 -0
data/.claude/commands/tm/view-models.md +52 -0
data/.env.example +12 -0
data/.mcp.json +24 -0
data/.taskmaster/CLAUDE.md +435 -0
data/.taskmaster/config.json +44 -0
data/.taskmaster/docs/prd.txt +2044 -0
data/.taskmaster/state.json +6 -0
data/.taskmaster/tasks/task_001.md +19 -0
data/.taskmaster/tasks/task_002.md +19 -0
data/.taskmaster/tasks/task_003.md +19 -0
data/.taskmaster/tasks/task_004.md +19 -0
data/.taskmaster/tasks/task_005.md +19 -0
data/.taskmaster/tasks/task_006.md +19 -0
data/.taskmaster/tasks/task_007.md +19 -0
data/.taskmaster/tasks/task_008.md +19 -0
data/.taskmaster/tasks/task_009.md +19 -0
data/.taskmaster/tasks/task_010.md +19 -0
data/.taskmaster/tasks/task_011.md +19 -0
data/.taskmaster/tasks/task_012.md +19 -0
data/.taskmaster/tasks/task_013.md +19 -0
data/.taskmaster/tasks/task_014.md +19 -0
data/.taskmaster/tasks/task_015.md +19 -0
data/.taskmaster/tasks/task_016.md +19 -0
data/.taskmaster/tasks/task_017.md +19 -0
data/.taskmaster/tasks/task_018.md +19 -0
data/.taskmaster/tasks/task_019.md +19 -0
data/.taskmaster/tasks/task_020.md +19 -0
data/.taskmaster/tasks/tasks.json +299 -0
data/.taskmaster/templates/example_prd.txt +47 -0
data/.taskmaster/templates/example_prd_rpg.txt +511 -0
data/CHANGELOG.md +29 -0
data/CLAUDE.md +65 -0
data/CODE_OF_CONDUCT.md +10 -0
data/CONTRIBUTING.md +49 -0
data/LICENSE +21 -0
data/README.md +463 -0
data/Rakefile +12 -0
data/app/assets/stylesheets/data_porter/application.css +646 -0
data/app/channels/data_porter/import_channel.rb +10 -0
data/app/controllers/data_porter/imports_controller.rb +68 -0
data/app/javascript/data_porter/progress_controller.js +33 -0
data/app/jobs/data_porter/dry_run_job.rb +12 -0
data/app/jobs/data_porter/import_job.rb +12 -0
data/app/jobs/data_porter/parse_job.rb +12 -0
data/app/models/data_porter/data_import.rb +49 -0
data/app/views/data_porter/imports/index.html.erb +142 -0
data/app/views/data_porter/imports/new.html.erb +88 -0
data/app/views/data_porter/imports/show.html.erb +49 -0
data/config/database.yml +3 -0
data/config/routes.rb +12 -0
data/docs/SPEC.md +2012 -0
data/docs/UI.md +32 -0
data/docs/blog/001-why-build-a-data-import-engine.md +166 -0
data/docs/blog/002-scaffolding-a-rails-engine.md +188 -0
data/docs/blog/003-configuration-dsl.md +222 -0
data/docs/blog/004-store-model-jsonb.md +237 -0
data/docs/blog/005-target-dsl.md +284 -0
data/docs/blog/006-parsing-csv-sources.md +300 -0
data/docs/blog/007-orchestrator.md +247 -0
data/docs/blog/008-actioncable-stimulus.md +376 -0
data/docs/blog/009-phlex-ui-components.md +446 -0
data/docs/blog/010-controllers-routing.md +374 -0
data/docs/blog/011-generators.md +364 -0
data/docs/blog/012-json-api-sources.md +323 -0
data/docs/blog/013-testing-rails-engine.md +618 -0
data/docs/blog/014-dry-run.md +307 -0
data/docs/blog/015-publishing-retro.md +264 -0
data/docs/blog/016-erb-view-templates.md +431 -0
data/docs/blog/017-showcase-final-retro.md +220 -0
data/docs/blog/BACKLOG.md +8 -0
data/docs/blog/SERIES.md +154 -0
data/docs/screenshots/index-with-previewing.jpg +0 -0
data/docs/screenshots/index.jpg +0 -0
data/docs/screenshots/modal-new-import.jpg +0 -0
data/docs/screenshots/preview.jpg +0 -0
data/lib/data_porter/broadcaster.rb +29 -0
data/lib/data_porter/components/base.rb +10 -0
data/lib/data_porter/components/failure_alert.rb +20 -0
data/lib/data_porter/components/preview_table.rb +54 -0
data/lib/data_porter/components/progress_bar.rb +33 -0
data/lib/data_porter/components/results_summary.rb +19 -0
data/lib/data_porter/components/status_badge.rb +16 -0
data/lib/data_porter/components/summary_cards.rb +30 -0
data/lib/data_porter/components.rb +14 -0
data/lib/data_porter/configuration.rb +25 -0
data/lib/data_porter/dsl/api_config.rb +25 -0
data/lib/data_porter/dsl/column.rb +17 -0
data/lib/data_porter/engine.rb +15 -0
data/lib/data_porter/orchestrator.rb +141 -0
data/lib/data_porter/record_validator.rb +32 -0
data/lib/data_porter/registry.rb +33 -0
data/lib/data_porter/sources/api.rb +49 -0
data/lib/data_porter/sources/base.rb +35 -0
data/lib/data_porter/sources/csv.rb +43 -0
data/lib/data_porter/sources/json.rb +45 -0
data/lib/data_porter/sources.rb +20 -0
data/lib/data_porter/store_models/error.rb +13 -0
data/lib/data_porter/store_models/import_record.rb +52 -0
data/lib/data_porter/store_models/report.rb +21 -0
data/lib/data_porter/target.rb +89 -0
data/lib/data_porter/type_validator.rb +46 -0
data/lib/data_porter/version.rb +5 -0
data/lib/data_porter.rb +32 -0
data/lib/generators/data_porter/install/install_generator.rb +33 -0
data/lib/generators/data_porter/install/templates/create_data_porter_imports.rb.erb +21 -0
data/lib/generators/data_porter/install/templates/initializer.rb +30 -0
data/lib/generators/data_porter/target/target_generator.rb +44 -0
data/lib/generators/data_porter/target/templates/target.rb.tt +20 -0
data/sig/data_porter.rbs +4 -0
metadata +274 -0

data/.taskmaster/templates/example_prd_rpg.txt ADDED Viewed

@@ -0,0 +1,511 @@
+<rpg-method>
+# Repository Planning Graph (RPG) Method - PRD Template
+This template teaches you (AI or human) how to create structured, dependency-aware PRDs using the RPG methodology from Microsoft Research. The key insight: separate WHAT (functional) from HOW (structural), then connect them with explicit dependencies.
+## Core Principles
+1. **Dual-Semantics**: Think functional (capabilities) AND structural (code organization) separately, then map them
+2. **Explicit Dependencies**: Never assume - always state what depends on what
+3. **Topological Order**: Build foundation first, then layers on top
+4. **Progressive Refinement**: Start broad, refine iteratively
+## How to Use This Template
+- Follow the instructions in each `<instruction>` block
+- Look at `<example>` blocks to see good vs bad patterns
+- Fill in the content sections with your project details
+- The AI reading this will learn the RPG method by following along
+- Task Master will parse the resulting PRD into dependency-aware tasks
+## Recommended Tools for Creating PRDs
+When using this template to **create** a PRD (not parse it), use **code-context-aware AI assistants** for best results:
+**Why?** The AI needs to understand your existing codebase to make good architectural decisions about modules, dependencies, and integration points.
+**Recommended tools:**
+- **Claude Code** (claude-code CLI) - Best for structured reasoning and large contexts
+- **Cursor/Windsurf** - IDE integration with full codebase context
+- **Gemini CLI** (gemini-cli) - Massive context window for large codebases
+- **Codex/Grok CLI** - Strong code generation with context awareness
+**Note:** Once your PRD is created, `task-master parse-prd` works with any configured AI model - it just needs to read the PRD text itself, not your codebase.
+</rpg-method>
+---
+<overview>
+<instruction>
+Start with the problem, not the solution. Be specific about:
+- What pain point exists?
+- Who experiences it?
+- Why existing solutions don't work?
+- What success looks like (measurable outcomes)?
+Keep this section focused - don't jump into implementation details yet.
+</instruction>
+## Problem Statement
+[Describe the core problem. Be concrete about user pain points.]
+## Target Users
+[Define personas, their workflows, and what they're trying to achieve.]
+## Success Metrics
+[Quantifiable outcomes. Examples: "80% task completion via autopilot", "< 5% manual intervention rate"]
+</overview>
+---
+<functional-decomposition>
+<instruction>
+Now think about CAPABILITIES (what the system DOES), not code structure yet.
+Step 1: Identify high-level capability domains
+- Think: "What major things does this system do?"
+- Examples: Data Management, Core Processing, Presentation Layer
+Step 2: For each capability, enumerate specific features
+- Use explore-exploit strategy:
+  * Exploit: What features are REQUIRED for core value?
+  * Explore: What features make this domain COMPLETE?
+Step 3: For each feature, define:
+- Description: What it does in one sentence
+- Inputs: What data/context it needs
+- Outputs: What it produces/returns
+- Behavior: Key logic or transformations
+<example type="good">
+Capability: Data Validation
+  Feature: Schema validation
+    - Description: Validate JSON payloads against defined schemas
+    - Inputs: JSON object, schema definition
+    - Outputs: Validation result (pass/fail) + error details
+    - Behavior: Iterate fields, check types, enforce constraints
+  Feature: Business rule validation
+    - Description: Apply domain-specific validation rules
+    - Inputs: Validated data object, rule set
+    - Outputs: Boolean + list of violated rules
+    - Behavior: Execute rules sequentially, short-circuit on failure
+</example>
+<example type="bad">
+Capability: validation.js
+  (Problem: This is a FILE, not a CAPABILITY. Mixing structure into functional thinking.)
+Capability: Validation
+  Feature: Make sure data is good
+  (Problem: Too vague. No inputs/outputs. Not actionable.)
+</example>
+</instruction>
+## Capability Tree
+### Capability: [Name]
+[Brief description of what this capability domain covers]
+#### Feature: [Name]
+- **Description**: [One sentence]
+- **Inputs**: [What it needs]
+- **Outputs**: [What it produces]
+- **Behavior**: [Key logic]
+#### Feature: [Name]
+- **Description**:
+- **Inputs**:
+- **Outputs**:
+- **Behavior**:
+### Capability: [Name]
+...
+</functional-decomposition>
+---
+<structural-decomposition>
+<instruction>
+NOW think about code organization. Map capabilities to actual file/folder structure.
+Rules:
+1. Each capability maps to a module (folder or file)
+2. Features within a capability map to functions/classes
+3. Use clear module boundaries - each module has ONE responsibility
+4. Define what each module exports (public interface)
+The goal: Create a clear mapping between "what it does" (functional) and "where it lives" (structural).
+<example type="good">
+Capability: Data Validation
+  → Maps to: src/validation/
+    ├── schema-validator.js      (Schema validation feature)
+    ├── rule-validator.js         (Business rule validation feature)
+    └── index.js                  (Public exports)
+Exports:
+  - validateSchema(data, schema)
+  - validateRules(data, rules)
+</example>
+<example type="bad">
+Capability: Data Validation
+  → Maps to: src/utils.js
+  (Problem: "utils" is not a clear module boundary. Where do I find validation logic?)
+Capability: Data Validation
+  → Maps to: src/validation/everything.js
+  (Problem: One giant file. Features should map to separate files for maintainability.)
+</example>
+</instruction>
+## Repository Structure
+```
+project-root/
+├── src/
+│   ├── [module-name]/       # Maps to: [Capability Name]
+│   │   ├── [file].js        # Maps to: [Feature Name]
+│   │   └── index.js         # Public exports
+│   └── [module-name]/
+├── tests/
+└── docs/
+```
+## Module Definitions
+### Module: [Name]
+- **Maps to capability**: [Capability from functional decomposition]
+- **Responsibility**: [Single clear purpose]
+- **File structure**:
+  ```
+  module-name/
+  ├── feature1.js
+  ├── feature2.js
+  └── index.js
+  ```
+- **Exports**:
+  - `functionName()` - [what it does]
+  - `ClassName` - [what it does]
+</structural-decomposition>
+---
+<dependency-graph>
+<instruction>
+This is THE CRITICAL SECTION for Task Master parsing.
+Define explicit dependencies between modules. This creates the topological order for task execution.
+Rules:
+1. List modules in dependency order (foundation first)
+2. For each module, state what it depends on
+3. Foundation modules should have NO dependencies
+4. Every non-foundation module should depend on at least one other module
+5. Think: "What must EXIST before I can build this module?"
+<example type="good">
+Foundation Layer (no dependencies):
+  - error-handling: No dependencies
+  - config-manager: No dependencies
+  - base-types: No dependencies
+Data Layer:
+  - schema-validator: Depends on [base-types, error-handling]
+  - data-ingestion: Depends on [schema-validator, config-manager]
+Core Layer:
+  - algorithm-engine: Depends on [base-types, error-handling]
+  - pipeline-orchestrator: Depends on [algorithm-engine, data-ingestion]
+</example>
+<example type="bad">
+- validation: Depends on API
+- API: Depends on validation
+(Problem: Circular dependency. This will cause build/runtime issues.)
+- user-auth: Depends on everything
+(Problem: Too many dependencies. Should be more focused.)
+</example>
+</instruction>
+## Dependency Chain
+### Foundation Layer (Phase 0)
+No dependencies - these are built first.
+- **[Module Name]**: [What it provides]
+- **[Module Name]**: [What it provides]
+### [Layer Name] (Phase 1)
+- **[Module Name]**: Depends on [[module-from-phase-0], [module-from-phase-0]]
+- **[Module Name]**: Depends on [[module-from-phase-0]]
+### [Layer Name] (Phase 2)
+- **[Module Name]**: Depends on [[module-from-phase-1], [module-from-foundation]]
+[Continue building up layers...]
+</dependency-graph>
+---
+<implementation-roadmap>
+<instruction>
+Turn the dependency graph into concrete development phases.
+Each phase should:
+1. Have clear entry criteria (what must exist before starting)
+2. Contain tasks that can be parallelized (no inter-dependencies within phase)
+3. Have clear exit criteria (how do we know phase is complete?)
+4. Build toward something USABLE (not just infrastructure)
+Phase ordering follows topological sort of dependency graph.
+<example type="good">
+Phase 0: Foundation
+  Entry: Clean repository
+  Tasks:
+    - Implement error handling utilities
+    - Create base type definitions
+    - Setup configuration system
+  Exit: Other modules can import foundation without errors
+Phase 1: Data Layer
+  Entry: Phase 0 complete
+  Tasks:
+    - Implement schema validator (uses: base types, error handling)
+    - Build data ingestion pipeline (uses: validator, config)
+  Exit: End-to-end data flow from input to validated output
+</example>
+<example type="bad">
+Phase 1: Build Everything
+  Tasks:
+    - API
+    - Database
+    - UI
+    - Tests
+  (Problem: No clear focus. Too broad. Dependencies not considered.)
+</example>
+</instruction>
+## Development Phases
+### Phase 0: [Foundation Name]
+**Goal**: [What foundational capability this establishes]
+**Entry Criteria**: [What must be true before starting]
+**Tasks**:
+- [ ] [Task name] (depends on: [none or list])
+  - Acceptance criteria: [How we know it's done]
+  - Test strategy: [What tests prove it works]
+- [ ] [Task name] (depends on: [none or list])
+**Exit Criteria**: [Observable outcome that proves phase complete]
+**Delivers**: [What can users/developers do after this phase?]
+---
+### Phase 1: [Layer Name]
+**Goal**:
+**Entry Criteria**: Phase 0 complete
+**Tasks**:
+- [ ] [Task name] (depends on: [[tasks-from-phase-0]])
+- [ ] [Task name] (depends on: [[tasks-from-phase-0]])
+**Exit Criteria**:
+**Delivers**:
+---
+[Continue with more phases...]
+</implementation-roadmap>
+---
+<test-strategy>
+<instruction>
+Define how testing will be integrated throughout development (TDD approach).
+Specify:
+1. Test pyramid ratios (unit vs integration vs e2e)
+2. Coverage requirements
+3. Critical test scenarios
+4. Test generation guidelines for Surgical Test Generator
+This section guides the AI when generating tests during the RED phase of TDD.
+<example type="good">
+Critical Test Scenarios for Data Validation module:
+  - Happy path: Valid data passes all checks
+  - Edge cases: Empty strings, null values, boundary numbers
+  - Error cases: Invalid types, missing required fields
+  - Integration: Validator works with ingestion pipeline
+</example>
+</instruction>
+## Test Pyramid
+```
+        /\
+       /E2E\       ← [X]% (End-to-end, slow, comprehensive)
+      /------\
+     /Integration\ ← [Y]% (Module interactions)
+    /------------\
+   /  Unit Tests  \ ← [Z]% (Fast, isolated, deterministic)
+  /----------------\
+```
+## Coverage Requirements
+- Line coverage: [X]% minimum
+- Branch coverage: [X]% minimum
+- Function coverage: [X]% minimum
+- Statement coverage: [X]% minimum
+## Critical Test Scenarios
+### [Module/Feature Name]
+**Happy path**:
+- [Scenario description]
+- Expected: [What should happen]
+**Edge cases**:
+- [Scenario description]
+- Expected: [What should happen]
+**Error cases**:
+- [Scenario description]
+- Expected: [How system handles failure]
+**Integration points**:
+- [What interactions to test]
+- Expected: [End-to-end behavior]
+## Test Generation Guidelines
+[Specific instructions for Surgical Test Generator about what to focus on, what patterns to follow, project-specific test conventions]
+</test-strategy>
+---
+<architecture>
+<instruction>
+Describe technical architecture, data models, and key design decisions.
+Keep this section AFTER functional/structural decomposition - implementation details come after understanding structure.
+</instruction>
+## System Components
+[Major architectural pieces and their responsibilities]
+## Data Models
+[Core data structures, schemas, database design]
+## Technology Stack
+[Languages, frameworks, key libraries]
+**Decision: [Technology/Pattern]**
+- **Rationale**: [Why chosen]
+- **Trade-offs**: [What we're giving up]
+- **Alternatives considered**: [What else we looked at]
+</architecture>
+---
+<risks>
+<instruction>
+Identify risks that could derail development and how to mitigate them.
+Categories:
+- Technical risks (complexity, unknowns)
+- Dependency risks (blocking issues)
+- Scope risks (creep, underestimation)
+</instruction>
+## Technical Risks
+**Risk**: [Description]
+- **Impact**: [High/Medium/Low - effect on project]
+- **Likelihood**: [High/Medium/Low]
+- **Mitigation**: [How to address]
+- **Fallback**: [Plan B if mitigation fails]
+## Dependency Risks
+[External dependencies, blocking issues]
+## Scope Risks
+[Scope creep, underestimation, unclear requirements]
+</risks>
+---
+<appendix>
+## References
+[Papers, documentation, similar systems]
+## Glossary
+[Domain-specific terms]
+## Open Questions
+[Things to resolve during development]
+</appendix>
+---
+<task-master-integration>
+# How Task Master Uses This PRD
+When you run `task-master parse-prd <file>.txt`, the parser:
+1. **Extracts capabilities** → Main tasks
+   - Each `### Capability:` becomes a top-level task
+2. **Extracts features** → Subtasks
+   - Each `#### Feature:` becomes a subtask under its capability
+3. **Parses dependencies** → Task dependencies
+   - `Depends on: [X, Y]` sets task.dependencies = ["X", "Y"]
+4. **Orders by phases** → Task priorities
+   - Phase 0 tasks = highest priority
+   - Phase N tasks = lower priority, properly sequenced
+5. **Uses test strategy** → Test generation context
+   - Feeds test scenarios to Surgical Test Generator during implementation
+**Result**: A dependency-aware task graph that can be executed in topological order.
+## Why RPG Structure Matters
+Traditional flat PRDs lead to:
+- ❌ Unclear task dependencies
+- ❌ Arbitrary task ordering
+- ❌ Circular dependencies discovered late
+- ❌ Poorly scoped tasks
+RPG-structured PRDs provide:
+- ✅ Explicit dependency chains
+- ✅ Topological execution order
+- ✅ Clear module boundaries
+- ✅ Validated task graph before implementation
+## Tips for Best Results
+1. **Spend time on dependency graph** - This is the most valuable section for Task Master
+2. **Keep features atomic** - Each feature should be independently testable
+3. **Progressive refinement** - Start broad, use `task-master expand` to break down complex tasks
+4. **Use research mode** - `task-master parse-prd --research` leverages AI for better task generation
+</task-master-integration>

data/CHANGELOG.md ADDED Viewed

@@ -0,0 +1,29 @@
+# Changelog
+All notable changes to this project will be documented in this file.
+The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.1.0/),
+and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
+## [Unreleased]
+## [0.1.0] - 2026-02-06
+### Added
+- **Target DSL** -- Declarative class-level DSL (`label`, `model_name`, `columns`, `csv_mapping`, `deduplicate_by`, `dry_run_enabled`) with auto-registration via `Registry`
+- **CSV source** -- Parse CSV files via ActiveStorage with header mapping and custom separators
+- **JSON source** -- Parse JSON files with configurable `json_root` path extraction
+- **API source** -- Fetch records from HTTP endpoints with dynamic `endpoint` and `headers` lambdas and `response_root` extraction
+- **Orchestrator** -- Coordinates parse, import, and dry run workflows with per-record error handling
+- **Dry run mode** -- Transaction-based validation that rolls back after testing all records against the database
+- **Real-time progress** -- ActionCable broadcaster with Stimulus controller for live progress updates
+- **Phlex UI components** -- StatusBadge, SummaryCards, PreviewTable, ProgressBar, ResultsSummary, FailureAlert (pure Ruby, no phlex-rails dependency)
+- **ERB view templates** -- Index (with modal form and dropzone), new, and show pages composing Phlex components via `.call`
+- **Plain CSS stylesheet** -- `dp-*` prefixed classes with CSS custom properties (`--dp-*`) for theming, auto-precompiled via Sprockets
+- **StoreModel JSONB columns** -- ImportRecord, Error, and Report models stored as JSONB on the DataImport model
+- **Install generator** -- Creates migration, initializer, routes mount, and `app/importers/` directory
+- **Target generator** -- Scaffolds target classes with column parsing from CLI arguments
+- **Configuration DSL** -- `DataPorter.configure` block with `parent_controller`, `queue_name`, `storage_service`, `cable_channel_prefix`, `context_builder`, `preview_limit`, `enabled_sources`
+- **ActiveJob integration** -- ParseJob, ImportJob, DryRunJob with configurable queue name
+- **221 RSpec examples** covering models, sources, orchestrator, jobs, channels, components, controllers, routes, generators, and views

data/CLAUDE.md ADDED Viewed

@@ -0,0 +1,65 @@
+# CLAUDE.md
+## Project
+DataPorter - Mountable Rails engine for 3-step data import workflows.
+## Stack
+- Ruby >= 3.2, Rails >= 7.0
+- store_model, phlex, turbo-rails, stimulus
+- Tailwind CSS (prefixed `dp-`, scoped `.data-porter`)
+- RSpec for testing
+## Language
+- ALL code, comments, commits, docs, specs, error messages in English
+- NO French anywhere in the codebase
+## Conventions
+- NO COMMENTS in generated code
+- Conventional Commits (feat, fix, test, refactor, chore, docs)
+- Frozen string literals (`# frozen_string_literal: true` in every .rb file)
+## Development Constraints
+### TDD
+- Always write specs BEFORE implementation code
+- Red -> Green -> Refactor cycle
+- Run `bundle exec rspec` to validate before moving on
+### Code Quality
+- One file = one class/module
+- Max 10 lines per method (excluding private keyword lines)
+- Single Responsibility Principle: each class does one thing
+- No `class_eval`, no monkey-patching
+- No implicit dependencies between modules (explicit requires)
+- Everything namespaced under `DataPorter::`
+- Run `bundle exec rubocop` before every commit
+### Commits
+- Small, focused commits (one concern per commit)
+- Never commit large chunks of unrelated code together
+- Each commit should pass specs and rubocop
+### Design Principles
+- Balance simplicity and extensibility: simple code that can evolve
+- The gem MUST remain business-agnostic (no domain logic, no hardcoded model names)
+- All business logic belongs in Targets defined by the host app
+- Prefer composition over inheritance
+- Expose hooks and configuration, not internal state
+## Blog Series Automation
+After completing a task, ALWAYS check `docs/blog/SERIES.md` to see if all tasks
+for a blog part are now done. If yes:
+1. Immediately generate the article draft following the `/blog` command process
+2. Update `docs/blog/BACKLOG.md` and `docs/blog/SERIES.md`
+3. Commit the draft, then resume development
+## Architecture
+See docs/SPEC.md for full specification.
+## Commands
+- `bundle exec rspec` - run tests
+- `bundle exec rubocop` - lint
+## Task Master AI Instructions
+**Import Task Master's development workflow commands and guidelines, treat as if import is in the main CLAUDE.md file.**
+@./.taskmaster/CLAUDE.md

data/CODE_OF_CONDUCT.md ADDED Viewed

@@ -0,0 +1,10 @@
+# Code of Conduct
+"data_porter" follows [The Ruby Community Conduct Guideline](https://www.ruby-lang.org/en/conduct) in all "collaborative space", which is defined as community communications channels (such as mailing lists, submitted patches, commit comments, etc.):
+* Participants will be tolerant of opposing views.
+* Participants must ensure that their language and actions are free of personal attacks and disparaging personal remarks.
+* When interpreting the words and actions of others, participants should always assume good intentions.
+* Behaviour which can be reasonably considered harassment will not be tolerated.
+If you have any concerns about behaviour within this project, please contact us at ["seryllounis@outlook.fr"](mailto:"seryllounis@outlook.fr").

data/CONTRIBUTING.md ADDED Viewed

@@ -0,0 +1,49 @@
+# Contributing to DataPorter
+Thank you for considering contributing to DataPorter!
+## Bug Reports
+Open an issue on [GitHub](https://github.com/SerylLns/data_porter/issues) with:
+- Ruby and Rails versions
+- Steps to reproduce
+- Expected vs actual behavior
+- Relevant logs or error messages
+## Pull Requests
+1. Fork the repo and create your branch from `main`
+2. Write specs first (TDD -- red, green, refactor)
+3. Ensure `bundle exec rspec` passes (221+ examples, 0 failures)
+4. Ensure `bundle exec rubocop` passes (0 offenses)
+5. One concern per commit, using [Conventional Commits](https://www.conventionalcommits.org/) (`feat:`, `fix:`, `test:`, `refactor:`, `chore:`, `docs:`)
+6. Open a PR against `main`
+## Development Setup
+```bash
+git clone https://github.com/SerylLns/data_porter.git
+cd data_porter
+bin/setup
+bundle exec rspec
+bundle exec rubocop
+```
+## Code Style
+- `# frozen_string_literal: true` in every `.rb` file
+- Max 10 lines per method
+- No comments in code -- code should be self-explanatory
+- Everything namespaced under `DataPorter::`
+- All English (code, commits, docs, specs, error messages)
+- `dp-` CSS prefix, scoped under `.data-porter`
+## Architecture
+- The gem must remain **business-agnostic** -- no domain logic, no hardcoded model names
+- All business logic belongs in Targets defined by the host app
+- Prefer composition over inheritance
+- Expose hooks and configuration, not internal state
+See [docs/SPEC.md](docs/SPEC.md) for the full specification.

data/LICENSE ADDED Viewed

@@ -0,0 +1,21 @@
+MIT License
+Copyright (c) 2026 Seryl Lounis
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE.