PyPI - dataflow-cv - Versions diffs - 0.3.0__tar.gz → 0.4.0__tar.gz - Mend

dataflow-cv 0.3.0tar.gz → 0.4.0tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (30) hide show

{dataflow_cv-0.3.0/dataflow_cv.egg-info → dataflow_cv-0.4.0}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: dataflow-cv
-Version: 0.3.0
+Version: 0.4.0
 Summary: A data processing library for computer vision datasets
 Home-page: https://github.com/zjykzj/DataFlow-CV
 Author: DataFlow Team
@@ -34,10 +34,8 @@ Dynamic: requires-python
 > **Where Vibe Coding meets CV data.** 🌊
 > Convert & visualize datasets. Built with the flow of Claude Code.
-![Python Version](https://img.shields.io/badge/python-3.8%20|%203.9%20|%203.10%20|%203.11%20|%203.12-blue)
-![License](https://img.shields.io/badge/license-MIT-green)
-![Version](https://img.shields.io/badge/version-0.3.0-orange)
-![Development Status](https://img.shields.io/badge/status-alpha-yellow)
+![Python Version](https://img.shields.io/badge/python-3.8%20|%203.9%20|%203.10%20|%203.11%20|%203.12-blue) ![License](https://img.shields.io/badge/license-MIT-green) [![PyPI](https://img.shields.io/pypi/v/dataflow-cv.svg)](https://pypi.org/project/dataflow-cv/) ![Development Status](https://img.shields.io/badge/status-alpha-yellow) [![GitHub Actions](https://github.com/zjykzj/DataFlow-CV/actions/workflows/python-publish.yml/badge.svg)](https://github.com/zjykzj/DataFlow-CV/actions/workflows/python-publish.yml)
 A data processing library for computer vision datasets, focusing on format conversion and visualization between LabelMe, COCO, and YOLO formats. Provides both a CLI and Python API.
@@ -50,6 +48,8 @@ A data processing library for computer vision datasets, focusing on format conve
     - [Core Dependencies](#core-dependencies)
   - [Quick Start](#quick-start)
     - [Installation](#installation)
+      - [Editable Installation (Development Mode)](#editable-installation-development-mode)
+      - [Build System](#build-system)
     - [Command Line Usage](#command-line-usage)
     - [Python API Usage](#python-api-usage)
     - [CLI Reference](#cli-reference)
@@ -61,6 +61,7 @@ A data processing library for computer vision datasets, focusing on format conve
     - [Segmentation Support](#segmentation-support)
     - [Running Tests](#running-tests)
     - [Examples](#examples)
+    - [Documentation](#documentation)
   - [License](#license)
 ## Project Structure
@@ -79,6 +80,7 @@ dataflow/
 ├── visualize/               # Annotation visualization module
 │   ├── __init__.py
 │   ├── base.py            # Visualizer base class
+│   ├── generic.py         # Generic visualizer base class using label handlers
 │   ├── yolo.py            # YOLO annotation visualizer
 │   ├── coco.py            # COCO annotation visualizer
 │   └── labelme.py         # LabelMe annotation visualizer
@@ -92,7 +94,11 @@ tests/
 ├── convert/                # Conversion tests
 │   ├── __init__.py
 │   ├── test_coco_to_yolo.py
-│   └── test_yolo_to_coco.py
+│   ├── test_yolo_to_coco.py
+│   ├── test_coco_to_labelme.py
+│   ├── test_labelme_to_coco.py
+│   ├── test_labelme_to_yolo.py
+│   └── test_yolo_to_labelme.py
 ├── visualize/              # Visualization tests
 │   ├── __init__.py
 │   ├── test_yolo.py
@@ -130,6 +136,11 @@ samples/
         ├── api_yolo.py
         ├── api_coco.py
         └── api_labelme.py
+docs/                       # Data format documentation
+├── README.md              # Documentation index
+├── yolo.md                # YOLO format specification
+├── labelme.md             # LabelMe format specification
+└── coco.md                # COCO format specification
 ```
 ## Requirements
@@ -468,6 +479,20 @@ Check the `samples/` directory for detailed usage examples:
 - `samples/api/convert/` - Python API conversion examples
 - `samples/api/visualize/` - Python API visualization examples
+### Documentation
+Detailed data format specifications are available in the `docs/` directory:
+- [`docs/README.md`](docs/README.md) - Documentation index
+- [`docs/yolo.md`](docs/yolo.md) - YOLO format specification
+- [`docs/labelme.md`](docs/labelme.md) - LabelMe format specification
+- [`docs/coco.md`](docs/coco.md) - COCO format specification
+These documents describe the annotation formats supported by DataFlow-CV, without covering tool usage.
+## Development
+For development guidelines, architecture details, and contribution instructions, see [CLAUDE.md](CLAUDE.md). This file provides guidance for working with the codebase, including common development commands, architectural patterns, and writing principles.
 ## License
 [MIT License](LICENSE) © 2026 zjykzj

{dataflow_cv-0.3.0 → dataflow_cv-0.4.0}/README.md RENAMED Viewed

@@ -3,10 +3,8 @@
 > **Where Vibe Coding meets CV data.** 🌊
 > Convert & visualize datasets. Built with the flow of Claude Code.
-![Python Version](https://img.shields.io/badge/python-3.8%20|%203.9%20|%203.10%20|%203.11%20|%203.12-blue)
-![License](https://img.shields.io/badge/license-MIT-green)
-![Version](https://img.shields.io/badge/version-0.3.0-orange)
-![Development Status](https://img.shields.io/badge/status-alpha-yellow)
+![Python Version](https://img.shields.io/badge/python-3.8%20|%203.9%20|%203.10%20|%203.11%20|%203.12-blue) ![License](https://img.shields.io/badge/license-MIT-green) [![PyPI](https://img.shields.io/pypi/v/dataflow-cv.svg)](https://pypi.org/project/dataflow-cv/) ![Development Status](https://img.shields.io/badge/status-alpha-yellow) [![GitHub Actions](https://github.com/zjykzj/DataFlow-CV/actions/workflows/python-publish.yml/badge.svg)](https://github.com/zjykzj/DataFlow-CV/actions/workflows/python-publish.yml)
 A data processing library for computer vision datasets, focusing on format conversion and visualization between LabelMe, COCO, and YOLO formats. Provides both a CLI and Python API.
@@ -19,6 +17,8 @@ A data processing library for computer vision datasets, focusing on format conve
     - [Core Dependencies](#core-dependencies)
   - [Quick Start](#quick-start)
     - [Installation](#installation)
+      - [Editable Installation (Development Mode)](#editable-installation-development-mode)
+      - [Build System](#build-system)
     - [Command Line Usage](#command-line-usage)
     - [Python API Usage](#python-api-usage)
     - [CLI Reference](#cli-reference)
@@ -30,6 +30,7 @@ A data processing library for computer vision datasets, focusing on format conve
     - [Segmentation Support](#segmentation-support)
     - [Running Tests](#running-tests)
     - [Examples](#examples)
+    - [Documentation](#documentation)
   - [License](#license)
 ## Project Structure
@@ -48,6 +49,7 @@ dataflow/
 ├── visualize/               # Annotation visualization module
 │   ├── __init__.py
 │   ├── base.py            # Visualizer base class
+│   ├── generic.py         # Generic visualizer base class using label handlers
 │   ├── yolo.py            # YOLO annotation visualizer
 │   ├── coco.py            # COCO annotation visualizer
 │   └── labelme.py         # LabelMe annotation visualizer
@@ -61,7 +63,11 @@ tests/
 ├── convert/                # Conversion tests
 │   ├── __init__.py
 │   ├── test_coco_to_yolo.py
-│   └── test_yolo_to_coco.py
+│   ├── test_yolo_to_coco.py
+│   ├── test_coco_to_labelme.py
+│   ├── test_labelme_to_coco.py
+│   ├── test_labelme_to_yolo.py
+│   └── test_yolo_to_labelme.py
 ├── visualize/              # Visualization tests
 │   ├── __init__.py
 │   ├── test_yolo.py
@@ -99,6 +105,11 @@ samples/
         ├── api_yolo.py
         ├── api_coco.py
         └── api_labelme.py
+docs/                       # Data format documentation
+├── README.md              # Documentation index
+├── yolo.md                # YOLO format specification
+├── labelme.md             # LabelMe format specification
+└── coco.md                # COCO format specification
 ```
 ## Requirements
@@ -437,6 +448,20 @@ Check the `samples/` directory for detailed usage examples:
 - `samples/api/convert/` - Python API conversion examples
 - `samples/api/visualize/` - Python API visualization examples
+### Documentation
+Detailed data format specifications are available in the `docs/` directory:
+- [`docs/README.md`](docs/README.md) - Documentation index
+- [`docs/yolo.md`](docs/yolo.md) - YOLO format specification
+- [`docs/labelme.md`](docs/labelme.md) - LabelMe format specification
+- [`docs/coco.md`](docs/coco.md) - COCO format specification
+These documents describe the annotation formats supported by DataFlow-CV, without covering tool usage.
+## Development
+For development guidelines, architecture details, and contribution instructions, see [CLAUDE.md](CLAUDE.md). This file provides guidance for working with the codebase, including common development commands, architectural patterns, and writing principles.
 ## License
 [MIT License](LICENSE) © 2026 zjykzj

{dataflow_cv-0.3.0 → dataflow_cv-0.4.0}/dataflow/__init__.py RENAMED Viewed

@@ -7,7 +7,7 @@
 @Description: DataFlow-CV: A data processing library for computer vision datasets
 """
-__version__ = "0.3.0"
+__version__ = "0.4.0"
 __author__ = "DataFlow Team"
 __description__ = "A data processing library for computer vision datasets"
@@ -42,14 +42,15 @@ def coco_to_yolo(coco_json_path: str, output_dir: str, **kwargs):
     Args:
         coco_json_path: Path to COCO JSON file
-        output_dir: Output directory where labels/ and class.names will be created
+        output_dir: Output directory where YOLO label files will be created
+            (class.names will be auto-generated in output_dir)
         **kwargs: Additional options passed to CocoToYoloConverter.convert()
     Returns:
         Dictionary with conversion statistics
     """
     converter = CocoToYoloConverter()
-    return converter.convert(coco_json_path, output_dir, **kwargs)
+    return converter.convert(coco_json_path, output_dir, classes_path=None, **kwargs)
 def yolo_to_coco(
@@ -130,20 +131,21 @@ def yolo_to_labelme(image_dir: str, label_dir: str, classes_path: str, output_di
     return converter.convert(image_dir, label_dir, classes_path, output_dir, **kwargs)
-def labelme_to_yolo(label_dir: str, output_dir: str, **kwargs):
+def labelme_to_yolo(label_dir: str, classes_path: str, output_dir: str, **kwargs):
     """
     Convert LabelMe format to YOLO format.
     Args:
         label_dir: Directory containing LabelMe JSON files
-        output_dir: Output directory where labels/ and class.names will be created
+        classes_path: Path to class names file (e.g., class.names)
+        output_dir: Output directory where YOLO label files will be created
         **kwargs: Additional options passed to LabelMeToYoloConverter.convert()
     Returns:
         Dictionary with conversion statistics
     """
     converter = LabelMeToYoloConverter()
-    return converter.convert(label_dir, output_dir, **kwargs)
+    return converter.convert(label_dir, classes_path, output_dir, **kwargs)
 # Convenience functions for visualization

{dataflow_cv-0.3.0 → dataflow_cv-0.4.0}/dataflow/cli.py RENAMED Viewed

@@ -31,6 +31,11 @@ from dataflow import __version__
 @click.pass_context
 def cli(ctx, verbose, overwrite):
     """DataFlow-CV: Computer vision dataset processing tool."""
+    # If -v is used alone (no subcommand), show version and exit
+    if verbose and ctx.invoked_subcommand is None:
+        click.echo(f"DataFlow-CV, version {__version__}")
+        ctx.exit()
     # Store configuration in context
     ctx.ensure_object(dict)
     ctx.obj['verbose'] = verbose
@@ -70,29 +75,30 @@ def coco2yolo(ctx, coco_json_path, output_dir, segmentation):
     \b
     COCO_JSON_PATH: Path to COCO JSON annotation file
-    OUTPUT_DIR: Directory where labels/ and class.names will be created
+    OUTPUT_DIR: Directory where YOLO label files will be created (class.names will be auto-generated)
     """
     try:
         # Segmentation parameter is passed directly to converter
         click.echo(f"Converting COCO JSON: {coco_json_path}")
         click.echo(f"Output directory: {output_dir}")
+        # Classes file will be auto-generated as {os.path.join(output_dir, Config.YOLO_CLASSES_FILENAME)}
         # Create converter and perform conversion
         converter = CocoToYoloConverter(verbose=ctx.obj['verbose'])
-        result = converter.convert(coco_json_path, output_dir, segmentation=segmentation)
+        result = converter.convert(coco_json_path, output_dir, classes_path=None, segmentation=segmentation)
         # Print summary
         click.echo("\n" + "="*50)
         click.echo("CONVERSION SUMMARY")
         click.echo("="*50)
         click.echo(f"COCO JSON: {coco_json_path}")
-        click.echo(f"Output directory: {result.get('output_dir')}")
-        click.echo(f"Labels directory: {result.get('labels_dir')}")
         click.echo(f"Classes file: {result.get('classes_file')}")
+        click.echo(f"Output directory: {result.get('output_dir')}")
         click.echo(f"Images processed: {result.get('images_processed', 0)}")
         click.echo(f"Annotations processed: {result.get('annotations_processed', 0)}")
-        click.echo(f"Categories found: {result.get('categories_found', 0)}")
+        click.echo(f"Categories in classes file: {result.get('categories_found', 0)}")
+        click.echo(f"Categories in data: {result.get('categories_in_data', 0)}")
         click.echo(f"Segmentation mode: {'ON' if segmentation else 'OFF'}")
         click.echo("\n✅ Conversion completed successfully!")
@@ -240,38 +246,41 @@ def labelme2coco(ctx, label_dir, classes_path, output_json_path, segmentation):
 @convert.command(name='labelme2yolo')
 @click.argument('label_dir', type=click.Path(exists=True, file_okay=False))
+@click.argument('classes_path', type=click.Path(exists=True, dir_okay=False))
 @click.argument('output_dir', type=click.Path(file_okay=False))
 @click.option('--segmentation', '-s', is_flag=True, help='Handle segmentation annotations')
 @click.pass_context
-def labelme2yolo(ctx, label_dir, output_dir, segmentation):
+def labelme2yolo(ctx, label_dir, classes_path, output_dir, segmentation):
     """
     Convert LabelMe format to YOLO format.
     \b
     LABEL_DIR: Directory containing LabelMe JSON files
-    OUTPUT_DIR: Directory where labels/ and class.names will be created
+    CLASSES_PATH: Path to class names file (e.g., class.names)
+    OUTPUT_DIR: Directory where YOLO label files will be created
     """
     try:
         click.echo(f"Label directory: {label_dir}")
+        click.echo(f"Classes file: {classes_path}")
         click.echo(f"Output directory: {output_dir}")
         if segmentation:
             click.echo("Segmentation mode: ON (strict)")
         # Create converter and perform conversion
         converter = LabelMeToYoloConverter(verbose=ctx.obj['verbose'])
-        result = converter.convert(label_dir, output_dir, segmentation=segmentation)
+        result = converter.convert(label_dir, classes_path, output_dir, segmentation=segmentation)
         # Print summary
         click.echo("\n" + "="*50)
         click.echo("CONVERSION SUMMARY")
         click.echo("="*50)
         click.echo(f"Label directory: {result.get('label_dir')}")
+        click.echo(f"Classes file: {result.get('classes_file')}")
         click.echo(f"Output directory: {result.get('output_dir')}")
-        click.echo(f"Labels directory: {result.get('labels_dir')}")
-        click.echo(f"Classes file: {result.get('classes_file', 'Not created')}")
         click.echo(f"Images processed: {result.get('images_processed', 0)}")
         click.echo(f"Annotations processed: {result.get('annotations_processed', 0)}")
-        click.echo(f"Categories found: {result.get('categories_found', 0)}")
+        click.echo(f"Categories in classes file: {result.get('categories_found', 0)}")
+        click.echo(f"Categories in data: {result.get('categories_in_data', 0)}")
         click.echo(f"Segmentation mode: {'ON' if segmentation else 'OFF'}")
         click.echo("\n✅ Conversion completed successfully!")

{dataflow_cv-0.3.0 → dataflow_cv-0.4.0}/dataflow/convert/coco_and_labelme.py RENAMED Viewed

@@ -29,7 +29,9 @@ class CocoToLabelMeConverter(LabelBasedConverter):
             coco_json_path: Path to COCO JSON file
             output_dir: Output directory where LabelMe JSON files will be created
             segmentation: Whether to enforce segmentation annotations.
-                If True, only annotations with segmentation data will be processed.
+                If True, only annotations with polygon segmentation data will be processed,
+                bounding box annotations will be skipped. If False, both bounding box and
+                segmentation annotations are processed.
         Returns:
             Dictionary with conversion statistics
@@ -119,7 +121,9 @@ class LabelMeToCocoConverter(LabelBasedConverter):
             classes_path: Path to class names file (e.g., class.names)
             output_json_path: Path to save COCO JSON file
             segmentation: Whether to enforce segmentation annotations.
-                If True, only annotations with segmentation data will be processed.
+                If True, only polygon shapes (shape_type="polygon") will be processed,
+                rectangle shapes will be skipped. If False, both rectangle and polygon
+                shapes are processed.
         Returns:
             Dictionary with conversion statistics

{dataflow_cv-0.3.0 → dataflow_cv-0.4.0}/dataflow/convert/coco_and_yolo.py RENAMED Viewed

@@ -10,7 +10,7 @@ reusing the label module handlers for consistent parsing and serialization.
 """
 import os
-from typing import Dict, List, Any
+from typing import Dict, List, Any, Optional
 from .base import LabelBasedConverter
 from ..config import Config
@@ -21,13 +21,15 @@ from ..label.yolo import YoloHandler
 class CocoToYoloConverter(LabelBasedConverter):
     """Convert COCO JSON format to YOLO label format."""
-    def convert(self, coco_json_path: str, output_dir: str, segmentation: bool = False) -> Dict[str, Any]:
+    def convert(self, coco_json_path: str, output_dir: str, classes_path: Optional[str] = None, segmentation: bool = False) -> Dict[str, Any]:
         """
         Convert COCO JSON file to YOLO format.
         Args:
             coco_json_path: Path to COCO JSON file
-            output_dir: Output directory where labels/ and class.names will be created
+            output_dir: Output directory where YOLO label files will be created
+            classes_path: Optional path to class names file (e.g., class.names).
+                If not provided, will be automatically generated as `output_dir/class.names`.
             segmentation: Whether to enforce segmentation annotations.
                 If True, only annotations with segmentation data will be processed.
@@ -39,6 +41,9 @@ class CocoToYoloConverter(LabelBasedConverter):
         """
         self.segmentation = segmentation
+        # Track if classes_path was auto-generated
+        original_classes_path = classes_path
         # 1. Validate input and output paths
         if not self.validate_input_path(coco_json_path, is_dir=False):
             raise ValueError(f"Invalid COCO JSON file: {coco_json_path}")
@@ -46,6 +51,17 @@ class CocoToYoloConverter(LabelBasedConverter):
         if not self.validate_output_path(output_dir, is_dir=True, create=True):
             raise ValueError(f"Invalid output directory: {output_dir}")
+        # 1.1 Handle classes_path: if None, auto-generate; otherwise validate
+        if classes_path is None:
+            classes_path = os.path.join(output_dir, Config.YOLO_CLASSES_FILENAME)
+            self.logger.info(f"Classes file not provided, will auto-generate: {classes_path}")
+        else:
+            if not self.validate_input_path(classes_path, is_dir=False):
+                raise ValueError(f"Invalid classes file: {classes_path}")
+        # 1.2 Create labels directory for YOLO output
+        labels_dir = self._create_labels_directory(output_dir)
         self.logger.info(f"Converting COCO to YOLO: {coco_json_path} -> {output_dir}")
         # 2. Use CocoHandler to read COCO data and convert to unified format
@@ -75,30 +91,47 @@ class CocoToYoloConverter(LabelBasedConverter):
                 for ann in img_data.get("annotations", []):
                     ann.pop("segmentation", None)
-        # 4. Extract unique categories and write class.names file
-        categories = self._extract_unique_categories(unified_data)
-        classes_path = os.path.join(output_dir, Config.YOLO_CLASSES_FILENAME)
-        if categories:
-            if not self.write_classes_file(categories, classes_path):
-                raise ValueError(f"Failed to write classes file: {classes_path}")
-            self.logger.info(f"Written {len(categories)} categories to {classes_path}")
-        # 5. Create labels directory
-        labels_dir = os.path.join(output_dir, Config.YOLO_LABELS_DIRNAME)
-        self.ensure_directory(labels_dir)
-        # 6. Use YoloHandler to write YOLO format
+        # 4. Extract unique categories from COCO data
+        data_categories = self._extract_unique_categories(unified_data)
+        if not data_categories:
+            self.logger.warning("No categories found in COCO data")
+            data_categories = []  # Ensure it's an empty list
+        # 5. Handle classes based on whether it was auto-generated or provided
+        if original_classes_path is None:
+            # Auto-generated classes path: write categories to file if we have any
+            categories = data_categories
+            if data_categories:
+                self.write_classes_file(data_categories, classes_path)
+                self.logger.info(f"Auto-generated classes file: {classes_path}")
+            else:
+                self.logger.warning(f"No categories to write to classes file: {classes_path}")
+                # Still create empty file or leave it? For now, create empty file
+                self.write_classes_file([], classes_path)
+        else:
+            # User-provided classes path: read and validate
+            categories = self.read_classes_file(classes_path)
+            if not categories:
+                raise ValueError(f"No categories found in classes file: {classes_path}")
+            # Validate all categories in data are present in provided classes file
+            if data_categories:
+                for category in data_categories:
+                    if category not in categories:
+                        raise ValueError(f"Category '{category}' found in COCO data but not in classes file")
+        # 6. Use YoloHandler to write YOLO format to labels directory
         yolo_handler = YoloHandler(verbose=self.verbose)
         success = False
-        if unified_data:
+        if unified_data and categories:
             success = yolo_handler.write_batch(unified_data, labels_dir, classes_path)
         else:
             # Create empty directory structure
             success = True
-            self.logger.info("No annotations to write, created empty directory structure")
+            self.logger.info("No annotations or categories to write, created empty directory structure")
         if not success:
-            raise ValueError(f"Failed to write YOLO label files to {labels_dir}")
+            raise ValueError(f"Failed to write YOLO label files to {output_dir}")
         # 7. Return statistics
         total_annotations = sum(len(img["annotations"]) for img in unified_data)
@@ -106,9 +139,9 @@ class CocoToYoloConverter(LabelBasedConverter):
             "images_processed": len(unified_data),
             "annotations_processed": total_annotations,
             "categories_found": len(categories),
+            "categories_in_data": len(data_categories) if unified_data else 0,
             "output_dir": output_dir,
             "classes_file": classes_path,
-            "labels_dir": labels_dir,
             "segmentation_mode": segmentation,
         }

{dataflow_cv-0.3.0 → dataflow_cv-0.4.0}/dataflow/convert/yolo_and_labelme.py RENAMED Viewed

@@ -32,7 +32,9 @@ class YoloToLabelMeConverter(LabelBasedConverter):
             classes_path: Path to YOLO class names file (e.g., class.names)
             output_dir: Output directory where LabelMe JSON files will be created
             segmentation: Whether to enforce segmentation annotations.
-                If True, only annotations with segmentation data will be processed.
+                If True, detection annotations (4 coordinates) will be converted to polygons
+                from bounding boxes, and segmentation annotations (6+ coordinates) will be
+                processed normally. If False, automatic format detection is used.
         Returns:
             Dictionary with conversion statistics
@@ -107,15 +109,18 @@ class YoloToLabelMeConverter(LabelBasedConverter):
 class LabelMeToYoloConverter(LabelBasedConverter):
     """Convert LabelMe format to YOLO format."""
-    def convert(self, label_dir: str, output_dir: str, segmentation: bool = False) -> Dict[str, Any]:
+    def convert(self, label_dir: str, classes_path: str, output_dir: str, segmentation: bool = False) -> Dict[str, Any]:
         """
         Convert LabelMe format to YOLO format.
         Args:
             label_dir: Directory containing LabelMe JSON files
-            output_dir: Output directory where labels/ and class.names will be created
+            classes_path: Path to class names file (e.g., class.names)
+            output_dir: Output directory where YOLO label files will be created
             segmentation: Whether to enforce segmentation annotations.
-                If True, only annotations with segmentation data will be processed.
+                If True, only polygon shapes (shape_type="polygon") will be processed,
+                rectangle shapes will be skipped. If False, both rectangle and polygon
+                shapes are processed.
         Returns:
             Dictionary with conversion statistics
@@ -129,10 +134,16 @@ class LabelMeToYoloConverter(LabelBasedConverter):
         if not self.validate_input_path(label_dir, is_dir=True):
             raise ValueError(f"Invalid label directory: {label_dir}")
+        if not self.validate_input_path(classes_path, is_dir=False):
+            raise ValueError(f"Invalid classes file: {classes_path}")
         if not self.validate_output_path(output_dir, is_dir=True, create=True):
             raise ValueError(f"Invalid output directory: {output_dir}")
-        self.logger.info(f"Converting LabelMe to YOLO: {label_dir} -> {output_dir}")
+        # Create labels directory for YOLO output
+        labels_dir = self._create_labels_directory(output_dir)
+        self.logger.info(f"Converting LabelMe to YOLO: {label_dir} -> {output_dir} (labels in {labels_dir})")
         # 2. Use LabelMeHandler to read LabelMe data in batch
         labelme_handler = LabelMeHandler(verbose=self.verbose)
@@ -152,23 +163,22 @@ class LabelMeToYoloConverter(LabelBasedConverter):
                 if not self._validate_segmentation_annotations(img_data["annotations"]):
                     raise ValueError(f"Image {img_data['image_id']} missing segmentation annotations")
-        # 4. Extract unique categories from LabelMe data
-        categories = self._extract_unique_categories(unified_data)
+        # 4. Read provided classes file and validate
+        categories = self.read_classes_file(classes_path)
         if not categories:
-            self.logger.warning("No categories found in LabelMe data")
+            raise ValueError(f"No categories found in classes file: {classes_path}")
-        # 5. Write class.names file
-        classes_path = os.path.join(output_dir, Config.YOLO_CLASSES_FILENAME)
-        if categories:
-            if not self.write_classes_file(categories, classes_path):
-                raise ValueError(f"Failed to write classes file: {classes_path}")
-            self.logger.info(f"Written {len(categories)} categories to {classes_path}")
-        # 6. Create labels directory
-        labels_dir = os.path.join(output_dir, Config.YOLO_LABELS_DIRNAME)
-        self.ensure_directory(labels_dir)
+        # 5. Extract unique categories from LabelMe data and validate against provided classes
+        data_categories = self._extract_unique_categories(unified_data)
+        if not data_categories:
+            self.logger.warning("No categories found in LabelMe data")
+        else:
+            # Validate all categories in data are present in provided classes file
+            for category in data_categories:
+                if category not in categories:
+                    raise ValueError(f"Category '{category}' found in LabelMe data but not in classes file")
-        # 7. Use YoloHandler to write YOLO format
+        # 6. Use YoloHandler to write YOLO format to labels directory
         yolo_handler = YoloHandler(verbose=self.verbose)
         success = False
         if unified_data and categories:
@@ -179,18 +189,18 @@ class LabelMeToYoloConverter(LabelBasedConverter):
             self.logger.info("No annotations or categories to write, created empty directory structure")
         if not success:
-            raise ValueError(f"Failed to write YOLO label files to {labels_dir}")
+            raise ValueError(f"Failed to write YOLO label files to {output_dir}")
         # 8. Return statistics
         total_annotations = sum(len(img["annotations"]) for img in unified_data)
         stats = {
             "label_dir": label_dir,
+            "classes_file": classes_path,
             "output_dir": output_dir,
-            "classes_file": classes_path if categories else None,
-            "labels_dir": labels_dir,
             "images_processed": len(unified_data),
             "annotations_processed": total_annotations,
             "categories_found": len(categories),
+            "categories_in_data": len(data_categories) if unified_data else 0,
             "segmentation_mode": segmentation,
         }

{dataflow_cv-0.3.0 → dataflow_cv-0.4.0}/dataflow/label/labelme.py RENAMED Viewed

@@ -358,24 +358,44 @@ class LabelMeHandler:
         width, height = image_size
-        # 优先使用分割数据
+        # 检查分割数据
         if annotation.get("segmentation") and annotation["segmentation"][0]:
-            # 分割标注
             points_flat = annotation["segmentation"][0]
-            # 将展平的坐标转换为点列表
-            points = []
-            for i in range(0, len(points_flat), 2):
-                if i + 1 < len(points_flat):
-                    x = max(0, min(points_flat[i], width - 1))
-                    y = max(0, min(points_flat[i + 1], height - 1))
-                    points.append([x, y])
-            if len(points) >= 3:
-                shape["points"] = points
-                shape["shape_type"] = "polygon"
-                return shape
-        # 使用边界框数据
+            # 检查是否为从边界框生成的4点多边形（8个坐标）
+            is_bbox_polygon = (len(points_flat) == 8 and
+                              annotation.get("force_polygon", False))
+            if is_bbox_polygon:
+                # 强制分割模式：从边界框生成的多边形→多边形
+                # 将展平的坐标转换为点列表
+                points = []
+                for i in range(0, len(points_flat), 2):
+                    if i + 1 < len(points_flat):
+                        x = max(0, min(points_flat[i], width - 1))
+                        y = max(0, min(points_flat[i + 1], height - 1))
+                        points.append([x, y])
+                if len(points) >= 3:
+                    shape["points"] = points
+                    shape["shape_type"] = "polygon"
+                    return shape
+            else:
+                # 真实分割数据→多边形
+                # 将展平的坐标转换为点列表
+                points = []
+                for i in range(0, len(points_flat), 2):
+                    if i + 1 < len(points_flat):
+                        x = max(0, min(points_flat[i], width - 1))
+                        y = max(0, min(points_flat[i + 1], height - 1))
+                        points.append([x, y])
+                if len(points) >= 3:
+                    shape["points"] = points
+                    shape["shape_type"] = "polygon"
+                    return shape
+        # 使用边界框数据→矩形
         if annotation.get("bbox"):
             bbox = annotation["bbox"]
             x_min, y_min, bbox_width, bbox_height = bbox

{dataflow_cv-0.3.0 → dataflow_cv-0.4.0}/dataflow/label/yolo.py RENAMED Viewed

@@ -48,7 +48,7 @@ class YoloHandler:
             image_path: 对应图像文件路径
             classes: 类别名称列表
             image_size: 可选图像尺寸 (width, height)。如果未提供，将尝试从图像文件读取
-            require_segmentation: 是否要求分割格式。如果True，只接受分割标注（至少6个坐标），检测格式将跳过
+            require_segmentation: 是否强制分割模式。如果True，检测标注（4个坐标）将生成为从边界框创建的多边形，分割标注（6+个坐标）正常处理。如果False，自动检测格式类型。
         Returns:
             统一格式的图像标注数据字典，结构如下：
@@ -113,14 +113,14 @@ class YoloHandler:
                 # 分割格式: 2n个坐标 (多边形顶点)
                 if len(coords) == 4:
                     # 检测格式
-                    if require_segmentation:
-                        if self.verbose:
-                            print(f"警告: 行 {line_num}: 要求分割格式但检测到检测格式，跳过")
-                        continue
-                    annotation = self._parse_detection(coords, class_id, classes, (width, height))
+                    # require_segmentation=True 表示强制分割模式，检测标注应生成为多边形
+                    annotation = self._parse_detection(coords, class_id, classes, (width, height),
+                                                      force_polygon=require_segmentation)
                 elif len(coords) >= 6 and len(coords) % 2 == 0:
                     # 分割格式（至少3个点）
-                    annotation = self._parse_segmentation(coords, class_id, classes, (width, height))
+                    # require_segmentation=True 表示强制分割模式，但真实分割标注不需要force_polygon标记
+                    annotation = self._parse_segmentation(coords, class_id, classes, (width, height),
+                                                         force_polygon=require_segmentation)
                 else:
                     if self.verbose:
                         print(f"警告: 行 {line_num}: 坐标数量无效: {len(coords)}")
@@ -183,7 +183,7 @@ class YoloHandler:
             classes_path: 类别文件路径
             label_ext: 标签文件扩展名
             image_exts: 图像文件扩展名元组
-            require_segmentation: 是否要求分割格式。如果True，只接受分割标注（至少6个坐标），检测格式将跳过
+            require_segmentation: 是否强制分割模式。如果True，检测标注（4个坐标）将生成为从边界框创建的多边形，分割标注（6+个坐标）正常处理。如果False，自动检测格式类型。
         Returns:
             图像标注数据列表，每个元素为read()返回的格式
@@ -445,7 +445,8 @@ class YoloHandler:
             return False
     def _parse_detection(self, coords: List[float], class_id: int,
-                        classes: List[str], image_size: Tuple[int, int]) -> Optional[Dict]:
+                        classes: List[str], image_size: Tuple[int, int],
+                        force_polygon: bool = False) -> Optional[Dict]:
         """解析检测格式标注
         Args:
@@ -453,6 +454,7 @@ class YoloHandler:
             class_id: 类别ID
             classes: 类别名称列表
             image_size: 图像尺寸 (width, height)
+            force_polygon: 是否强制生成多边形分割数据
         Returns:
             统一格式的标注字典
@@ -486,20 +488,23 @@ class YoloHandler:
             "segmentation": None
         }
-        # 从边界框创建简单多边形
-        x_max = x_min + bbox_width
-        y_max = y_min + bbox_height
-        annotation["segmentation"] = [[
-            x_min, y_min,
-            x_max, y_min,
-            x_max, y_max,
-            x_min, y_max
-        ]]
+        # 只有在强制分割模式时才从边界框创建简单多边形
+        if force_polygon:
+            x_max = x_min + bbox_width
+            y_max = y_min + bbox_height
+            annotation["segmentation"] = [[
+                x_min, y_min,
+                x_max, y_min,
+                x_max, y_max,
+                x_min, y_max
+            ]]
+            annotation["force_polygon"] = True  # 标记为强制转换的多边形
         return annotation
     def _parse_segmentation(self, coords: List[float], class_id: int,
-                           classes: List[str], image_size: Tuple[int, int]) -> Optional[Dict]:
+                           classes: List[str], image_size: Tuple[int, int],
+                           force_polygon: bool = False) -> Optional[Dict]:
         """解析分割格式标注
         Args:
@@ -507,6 +512,7 @@ class YoloHandler:
             class_id: 类别ID
             classes: 类别名称列表
             image_size: 图像尺寸 (width, height)
+            force_polygon: 是否强制生成多边形分割数据（对于真实分割标注应为False）
         Returns:
             统一格式的标注字典
@@ -548,6 +554,10 @@ class YoloHandler:
             "segmentation": [denormalized_coords]
         }
+        # 真实分割标注不标记force_polygon，或标记为False
+        if force_polygon:
+            annotation["force_polygon"] = False
         return annotation
     def _normalize_coords(self, coords: List[float], image_size: Tuple[int, int]) -> Optional[List[float]]:

{dataflow_cv-0.3.0 → dataflow_cv-0.4.0}/dataflow/visualize/base.py RENAMED Viewed

@@ -135,9 +135,23 @@ class BaseVisualizer:
             color_idx = class_id % len(self.DEFAULT_COLORS)
             return self.DEFAULT_COLORS[color_idx]
         else:
-            # Generate distinct colors using HSV color space
-            hue = int(179 * class_id / max(num_classes, 1))
-            hsv_color = np.uint8([[[hue, 255, 255]]])
+            # Generate distinct colors using HSV color space with golden ratio distribution
+            # This provides better color separation than linear spacing
+            # Golden angle in degrees: 137.508 (gives optimal spacing on color wheel)
+            golden_angle = 137.508
+            # Use modulo 360 to wrap around the hue circle
+            hue_angle = (class_id * golden_angle) % 360.0
+            # Convert to OpenCV hue range (0-179, corresponding to 0-360 degrees)
+            hue = int(hue_angle * 179.0 / 360.0)
+            # Vary saturation and value slightly to increase color distinction
+            # while keeping colors bright and vibrant
+            # Use class_id to create patterns in saturation and value
+            # This adds extra dimension of variation beyond just hue
+            saturation = 220 + (class_id % 4) * 12  # 220-256 range
+            value = 220 + ((class_id // 4) % 4) * 12  # 220-256 range
+            hsv_color = np.uint8([[[hue, saturation, value]]])
             bgr_color = cv2.cvtColor(hsv_color, cv2.COLOR_HSV2BGR)
             return tuple(map(int, bgr_color[0][0]))

{dataflow_cv-0.3.0 → dataflow_cv-0.4.0}/dataflow/visualize/generic.py RENAMED Viewed

@@ -10,6 +10,7 @@
 import os
 import cv2
 import numpy as np
+import logging
 from typing import List, Dict, Any, Optional, Tuple
 from .base import BaseVisualizer
@@ -74,6 +75,7 @@ class GenericVisualizer(BaseVisualizer):
             Image with annotations drawn
         """
         result_image = image.copy()
+        self.logger.debug(f"_draw_annotations called with {len(annotations)} annotations, {len(classes)} classes: {classes}")
         # Validate segmentation format if required (strict mode)
         if self.segmentation:
@@ -82,7 +84,61 @@ class GenericVisualizer(BaseVisualizer):
         for ann in annotations:
             category_id = ann.get("category_id", 0)
             category_name = ann.get("category_name", f"class_{category_id}")
-            color = self.get_color_for_class(category_id, len(classes))
+            # Get color based on class name index in classes list
+            class_idx = None
+            original_category_name = category_name
+            # 1. Try exact match
+            try:
+                class_idx = classes.index(category_name)
+                color = self.get_color_for_class(class_idx, len(classes))
+                # Debug logging for successful color assignment
+                self.logger.debug(f"Color assigned: category_name='{category_name}', class_idx={class_idx}, color={color}")
+            except ValueError:
+                # 2. Try normalized match (strip whitespace, case-insensitive)
+                normalized_name = category_name.strip()
+                # Try case-insensitive match
+                try:
+                    # Find case-insensitive match
+                    for idx, cls in enumerate(classes):
+                        if cls.strip().lower() == normalized_name.lower():
+                            class_idx = idx
+                            break
+                except Exception:
+                    pass
+                if class_idx is not None:
+                    color = self.get_color_for_class(class_idx, len(classes))
+                    self.logger.warning(f"Class '{original_category_name}' matched case-insensitively to '{classes[class_idx]}', using index {class_idx}")
+                    self.logger.debug(f"Case-insensitive match: category_name='{original_category_name}', matched='{classes[class_idx]}', class_idx={class_idx}, color={color}")
+                else:
+                    # 3. Try to parse "class_X" format
+                    if category_name.startswith("class_") and category_name[6:].isdigit():
+                        try:
+                            parsed_id = int(category_name[6:])
+                            # Use parsed_id as fallback, but ensure it's within reasonable bounds
+                            if parsed_id < len(classes) * 2:  # Allow some flexibility
+                                class_idx = parsed_id % len(classes) if len(classes) > 0 else 0
+                                self.logger.warning(f"Class '{category_name}' parsed as class_{parsed_id}, using index {class_idx}")
+                            else:
+                                self.logger.warning(f"Parsed class ID {parsed_id} from '{category_name}' is too large, using category_id")
+                        except ValueError:
+                            pass
+                    # 4. Final fallback to category_id
+                    if class_idx is None:
+                        self.logger.warning(f"Class '{category_name}' not found in classes list, using category_id {category_id} for color")
+                        # Ensure category_id is within bounds
+                        if category_id < len(classes):
+                            class_idx = category_id
+                        else:
+                            # If category_id is out of bounds, use modulo
+                            class_idx = category_id % len(classes) if len(classes) > 0 else 0
+                    color = self.get_color_for_class(class_idx, len(classes))
+                    # Debug logging for fallback case
+                    self.logger.debug(f"Fallback color: category_name='{original_category_name}', category_id={category_id}, class_idx={class_idx}, color={color}")
             # Determine what to draw based on mode and available data
             if self.segmentation:

{dataflow_cv-0.3.0 → dataflow_cv-0.4.0}/dataflow/visualize/labelme.py RENAMED Viewed

@@ -171,6 +171,10 @@ class LabelMeVisualizer(GenericVisualizer):
                 class_name = annotation.get("category_name")
                 if class_name:
                     class_names.add(class_name)
+                else:
+                    # Fallback to category_id if category_name is missing
+                    category_id = annotation.get("category_id", 0)
+                    class_names.add(f"class_{category_id}")
         return sorted(class_names)
     def _resolve_image_paths(self, annotations_list: List[Dict], image_dir: str) -> List[Dict]:

{dataflow_cv-0.3.0 → dataflow_cv-0.4.0}/dataflow/visualize/yolo.py RENAMED Viewed

@@ -8,6 +8,8 @@
 """
 import os
+import sys
+import logging
 from typing import List, Dict, Any, Optional
 from .generic import GenericVisualizer
@@ -55,6 +57,10 @@ class YoloVisualizer(GenericVisualizer):
                 - save_dir: Path where images were saved (if save_dir provided)
         """
         # Validate inputs
+        # Temporarily enable debug logging for troubleshooting
+        self.logger.setLevel(logging.DEBUG)
+        self.logger.debug("Debug logging enabled for YOLO visualization")
         if not self.validate_input_path(image_dir, is_dir=True):
             raise ValueError(f"Invalid image directory: {image_dir}")
         if not self.validate_input_path(label_dir, is_dir=True):
@@ -64,9 +70,11 @@ class YoloVisualizer(GenericVisualizer):
         if save_dir and not self.validate_output_path(save_dir, is_dir=True, create=True):
             raise ValueError(f"Invalid save directory: {save_dir}")
         # Read classes
         classes = self._read_classes_file(class_path)
         self.logger.info(f"Loaded {len(classes)} classes from {class_path}")
+        self.logger.debug(f"Classes list: {classes}")
         # Read annotations using YoloHandler
         try:
@@ -76,6 +84,14 @@ class YoloVisualizer(GenericVisualizer):
                 classes_path=class_path,
                 require_segmentation=self.segmentation
             )
+            # Debug logging for annotation data
+            if annotations_list:
+                print(f"[DEBUG] Read {len(annotations_list)} image annotations", flush=True)
+                for i, img_data in enumerate(annotations_list[:3]):  # Check first 3 images
+                    anns = img_data.get("annotations", [])
+                    print(f"[DEBUG]   Image {i}: {img_data.get('image_id', 'unknown')}, {len(anns)} annotations", flush=True)
+                    for j, ann in enumerate(anns[:3]):  # Check first 3 annotations per image
+                        print(f"[DEBUG]     Annotation {j}: category_id={ann.get('category_id')}, category_name={ann.get('category_name')}", flush=True)
         except Exception as e:
             raise ValueError(f"Failed to read YOLO annotations: {e}")
@@ -110,6 +126,10 @@ class YoloVisualizer(GenericVisualizer):
                     f"Found {len(label_files)} label file(s) with only detection format or no annotations."
                 )
+        # Merge classes from file with classes found in annotations
+        classes = self._extract_classes(annotations_list, classes)
+        self.logger.info(f"Using {len(classes)} classes for visualization")
         # Create results template
         results = self._create_results_template(
             image_dir=image_dir,
@@ -178,6 +198,87 @@ class YoloVisualizer(GenericVisualizer):
             self.logger.error(f"Error reading class file {class_path}: {e}")
             raise ValueError(f"Could not read class file: {class_path}") from e
+    def _extract_classes(self, annotations_list: List[Dict], file_classes: List[str]) -> List[str]:
+        """Extract and merge class names from annotations and file.
+        Args:
+            annotations_list: List of image annotation data
+            file_classes: List of class names from file
+        Returns:
+            Merged list of class names, ensuring all annotations have matching names
+        """
+        # Normalize file classes: strip whitespace, create mapping from normalized to original
+        file_class_map = {}  # normalized -> original
+        normalized_file_classes = []
+        for class_name in file_classes:
+            normalized = class_name.strip()
+            file_class_map[normalized] = class_name
+            normalized_file_classes.append(normalized)
+        # Extract unique class names from annotations with normalization
+        annotation_classes = set()  # Store normalized names
+        original_annotation_names = {}  # normalized -> first original encountered
+        for image_data in annotations_list:
+            for annotation in image_data.get("annotations", []):
+                class_name = annotation.get("category_name")
+                if class_name:
+                    normalized = class_name.strip()
+                    annotation_classes.add(normalized)
+                    if normalized not in original_annotation_names:
+                        original_annotation_names[normalized] = class_name  # Keep original for reference
+                else:
+                    # Fallback to category_id
+                    category_id = annotation.get("category_id", 0)
+                    class_name = f"class_{category_id}"
+                    normalized = class_name.strip()
+                    annotation_classes.add(normalized)
+                    if normalized not in original_annotation_names:
+                        original_annotation_names[normalized] = class_name
+        # Merge with file classes, preserving file order for existing classes
+        merged_classes = []
+        matched_normalized = set()
+        # First add file classes that appear in annotations (case-insensitive and whitespace-insensitive)
+        for normalized, original in file_class_map.items():
+            if normalized in annotation_classes:
+                merged_classes.append(original)  # Use original file class name
+                matched_normalized.add(normalized)
+                annotation_classes.remove(normalized)
+            else:
+                # Also check case-insensitive match
+                matched = False
+                for ann_normalized in list(annotation_classes):
+                    if ann_normalized.lower() == normalized.lower():
+                        merged_classes.append(original)  # Use original file class name
+                        matched_normalized.add(ann_normalized)
+                        annotation_classes.remove(ann_normalized)
+                        matched = True
+                        self.logger.warning(f"Class name '{ann_normalized}' matched case-insensitively to file class '{original}'")
+                        break
+                if not matched:
+                    # File class not found in annotations, still include it
+                    merged_classes.append(original)
+        # Add remaining annotation classes (not matched to file classes)
+        remaining = sorted(annotation_classes)
+        for normalized in remaining:
+            # Use original annotation name if available, otherwise normalized
+            original = original_annotation_names.get(normalized, normalized)
+            merged_classes.append(original)
+        self.logger.info(f"Merged classes: {len(file_classes)} from file, {len(merged_classes)} total after merge")
+        if len(merged_classes) > len(file_classes):
+            self.logger.warning(f"Found {len(merged_classes) - len(file_classes)} classes in annotations not in file")
+        # Debug logging for color assignment consistency
+        self.logger.debug(f"Merged classes list: {merged_classes}")
+        self.logger.debug(f"File classes: {file_classes}")
+        self.logger.debug(f"Annotation classes (normalized): {original_annotation_names}")
+        return merged_classes
     def batch_visualize(self,
                         image_dirs: List[str],
                         label_dirs: List[str],

{dataflow_cv-0.3.0 → dataflow_cv-0.4.0/dataflow_cv.egg-info}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: dataflow-cv
-Version: 0.3.0
+Version: 0.4.0
 Summary: A data processing library for computer vision datasets
 Home-page: https://github.com/zjykzj/DataFlow-CV
 Author: DataFlow Team
@@ -34,10 +34,8 @@ Dynamic: requires-python
 > **Where Vibe Coding meets CV data.** 🌊
 > Convert & visualize datasets. Built with the flow of Claude Code.
-![Python Version](https://img.shields.io/badge/python-3.8%20|%203.9%20|%203.10%20|%203.11%20|%203.12-blue)
-![License](https://img.shields.io/badge/license-MIT-green)
-![Version](https://img.shields.io/badge/version-0.3.0-orange)
-![Development Status](https://img.shields.io/badge/status-alpha-yellow)
+![Python Version](https://img.shields.io/badge/python-3.8%20|%203.9%20|%203.10%20|%203.11%20|%203.12-blue) ![License](https://img.shields.io/badge/license-MIT-green) [![PyPI](https://img.shields.io/pypi/v/dataflow-cv.svg)](https://pypi.org/project/dataflow-cv/) ![Development Status](https://img.shields.io/badge/status-alpha-yellow) [![GitHub Actions](https://github.com/zjykzj/DataFlow-CV/actions/workflows/python-publish.yml/badge.svg)](https://github.com/zjykzj/DataFlow-CV/actions/workflows/python-publish.yml)
 A data processing library for computer vision datasets, focusing on format conversion and visualization between LabelMe, COCO, and YOLO formats. Provides both a CLI and Python API.
@@ -50,6 +48,8 @@ A data processing library for computer vision datasets, focusing on format conve
     - [Core Dependencies](#core-dependencies)
   - [Quick Start](#quick-start)
     - [Installation](#installation)
+      - [Editable Installation (Development Mode)](#editable-installation-development-mode)
+      - [Build System](#build-system)
     - [Command Line Usage](#command-line-usage)
     - [Python API Usage](#python-api-usage)
     - [CLI Reference](#cli-reference)
@@ -61,6 +61,7 @@ A data processing library for computer vision datasets, focusing on format conve
     - [Segmentation Support](#segmentation-support)
     - [Running Tests](#running-tests)
     - [Examples](#examples)
+    - [Documentation](#documentation)
   - [License](#license)
 ## Project Structure
@@ -79,6 +80,7 @@ dataflow/
 ├── visualize/               # Annotation visualization module
 │   ├── __init__.py
 │   ├── base.py            # Visualizer base class
+│   ├── generic.py         # Generic visualizer base class using label handlers
 │   ├── yolo.py            # YOLO annotation visualizer
 │   ├── coco.py            # COCO annotation visualizer
 │   └── labelme.py         # LabelMe annotation visualizer
@@ -92,7 +94,11 @@ tests/
 ├── convert/                # Conversion tests
 │   ├── __init__.py
 │   ├── test_coco_to_yolo.py
-│   └── test_yolo_to_coco.py
+│   ├── test_yolo_to_coco.py
+│   ├── test_coco_to_labelme.py
+│   ├── test_labelme_to_coco.py
+│   ├── test_labelme_to_yolo.py
+│   └── test_yolo_to_labelme.py
 ├── visualize/              # Visualization tests
 │   ├── __init__.py
 │   ├── test_yolo.py
@@ -130,6 +136,11 @@ samples/
         ├── api_yolo.py
         ├── api_coco.py
         └── api_labelme.py
+docs/                       # Data format documentation
+├── README.md              # Documentation index
+├── yolo.md                # YOLO format specification
+├── labelme.md             # LabelMe format specification
+└── coco.md                # COCO format specification
 ```
 ## Requirements
@@ -468,6 +479,20 @@ Check the `samples/` directory for detailed usage examples:
 - `samples/api/convert/` - Python API conversion examples
 - `samples/api/visualize/` - Python API visualization examples
+### Documentation
+Detailed data format specifications are available in the `docs/` directory:
+- [`docs/README.md`](docs/README.md) - Documentation index
+- [`docs/yolo.md`](docs/yolo.md) - YOLO format specification
+- [`docs/labelme.md`](docs/labelme.md) - LabelMe format specification
+- [`docs/coco.md`](docs/coco.md) - COCO format specification
+These documents describe the annotation formats supported by DataFlow-CV, without covering tool usage.
+## Development
+For development guidelines, architecture details, and contribution instructions, see [CLAUDE.md](CLAUDE.md). This file provides guidance for working with the codebase, including common development commands, architectural patterns, and writing principles.
 ## License
 [MIT License](LICENSE) © 2026 zjykzj

{dataflow_cv-0.3.0 → dataflow_cv-0.4.0}/pyproject.toml RENAMED Viewed

@@ -4,7 +4,7 @@ build-backend = "setuptools.build_meta"
 [project]
 name = "dataflow-cv"
-version = "0.3.0"
+version = "0.4.0"
 description = "A data processing library for computer vision datasets"
 readme = "README.md"
 authors = [

{dataflow_cv-0.3.0 → dataflow_cv-0.4.0}/setup.py RENAMED Viewed

@@ -37,7 +37,7 @@ class DevelopCommand(_develop):
 setup(
     name="dataflow-cv",
-    version="0.3.0",
+    version="0.3.1",
     author="DataFlow Team",
     description="A data processing library for computer vision datasets",
     long_description=long_description,