PyPI - dataframe-textual - Versions diffs - 1.2.0__tar.gz → 1.4.0__tar.gz - Mend

dataframe-textual 1.2.0tar.gz → 1.4.0tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (18) hide show

{dataframe_textual-1.2.0 → dataframe_textual-1.4.0}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: dataframe-textual
-Version: 1.2.0
+Version: 1.4.0
 Summary: Interactive terminal viewer/editor for tabular data
 Project-URL: Homepage, https://github.com/need47/dataframe-textual
 Project-URL: Repository, https://github.com/need47/dataframe-textual.git
@@ -29,7 +29,7 @@ Classifier: Topic :: Utilities
 Classifier: Typing :: Typed
 Requires-Python: >=3.11
 Requires-Dist: polars>=1.34.0
-Requires-Dist: textual>=6.5.0
+Requires-Dist: textual[syntax]>=6.5.0
 Provides-Extra: dev
 Requires-Dist: textual-dev>=1.8.0; extra == 'dev'
 Provides-Extra: excel
@@ -129,6 +129,13 @@ uv run python main.py pokemon.csv
 # Read from stdin (auto-detects format; defaults to TSV if not recognized)
 cat data.tsv | dv
 dv < data.tsv
+# Gzipped files are supported
+dv data.csv.gz
+dv large_dataset.tsv.gz
+# Specify format for gzipped stdin
+zcat data.csv.gz | dv -f csv
 ```
 ### Multi-File Usage - Multiple Tabs
@@ -142,6 +149,9 @@ dv file.xlsx
 # Mix files and stdin (read from stdin, then open file)
 dv data1.tsv < data2.tsv
+# Mix regular and gzipped files
+dv data1.csv data2.csv.gz data3.tsv.gz
 ```
 When multiple files are opened:
@@ -151,6 +161,67 @@ When multiple files are opened:
 - Close the current tab with `Ctrl+W`
 - Each file maintains its own state (edits, sort order, selections, history, etc.)
+## Command Line Options
+```
+usage: dv [-h] [-f {csv,excel,tsv,parquet,json,ndjson}] [-H] [-I] [-L SKIP_LINES] [-K SKIP_ROWS_AFTER_HEADER] [-U NULL [NULL ...]] [files ...]
+Interactive terminal based viewer/editor for tabular data (e.g., CSV/Excel).
+positional arguments:
+  files                 Files to view (or read from stdin)
+options:
+  -h, --help            show this help message and exit
+  -f, --format {csv,excel,tsv,parquet,json,ndjson}
+                        Specify the format of the input files
+  -H, --no-header       Specify that input files have no header row
+  -I, --no-inferrence   Do not infer data types when reading CSV/TSV
+  -L, --skip-lines SKIP_LINES
+                        Skip lines when reading CSV/TSV (default: 0)
+  -K, --skip-rows-after-header SKIP_ROWS_AFTER_HEADER
+                        Skip rows after header when reading CSV/TSV (default: 0)
+  -U, --null NULL [NULL ...]
+                        Values to interpret as null values when reading CSV/TSV
+```
+### CLI Examples
+```bash
+# View CSV file without header row
+dv -H data_no_header.csv
+# Disable type inference for faster loading
+dv -I large_data.csv
+# Skip first 3 lines of file (e.g., comments, metadata)
+dv -L 3 data_with_comments.csv
+# Skip 1 row after header (e.g., units row)
+dv -K 1 data_with_units.csv
+# Treat specific values as null/missing (e.g., 'NA', 'N/A', '-')
+dv -U NA N/A - data.csv
+# Multiple null values with different formats
+dv -U NULL NA "" "Not Available" messy_data.csv
+# Complex CSV with comments and units row
+dv -L 3 -K 1 -I messy_scientific_data.csv
+# Combine all options: skip lines, skip after header, no header, no inference, gzipped
+dv -L 2 -K 1 -H -I complex_data.csv.gz
+# Process compressed data from stdin with line skipping
+zcat compressed_data.csv.gz | dv -f csv -L 2
+# CSV with custom null values and no header
+dv -H -U NA "N/A" "-" raw_data.csv
+# Skip lines, specify null values, and disable type inference
+dv -L 5 -U NA "" data_with_metadata.csv
+```
 ## Keyboard Shortcuts
 ### App-Level Controls
@@ -161,7 +232,7 @@ When multiple files are opened:
 |-----|--------|
 | `Ctrl+O` | Open file in a new tab |
 | `Ctrl+W` | Close current tab |
-| `Ctrl+Shift+S` | Save all open tabs to Excel file |
+| `Ctrl+A` | Save all open tabs to Excel file |
 | `>` or `b` | Move to next tab |
 | `<` | Move to previous tab |
 | `B` | Toggle tab bar visibility |
@@ -171,7 +242,7 @@ When multiple files are opened:
 | Key | Action |
 |-----|--------|
-| `Ctrl+H` | Toggle help panel |
+| `F1` | Toggle help panel |
 | `k` | Cycle through themes |
 ---
@@ -189,6 +260,8 @@ When multiple files are opened:
 | `Home` / `End` | Jump to first/last column in current row |
 | `Ctrl + Home` / `Ctrl + End` | Jump to top/bottom in current page |
 | `PageDown` / `PageUp` | Scroll down/up one page |
+| `Ctrl+F` | Page down |
+| `Ctrl+B` | Page up |
 #### Viewing & Display
@@ -200,6 +273,7 @@ When multiple files are opened:
 | `S` | Show statistics for entire dataframe |
 | `K` | Cycle cursor type: cell → row → column → cell |
 | `~` | Toggle row labels |
+| `_` (underscore) | Expand column to full width |
 #### Data Editing
@@ -212,8 +286,6 @@ When multiple files are opened:
 | `a` | Add empty column after current |
 | `A` | Add column with name and value/expression |
 | `-` (minus) | Delete current column |
-| `_` (underscore) | Delete current column and all columns after |
-| `Ctrl+-` | Delete current column and all columns before |
 | `x` | Delete current row |
 | `X` | Delete current row and all rows below |
 | `Ctrl+X` | Delete current row and all rows above |
@@ -241,12 +313,19 @@ When multiple files are opened:
 | `v` | View only rows by selected rows and/or matches or cursor value |
 | `V` | View only rows by expression |
+#### SQL Interface
+| Key | Action |
+|-----|--------|
+| `l` | Simple SQL interface (select columns & WHERE clause) |
+| `L` | Advanced SQL interface (full SQL queries) |
 #### Find & Replace
 | Key | Action |
 |-----|--------|
-| `f` | Find across all columns with cursor value |
-| `Ctrl+F` | Find across all columns with expression |
+| `;` | Find across all columns with cursor value |
+| `:` | Find across all columns with expression |
 | `r` | Find and replace in current column (interactive or replace all) |
 | `R` | Find and replace across all columns (interactive or replace all) |
@@ -322,8 +401,8 @@ The application provides multiple search modes for different use cases:
 **Find Operations** - Find by value/expression:
 - **`/` - Column Find**: Find cursor value within current column
 - **`?` - Column Expression Find**: Open dialog to search current column with expression
-- **`f` - Global Find**: Find cursor value across all columns
-- **`Ctrl+f` - Global Expression Find**: Open dialog to search all columns with expression
+- **`;` - Global Find**: Find cursor value across all columns
+- **`:` - Global Expression Find**: Open dialog to search all columns with expression
 **Selection & Filtering**:
 - **`'` - Toggle Row Selection**: Select/deselect current row (marks it for filtering)
@@ -703,7 +782,40 @@ Press `@` to make URLs in the current column clickable:
 - **Scans** all cells in the current column for URLs starting with `http://` or `https://`
 - **Applies** link styling to make them clickable and dataframe remains unchanged
-### 19. Clipboard Operations
+### 19. SQL Interface
+The SQL interface provides two modes for querying your dataframe:
+#### Simple SQL Interface (`l`)
+Select specific columns and apply WHERE conditions without writing full SQL:
+- Choose which columns to include in results
+- Specify WHERE clause for filtering
+- Ideal for quick filtering and column selection
+#### Advanced SQL Interface (`L`)
+Execute complete SQL queries for advanced data manipulation:
+- Write full SQL queries with standard [SQL syntax](https://docs.pola.rs/api/python/stable/reference/sql/index.html)
+- Support for JOINs, GROUP BY, aggregations, and more
+- Access to all SQL capabilities for complex transformations
+- Always use `self` as the table name
+**Examples:**
+```sql
+-- Filter and select specific rows and/or columns
+SELECT name, age FROM self WHERE age > 30
+-- Aggregate with GROUP BY
+SELECT department, COUNT(*) as count, AVG(salary) as avg_salary
+FROM self
+GROUP BY department
+-- Complex filtering with multiple conditions
+SELECT *
+FROM self
+WHERE (age > 25 AND salary > 50000) OR department = 'Management'
+```
+### 20. Clipboard Operations
 Copies value to system clipboard with `pbcopy` on macOS and `xclip` on Linux
@@ -722,6 +834,30 @@ dv pokemon.csv
 # Chain with other command and specify input file format
 cut -d',' -f1,2,3 pokemon.csv | dv -f csv
+# Work with gzipped files
+dv large_dataset.csv.gz
+# CSV file without header row
+dv -H raw_data.csv
+# Skip type inference for faster loading
+dv -I huge_file.csv
+# Skip first 5 lines (comments, metadata)
+dv -L 5 data_with_metadata.csv
+# Skip 1 row after header (units row)
+dv -K 1 data_with_units.csv
+# Complex CSV with comments and units row
+dv -L 3 -K 1 -I messy_scientific_data.csv
+# Combine all options: skip lines, skip after header, no header, no inference, gzipped
+dv -L 2 -K 1 -H -I complex_data.csv.gz
+# Process compressed data from stdin with line skipping
+zcat compressed_data.csv.gz | dv -f csv -L 2
 ```
 ### Multi-File/Tab Examples
@@ -730,8 +866,8 @@ cut -d',' -f1,2,3 pokemon.csv | dv -f csv
 # Open multiple sheets as tabs in a single Excel
 dv sales.xlsx
-# Open multiple files as tabs
-dv pokemon.csv titanic.csv
+# Open multiple files as tabs (including gzipped)
+dv pokemon.csv titanic.csv large_data.csv.gz
 # Start with one file, then open others using Ctrl+O
 dv initial_data.csv

{dataframe_textual-1.2.0 → dataframe_textual-1.4.0}/README.md RENAMED Viewed

@@ -90,6 +90,13 @@ uv run python main.py pokemon.csv
 # Read from stdin (auto-detects format; defaults to TSV if not recognized)
 cat data.tsv | dv
 dv < data.tsv
+# Gzipped files are supported
+dv data.csv.gz
+dv large_dataset.tsv.gz
+# Specify format for gzipped stdin
+zcat data.csv.gz | dv -f csv
 ```
 ### Multi-File Usage - Multiple Tabs
@@ -103,6 +110,9 @@ dv file.xlsx
 # Mix files and stdin (read from stdin, then open file)
 dv data1.tsv < data2.tsv
+# Mix regular and gzipped files
+dv data1.csv data2.csv.gz data3.tsv.gz
 ```
 When multiple files are opened:
@@ -112,6 +122,67 @@ When multiple files are opened:
 - Close the current tab with `Ctrl+W`
 - Each file maintains its own state (edits, sort order, selections, history, etc.)
+## Command Line Options
+```
+usage: dv [-h] [-f {csv,excel,tsv,parquet,json,ndjson}] [-H] [-I] [-L SKIP_LINES] [-K SKIP_ROWS_AFTER_HEADER] [-U NULL [NULL ...]] [files ...]
+Interactive terminal based viewer/editor for tabular data (e.g., CSV/Excel).
+positional arguments:
+  files                 Files to view (or read from stdin)
+options:
+  -h, --help            show this help message and exit
+  -f, --format {csv,excel,tsv,parquet,json,ndjson}
+                        Specify the format of the input files
+  -H, --no-header       Specify that input files have no header row
+  -I, --no-inferrence   Do not infer data types when reading CSV/TSV
+  -L, --skip-lines SKIP_LINES
+                        Skip lines when reading CSV/TSV (default: 0)
+  -K, --skip-rows-after-header SKIP_ROWS_AFTER_HEADER
+                        Skip rows after header when reading CSV/TSV (default: 0)
+  -U, --null NULL [NULL ...]
+                        Values to interpret as null values when reading CSV/TSV
+```
+### CLI Examples
+```bash
+# View CSV file without header row
+dv -H data_no_header.csv
+# Disable type inference for faster loading
+dv -I large_data.csv
+# Skip first 3 lines of file (e.g., comments, metadata)
+dv -L 3 data_with_comments.csv
+# Skip 1 row after header (e.g., units row)
+dv -K 1 data_with_units.csv
+# Treat specific values as null/missing (e.g., 'NA', 'N/A', '-')
+dv -U NA N/A - data.csv
+# Multiple null values with different formats
+dv -U NULL NA "" "Not Available" messy_data.csv
+# Complex CSV with comments and units row
+dv -L 3 -K 1 -I messy_scientific_data.csv
+# Combine all options: skip lines, skip after header, no header, no inference, gzipped
+dv -L 2 -K 1 -H -I complex_data.csv.gz
+# Process compressed data from stdin with line skipping
+zcat compressed_data.csv.gz | dv -f csv -L 2
+# CSV with custom null values and no header
+dv -H -U NA "N/A" "-" raw_data.csv
+# Skip lines, specify null values, and disable type inference
+dv -L 5 -U NA "" data_with_metadata.csv
+```
 ## Keyboard Shortcuts
 ### App-Level Controls
@@ -122,7 +193,7 @@ When multiple files are opened:
 |-----|--------|
 | `Ctrl+O` | Open file in a new tab |
 | `Ctrl+W` | Close current tab |
-| `Ctrl+Shift+S` | Save all open tabs to Excel file |
+| `Ctrl+A` | Save all open tabs to Excel file |
 | `>` or `b` | Move to next tab |
 | `<` | Move to previous tab |
 | `B` | Toggle tab bar visibility |
@@ -132,7 +203,7 @@ When multiple files are opened:
 | Key | Action |
 |-----|--------|
-| `Ctrl+H` | Toggle help panel |
+| `F1` | Toggle help panel |
 | `k` | Cycle through themes |
 ---
@@ -150,6 +221,8 @@ When multiple files are opened:
 | `Home` / `End` | Jump to first/last column in current row |
 | `Ctrl + Home` / `Ctrl + End` | Jump to top/bottom in current page |
 | `PageDown` / `PageUp` | Scroll down/up one page |
+| `Ctrl+F` | Page down |
+| `Ctrl+B` | Page up |
 #### Viewing & Display
@@ -161,6 +234,7 @@ When multiple files are opened:
 | `S` | Show statistics for entire dataframe |
 | `K` | Cycle cursor type: cell → row → column → cell |
 | `~` | Toggle row labels |
+| `_` (underscore) | Expand column to full width |
 #### Data Editing
@@ -173,8 +247,6 @@ When multiple files are opened:
 | `a` | Add empty column after current |
 | `A` | Add column with name and value/expression |
 | `-` (minus) | Delete current column |
-| `_` (underscore) | Delete current column and all columns after |
-| `Ctrl+-` | Delete current column and all columns before |
 | `x` | Delete current row |
 | `X` | Delete current row and all rows below |
 | `Ctrl+X` | Delete current row and all rows above |
@@ -202,12 +274,19 @@ When multiple files are opened:
 | `v` | View only rows by selected rows and/or matches or cursor value |
 | `V` | View only rows by expression |
+#### SQL Interface
+| Key | Action |
+|-----|--------|
+| `l` | Simple SQL interface (select columns & WHERE clause) |
+| `L` | Advanced SQL interface (full SQL queries) |
 #### Find & Replace
 | Key | Action |
 |-----|--------|
-| `f` | Find across all columns with cursor value |
-| `Ctrl+F` | Find across all columns with expression |
+| `;` | Find across all columns with cursor value |
+| `:` | Find across all columns with expression |
 | `r` | Find and replace in current column (interactive or replace all) |
 | `R` | Find and replace across all columns (interactive or replace all) |
@@ -283,8 +362,8 @@ The application provides multiple search modes for different use cases:
 **Find Operations** - Find by value/expression:
 - **`/` - Column Find**: Find cursor value within current column
 - **`?` - Column Expression Find**: Open dialog to search current column with expression
-- **`f` - Global Find**: Find cursor value across all columns
-- **`Ctrl+f` - Global Expression Find**: Open dialog to search all columns with expression
+- **`;` - Global Find**: Find cursor value across all columns
+- **`:` - Global Expression Find**: Open dialog to search all columns with expression
 **Selection & Filtering**:
 - **`'` - Toggle Row Selection**: Select/deselect current row (marks it for filtering)
@@ -664,7 +743,40 @@ Press `@` to make URLs in the current column clickable:
 - **Scans** all cells in the current column for URLs starting with `http://` or `https://`
 - **Applies** link styling to make them clickable and dataframe remains unchanged
-### 19. Clipboard Operations
+### 19. SQL Interface
+The SQL interface provides two modes for querying your dataframe:
+#### Simple SQL Interface (`l`)
+Select specific columns and apply WHERE conditions without writing full SQL:
+- Choose which columns to include in results
+- Specify WHERE clause for filtering
+- Ideal for quick filtering and column selection
+#### Advanced SQL Interface (`L`)
+Execute complete SQL queries for advanced data manipulation:
+- Write full SQL queries with standard [SQL syntax](https://docs.pola.rs/api/python/stable/reference/sql/index.html)
+- Support for JOINs, GROUP BY, aggregations, and more
+- Access to all SQL capabilities for complex transformations
+- Always use `self` as the table name
+**Examples:**
+```sql
+-- Filter and select specific rows and/or columns
+SELECT name, age FROM self WHERE age > 30
+-- Aggregate with GROUP BY
+SELECT department, COUNT(*) as count, AVG(salary) as avg_salary
+FROM self
+GROUP BY department
+-- Complex filtering with multiple conditions
+SELECT *
+FROM self
+WHERE (age > 25 AND salary > 50000) OR department = 'Management'
+```
+### 20. Clipboard Operations
 Copies value to system clipboard with `pbcopy` on macOS and `xclip` on Linux
@@ -683,6 +795,30 @@ dv pokemon.csv
 # Chain with other command and specify input file format
 cut -d',' -f1,2,3 pokemon.csv | dv -f csv
+# Work with gzipped files
+dv large_dataset.csv.gz
+# CSV file without header row
+dv -H raw_data.csv
+# Skip type inference for faster loading
+dv -I huge_file.csv
+# Skip first 5 lines (comments, metadata)
+dv -L 5 data_with_metadata.csv
+# Skip 1 row after header (units row)
+dv -K 1 data_with_units.csv
+# Complex CSV with comments and units row
+dv -L 3 -K 1 -I messy_scientific_data.csv
+# Combine all options: skip lines, skip after header, no header, no inference, gzipped
+dv -L 2 -K 1 -H -I complex_data.csv.gz
+# Process compressed data from stdin with line skipping
+zcat compressed_data.csv.gz | dv -f csv -L 2
 ```
 ### Multi-File/Tab Examples
@@ -691,8 +827,8 @@ cut -d',' -f1,2,3 pokemon.csv | dv -f csv
 # Open multiple sheets as tabs in a single Excel
 dv sales.xlsx
-# Open multiple files as tabs
-dv pokemon.csv titanic.csv
+# Open multiple files as tabs (including gzipped)
+dv pokemon.csv titanic.csv large_data.csv.gz
 # Start with one file, then open others using Ctrl+O
 dv initial_data.csv

{dataframe_textual-1.2.0 → dataframe_textual-1.4.0}/pyproject.toml RENAMED Viewed

@@ -4,7 +4,7 @@ build-backend = "hatchling.build"
 [project]
 name = "dataframe-textual"
-version = "1.2.0"
+version = "1.4.0"
 description = "Interactive terminal viewer/editor for tabular data"
 readme = "README.md"
 requires-python = ">=3.11"
@@ -34,7 +34,7 @@ classifiers = [
 ]
 dependencies = [
     "polars>=1.34.0",
-    "textual>=6.5.0",
+    "textual[syntax]>=6.5.0",
 ]
 [project.urls]
@@ -69,4 +69,5 @@ dev = [
 ]
 [project.scripts]
+dataframe-textual = "dataframe_textual.__main__:main"
 dv = "dataframe_textual.__main__:main"

dataframe_textual-1.4.0/src/dataframe_textual/__main__.py ADDED Viewed

@@ -0,0 +1,90 @@
+"""Entry point for running DataFrameViewer as a module."""
+import argparse
+import sys
+from pathlib import Path
+from .common import SUPPORTED_FORMATS, load_dataframe
+from .data_frame_viewer import DataFrameViewer
+def cli() -> argparse.Namespace:
+    """Parse command-line arguments.
+    Determines input files or stdin and validates file existence
+    """
+    parser = argparse.ArgumentParser(
+        prog="dv",
+        description="Interactive terminal based viewer/editor for tabular data (e.g., CSV/Excel).",
+        formatter_class=argparse.RawDescriptionHelpFormatter,
+        epilog="Examples:\n"
+        "  %(prog)s data.csv\n"
+        "  %(prog)s file1.csv file2.csv file3.csv\n"
+        "  %(prog)s data.xlsx  (opens each sheet in separate tab)\n"
+        "  cat data.csv | %(prog)s --format csv\n",
+    )
+    parser.add_argument("files", nargs="*", help="Files to view (or read from stdin)")
+    parser.add_argument(
+        "-f",
+        "--format",
+        choices=SUPPORTED_FORMATS,
+        help="Specify the format of the input files (csv, excel, tsv etc.)",
+    )
+    parser.add_argument(
+        "-H",
+        "--no-header",
+        action="store_true",
+        help="Specify that input files have no header row when reading CSV/TSV",
+    )
+    parser.add_argument(
+        "-I", "--no-inferrence", action="store_true", help="Do not infer data types when reading CSV/TSV"
+    )
+    parser.add_argument(
+        "-C", "--comment-prefix", nargs="?", const="#", help="Comment lines are skipped when reading CSV/TSV"
+    )
+    parser.add_argument("-L", "--skip-lines", type=int, default=0, help="Skip lines when reading CSV/TSV")
+    parser.add_argument(
+        "-K", "--skip-rows-after-header", type=int, default=0, help="Skip rows after header when reading CSV/TSV"
+    )
+    parser.add_argument("-U", "--null", nargs="+", help="Values to interpret as null values when reading CSV/TSV")
+    args = parser.parse_args()
+    if args.files is None:
+        args.files = []
+    # Check if reading from stdin (pipe or redirect)
+    if not sys.stdin.isatty():
+        args.files.append("-")
+    else:
+        # Validate all files exist
+        for filename in args.files:
+            if not Path(filename).exists():
+                print(f"File not found: {filename}")
+                sys.exit(1)
+    if not args.files:
+        parser.print_help()
+        sys.exit(1)
+    return args
+def main() -> None:
+    """Run the DataFrame Viewer application."""
+    args = cli()
+    sources = load_dataframe(
+        args.files,
+        file_format=args.format,
+        has_header=not args.no_header,
+        infer_schema=not args.no_inferrence,
+        comment_prefix=args.comment_prefix,
+        skip_lines=args.skip_lines,
+        skip_rows_after_header=args.skip_rows_after_header,
+        null_values=args.null,
+    )
+    app = DataFrameViewer(*sources)
+    app.run()
+if __name__ == "__main__":
+    main()

dataframe-textual 1.2.0__tar.gz → 1.4.0__tar.gz

dataframe-textual 1.2.0tar.gz → 1.4.0tar.gz