PyPI - deepcsv - Versions diffs - 0.6.2b2__tar.gz → 0.6.3__tar.gz - Mend

deepcsv 0.6.2b2tar.gz → 0.6.3tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (33) hide show

deepcsv-0.6.3/CHANGELOG.md ADDED Viewed

@@ -0,0 +1,21 @@
+# Changelog
+---
+### Added
+- `process_file` — Added Doc & Examples for new params in function
+- `process_all_files` — Added Doc & Examples for new params in function
+- `process_file` & `process_all_file` — Added New parameter `to_list` to be real python list if you don't need array
+- Added `auto_fix` function for automatic data type correction in DataFrames for mixed Dtypes.
+- Added logg to `auto_fix` to track changes made to columns.
+- Added Documentation for `auto_fix` function to understand more about function.
+---
+### Changed
+- `process_file()` — Changed `save_file_extension` parameter to `file_format`
+- `process_all_files()` — Changed `file_extension` parameter to `file_format`
+---

{deepcsv-0.6.2b2/deepcsv.egg-info → deepcsv-0.6.3}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: deepcsv
-Version: 0.6.2b2
+Version: 0.6.3
 Summary: Automatically processes data files in directories, converts array-like strings to NumPy arrays, detects and fixes data type issues, and saves results as optimized Parquet files and MORE!
 Home-page: https://github.com/abdubakr77/deepcsv
 Author: Abdullah Bakr
@@ -50,6 +50,7 @@ Your column has numbers — it secretly has 3 different data types.
 You have 200 CSV files across 40 folders — and you process them one by one.
 You load a file and spend 20 minutes just picking the right reader.
 You have nulls scattered everywhere with no clean way to handle them.
+You have alot of mixed DTypes columns no way to handle one by one
 This is the silent killer of every data pipeline.
@@ -65,7 +66,7 @@ This is the silent killer of every data pipeline.
 - Catches mixed-type columns and fixes them automatically
 - Saves everything in any format you choose — not just Parquet
 - Reads any file format with one function — no more picking the right reader
-- Cleans nulls with full control over columns, rows, indexes, values, and types
+- Cleans nulls with full control over columns, rows, indexes, values, and Fix Mixed DTypes
 ---
@@ -79,7 +80,7 @@ pip install deepcsv
 ## Functions
-### `process_file(data_input, save_file_extension= str)`
+### `process_file(data_input, file_format= str, to_list=False)`
 Reads a file or DataFrame, converts array-like strings to NumPy arrays, fixes mixed-type columns, and optionally saves the result in any format you choose.
@@ -90,17 +91,20 @@ import deepcsv
 df = deepcsv.process_file('path/to/file.csv')
 # Process and save as parquet
-df = deepcsv.process_file('path/to/file.csv', save_file_extension='parquet')
+df = deepcsv.process_file('path/to/file.csv', file_format='parquet')
 # Process and save as Excel
-df = deepcsv.process_file('path/to/file.csv', save_file_extension='xlsx')
+df = deepcsv.process_file('path/to/file.csv', file_format='xlsx')
+# Process and convert it to Real Python List
+df = deepcsv.process_file('path/to/file.csv', to_list=True)
 ```
-**Supported save formats:** `.csv` `.tsv` `.txt` `.xlsx` `.json` `.parquet` `.pkl` `.feather` `.html` `.xml`
+**Supported file formats:** `.csv` `.tsv` `.txt` `.xlsx` `.json` `.parquet` `.pkl` `.feather` `.html` `.xml`
 ---
-### `process_all_files(directory_path, output_dir="All CSV Files is Converted Here", file_extension="parquet")`
+### `process_all_files(directory_path, output_dir="All CSV Files is Converted Here", file_format="parquet", to_list=False)`
 Walks through all folders and subfolders, applies `process_file` on every supported file, and saves results in the format you choose.
@@ -121,7 +125,7 @@ deepcsv.process_all_files('path/to/folder', file_extension='csv')
 ---
-### `read_any(file_path)` ✨
+### `read_any(file_path)`
 Reads any supported file format and returns a pandas DataFrame — one function for everything.
@@ -138,7 +142,7 @@ df = read_any('local.db')
 ---
-### `clean_values(data_input, ...)` ✨
+### `clean_values(data_input, ...)`
 Cleans a DataFrame by removing nulls, specific values, specific types, or rows by index — with full control over which columns to target and optional conditions.
@@ -182,6 +186,17 @@ df = clean_values('data.csv', all_cols_except=['id', 'name'])
 ---
+### `auto_fix(data_input)`
+Automatic data type correction in DataFrames for solving mixed Dtypes with logg to track changes made to columns.
+```python
+from deepcsv import auto_fix
+df = auto_fix('My_Data')
+```
+---
 ## Function Signatures
 ```python
@@ -189,6 +204,7 @@ process_file(data_input: Union[str, pd.DataFrame], save_file_extension: str = No
 process_all_files(directory_path: str, output_dir: str = "All CSV Files is Converted Here", file_extension: str = "parquet") -> None
 read_any(file_path: str) -> pd.DataFrame
 clean_values(data_input, cols=None, ax_0=False, index=None, condition=None, all_cols_except=None, finding_value=None, finding_type=None) -> pd.DataFrame
+auto_fix(data_input: Union[str, pd.DataFrame])
 ```
 ---
@@ -228,16 +244,19 @@ clean_values(data_input, cols=None, ax_0=False, index=None, condition=None, all_
 ---
 ### Added
-- `process_all_files` — Added option for user to customize the output folder name in
-- `read_any()` — Reads any supported file format and returns a pandas DataFrame automatically. Supports: `.csv`, `.txt`, `.tsv`, `.xls`, `.xlsx`, `.json`, `.parquet`, `.pkl`, `.feather`, `.db`, `.sqlite`
-- `clean_values()` — Cleans a DataFrame by removing nulls, specific values, specific types, or rows by index. Supports optional condition filtering with 6 operators
-- `_validate_cols()` — Internal helper: validates cols is a non-empty list and all columns exist in the DataFrame
-- `_validate_index()` — Internal helper: validates index is a non-empty list and all indexes exist in the DataFrame. Supports optional `reset_index` before validation
-- `_validate_condition()` — Internal helper: validates condition list and returns `(operator_func, value)`
-- `_parse_operator()` — Internal helper: converts operator string like `'>='` into its Python operator function
+- `process_file` — Added Doc & Examples for new params in function
+- `process_all_files` — Added Doc & Examples for new params in function
+- `process_file` & `process_all_file` — Added New parameter `to_list` to be real python list if you don't need array
+- Added `auto_fix` function for automatic data type correction in DataFrames for mixed Dtypes.
+- Added logg to `auto_fix` to track changes made to columns.
+- Added Documentation for `auto_fix` function to understand more about function.
+---
 ### Changed
-- `process_file()` — Added `save_file_extension` parameter. Now supports saving the processed DataFrame in any format after conversion, not just returning it
-- `process_all_files()` — Added `file_extension` parameter. Now supports saving converted files in any format instead of always saving as Parquet. Also expanded supported input formats beyond `.csv` and `.xlsx` to cover all formats supported by `read_any()`
+- `process_file()` — Changed `save_file_extension` parameter to `file_format`
+- `process_all_files()` — Changed `file_extension` parameter to `file_format`
 ---

{deepcsv-0.6.2b2 → deepcsv-0.6.3}/README.md RENAMED Viewed

@@ -14,6 +14,7 @@ Your column has numbers — it secretly has 3 different data types.
 You have 200 CSV files across 40 folders — and you process them one by one.
 You load a file and spend 20 minutes just picking the right reader.
 You have nulls scattered everywhere with no clean way to handle them.
+You have alot of mixed DTypes columns no way to handle one by one
 This is the silent killer of every data pipeline.
@@ -29,7 +30,7 @@ This is the silent killer of every data pipeline.
 - Catches mixed-type columns and fixes them automatically
 - Saves everything in any format you choose — not just Parquet
 - Reads any file format with one function — no more picking the right reader
-- Cleans nulls with full control over columns, rows, indexes, values, and types
+- Cleans nulls with full control over columns, rows, indexes, values, and Fix Mixed DTypes
 ---
@@ -43,7 +44,7 @@ pip install deepcsv
 ## Functions
-### `process_file(data_input, save_file_extension= str)`
+### `process_file(data_input, file_format= str, to_list=False)`
 Reads a file or DataFrame, converts array-like strings to NumPy arrays, fixes mixed-type columns, and optionally saves the result in any format you choose.
@@ -54,17 +55,20 @@ import deepcsv
 df = deepcsv.process_file('path/to/file.csv')
 # Process and save as parquet
-df = deepcsv.process_file('path/to/file.csv', save_file_extension='parquet')
+df = deepcsv.process_file('path/to/file.csv', file_format='parquet')
 # Process and save as Excel
-df = deepcsv.process_file('path/to/file.csv', save_file_extension='xlsx')
+df = deepcsv.process_file('path/to/file.csv', file_format='xlsx')
+# Process and convert it to Real Python List
+df = deepcsv.process_file('path/to/file.csv', to_list=True)
 ```
-**Supported save formats:** `.csv` `.tsv` `.txt` `.xlsx` `.json` `.parquet` `.pkl` `.feather` `.html` `.xml`
+**Supported file formats:** `.csv` `.tsv` `.txt` `.xlsx` `.json` `.parquet` `.pkl` `.feather` `.html` `.xml`
 ---
-### `process_all_files(directory_path, output_dir="All CSV Files is Converted Here", file_extension="parquet")`
+### `process_all_files(directory_path, output_dir="All CSV Files is Converted Here", file_format="parquet", to_list=False)`
 Walks through all folders and subfolders, applies `process_file` on every supported file, and saves results in the format you choose.
@@ -85,7 +89,7 @@ deepcsv.process_all_files('path/to/folder', file_extension='csv')
 ---
-### `read_any(file_path)` ✨
+### `read_any(file_path)`
 Reads any supported file format and returns a pandas DataFrame — one function for everything.
@@ -102,7 +106,7 @@ df = read_any('local.db')
 ---
-### `clean_values(data_input, ...)` ✨
+### `clean_values(data_input, ...)`
 Cleans a DataFrame by removing nulls, specific values, specific types, or rows by index — with full control over which columns to target and optional conditions.
@@ -146,6 +150,17 @@ df = clean_values('data.csv', all_cols_except=['id', 'name'])
 ---
+### `auto_fix(data_input)`
+Automatic data type correction in DataFrames for solving mixed Dtypes with logg to track changes made to columns.
+```python
+from deepcsv import auto_fix
+df = auto_fix('My_Data')
+```
+---
 ## Function Signatures
 ```python
@@ -153,6 +168,7 @@ process_file(data_input: Union[str, pd.DataFrame], save_file_extension: str = No
 process_all_files(directory_path: str, output_dir: str = "All CSV Files is Converted Here", file_extension: str = "parquet") -> None
 read_any(file_path: str) -> pd.DataFrame
 clean_values(data_input, cols=None, ax_0=False, index=None, condition=None, all_cols_except=None, finding_value=None, finding_type=None) -> pd.DataFrame
+auto_fix(data_input: Union[str, pd.DataFrame])
 ```
 ---

{deepcsv-0.6.2b2 → deepcsv-0.6.3}/deepcsv/deepcsv.py RENAMED Viewed

@@ -9,14 +9,18 @@ from os.path import join,relpath,dirname,isfile,isdir
 from warnings import filterwarnings
 filterwarnings("ignore")
-def process_file(data_input: Union[str, pd.DataFrame] , save_file_extension=str) -> pd.DataFrame:
+def process_file(data_input: Union[str, pd.DataFrame] , file_format= str, to_list = False) -> pd.DataFrame:
     """
     Parses string representations of lists in DataFrame columns to actual NumPy arrays.
     Parameters
     ----------
-    data_input : str or pd.DataFrame
-        Path to the CSV/XLSX file or an existing DataFrame.
+    data_input:  str or pd.DataFrame
+                 Path to the CSV/XLSX file or an existing DataFrame.
+    file_format: str
+                 Saves a DataFrame to a file with the specified format.
+    to_list:     False -> (Array) is better
+                 True  -> it will convert to list
     Returns
     -------
@@ -28,6 +32,8 @@ def process_file(data_input: Union[str, pd.DataFrame] , save_file_extension=str)
     --------
     >>> df = process_file('path/to/file.csv')
     >>> df = process_file(my_dataframe)
+    >>> df = process_file(my_dataframe , save_format="parquet")
+    >>> df = process_file(my_dataframe , save_format="parquet", to_list = True)
     """
     try:
@@ -54,16 +60,20 @@ def process_file(data_input: Union[str, pd.DataFrame] , save_file_extension=str)
                 print("System : Done!")
         elif isinstance(First_Value , str) and First_Value.strip().startswith("["):
-            data[f"{ColName.capitalize()}List"] = data[ColName].apply(lambda x : array(literal_eval(x)) if pd.notna(x) else nan)
+            if to_list:
+                data[f"{ColName.capitalize()}List"] = data[ColName].apply(lambda x : list(literal_eval(x)) if pd.notna(x) else nan)
+            else:
+                data[f"{ColName.capitalize()}List"] = data[ColName].apply(lambda x : array(literal_eval(x)) if pd.notna(x) else nan)
             data.drop(ColName,inplace=True,axis=1)
-    if save_file_extension.strip().lower() in ['csv','txt','tsv','xls','xlsx','json','parquet','pkl','feather','db','sqlite']:
-        _save_as(data=data,ext=save_file_extension)
+    if file_format.strip().lower() in ['csv','txt','tsv','xls','xlsx','json','parquet','pkl','feather','db','sqlite']:
+        _save_as(data=data,ext=file_format)
     return data
-def process_all_files(directory_path: str, output_dir="All CSV Files is Converted Here",file_extension= "parquet") -> None:
+def process_all_files(directory_path: str, output_dir="All CSV Files is Converted Here",file_format= "parquet",to_list = False) -> None:
     """
     Recursively processes all CSV and XLSX files in a directory,
     converts array strings to NumPy arrays, and saves as Parquet files.
@@ -74,6 +84,8 @@ def process_all_files(directory_path: str, output_dir="All CSV Files is Converte
         Root directory path to search for CSV/XLSX files.
     output_dir : str, default 'All CSV Files is Converted Here'
         Folder name where converted files will be saved.
+    file_format: str
+        Saves a DataFrame to a file with the specified format for every file.
     Returns
     -------
@@ -83,6 +95,7 @@ def process_all_files(directory_path: str, output_dir="All CSV Files is Converte
     --------
     >>> process_all_files('/path/to/directory')
     >>> process_all_files('/path/to/directory', output_dir="Converted Files")
+    >>> process_all_files('/path/to/directory', output_dir="Converted Files", file_format="tsv")
     """
     base_output = join(directory_path, output_dir)
@@ -113,8 +126,8 @@ def process_all_files(directory_path: str, output_dir="All CSV Files is Converte
                         print(Sub_Item_Path)
                         makedirs(dirname(output),exist_ok=True)
                         _save_as(data=df_converted,
-                                current_dir=output.replace(f".{Sub_Item_Path.split(".")[-1].strip().lower()}", f".{file_extension}"),
-                                ext=file_extension)
+                                current_dir=output.replace(f".{Sub_Item_Path.split(".")[-1].strip().lower()}", f".{file_format}"),
+                                ext=file_format,to_list=to_list)
                 elif isdir(Sub_Item_Path):

{deepcsv-0.6.2b2 → deepcsv-0.6.3}/deepcsv/utils.py RENAMED Viewed

@@ -110,8 +110,8 @@ def _save_as(data: pd.DataFrame, current_dir = str(Path.cwd()), ext= str) -> Non
     Examples
     --------
-    >>> save_as(df, "data/myfile", ".parquet")
-    >>> save_as(df, "data/myfile", ".csv")
+    >>> _save_as(df, "data/myfile", ".parquet")
+    >>> _save_as(df, "data/myfile", ".csv")
     """
     ext = ext.strip().lower()
     if not ext.startswith("."):
@@ -143,6 +143,14 @@ def _save_as(data: pd.DataFrame, current_dir = str(Path.cwd()), ext= str) -> Non
     print("-"*50)
+def _val_dtype(x,dtype):
+    if dtype == str:
+        return str(x)
+    elif dtype == float:
+        return float(x)
+    else:
+        return bool(x)
 # ──────────────────────────────────────────────
 #               PUBLIC FUNCTIONS
 # ──────────────────────────────────────────────
@@ -299,4 +307,77 @@ def clean_values(data_input: Union[str, pd.DataFrame],
         else:
             data.dropna(axis=1, inplace=True)
-    return data
+    return data
+def auto_fix(data_input: Union[str, pd.DataFrame]):
+    """
+    Automatically detects and fixes columns with mixed data types in a DataFrame.
+    This function scans each column for mixed data types (columns containing exactly
+    2 different Python types) and attempts to convert all values to the most common
+    type. If the primary conversion fails, it falls back to the secondary type.
+    Parameters
+    ----------
+    data_input : str or pd.DataFrame
+        File path to read from or an existing DataFrame to process.
+    Returns
+    -------
+    pd.DataFrame
+        DataFrame with mixed-type columns automatically converted to consistent types.
+    Notes
+    -----
+    - Only processes columns with exactly 2 different data types
+    - Attempts conversion to the most frequent type first
+    - Falls back to the less frequent type if primary conversion fails
+    - Prints progress messages for each column being processed
+    - Supported target types: str, float, bool
+    Examples
+    --------
+    >>> df = auto_fix('data/mixed_types.csv')
+    Found a column (price) Have mixed DTypes!
+    This Col Have These DTypes: [<class 'str'>, <class 'float'>]
+    NOW TRYING TO FIX!
+    Done!
+    -----------------------------------
+    >>> df = auto_fix(my_dataframe)
+    """
+    try:
+        df = read_any(data_input)
+    except Exception:
+        df = data_input
+    for ColName in df.columns:
+        if len(df[ColName].apply(type).unique()) == 2:
+            print(f"Found a column ({ColName}) Have mixed DTypes!")
+            print(f"This Col Have These DTypes: {df[ColName].apply(type).unique()}\nNOW TRYING TO FIX!")
+            dtype_dict = dict(df[ColName].apply(type).value_counts().to_dict())
+            dtype_values_list = [dtype_value for dtype_value in dtype_dict.values()]
+            dtype = None
+            try:
+                for dtype_name in dtype_dict.keys():
+                    if dtype_dict[dtype_name] == max(dtype_values_list):
+                        dtype = dtype_name
+                df[ColName] = df[ColName].apply(lambda x: _val_dtype(x,dtype))
+            except:
+                for dtype_name in dtype_dict.keys():
+                    if dtype_dict[dtype_name] == min(dtype_values_list):
+                        dtype = dtype_name
+                df[ColName] = df[ColName].apply(lambda x: _val_dtype(x,dtype))
+            print("Done!")
+            print("—"*35)
+    return df

{deepcsv-0.6.2b2 → deepcsv-0.6.3/deepcsv.egg-info}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: deepcsv
-Version: 0.6.2b2
+Version: 0.6.3
 Summary: Automatically processes data files in directories, converts array-like strings to NumPy arrays, detects and fixes data type issues, and saves results as optimized Parquet files and MORE!
 Home-page: https://github.com/abdubakr77/deepcsv
 Author: Abdullah Bakr
@@ -50,6 +50,7 @@ Your column has numbers — it secretly has 3 different data types.
 You have 200 CSV files across 40 folders — and you process them one by one.
 You load a file and spend 20 minutes just picking the right reader.
 You have nulls scattered everywhere with no clean way to handle them.
+You have alot of mixed DTypes columns no way to handle one by one
 This is the silent killer of every data pipeline.
@@ -65,7 +66,7 @@ This is the silent killer of every data pipeline.
 - Catches mixed-type columns and fixes them automatically
 - Saves everything in any format you choose — not just Parquet
 - Reads any file format with one function — no more picking the right reader
-- Cleans nulls with full control over columns, rows, indexes, values, and types
+- Cleans nulls with full control over columns, rows, indexes, values, and Fix Mixed DTypes
 ---
@@ -79,7 +80,7 @@ pip install deepcsv
 ## Functions
-### `process_file(data_input, save_file_extension= str)`
+### `process_file(data_input, file_format= str, to_list=False)`
 Reads a file or DataFrame, converts array-like strings to NumPy arrays, fixes mixed-type columns, and optionally saves the result in any format you choose.
@@ -90,17 +91,20 @@ import deepcsv
 df = deepcsv.process_file('path/to/file.csv')
 # Process and save as parquet
-df = deepcsv.process_file('path/to/file.csv', save_file_extension='parquet')
+df = deepcsv.process_file('path/to/file.csv', file_format='parquet')
 # Process and save as Excel
-df = deepcsv.process_file('path/to/file.csv', save_file_extension='xlsx')
+df = deepcsv.process_file('path/to/file.csv', file_format='xlsx')
+# Process and convert it to Real Python List
+df = deepcsv.process_file('path/to/file.csv', to_list=True)
 ```
-**Supported save formats:** `.csv` `.tsv` `.txt` `.xlsx` `.json` `.parquet` `.pkl` `.feather` `.html` `.xml`
+**Supported file formats:** `.csv` `.tsv` `.txt` `.xlsx` `.json` `.parquet` `.pkl` `.feather` `.html` `.xml`
 ---
-### `process_all_files(directory_path, output_dir="All CSV Files is Converted Here", file_extension="parquet")`
+### `process_all_files(directory_path, output_dir="All CSV Files is Converted Here", file_format="parquet", to_list=False)`
 Walks through all folders and subfolders, applies `process_file` on every supported file, and saves results in the format you choose.
@@ -121,7 +125,7 @@ deepcsv.process_all_files('path/to/folder', file_extension='csv')
 ---
-### `read_any(file_path)` ✨
+### `read_any(file_path)`
 Reads any supported file format and returns a pandas DataFrame — one function for everything.
@@ -138,7 +142,7 @@ df = read_any('local.db')
 ---
-### `clean_values(data_input, ...)` ✨
+### `clean_values(data_input, ...)`
 Cleans a DataFrame by removing nulls, specific values, specific types, or rows by index — with full control over which columns to target and optional conditions.
@@ -182,6 +186,17 @@ df = clean_values('data.csv', all_cols_except=['id', 'name'])
 ---
+### `auto_fix(data_input)`
+Automatic data type correction in DataFrames for solving mixed Dtypes with logg to track changes made to columns.
+```python
+from deepcsv import auto_fix
+df = auto_fix('My_Data')
+```
+---
 ## Function Signatures
 ```python
@@ -189,6 +204,7 @@ process_file(data_input: Union[str, pd.DataFrame], save_file_extension: str = No
 process_all_files(directory_path: str, output_dir: str = "All CSV Files is Converted Here", file_extension: str = "parquet") -> None
 read_any(file_path: str) -> pd.DataFrame
 clean_values(data_input, cols=None, ax_0=False, index=None, condition=None, all_cols_except=None, finding_value=None, finding_type=None) -> pd.DataFrame
+auto_fix(data_input: Union[str, pd.DataFrame])
 ```
 ---
@@ -228,16 +244,19 @@ clean_values(data_input, cols=None, ax_0=False, index=None, condition=None, all_
 ---
 ### Added
-- `process_all_files` — Added option for user to customize the output folder name in
-- `read_any()` — Reads any supported file format and returns a pandas DataFrame automatically. Supports: `.csv`, `.txt`, `.tsv`, `.xls`, `.xlsx`, `.json`, `.parquet`, `.pkl`, `.feather`, `.db`, `.sqlite`
-- `clean_values()` — Cleans a DataFrame by removing nulls, specific values, specific types, or rows by index. Supports optional condition filtering with 6 operators
-- `_validate_cols()` — Internal helper: validates cols is a non-empty list and all columns exist in the DataFrame
-- `_validate_index()` — Internal helper: validates index is a non-empty list and all indexes exist in the DataFrame. Supports optional `reset_index` before validation
-- `_validate_condition()` — Internal helper: validates condition list and returns `(operator_func, value)`
-- `_parse_operator()` — Internal helper: converts operator string like `'>='` into its Python operator function
+- `process_file` — Added Doc & Examples for new params in function
+- `process_all_files` — Added Doc & Examples for new params in function
+- `process_file` & `process_all_file` — Added New parameter `to_list` to be real python list if you don't need array
+- Added `auto_fix` function for automatic data type correction in DataFrames for mixed Dtypes.
+- Added logg to `auto_fix` to track changes made to columns.
+- Added Documentation for `auto_fix` function to understand more about function.
+---
 ### Changed
-- `process_file()` — Added `save_file_extension` parameter. Now supports saving the processed DataFrame in any format after conversion, not just returning it
-- `process_all_files()` — Added `file_extension` parameter. Now supports saving converted files in any format instead of always saving as Parquet. Also expanded supported input formats beyond `.csv` and `.xlsx` to cover all formats supported by `read_any()`
+- `process_file()` — Changed `save_file_extension` parameter to `file_format`
+- `process_all_files()` — Changed `file_extension` parameter to `file_format`
 ---

deepcsv-0.6.3/deepcsv.egg-info/SOURCES.txt ADDED Viewed

@@ -0,0 +1,29 @@
+CHANGELOG.md
+LICENSE
+MANIFEST.in
+README.md
+setup.py
+deepcsv/__init__.py
+deepcsv/deepcsv.py
+deepcsv/utils.py
+deepcsv.egg-info/PKG-INFO
+deepcsv.egg-info/SOURCES.txt
+deepcsv.egg-info/dependency_links.txt
+deepcsv.egg-info/requires.txt
+deepcsv.egg-info/top_level.txt
+deepcsv.egg-info/dist/deepcsv-0.5.0-py3-none-any.whl
+deepcsv.egg-info/dist/deepcsv-0.5.0.tar.gz
+deepcsv.egg-info/dist/deepcsv-0.5.0b1-py3-none-any.whl
+deepcsv.egg-info/dist/deepcsv-0.5.0b1.tar.gz
+deepcsv.egg-info/dist/deepcsv-0.6.0-py3-none-any.whl
+deepcsv.egg-info/dist/deepcsv-0.6.0.tar.gz
+deepcsv.egg-info/dist/deepcsv-0.6.1-py3-none-any.whl
+deepcsv.egg-info/dist/deepcsv-0.6.1.tar.gz
+deepcsv.egg-info/dist/deepcsv-0.6.2-py3-none-any.whl
+deepcsv.egg-info/dist/deepcsv-0.6.2.tar.gz
+deepcsv.egg-info/dist/deepcsv-0.6.2b1-py3-none-any.whl
+deepcsv.egg-info/dist/deepcsv-0.6.2b1.tar.gz
+deepcsv.egg-info/dist/deepcsv-0.6.2b2-py3-none-any.whl
+deepcsv.egg-info/dist/deepcsv-0.6.2b2.tar.gz
+deepcsv.egg-info/dist/deepcsv-0.6.3b1-py3-none-any.whl
+deepcsv.egg-info/dist/deepcsv-0.6.3b1.tar.gz

deepcsv-0.6.3/deepcsv.egg-info/dist/deepcsv-0.5.0-py3-none-any.whl ADDED Viewed

Binary file

deepcsv-0.6.3/deepcsv.egg-info/dist/deepcsv-0.5.0.tar.gz ADDED Viewed

Binary file

deepcsv-0.6.3/deepcsv.egg-info/dist/deepcsv-0.5.0b1-py3-none-any.whl ADDED Viewed

Binary file

deepcsv-0.6.3/deepcsv.egg-info/dist/deepcsv-0.5.0b1.tar.gz ADDED Viewed

Binary file

deepcsv-0.6.3/deepcsv.egg-info/dist/deepcsv-0.6.0-py3-none-any.whl ADDED Viewed

Binary file

deepcsv-0.6.3/deepcsv.egg-info/dist/deepcsv-0.6.0.tar.gz ADDED Viewed

Binary file

deepcsv-0.6.3/deepcsv.egg-info/dist/deepcsv-0.6.1-py3-none-any.whl ADDED Viewed

Binary file

deepcsv-0.6.3/deepcsv.egg-info/dist/deepcsv-0.6.1.tar.gz ADDED Viewed

Binary file

deepcsv-0.6.3/deepcsv.egg-info/dist/deepcsv-0.6.2-py3-none-any.whl ADDED Viewed

Binary file

deepcsv-0.6.3/deepcsv.egg-info/dist/deepcsv-0.6.2.tar.gz ADDED Viewed

Binary file

deepcsv-0.6.3/deepcsv.egg-info/dist/deepcsv-0.6.2b1-py3-none-any.whl ADDED Viewed

Binary file

deepcsv-0.6.3/deepcsv.egg-info/dist/deepcsv-0.6.2b1.tar.gz ADDED Viewed

Binary file

deepcsv-0.6.3/deepcsv.egg-info/dist/deepcsv-0.6.2b2-py3-none-any.whl ADDED Viewed

Binary file

deepcsv-0.6.3/deepcsv.egg-info/dist/deepcsv-0.6.2b2.tar.gz ADDED Viewed

Binary file

deepcsv-0.6.3/deepcsv.egg-info/dist/deepcsv-0.6.3b1-py3-none-any.whl ADDED Viewed

Binary file

deepcsv-0.6.3/deepcsv.egg-info/dist/deepcsv-0.6.3b1.tar.gz ADDED Viewed

Binary file

{deepcsv-0.6.2b2 → deepcsv-0.6.3}/setup.py RENAMED Viewed

@@ -8,7 +8,7 @@ changelog = (this_directory / "CHANGELOG.md").read_text(encoding="utf-8")
 setup(
     name="deepcsv",
-    version="0.6.2b2",
+    version="0.6.3",
     author="Abdullah Bakr",
     author_email="abdubakora1232@gmail.com",
     description="Automatically processes data files in directories, converts array-like strings to NumPy arrays, detects and fixes data type issues, and saves results as optimized Parquet files and MORE!",

deepcsv-0.6.2b2/CHANGELOG.md DELETED Viewed

@@ -1,18 +0,0 @@
-# Changelog
----
-### Added
-- `process_all_files` — Added option for user to customize the output folder name in
-- `read_any()` — Reads any supported file format and returns a pandas DataFrame automatically. Supports: `.csv`, `.txt`, `.tsv`, `.xls`, `.xlsx`, `.json`, `.parquet`, `.pkl`, `.feather`, `.db`, `.sqlite`
-- `clean_values()` — Cleans a DataFrame by removing nulls, specific values, specific types, or rows by index. Supports optional condition filtering with 6 operators
-- `_validate_cols()` — Internal helper: validates cols is a non-empty list and all columns exist in the DataFrame
-- `_validate_index()` — Internal helper: validates index is a non-empty list and all indexes exist in the DataFrame. Supports optional `reset_index` before validation
-- `_validate_condition()` — Internal helper: validates condition list and returns `(operator_func, value)`
-- `_parse_operator()` — Internal helper: converts operator string like `'>='` into its Python operator function
-### Changed
-- `process_file()` — Added `save_file_extension` parameter. Now supports saving the processed DataFrame in any format after conversion, not just returning it
-- `process_all_files()` — Added `file_extension` parameter. Now supports saving converted files in any format instead of always saving as Parquet. Also expanded supported input formats beyond `.csv` and `.xlsx` to cover all formats supported by `read_any()`
----

deepcsv-0.6.2b2/deepcsv.egg-info/SOURCES.txt DELETED Viewed

@@ -1,13 +0,0 @@
-CHANGELOG.md
-LICENSE
-MANIFEST.in
-README.md
-setup.py
-deepcsv/__init__.py
-deepcsv/deepcsv.py
-deepcsv/utils.py
-deepcsv.egg-info/PKG-INFO
-deepcsv.egg-info/SOURCES.txt
-deepcsv.egg-info/dependency_links.txt
-deepcsv.egg-info/requires.txt
-deepcsv.egg-info/top_level.txt

{deepcsv-0.6.2b2 → deepcsv-0.6.3}/LICENSE RENAMED Viewed

File without changes

{deepcsv-0.6.2b2 → deepcsv-0.6.3}/MANIFEST.in RENAMED Viewed

File without changes

{deepcsv-0.6.2b2 → deepcsv-0.6.3}/deepcsv/__init__.py RENAMED Viewed

File without changes

{deepcsv-0.6.2b2 → deepcsv-0.6.3}/deepcsv.egg-info/dependency_links.txt RENAMED Viewed

File without changes

{deepcsv-0.6.2b2 → deepcsv-0.6.3}/deepcsv.egg-info/requires.txt RENAMED Viewed

File without changes

{deepcsv-0.6.2b2 → deepcsv-0.6.3}/deepcsv.egg-info/top_level.txt RENAMED Viewed

File without changes

{deepcsv-0.6.2b2 → deepcsv-0.6.3}/setup.cfg RENAMED Viewed

File without changes

deepcsv 0.6.2b2__tar.gz → 0.6.3__tar.gz

deepcsv 0.6.2b2tar.gz → 0.6.3tar.gz