PyPI - informatica-python - Versions diffs - 1.9.7__tar.gz → 1.9.9__tar.gz - Mend

informatica-python 1.9.7tar.gz → 1.9.9tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (31) hide show

{informatica_python-1.9.7 → informatica_python-1.9.9}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: informatica-python
-Version: 1.9.7
+Version: 1.9.9
 Summary: Convert Informatica PowerCenter workflow XML to Python/PySpark code
 Author: Nick
 License: MIT
@@ -124,7 +124,7 @@ The code generator produces real, runnable Python for these transformation types
 - **Expression** — Field-level expressions converted to vectorized pandas operations (`df["COL"]` style) with 40+ vectorized function handlers
 - **Filter** — Row filtering with vectorized converted conditions
 - **Joiner** — `pd.merge()` with join type and condition parsing (inner/left/right/outer)
-- **Lookup** — `pd.merge()` lookups with connection-aware DB reads, multiple match policies, default values, `$$PARAM` substitution
+- **Lookup** — `pd.merge()` lookups with connection-aware DB reads, multiple match policies, default values, `$$PARAM` substitution, SQL override support, table caching via `lookup_func()`
 - **Aggregator** — `groupby().agg()` with SUM/COUNT/AVG/MIN/MAX/FIRST/LAST, computed aggregates
 - **Sorter** — `sort_values()` with multi-key ascending/descending per-field direction from SORTDIRECTION attribute
 - **Router** — Multi-group conditional routing with named groups
@@ -196,7 +196,7 @@ Column-level pandas operations instead of row-level iteration. The expression co
 - `REPLACECHR/REPLACESTR` → `.str.replace()`
 - `REG_EXTRACT/REG_REPLACE` → `.str.extract()/.str.replace(regex=True)`
 - `CHR(code)` → `chr(int(code))`
-- `||` concatenation → `+` with `.astype(str)` on non-literals
+- `||` concatenation → `+` with smart coercion: `.fillna('').astype(str)` for Series, `str()` for scalars
 **Date/Time:**
 - `TO_DATE(val, fmt)` → `pd.to_datetime()` with Informatica→Python format conversion
@@ -343,10 +343,12 @@ Target field datatypes are mapped to pandas types and generate proper casting co
 - Decimals/Floats: `pd.to_numeric(errors='coerce')`
 - Booleans: `.astype('boolean')`
-### Flat File Handling (v1.3+)
+### Flat File Handling (v1.3+, enhanced v1.9.8)
 Parses FLATFILE metadata for delimiter, fixed-width, header lines, skip rows, quote/escape chars. Generates `pd.read_fwf()` for fixed-width or enriched `read_file()` for delimited.
+**Fixed-width enhancements (v1.9.8):** `OFFSET`, `PHYSICALLENGTH`, and `PHYSICALOFFSET` are parsed from `SOURCEFIELD` attributes. `physical_length` is preferred over `precision` for accurate column width calculations in `pd.read_fwf()`.
 ### Mapplet Inlining (v1.3+)
 Expands Mapplet instances into prefixed transforms, rewires connectors, and eliminates duplication.
@@ -371,12 +373,17 @@ The generated `helper_functions.py` provides a complete runtime library:
 ### Database Operations
 | Function | Description |
 |----------|-------------|
-| `get_db_connection(config, conn_name)` | Create DB connection (pyodbc/pymssql/sqlalchemy fallback for MSSQL) |
+| `get_db_connection(config, conn_name)` | SQLAlchemy-first DB connection with engine caching and connection pooling; DBAPI fallback for pyodbc/pymssql |
 | `read_from_db(config, query, conn_name)` | Execute SQL query and return DataFrame |
 | `write_to_db(config, df, table, conn_name)` | Write DataFrame to database table via `.to_sql()` |
-| `execute_sql(config, sql, conn_name)` | Execute DDL/DML statement (INSERT, UPDATE, DELETE) |
+| `execute_sql(config, sql, conn_name)` | Execute DDL/DML statement; auto-detects SQLAlchemy vs DBAPI via `dialect` attribute |
 | `write_with_update_strategy(config, df, table, ...)` | Split rows by `_update_strategy` column into INSERT/UPDATE/DELETE/REJECT operations |
 | `call_stored_procedure(config, proc, params, ...)` | Execute stored procedure with input/output parameter mapping (Oracle/MSSQL/generic) |
+| `lookup_func(table, *args)` | Full lookup implementation with table caching, condition parsing, and default value support |
+| `resolve_env(value)` | Resolve `${VAR}` placeholders from environment variables with config fallback |
+| `resolve_builtin_variable(var_name, ...)` | Resolve `$PMMappingName`, `$PMSessionName`, `$PMFolderName`, etc. |
+| `rename_with_duplicates(df, col_map)` | Safe column rename supporting one-source-to-many-target mapping |
+| `_safe_close(conn)` | Safe connection cleanup handling both SQLAlchemy and raw DBAPI connections |
 ### File Operations
 | Function | Description |
@@ -407,7 +414,34 @@ The generated `helper_functions.py` provides a complete runtime library:
 ## Changelog
-### v1.9.3 (Current)
+### v1.9.8 (Current)
+- **NOT(expr) function-call form**: `NOT(ISNULL(x))` now correctly converts to `~(df["x"].isna())` — handles both `NOT ` (with space) and `NOT(` (without space) forms
+- **AND/OR/NOT as field names fix**: Logical operators no longer mangled into `df["AND"]` / `df["OR"]` — conversion moved before field substitution in both `_vec_recursive` fallback and `_vectorize_simple`
+- **Condition tokenizer word-boundary fix**: `_split_condition_tokens` no longer splits on `OR` inside field names like `DeletedIndicator` — verifies preceding character is a real word boundary
+- **`$PMMappingName` in expressions**: `$PM*` built-in variables in expression context properly convert to `resolve_builtin_variable("PMMappingName")` instead of being mangled to `$df["PMMappingName"]`
+- **TO_CHAR arithmetic parenthesization**: `TO_CHAR(TO_INTEGER(x) - 1)` now produces `(pd.to_numeric(...) - 1).astype(str)` instead of incorrect `- 1.astype(str)` binding
+- **String literal early-return fix**: Expressions like `'PER_' || X || '_suffix'` no longer short-circuit as a single string literal
+- **Fixed-width file enhancements**: `OFFSET`, `PHYSICALLENGTH`, `PHYSICALOFFSET` parsed from SOURCEFIELD XML; `physical_length` preferred over `precision` for `read_fwf` column widths
+- **Smart concat coercion**: Scalar returns (e.g. `resolve_builtin_variable()`, `get_variable()`) use `str()` wrapping; Series use `.fillna('').astype(str)`
+- **700 tests** passing
+### v1.9.5 / v1.9.6
+- **`rename_with_duplicates`** helper for one-source-to-many-target column mapping
+- **`resolve_env()`** for `${VAR}` placeholder resolution (env → config fallback)
+- **`resolve_builtin_variable()`** for `$PMMappingName`, `$PMSessionName`, `$PMFolderName`, etc.
+- **SQLAlchemy-first `get_db_connection`**: Engine caching and connection pooling; DBAPI fallback for pyodbc/pymssql
+- **`_safe_close()`**: Safe connection cleanup handling both SQLAlchemy and raw DBAPI connections
+- **Full `lookup_func()` implementation**: Table caching, condition parsing, default value support
+- **Null-safe `||` concatenation**: `.fillna('').astype(str)` prevents "nan" strings in concatenation
+- **`$PM*` variable substitution in SQL Override queries**
+- **`execute_sql` dialect detection**: Uses `dialect` attribute to choose SQLAlchemy `text()` vs DBAPI `cursor.execute()`
+- **678 tests** passing
+### v1.9.4
+- Extended expression function coverage and edge-case fixes
+- Improved mapplet and connector handling
+### v1.9.3
 - **Smart target write detection**: Bare targets default to `write_to_db()` instead of `write_file()`; file extension allowlist (`.csv`, `.dat`, `.txt`, `.xml`, `.json`, `.parquet`, `.xlsx`, `.xls`, `.tsv`, `.avro`) for file targets; schema-qualified names (`dbo.TABLE`) correctly route to database
 - **DECODE vectorization**: `DECODE(TRUE, cond1, val1, ..., default)` → nested `np.where()` chains; value-matching DECODE; handles IN() conditions and complex boolean nesting
 - **IS_SPACES vectorization**: `IS_SPACES(field)` → `field.str.strip().eq("")`
@@ -495,7 +529,7 @@ The generated `helper_functions.py` provides a complete runtime library:
 cd informatica_python
 pip install -e ".[dev]"
-# Run tests (663 tests)
+# Run tests (700 tests)
 pytest tests/ -v
 ```

{informatica_python-1.9.7 → informatica_python-1.9.9}/README.md RENAMED Viewed

@@ -97,7 +97,7 @@ The code generator produces real, runnable Python for these transformation types
 - **Expression** — Field-level expressions converted to vectorized pandas operations (`df["COL"]` style) with 40+ vectorized function handlers
 - **Filter** — Row filtering with vectorized converted conditions
 - **Joiner** — `pd.merge()` with join type and condition parsing (inner/left/right/outer)
-- **Lookup** — `pd.merge()` lookups with connection-aware DB reads, multiple match policies, default values, `$$PARAM` substitution
+- **Lookup** — `pd.merge()` lookups with connection-aware DB reads, multiple match policies, default values, `$$PARAM` substitution, SQL override support, table caching via `lookup_func()`
 - **Aggregator** — `groupby().agg()` with SUM/COUNT/AVG/MIN/MAX/FIRST/LAST, computed aggregates
 - **Sorter** — `sort_values()` with multi-key ascending/descending per-field direction from SORTDIRECTION attribute
 - **Router** — Multi-group conditional routing with named groups
@@ -169,7 +169,7 @@ Column-level pandas operations instead of row-level iteration. The expression co
 - `REPLACECHR/REPLACESTR` → `.str.replace()`
 - `REG_EXTRACT/REG_REPLACE` → `.str.extract()/.str.replace(regex=True)`
 - `CHR(code)` → `chr(int(code))`
-- `||` concatenation → `+` with `.astype(str)` on non-literals
+- `||` concatenation → `+` with smart coercion: `.fillna('').astype(str)` for Series, `str()` for scalars
 **Date/Time:**
 - `TO_DATE(val, fmt)` → `pd.to_datetime()` with Informatica→Python format conversion
@@ -316,10 +316,12 @@ Target field datatypes are mapped to pandas types and generate proper casting co
 - Decimals/Floats: `pd.to_numeric(errors='coerce')`
 - Booleans: `.astype('boolean')`
-### Flat File Handling (v1.3+)
+### Flat File Handling (v1.3+, enhanced v1.9.8)
 Parses FLATFILE metadata for delimiter, fixed-width, header lines, skip rows, quote/escape chars. Generates `pd.read_fwf()` for fixed-width or enriched `read_file()` for delimited.
+**Fixed-width enhancements (v1.9.8):** `OFFSET`, `PHYSICALLENGTH`, and `PHYSICALOFFSET` are parsed from `SOURCEFIELD` attributes. `physical_length` is preferred over `precision` for accurate column width calculations in `pd.read_fwf()`.
 ### Mapplet Inlining (v1.3+)
 Expands Mapplet instances into prefixed transforms, rewires connectors, and eliminates duplication.
@@ -344,12 +346,17 @@ The generated `helper_functions.py` provides a complete runtime library:
 ### Database Operations
 | Function | Description |
 |----------|-------------|
-| `get_db_connection(config, conn_name)` | Create DB connection (pyodbc/pymssql/sqlalchemy fallback for MSSQL) |
+| `get_db_connection(config, conn_name)` | SQLAlchemy-first DB connection with engine caching and connection pooling; DBAPI fallback for pyodbc/pymssql |
 | `read_from_db(config, query, conn_name)` | Execute SQL query and return DataFrame |
 | `write_to_db(config, df, table, conn_name)` | Write DataFrame to database table via `.to_sql()` |
-| `execute_sql(config, sql, conn_name)` | Execute DDL/DML statement (INSERT, UPDATE, DELETE) |
+| `execute_sql(config, sql, conn_name)` | Execute DDL/DML statement; auto-detects SQLAlchemy vs DBAPI via `dialect` attribute |
 | `write_with_update_strategy(config, df, table, ...)` | Split rows by `_update_strategy` column into INSERT/UPDATE/DELETE/REJECT operations |
 | `call_stored_procedure(config, proc, params, ...)` | Execute stored procedure with input/output parameter mapping (Oracle/MSSQL/generic) |
+| `lookup_func(table, *args)` | Full lookup implementation with table caching, condition parsing, and default value support |
+| `resolve_env(value)` | Resolve `${VAR}` placeholders from environment variables with config fallback |
+| `resolve_builtin_variable(var_name, ...)` | Resolve `$PMMappingName`, `$PMSessionName`, `$PMFolderName`, etc. |
+| `rename_with_duplicates(df, col_map)` | Safe column rename supporting one-source-to-many-target mapping |
+| `_safe_close(conn)` | Safe connection cleanup handling both SQLAlchemy and raw DBAPI connections |
 ### File Operations
 | Function | Description |
@@ -380,7 +387,34 @@ The generated `helper_functions.py` provides a complete runtime library:
 ## Changelog
-### v1.9.3 (Current)
+### v1.9.8 (Current)
+- **NOT(expr) function-call form**: `NOT(ISNULL(x))` now correctly converts to `~(df["x"].isna())` — handles both `NOT ` (with space) and `NOT(` (without space) forms
+- **AND/OR/NOT as field names fix**: Logical operators no longer mangled into `df["AND"]` / `df["OR"]` — conversion moved before field substitution in both `_vec_recursive` fallback and `_vectorize_simple`
+- **Condition tokenizer word-boundary fix**: `_split_condition_tokens` no longer splits on `OR` inside field names like `DeletedIndicator` — verifies preceding character is a real word boundary
+- **`$PMMappingName` in expressions**: `$PM*` built-in variables in expression context properly convert to `resolve_builtin_variable("PMMappingName")` instead of being mangled to `$df["PMMappingName"]`
+- **TO_CHAR arithmetic parenthesization**: `TO_CHAR(TO_INTEGER(x) - 1)` now produces `(pd.to_numeric(...) - 1).astype(str)` instead of incorrect `- 1.astype(str)` binding
+- **String literal early-return fix**: Expressions like `'PER_' || X || '_suffix'` no longer short-circuit as a single string literal
+- **Fixed-width file enhancements**: `OFFSET`, `PHYSICALLENGTH`, `PHYSICALOFFSET` parsed from SOURCEFIELD XML; `physical_length` preferred over `precision` for `read_fwf` column widths
+- **Smart concat coercion**: Scalar returns (e.g. `resolve_builtin_variable()`, `get_variable()`) use `str()` wrapping; Series use `.fillna('').astype(str)`
+- **700 tests** passing
+### v1.9.5 / v1.9.6
+- **`rename_with_duplicates`** helper for one-source-to-many-target column mapping
+- **`resolve_env()`** for `${VAR}` placeholder resolution (env → config fallback)
+- **`resolve_builtin_variable()`** for `$PMMappingName`, `$PMSessionName`, `$PMFolderName`, etc.
+- **SQLAlchemy-first `get_db_connection`**: Engine caching and connection pooling; DBAPI fallback for pyodbc/pymssql
+- **`_safe_close()`**: Safe connection cleanup handling both SQLAlchemy and raw DBAPI connections
+- **Full `lookup_func()` implementation**: Table caching, condition parsing, default value support
+- **Null-safe `||` concatenation**: `.fillna('').astype(str)` prevents "nan" strings in concatenation
+- **`$PM*` variable substitution in SQL Override queries**
+- **`execute_sql` dialect detection**: Uses `dialect` attribute to choose SQLAlchemy `text()` vs DBAPI `cursor.execute()`
+- **678 tests** passing
+### v1.9.4
+- Extended expression function coverage and edge-case fixes
+- Improved mapplet and connector handling
+### v1.9.3
 - **Smart target write detection**: Bare targets default to `write_to_db()` instead of `write_file()`; file extension allowlist (`.csv`, `.dat`, `.txt`, `.xml`, `.json`, `.parquet`, `.xlsx`, `.xls`, `.tsv`, `.avro`) for file targets; schema-qualified names (`dbo.TABLE`) correctly route to database
 - **DECODE vectorization**: `DECODE(TRUE, cond1, val1, ..., default)` → nested `np.where()` chains; value-matching DECODE; handles IN() conditions and complex boolean nesting
 - **IS_SPACES vectorization**: `IS_SPACES(field)` → `field.str.strip().eq("")`
@@ -468,7 +502,7 @@ The generated `helper_functions.py` provides a complete runtime library:
 cd informatica_python
 pip install -e ".[dev]"
-# Run tests (663 tests)
+# Run tests (700 tests)
 pytest tests/ -v
 ```

{informatica_python-1.9.7 → informatica_python-1.9.9}/informatica_python/utils/expression_converter.py RENAMED Viewed

@@ -883,8 +883,10 @@ def _vec_recursive(expr, df_var):
             v = _vec_recursive(p, df_var)
             if v.startswith("'") and v.endswith("'"):
                 vec_parts.append(v)
-            else:
+            elif v.startswith(df_var + '[') or v.startswith('pd.') or '.str.' in v:
                 vec_parts.append(f'{v}.fillna(\'\').astype(str)')
+            else:
+                vec_parts.append(f'str({v})')
         return " + ".join(vec_parts)
     for func_name in sorted(INFA_FUNC_MAP.keys(), key=lambda x: -len(x)):

{informatica_python-1.9.7 → informatica_python-1.9.9}/informatica_python.egg-info/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: informatica-python
-Version: 1.9.7
+Version: 1.9.9
 Summary: Convert Informatica PowerCenter workflow XML to Python/PySpark code
 Author: Nick
 License: MIT
@@ -124,7 +124,7 @@ The code generator produces real, runnable Python for these transformation types
 - **Expression** — Field-level expressions converted to vectorized pandas operations (`df["COL"]` style) with 40+ vectorized function handlers
 - **Filter** — Row filtering with vectorized converted conditions
 - **Joiner** — `pd.merge()` with join type and condition parsing (inner/left/right/outer)
-- **Lookup** — `pd.merge()` lookups with connection-aware DB reads, multiple match policies, default values, `$$PARAM` substitution
+- **Lookup** — `pd.merge()` lookups with connection-aware DB reads, multiple match policies, default values, `$$PARAM` substitution, SQL override support, table caching via `lookup_func()`
 - **Aggregator** — `groupby().agg()` with SUM/COUNT/AVG/MIN/MAX/FIRST/LAST, computed aggregates
 - **Sorter** — `sort_values()` with multi-key ascending/descending per-field direction from SORTDIRECTION attribute
 - **Router** — Multi-group conditional routing with named groups
@@ -196,7 +196,7 @@ Column-level pandas operations instead of row-level iteration. The expression co
 - `REPLACECHR/REPLACESTR` → `.str.replace()`
 - `REG_EXTRACT/REG_REPLACE` → `.str.extract()/.str.replace(regex=True)`
 - `CHR(code)` → `chr(int(code))`
-- `||` concatenation → `+` with `.astype(str)` on non-literals
+- `||` concatenation → `+` with smart coercion: `.fillna('').astype(str)` for Series, `str()` for scalars
 **Date/Time:**
 - `TO_DATE(val, fmt)` → `pd.to_datetime()` with Informatica→Python format conversion
@@ -343,10 +343,12 @@ Target field datatypes are mapped to pandas types and generate proper casting co
 - Decimals/Floats: `pd.to_numeric(errors='coerce')`
 - Booleans: `.astype('boolean')`
-### Flat File Handling (v1.3+)
+### Flat File Handling (v1.3+, enhanced v1.9.8)
 Parses FLATFILE metadata for delimiter, fixed-width, header lines, skip rows, quote/escape chars. Generates `pd.read_fwf()` for fixed-width or enriched `read_file()` for delimited.
+**Fixed-width enhancements (v1.9.8):** `OFFSET`, `PHYSICALLENGTH`, and `PHYSICALOFFSET` are parsed from `SOURCEFIELD` attributes. `physical_length` is preferred over `precision` for accurate column width calculations in `pd.read_fwf()`.
 ### Mapplet Inlining (v1.3+)
 Expands Mapplet instances into prefixed transforms, rewires connectors, and eliminates duplication.
@@ -371,12 +373,17 @@ The generated `helper_functions.py` provides a complete runtime library:
 ### Database Operations
 | Function | Description |
 |----------|-------------|
-| `get_db_connection(config, conn_name)` | Create DB connection (pyodbc/pymssql/sqlalchemy fallback for MSSQL) |
+| `get_db_connection(config, conn_name)` | SQLAlchemy-first DB connection with engine caching and connection pooling; DBAPI fallback for pyodbc/pymssql |
 | `read_from_db(config, query, conn_name)` | Execute SQL query and return DataFrame |
 | `write_to_db(config, df, table, conn_name)` | Write DataFrame to database table via `.to_sql()` |
-| `execute_sql(config, sql, conn_name)` | Execute DDL/DML statement (INSERT, UPDATE, DELETE) |
+| `execute_sql(config, sql, conn_name)` | Execute DDL/DML statement; auto-detects SQLAlchemy vs DBAPI via `dialect` attribute |
 | `write_with_update_strategy(config, df, table, ...)` | Split rows by `_update_strategy` column into INSERT/UPDATE/DELETE/REJECT operations |
 | `call_stored_procedure(config, proc, params, ...)` | Execute stored procedure with input/output parameter mapping (Oracle/MSSQL/generic) |
+| `lookup_func(table, *args)` | Full lookup implementation with table caching, condition parsing, and default value support |
+| `resolve_env(value)` | Resolve `${VAR}` placeholders from environment variables with config fallback |
+| `resolve_builtin_variable(var_name, ...)` | Resolve `$PMMappingName`, `$PMSessionName`, `$PMFolderName`, etc. |
+| `rename_with_duplicates(df, col_map)` | Safe column rename supporting one-source-to-many-target mapping |
+| `_safe_close(conn)` | Safe connection cleanup handling both SQLAlchemy and raw DBAPI connections |
 ### File Operations
 | Function | Description |
@@ -407,7 +414,34 @@ The generated `helper_functions.py` provides a complete runtime library:
 ## Changelog
-### v1.9.3 (Current)
+### v1.9.8 (Current)
+- **NOT(expr) function-call form**: `NOT(ISNULL(x))` now correctly converts to `~(df["x"].isna())` — handles both `NOT ` (with space) and `NOT(` (without space) forms
+- **AND/OR/NOT as field names fix**: Logical operators no longer mangled into `df["AND"]` / `df["OR"]` — conversion moved before field substitution in both `_vec_recursive` fallback and `_vectorize_simple`
+- **Condition tokenizer word-boundary fix**: `_split_condition_tokens` no longer splits on `OR` inside field names like `DeletedIndicator` — verifies preceding character is a real word boundary
+- **`$PMMappingName` in expressions**: `$PM*` built-in variables in expression context properly convert to `resolve_builtin_variable("PMMappingName")` instead of being mangled to `$df["PMMappingName"]`
+- **TO_CHAR arithmetic parenthesization**: `TO_CHAR(TO_INTEGER(x) - 1)` now produces `(pd.to_numeric(...) - 1).astype(str)` instead of incorrect `- 1.astype(str)` binding
+- **String literal early-return fix**: Expressions like `'PER_' || X || '_suffix'` no longer short-circuit as a single string literal
+- **Fixed-width file enhancements**: `OFFSET`, `PHYSICALLENGTH`, `PHYSICALOFFSET` parsed from SOURCEFIELD XML; `physical_length` preferred over `precision` for `read_fwf` column widths
+- **Smart concat coercion**: Scalar returns (e.g. `resolve_builtin_variable()`, `get_variable()`) use `str()` wrapping; Series use `.fillna('').astype(str)`
+- **700 tests** passing
+### v1.9.5 / v1.9.6
+- **`rename_with_duplicates`** helper for one-source-to-many-target column mapping
+- **`resolve_env()`** for `${VAR}` placeholder resolution (env → config fallback)
+- **`resolve_builtin_variable()`** for `$PMMappingName`, `$PMSessionName`, `$PMFolderName`, etc.
+- **SQLAlchemy-first `get_db_connection`**: Engine caching and connection pooling; DBAPI fallback for pyodbc/pymssql
+- **`_safe_close()`**: Safe connection cleanup handling both SQLAlchemy and raw DBAPI connections
+- **Full `lookup_func()` implementation**: Table caching, condition parsing, default value support
+- **Null-safe `||` concatenation**: `.fillna('').astype(str)` prevents "nan" strings in concatenation
+- **`$PM*` variable substitution in SQL Override queries**
+- **`execute_sql` dialect detection**: Uses `dialect` attribute to choose SQLAlchemy `text()` vs DBAPI `cursor.execute()`
+- **678 tests** passing
+### v1.9.4
+- Extended expression function coverage and edge-case fixes
+- Improved mapplet and connector handling
+### v1.9.3
 - **Smart target write detection**: Bare targets default to `write_to_db()` instead of `write_file()`; file extension allowlist (`.csv`, `.dat`, `.txt`, `.xml`, `.json`, `.parquet`, `.xlsx`, `.xls`, `.tsv`, `.avro`) for file targets; schema-qualified names (`dbo.TABLE`) correctly route to database
 - **DECODE vectorization**: `DECODE(TRUE, cond1, val1, ..., default)` → nested `np.where()` chains; value-matching DECODE; handles IN() conditions and complex boolean nesting
 - **IS_SPACES vectorization**: `IS_SPACES(field)` → `field.str.strip().eq("")`
@@ -495,7 +529,7 @@ The generated `helper_functions.py` provides a complete runtime library:
 cd informatica_python
 pip install -e ".[dev]"
-# Run tests (663 tests)
+# Run tests (700 tests)
 pytest tests/ -v
 ```

{informatica_python-1.9.7 → informatica_python-1.9.9}/pyproject.toml RENAMED Viewed

@@ -4,7 +4,7 @@ build-backend = "setuptools.build_meta"
 [project]
 name = "informatica-python"
-version = "1.9.7"
+version = "1.9.9"
 description = "Convert Informatica PowerCenter workflow XML to Python/PySpark code"
 readme = "README.md"
 license = {text = "MIT"}