PyPI - pvw-cli - Versions diffs - 1.0.14__tar.gz → 1.2.2__tar.gz - Mend

pvw-cli 1.0.14tar.gz → 1.2.2tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Potentially problematic release.

This version of pvw-cli might be problematic. Click here for more details.

Files changed (66) hide show

{pvw_cli-1.0.14 → pvw_cli-1.2.2}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: pvw-cli
-Version: 1.0.14
+Version: 1.2.2
 Summary: Microsoft Purview CLI with comprehensive automation capabilities
 Author-email: AYOUB KEBAILI <keayoub@msn.com>
 Maintainer-email: AYOUB KEBAILI <keayoub@msn.com>
@@ -34,7 +34,7 @@ Requires-Dist: rich>=12.0.0
 Requires-Dist: requests>=2.28.0
 Requires-Dist: pandas>=1.5.0
 Requires-Dist: aiohttp>=3.8.0
-Requires-Dist: pydantic<3.0.0,>=1.10.0
+Requires-Dist: pydantic<2.12,>=1.10.0
 Requires-Dist: PyYAML>=6.0
 Requires-Dist: cryptography<46.0.0,>=41.0.5
 Provides-Extra: dev
@@ -56,7 +56,7 @@ Requires-Dist: pytest-asyncio>=0.20.0; extra == "test"
 Requires-Dist: pytest-cov>=4.0.0; extra == "test"
 Requires-Dist: requests-mock>=1.9.0; extra == "test"
-# PURVIEW CLI v1.0.14 - Microsoft Purview Automation & Data Governance
+# PURVIEW CLI v1.2.1 - Microsoft Purview Automation & Data Governance
 > **LATEST UPDATE (October 2025):**
 > - **� NEW: Bulk Term Import/Export** - Import multiple terms from CSV/JSON with dry-run support
@@ -72,7 +72,7 @@ Requires-Dist: requests-mock>=1.9.0; extra == "test"
 ## What is PVW CLI?
-**PVW CLI v1.0.14** is a modern, full-featured command-line interface and Python library for Microsoft Purview. It enables automation and management of *all major Purview APIs* including:
+**PVW CLI v1.2.1** is a modern, full-featured command-line interface and Python library for Microsoft Purview. It enables automation and management of *all major Purview APIs* including:
 - **Unified Catalog (UC) Management** - Complete governance domains, glossary terms, data products, OKRs, CDEs
 - **Bulk Operations** - Import/export terms from CSV/JSON, bulk delete scripts with progress tracking
@@ -164,7 +164,7 @@ For more advanced usage, see the documentation in `doc/` or the project docs: ht
 ## Overview
-**PVW CLI v1.0.14** is a modern command-line interface and Python library for Microsoft Purview, enabling:
+**PVW CLI v1.2.1** is a modern command-line interface and Python library for Microsoft Purview, enabling:
 - Advanced data catalog search and discovery
 - Bulk import/export of entities, glossary terms, and lineage
@@ -1203,6 +1203,6 @@ See [LICENSE](LICENSE) file for details.
 ---
-**PVW CLI v1.0.14 empowers data engineers, stewards, and architects to automate, scale, and enhance their Microsoft Purview experience with powerful command-line and programmatic capabilities.**
+**PVW CLI v1.2.1 empowers data engineers, stewards, and architects to automate, scale, and enhance their Microsoft Purview experience with powerful command-line and programmatic capabilities.**
 **Latest Features:** Bulk term import/export, PowerShell integration, multiple output formats, and comprehensive bulk delete scripts with beautiful progress tracking.

{pvw_cli-1.0.14 → pvw_cli-1.2.2}/README.md RENAMED Viewed

@@ -1,4 +1,4 @@
-# PURVIEW CLI v1.0.14 - Microsoft Purview Automation & Data Governance
+# PURVIEW CLI v1.2.1 - Microsoft Purview Automation & Data Governance
 > **LATEST UPDATE (October 2025):**
 > - **� NEW: Bulk Term Import/Export** - Import multiple terms from CSV/JSON with dry-run support
@@ -14,7 +14,7 @@
 ## What is PVW CLI?
-**PVW CLI v1.0.14** is a modern, full-featured command-line interface and Python library for Microsoft Purview. It enables automation and management of *all major Purview APIs* including:
+**PVW CLI v1.2.1** is a modern, full-featured command-line interface and Python library for Microsoft Purview. It enables automation and management of *all major Purview APIs* including:
 - **Unified Catalog (UC) Management** - Complete governance domains, glossary terms, data products, OKRs, CDEs
 - **Bulk Operations** - Import/export terms from CSV/JSON, bulk delete scripts with progress tracking
@@ -106,7 +106,7 @@ For more advanced usage, see the documentation in `doc/` or the project docs: ht
 ## Overview
-**PVW CLI v1.0.14** is a modern command-line interface and Python library for Microsoft Purview, enabling:
+**PVW CLI v1.2.1** is a modern command-line interface and Python library for Microsoft Purview, enabling:
 - Advanced data catalog search and discovery
 - Bulk import/export of entities, glossary terms, and lineage
@@ -1145,6 +1145,6 @@ See [LICENSE](LICENSE) file for details.
 ---
-**PVW CLI v1.0.14 empowers data engineers, stewards, and architects to automate, scale, and enhance their Microsoft Purview experience with powerful command-line and programmatic capabilities.**
+**PVW CLI v1.2.1 empowers data engineers, stewards, and architects to automate, scale, and enhance their Microsoft Purview experience with powerful command-line and programmatic capabilities.**
 **Latest Features:** Bulk term import/export, PowerShell integration, multiple output formats, and comprehensive bulk delete scripts with beautiful progress tracking.

{pvw_cli-1.0.14 → pvw_cli-1.2.2}/purviewcli/__init__.py RENAMED Viewed

@@ -1,4 +1,4 @@
-__version__ = "1.0.14"
+__version__ = "1.2.2"
 # Import main client modules
 from .client import *

{pvw_cli-1.0.14 → pvw_cli-1.2.2}/purviewcli/cli/entity.py RENAMED Viewed

@@ -1689,44 +1689,150 @@ def bulk_update_csv(ctx, csv_file, batch_size, dry_run, error_csv):
             return
         df = pd.read_csv(csv_file)
-        if "guid" not in df.columns:
-            console.print("[red][X] CSV must contain 'guid' column[/red]")
+        if df.empty:
+            console.print("[yellow]No rows found in CSV. Exiting.[/yellow]")
             return
         entity_client = Entity()
         total = len(df)
         success, failed = 0, 0
         errors = []
         failed_rows = []
+        # Determine mode:
+        # - If CSV has both 'typeName' and 'qualifiedName' -> map rows to Purview entities and call bulk create-or-update
+        # - Else if CSV has 'guid' -> build guid-based payloads (preferred for partial attribute updates)
+        has_type_qn = ("typeName" in df.columns and "qualifiedName" in df.columns)
+        has_guid = "guid" in df.columns
         for i in range(0, total, batch_size):
-            batch = df.iloc[i:i+batch_size]
-            payload = {
-                "entities": [
-                    {col: row[col] for col in batch.columns if pd.notnull(row[col])}
-                    for _, row in batch.iterrows()
-                ]
-            }
-            if dry_run:
-                console.print(f"[blue]DRY RUN: Would update batch {i//batch_size+1} with {len(batch)} entities[/blue]")
-                continue
-            with tempfile.NamedTemporaryFile(mode="w", suffix=".json", delete=False) as tmpf:
-                json.dump(payload, tmpf, indent=2)
-                tmpf.flush()
-                payload_file = tmpf.name
-            try:
-                args = {"--payloadFile": payload_file}
-                result = entity_client.entityBulkUpdate(args)
-                if result and (not isinstance(result, dict) or result.get("status") != "error"):
-                    success += len(batch)
-                else:
+            batch = df.iloc[i : i + batch_size]
+            if has_type_qn:
+                # Map flat rows to Purview entity objects using helper
+                from purviewcli.client._entity import map_flat_entity_to_purview_entity
+                entities = [map_flat_entity_to_purview_entity(row) for _, row in batch.iterrows()]
+                payload = {"entities": entities}
+                if dry_run:
+                    console.print(f"[blue]DRY RUN: Would bulk-create/update batch {i//batch_size+1} with {len(batch)} entities[/blue]")
+                    continue
+                with tempfile.NamedTemporaryFile(mode="w", suffix=".json", delete=False, encoding="utf-8") as tmpf:
+                    json.dump(payload, tmpf, indent=2)
+                    tmpf.flush()
+                    payload_file = tmpf.name
+                try:
+                    args = {"--payloadFile": payload_file}
+                    result = entity_client.entityCreateBulk(args)
+                    if result and (not isinstance(result, dict) or result.get("status") != "error"):
+                        success += len(batch)
+                    else:
+                        failed += len(batch)
+                        errors.append(f"Batch {i//batch_size+1}: {result}")
+                        failed_rows.extend(batch.to_dict(orient="records"))
+                except Exception as e:
                     failed += len(batch)
-                    errors.append(f"Batch {i//batch_size+1}: {result}")
+                    errors.append(f"Batch {i//batch_size+1}: {str(e)}")
                     failed_rows.extend(batch.to_dict(orient="records"))
-            except Exception as e:
-                failed += len(batch)
-                errors.append(f"Batch {i//batch_size+1}: {str(e)}")
-                failed_rows.extend(batch.to_dict(orient="records"))
-            finally:
-                os.remove(payload_file)
+                finally:
+                    try:
+                        os.remove(payload_file)
+                    except Exception:
+                        pass
+            elif has_guid:
+                # Build guid-based updates. If the CSV contains only guid + attr columns, we'll attempt to perform
+                # partial attribute updates by calling entityPartialUpdateAttribute where possible.
+                # If a row contains multiple attributes, we will call entityCreateBulk with a payload containing
+                # the guid and attributes (server supports bulk create-or-update by guid in some endpoints).
+                # Normalize rows into dicts
+                rows = [row.to_dict() for _, row in batch.iterrows()]
+                # Attempt to detect single-attribute update pattern: columns [guid, attrName, attrValue]
+                if set(["guid", "attrName", "attrValue"]).issubset(set(batch.columns)):
+                    # perform per-guid partial updates in batch
+                    for r in rows:
+                        guid = str(r.get("guid"))
+                        attr_name = r.get("attrName")
+                        attr_value = r.get("attrValue")
+                        if pd.isna(guid) or pd.isna(attr_name):
+                            failed += 1
+                            failed_rows.append(r)
+                            continue
+                        if dry_run:
+                            console.print(f"[blue]DRY RUN: Would update GUID {guid} set {attr_name}={attr_value}[/blue]")
+                            success += 1
+                            continue
+                        try:
+                            args = {"--guid": [guid], "--attrName": attr_name, "--attrValue": attr_value}
+                            result = entity_client.entityPartialUpdateAttribute(args)
+                            if result and (not isinstance(result, dict) or result.get("status") != "error"):
+                                success += 1
+                            else:
+                                failed += 1
+                                errors.append(f"GUID {guid}: {result}")
+                                failed_rows.append(r)
+                        except Exception as e:
+                            failed += 1
+                            errors.append(f"GUID {guid}: {str(e)}")
+                            failed_rows.append(r)
+                else:
+                    # Fallback: call bulk create-or-update with guid included in each entity object.
+                    # Map each row into an entity dict keeping non-null columns.
+                    entities = []
+                    for r in rows:
+                        if pd.isna(r.get("guid")):
+                            failed_rows.append(r)
+                            failed += 1
+                            continue
+                        ent = {k: v for k, v in r.items() if pd.notnull(v)}
+                        # ensure guid is string under top-level 'guid' field for server bulk endpoints
+                        ent["guid"] = str(ent.get("guid"))
+                        entities.append(ent)
+                    if not entities:
+                        continue
+                    payload = {"entities": entities}
+                    if dry_run:
+                        console.print(f"[blue]DRY RUN: Would bulk-update (by guid) batch {i//batch_size+1} with {len(entities)} entities[/blue]")
+                        success += len(entities)
+                        continue
+                    with tempfile.NamedTemporaryFile(mode="w", suffix=".json", delete=False, encoding="utf-8") as tmpf:
+                        json.dump(payload, tmpf, indent=2)
+                        tmpf.flush()
+                        payload_file = tmpf.name
+                    try:
+                        args = {"--payloadFile": payload_file}
+                        # Use the create-or-update bulk endpoint - server will use guid when present
+                        result = entity_client.entityCreateBulk(args)
+                        if result and (not isinstance(result, dict) or result.get("status") != "error"):
+                            success += len(entities)
+                        else:
+                            failed += len(entities)
+                            errors.append(f"Batch {i//batch_size+1}: {result}")
+                            failed_rows.extend(batch.to_dict(orient="records"))
+                    except Exception as e:
+                        failed += len(entities)
+                        errors.append(f"Batch {i//batch_size+1}: {str(e)}")
+                        failed_rows.extend(batch.to_dict(orient="records"))
+                    finally:
+                        try:
+                            os.remove(payload_file)
+                        except Exception:
+                            pass
+            else:
+                console.print(f"[red][X] CSV must contain either (typeName and qualifiedName) or guid column[/red]")
+                return
         console.print(f"[green][OK] Bulk update completed. Success: {success}, Failed: {failed}[/green]")
         if errors:
             console.print("[red]Errors:[/red]")
@@ -1734,7 +1840,7 @@ def bulk_update_csv(ctx, csv_file, batch_size, dry_run, error_csv):
                 console.print(f"[red]- {err}[/red]")
         if error_csv and failed_rows:
             pd.DataFrame(failed_rows).to_csv(error_csv, index=False)
-            console.print(f"[yellow][X] Failed rows written to {error_csv}[/yellow]")
+            console.print(f"[yellow]WARNING: Failed rows written to {error_csv}[/yellow]")
     except Exception as e:
         console.print(f"[red][X] Error executing entity bulk-update-csv: {str(e)}[/red]")
@@ -1774,7 +1880,7 @@ def bulk_delete_csv(ctx, csv_file, batch_size, dry_run, error_csv):
                 continue
             try:
                 args = {"--guid": guids}
-                result = entity_client.entityBulkDelete(args)
+                result = entity_client.entityDeleteBulk(args)
                 if result and (not isinstance(result, dict) or result.get("status") != "error"):
                     success += len(guids)
                 else:

{pvw_cli-1.0.14 → pvw_cli-1.2.2}/purviewcli/cli/search.py RENAMED Viewed

@@ -139,12 +139,8 @@ def _format_search_results(data, show_ids=False):
     table = Table(title=f"Search Results ({len(items)} of {count} total)")
     table.add_column("Name", style="cyan", min_width=15, max_width=25)
     table.add_column("Type", style="green", min_width=15, max_width=20)
+    table.add_column("ID", style="yellow", min_width=36, max_width=36)
     table.add_column("Collection", style="blue", min_width=12, max_width=20)
-    table.add_column("Classifications", style="magenta", min_width=15, max_width=30)
-    if show_ids:
-        table.add_column("ID", style="yellow", min_width=36, max_width=36)
     table.add_column("Qualified Name", style="white", min_width=30)
     for item in items:
@@ -158,34 +154,17 @@ def _format_search_results(data, show_ids=False):
         if len(qualified_name) > 60:
             qualified_name = qualified_name[:57] + "..."
-        # Handle collection
+        # Handle collection - try multiple sources
         collection = 'N/A'
         if 'collection' in item and item['collection']:
             collection = item['collection'].get('name', 'N/A')
+        elif 'collectionId' in item:
+            collection = item.get('collectionId', 'N/A')
+        elif 'assetName' in item:
+            collection = item.get('assetName', 'N/A')
-        # Handle classifications - truncate long classification lists
-        classifications = []
-        if 'classification' in item and item['classification']:
-            for cls in item['classification']:
-                if isinstance(cls, dict):
-                    cls_name = cls.get('typeName', str(cls))
-                    # Simplify Microsoft classifications for display
-                    if cls_name.startswith('MICROSOFT.'):
-                        cls_name = cls_name.replace('MICROSOFT.', 'MS.')
-                    classifications.append(cls_name)
-                else:
-                    classifications.append(str(cls))
-        # Truncate classifications if too long
-        cls_display = ", ".join(classifications) if classifications else ""
-        if len(cls_display) > 40:
-            cls_display = cls_display[:37] + "..."
-        # Build row data
-        row_data = [name, entity_type, collection, cls_display]
-        if show_ids:
-            row_data.append(entity_id)
-        row_data.append(qualified_name)
+        # Build row data with ID always shown
+        row_data = [name, entity_type, entity_id, collection, qualified_name]
         # Add row to table
         table.add_row(*row_data)
@@ -214,9 +193,9 @@ def _invoke_search_method(method_name, **kwargs):
         # Choose output format
         if output_json:
             _format_json_output(result)
-        elif detailed and method_name in ['searchQuery', 'searchBrowse', 'searchSuggest', 'searchAutoComplete']:
+        elif detailed and method_name in ['searchQuery', 'searchBrowse', 'searchSuggest', 'searchAutocomplete', 'searchFaceted']:
             _format_detailed_output(result)
-        elif method_name in ['searchQuery', 'searchBrowse', 'searchSuggest', 'searchAutoComplete']:
+        elif method_name in ['searchQuery', 'searchBrowse', 'searchSuggest', 'searchAutocomplete', 'searchFaceted']:
             _format_search_results(result, show_ids=show_ids)
         else:
             _format_json_output(result)
@@ -230,7 +209,7 @@ def _invoke_search_method(method_name, **kwargs):
 @click.option('--json', 'output_json', is_flag=True, help='Show full JSON details instead of table')
 def autocomplete(keywords, limit, filterfile, output_json):
     """Autocomplete search suggestions"""
-    _invoke_search_method('searchAutoComplete', keywords=keywords, limit=limit, filterFile=filterfile, output_json=output_json)
+    _invoke_search_method('searchAutocomplete', keywords=keywords, limit=limit, filterFile=filterfile, output_json=output_json)
 @search.command()
 @click.option('--entityType', required=False)
@@ -305,7 +284,7 @@ def advanced(keywords, limit, offset, filterfile, facets_file, businessmetadata,
         with open(businessmetadata, 'r', encoding='utf-8') as f:
             business_metadata_content = json.load(f)
     _invoke_search_method(
-        'searchAdvancedQuery',
+        'searchAdvanced',
         keywords=keywords,
         limit=limit,
         offset=offset,
@@ -316,4 +295,227 @@ def advanced(keywords, limit, offset, filterfile, facets_file, businessmetadata,
         termAssignments=termassignments
     )
+@search.command('find-table')
+@click.option('--name', required=False, help='Table name (e.g., Address)')
+@click.option('--schema', required=False, help='Schema name (e.g., SalesLT, dbo)')
+@click.option('--database', required=False, help='Database name (e.g., Adventureworks)')
+@click.option('--server', required=False, help='Server name (e.g., fabricdemos001.database.windows.net)')
+@click.option('--qualified-name', required=False, help='Full qualified name from Purview (e.g., mssql://server/database/schema/table)')
+@click.option('--entity-type', required=False, help='Entity type to search for (e.g., azure_sql_table, mssql_table)')
+@click.option('--limit', required=False, type=int, default=25, help='Maximum number of results to return')
+@click.option('--show-ids', is_flag=True, help='Show entity IDs in the results')
+@click.option('--json', 'output_json', is_flag=True, help='Show full JSON details')
+@click.option('--detailed', is_flag=True, help='Show detailed information')
+@click.option('--id-only', is_flag=True, help='Output only the GUID (useful for scripts and automation)')
+def find_table(name, schema, database, server, qualified_name, entity_type, limit, show_ids, output_json, detailed, id_only):
+    """Find a table by name, schema, database, or get all tables in a schema/database.
+    Perfect for getting the GUID of a data asset before updating it.
+    You can search for ONE specific table or ALL tables in a schema/database.
+    \b
+    SEARCH ONE SPECIFIC TABLE:
+      pvw search find-table --name Address --schema SalesLT --database Adventureworks
+      pvw search find-table --qualified-name "mssql://server/database/schema/table"
+    \b
+    SEARCH MULTIPLE TABLES:
+      pvw search find-table --schema SalesLT --database Adventureworks
+      pvw search find-table --database Adventureworks
+      pvw search find-table --schema SalesLT
+    \b
+    GET GUIDS FOR AUTOMATION:
+      pvw search find-table --name Address --schema SalesLT --database Adventureworks --id-only
+      pvw search find-table --schema SalesLT --database Adventureworks --id-only
+    \b
+    USE IN SCRIPTS (PowerShell):
+      $guid = pvw search find-table --name Address --schema SalesLT --database Adventureworks --id-only
+      pvw entity update --guid $guid --payload update.json
+      $guids = pvw search find-table --schema SalesLT --database Adventureworks --id-only
+      foreach ($guid in $guids) { pvw entity update --guid $guid --payload update.json }
+    """
+    search_client = Search()
+    # Validate that at least some search criteria is provided
+    if not name and not qualified_name and not schema and not database:
+        console.print("[red]ERROR:[/red] You must provide at least --name, --qualified-name, --schema, or --database")
+        return
+    # Build search pattern
+    search_pattern = qualified_name
+    if not search_pattern:
+        # Build pattern from components
+        # Try to build a full qualified name pattern that matches Purview's format
+        if server and database and schema and name:
+            # Full path with server: mssql://server/database/schema/table
+            search_pattern = f"mssql://{server}/{database}/{schema}/{name}"
+        elif database and schema and name:
+            # Database.schema.table format
+            search_pattern = f"{database}/{schema}/{name}"
+        elif database and schema:
+            # Database.schema format (all tables in schema)
+            search_pattern = f"{database}/{schema}"
+        elif schema and name:
+            # Schema.table format
+            search_pattern = f"{schema}/{name}"
+        elif database:
+            # Just database (all tables in database)
+            search_pattern = database
+        elif schema:
+            # Just schema (all tables in schema)
+            search_pattern = schema
+        elif name:
+            # Just the table name
+            search_pattern = name
+        else:
+            console.print("[red]ERROR:[/red] You must provide at least one search criterion")
+            return
+    # For keyword search, use different strategies based on what we have
+    if name:
+        search_keywords = name
+    elif schema:
+        search_keywords = schema
+    elif database:
+        search_keywords = database
+    else:
+        search_keywords = search_pattern.split('/')[-1]
+    # Build search arguments - use keywords that will match
+    args = {
+        '--keywords': search_keywords,
+        '--limit': limit,
+        '--offset': 0
+    }
+    # Create filter for entity type if specified
+    import tempfile
+    import json
+    import os
+    temp_filter_file = None
+    if entity_type:
+        filter_obj = {
+            'entityType': entity_type
+        }
+        # Write filter to temp file
+        with tempfile.NamedTemporaryFile(mode='w', suffix='.json', delete=False, encoding='utf-8') as f:
+            json.dump(filter_obj, f)
+            temp_filter_file = f.name
+        args['--filterFile'] = temp_filter_file
+    try:
+        # Execute search
+        result = search_client.searchQuery(args)
+        if not result:
+            console.print("[yellow]No results returned from search[/yellow]")
+            if temp_filter_file:
+                os.unlink(temp_filter_file)
+            return
+        # Filter results by qualified name match if provided
+        if result and 'value' in result and result['value']:
+            filtered_results = []
+            search_lower = search_pattern.lower()
+            for item in result.get('value', []):
+                item_qn = item.get('qualifiedName', '').lower()
+                item_name = item.get('name', '').lower()
+                # Build matching criteria
+                matches = False
+                # If we have all components, do strict matching
+                if name and schema and database:
+                    # Exact name match (not substring) - critical for precision
+                    name_match = name.lower() == item_name
+                    schema_match = schema.lower() in item_qn
+                    database_match = database.lower() in item_qn
+                    server_match = not server or server.lower() in item_qn
+                    matches = name_match and schema_match and database_match and server_match
+                # If we have database and schema (all tables in this schema)
+                elif database and schema and not name:
+                    schema_match = schema.lower() in item_qn
+                    database_match = database.lower() in item_qn
+                    server_match = not server or server.lower() in item_qn
+                    matches = schema_match and database_match and server_match
+                # If we have schema and name
+                elif name and schema:
+                    # Exact name match
+                    name_match = name.lower() == item_name
+                    schema_match = schema.lower() in item_qn
+                    matches = name_match and schema_match
+                # If we have just database (all tables in this database)
+                elif database and not name and not schema:
+                    database_match = database.lower() in item_qn
+                    server_match = not server or server.lower() in item_qn
+                    matches = database_match and server_match
+                # If we have just schema (all tables in this schema)
+                elif schema and not name and not database:
+                    schema_match = schema.lower() in item_qn
+                    matches = schema_match
+                # If we have just name or a qualified name pattern
+                elif name or qualified_name:
+                    # If qualified_name was provided, do exact match
+                    if qualified_name:
+                        # Check for exact match of the qualified name
+                        matches = search_lower == item_qn or item_qn.endswith('/' + search_keywords.lower())
+                    else:
+                        # Just name provided, match by name
+                        matches = search_keywords.lower() == item_name
+                if matches:
+                    filtered_results.append(item)
+            if filtered_results:
+                result['value'] = filtered_results
+                result['@search.count'] = len(filtered_results)
+            else:
+                console.print(f"[yellow]No results found matching '{search_pattern}'[/yellow]")
+                if temp_filter_file:
+                    os.unlink(temp_filter_file)
+                return
+        # Display results
+        if id_only:
+            # Output only the ID(s) for scripting purposes
+            if result and 'value' in result and result['value']:
+                for item in result['value']:
+                    print(item.get('id', ''))
+            else:
+                console.print("[yellow]No results found[/yellow]")
+        elif output_json:
+            _format_json_output(result)
+        elif detailed:
+            _format_detailed_output(result)
+        else:
+            _format_search_results(result, show_ids=show_ids)
+        # Clean up temp file
+        if temp_filter_file:
+            import os
+            os.unlink(temp_filter_file)
+    except Exception as e:
+        console.print(f"[red]ERROR:[/red] {str(e)}")
+        # Clean up temp file on error
+        if temp_filter_file:
+            import os
+            try:
+                os.unlink(temp_filter_file)
+            except:
+                pass
 __all__ = ['search']

{pvw_cli-1.0.14 → pvw_cli-1.2.2}/purviewcli/cli/unified_catalog.py RENAMED Viewed

@@ -813,6 +813,7 @@ def term():
 @click.option("--name", required=True, help="Name of the glossary term")
 @click.option("--description", required=False, default="", help="Rich text description of the term")
 @click.option("--domain-id", required=True, help="Governance domain ID")
+@click.option("--parent-id", required=False, help="Parent term ID (for hierarchical terms)")
 @click.option(
     "--status",
     required=False,
@@ -834,7 +835,7 @@ def term():
 )
 @click.option("--resource-name", required=False, help="Resource name for additional reading (can be specified multiple times)", multiple=True)
 @click.option("--resource-url", required=False, help="Resource URL for additional reading (can be specified multiple times)", multiple=True)
-def create(name, description, domain_id, status, acronym, owner_id, resource_name, resource_url):
+def create(name, description, domain_id, parent_id, status, acronym, owner_id, resource_name, resource_url):
     """Create a new Unified Catalog term (Governance Domain term)."""
     try:
         client = UnifiedCatalogClient()
@@ -847,6 +848,8 @@ def create(name, description, domain_id, status, acronym, owner_id, resource_nam
             "--status": [status],
         }
+        if parent_id:
+            args["--parent-id"] = [parent_id]
         if acronym:
             args["--acronym"] = list(acronym)
         if owner_id:
@@ -1037,6 +1040,7 @@ def delete(term_id, force):
 @click.option("--name", required=False, help="Name of the glossary term")
 @click.option("--description", required=False, help="Rich text description of the term")
 @click.option("--domain-id", required=False, help="Governance domain ID")
+@click.option("--parent-id", required=False, help="Parent term ID (for hierarchical terms)")
 @click.option(
     "--status",
     required=False,
@@ -1059,7 +1063,7 @@ def delete(term_id, force):
 @click.option("--resource-url", required=False, help="Resource URL for additional reading (can be specified multiple times, replaces existing)", multiple=True)
 @click.option("--add-acronym", required=False, help="Add acronym to existing ones (can be specified multiple times)", multiple=True)
 @click.option("--add-owner-id", required=False, help="Add owner to existing ones (can be specified multiple times)", multiple=True)
-def update(term_id, name, description, domain_id, status, acronym, owner_id, resource_name, resource_url, add_acronym, add_owner_id):
+def update(term_id, name, description, domain_id, parent_id, status, acronym, owner_id, resource_name, resource_url, add_acronym, add_owner_id):
     """Update an existing Unified Catalog term."""
     try:
         client = UnifiedCatalogClient()
@@ -1073,6 +1077,8 @@ def update(term_id, name, description, domain_id, status, acronym, owner_id, res
             args["--description"] = [description]
         if domain_id:
             args["--governance-domain-id"] = [domain_id]
+        if parent_id:
+            args["--parent-id"] = [parent_id]
         if status:
             args["--status"] = [status]
@@ -1386,7 +1392,7 @@ def update_terms_from_csv(csv_file, dry_run):
     """Bulk update glossary terms from a CSV file.
     CSV Format:
-    term_id,name,description,status,acronyms,owner_ids,add_acronyms,add_owner_ids
+    term_id,name,description,status,parent_id,acronyms,owner_ids,add_acronyms,add_owner_ids
     Required:
     - term_id: The ID of the term to update
@@ -1395,15 +1401,16 @@ def update_terms_from_csv(csv_file, dry_run):
     - name: New term name (replaces existing)
     - description: New description (replaces existing)
     - status: New status (Draft, Published, Archived)
+    - parent_id: Parent term ID for hierarchical relationships (replaces existing)
     - acronyms: New acronyms separated by semicolons (replaces all existing)
     - owner_ids: New owner IDs separated by semicolons (replaces all existing)
     - add_acronyms: Acronyms to add separated by semicolons (preserves existing)
     - add_owner_ids: Owner IDs to add separated by semicolons (preserves existing)
     Example CSV:
-    term_id,name,description,status,add_acronyms,add_owner_ids
-    abc-123,,Updated description,Published,API;REST,user1@company.com
-    def-456,New Name,,,SQL,
+    term_id,name,description,status,parent_id,add_acronyms,add_owner_ids
+    abc-123,,Updated description,Published,parent-term-guid,API;REST,user1@company.com
+    def-456,New Name,,,parent-term-guid,SQL,
     """
     import csv
@@ -1440,6 +1447,8 @@ def update_terms_from_csv(csv_file, dry_run):
                     changes.append(f"desc: {update['description'][:50]}...")
                 if update.get('status', '').strip():
                     changes.append(f"status: {update['status']}")
+                if update.get('parent_id', '').strip():
+                    changes.append(f"parent: {update['parent_id'][:20]}...")
                 if update.get('acronyms', '').strip():
                     changes.append(f"acronyms: {update['acronyms']}")
                 if update.get('add_acronyms', '').strip():
@@ -1479,6 +1488,8 @@ def update_terms_from_csv(csv_file, dry_run):
                 args['--description'] = [update['description'].strip()]
             if update.get('status', '').strip():
                 args['--status'] = [update['status'].strip()]
+            if update.get('parent_id', '').strip():
+                args['--parent-id'] = [update['parent_id'].strip()]
             if update.get('acronyms', '').strip():
                 args['--acronym'] = [a.strip() for a in update['acronyms'].split(';') if a.strip()]
             if update.get('owner_ids', '').strip():
@@ -1537,6 +1548,7 @@ def update_terms_from_json(json_file, dry_run):
                 "name": "New Name",                    // Optional: Replace name
                 "description": "New description",      // Optional: Replace description
                 "status": "Published",                 // Optional: Change status
+                "parent_id": "parent-term-guid",       // Optional: Set parent term (hierarchical)
                 "acronyms": ["API", "REST"],          // Optional: Replace all acronyms
                 "owner_ids": ["user@company.com"],    // Optional: Replace all owners
                 "add_acronyms": ["SQL"],              // Optional: Add acronyms (preserves existing)
@@ -1599,6 +1611,8 @@ def update_terms_from_json(json_file, dry_run):
                 args['--description'] = [update['description']]
             if update.get('status'):
                 args['--status'] = [update['status']]
+            if update.get('parent_id'):
+                args['--parent-id'] = [update['parent_id']]
             if update.get('acronyms'):
                 args['--acronym'] = update['acronyms'] if isinstance(update['acronyms'], list) else [update['acronyms']]
             if update.get('owner_ids'):

{pvw_cli-1.0.14 → pvw_cli-1.2.2}/purviewcli/client/_entity.py RENAMED Viewed

@@ -19,6 +19,41 @@ from .endpoint import Endpoint, decorator, get_json, no_api_call_decorator
 from .endpoints import ENDPOINTS, get_api_version_params
+def map_flat_entity_to_purview_entity(row):
+    """Map a flat row (pandas Series or dict) into a Purview entity dict.
+    Expected minimal input: { 'typeName': 'DataSet', 'qualifiedName': '...','attr1': 'v', ... }
+    Produces: { 'typeName': ..., 'attributes': { 'qualifiedName': ..., 'attr1': 'v', ... } }
+    """
+    try:
+        data = row.to_dict()
+    except Exception:
+        data = dict(row)
+    # pop typeName
+    type_name = data.pop("typeName", None)
+    # build attributes, skipping null-like values
+    attrs = {}
+    from math import isnan
+    for k, v in data.items():
+        # skip empty column names
+        if k is None or (isinstance(k, str) and k.strip() == ""):
+            continue
+        # treat NaN/None as missing
+        try:
+            if v is None:
+                continue
+            if isinstance(v, float) and isnan(v):
+                continue
+        except Exception:
+            pass
+        attrs[k] = v
+    return {"typeName": type_name, "attributes": attrs}
 class Entity(Endpoint):
     """Entity Management Operations - Complete Official API Implementation with 100% Coverage"""

{pvw_cli-1.0.14 → pvw_cli-1.2.2}/purviewcli/client/_unified_catalog.py RENAMED Viewed

@@ -411,6 +411,11 @@ class UnifiedCatalogClient(Endpoint):
             "status": status,
         }
+        # Add parent_id if provided
+        parent_id = args.get("--parent-id", [""])[0]
+        if parent_id:
+            payload["parentId"] = parent_id
         # Add optional fields
         if owners:
             payload["contacts"] = {"owner": owners}
@@ -450,6 +455,8 @@ class UnifiedCatalogClient(Endpoint):
             payload["description"] = args.get("--description", [""])[0]
         if args.get("--governance-domain-id"):
             payload["domain"] = args["--governance-domain-id"][0]
+        if args.get("--parent-id"):
+            payload["parentId"] = args["--parent-id"][0]
         if args.get("--status"):
             payload["status"] = args["--status"][0]

{pvw_cli-1.0.14 → pvw_cli-1.2.2}/pvw_cli.egg-info/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: pvw-cli
-Version: 1.0.14
+Version: 1.2.2
 Summary: Microsoft Purview CLI with comprehensive automation capabilities
 Author-email: AYOUB KEBAILI <keayoub@msn.com>
 Maintainer-email: AYOUB KEBAILI <keayoub@msn.com>
@@ -34,7 +34,7 @@ Requires-Dist: rich>=12.0.0
 Requires-Dist: requests>=2.28.0
 Requires-Dist: pandas>=1.5.0
 Requires-Dist: aiohttp>=3.8.0
-Requires-Dist: pydantic<3.0.0,>=1.10.0
+Requires-Dist: pydantic<2.12,>=1.10.0
 Requires-Dist: PyYAML>=6.0
 Requires-Dist: cryptography<46.0.0,>=41.0.5
 Provides-Extra: dev
@@ -56,7 +56,7 @@ Requires-Dist: pytest-asyncio>=0.20.0; extra == "test"
 Requires-Dist: pytest-cov>=4.0.0; extra == "test"
 Requires-Dist: requests-mock>=1.9.0; extra == "test"
-# PURVIEW CLI v1.0.14 - Microsoft Purview Automation & Data Governance
+# PURVIEW CLI v1.2.1 - Microsoft Purview Automation & Data Governance
 > **LATEST UPDATE (October 2025):**
 > - **� NEW: Bulk Term Import/Export** - Import multiple terms from CSV/JSON with dry-run support
@@ -72,7 +72,7 @@ Requires-Dist: requests-mock>=1.9.0; extra == "test"
 ## What is PVW CLI?
-**PVW CLI v1.0.14** is a modern, full-featured command-line interface and Python library for Microsoft Purview. It enables automation and management of *all major Purview APIs* including:
+**PVW CLI v1.2.1** is a modern, full-featured command-line interface and Python library for Microsoft Purview. It enables automation and management of *all major Purview APIs* including:
 - **Unified Catalog (UC) Management** - Complete governance domains, glossary terms, data products, OKRs, CDEs
 - **Bulk Operations** - Import/export terms from CSV/JSON, bulk delete scripts with progress tracking
@@ -164,7 +164,7 @@ For more advanced usage, see the documentation in `doc/` or the project docs: ht
 ## Overview
-**PVW CLI v1.0.14** is a modern command-line interface and Python library for Microsoft Purview, enabling:
+**PVW CLI v1.2.1** is a modern command-line interface and Python library for Microsoft Purview, enabling:
 - Advanced data catalog search and discovery
 - Bulk import/export of entities, glossary terms, and lineage
@@ -1203,6 +1203,6 @@ See [LICENSE](LICENSE) file for details.
 ---
-**PVW CLI v1.0.14 empowers data engineers, stewards, and architects to automate, scale, and enhance their Microsoft Purview experience with powerful command-line and programmatic capabilities.**
+**PVW CLI v1.2.1 empowers data engineers, stewards, and architects to automate, scale, and enhance their Microsoft Purview experience with powerful command-line and programmatic capabilities.**
 **Latest Features:** Bulk term import/export, PowerShell integration, multiple output formats, and comprehensive bulk delete scripts with beautiful progress tracking.

{pvw_cli-1.0.14 → pvw_cli-1.2.2}/pvw_cli.egg-info/requires.txt RENAMED Viewed

@@ -5,7 +5,7 @@ rich>=12.0.0
 requests>=2.28.0
 pandas>=1.5.0
 aiohttp>=3.8.0
-pydantic<3.0.0,>=1.10.0
+pydantic<2.12,>=1.10.0
 PyYAML>=6.0
 cryptography<46.0.0,>=41.0.5

{pvw_cli-1.0.14 → pvw_cli-1.2.2}/pyproject.toml RENAMED Viewed

@@ -4,7 +4,7 @@ build-backend = "setuptools.build_meta"
 [project]
 name = "pvw-cli"
-version = "1.0.14"
+version = "1.2.2"
 description = "Microsoft Purview CLI with comprehensive automation capabilities"
 readme = "README.md"
 license = "MIT"
@@ -41,7 +41,7 @@ dependencies = [
     "requests>=2.28.0",
     "pandas>=1.5.0",
     "aiohttp>=3.8.0",
-    "pydantic>=1.10.0,<3.0.0",
+    "pydantic>=1.10.0,<2.12",
     "PyYAML>=6.0",
     "cryptography>=41.0.5,<46.0.0",
 ]
@@ -177,6 +177,3 @@ exclude_lines = [
     "class .*\\bProtocol\\):",
     "@(abc\\.)?abstractmethod",
 ]