PyPI - datacontract-cli - Versions diffs - 0.10.10__py3-none-any.whl → 0.10.12__py3-none-any.whl - Mend

datacontract-cli 0.10.10py3-none-any.whl → 0.10.12py3-none-any.whl

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Potentially problematic release.

This version of datacontract-cli might be problematic. Click here for more details.

Files changed (39) hide show

datacontract/cli.py +19 -3
datacontract/data_contract.py +17 -17
datacontract/engines/fastjsonschema/check_jsonschema.py +15 -1
datacontract/engines/fastjsonschema/s3/s3_read_files.py +2 -0
datacontract/engines/soda/check_soda_execute.py +2 -8
datacontract/engines/soda/connections/duckdb.py +23 -20
datacontract/engines/soda/connections/kafka.py +81 -23
datacontract/engines/soda/connections/snowflake.py +8 -5
datacontract/export/avro_converter.py +12 -2
datacontract/export/dbml_converter.py +42 -19
datacontract/export/exporter.py +2 -1
datacontract/export/exporter_factory.py +6 -0
datacontract/export/jsonschema_converter.py +1 -4
datacontract/export/spark_converter.py +4 -0
datacontract/export/sql_type_converter.py +64 -29
datacontract/export/sqlalchemy_converter.py +169 -0
datacontract/imports/avro_importer.py +1 -0
datacontract/imports/bigquery_importer.py +2 -2
datacontract/imports/dbml_importer.py +112 -0
datacontract/imports/dbt_importer.py +67 -91
datacontract/imports/glue_importer.py +64 -54
datacontract/imports/importer.py +3 -2
datacontract/imports/importer_factory.py +5 -0
datacontract/imports/jsonschema_importer.py +106 -120
datacontract/imports/odcs_importer.py +1 -1
datacontract/imports/spark_importer.py +29 -10
datacontract/imports/sql_importer.py +5 -1
datacontract/imports/unity_importer.py +1 -1
datacontract/integration/{publish_datamesh_manager.py → datamesh_manager.py} +33 -5
datacontract/integration/{publish_opentelemetry.py → opentelemetry.py} +1 -1
datacontract/model/data_contract_specification.py +6 -2
datacontract/templates/partials/model_field.html +10 -2
{datacontract_cli-0.10.10.dist-info → datacontract_cli-0.10.12.dist-info}/METADATA +283 -113
{datacontract_cli-0.10.10.dist-info → datacontract_cli-0.10.12.dist-info}/RECORD +38 -37
{datacontract_cli-0.10.10.dist-info → datacontract_cli-0.10.12.dist-info}/WHEEL +1 -1
datacontract/publish/publish.py +0 -32
{datacontract_cli-0.10.10.dist-info → datacontract_cli-0.10.12.dist-info}/LICENSE +0 -0
{datacontract_cli-0.10.10.dist-info → datacontract_cli-0.10.12.dist-info}/entry_points.txt +0 -0
{datacontract_cli-0.10.10.dist-info → datacontract_cli-0.10.12.dist-info}/top_level.txt +0 -0

datacontract/model/data_contract_specification.py CHANGED Viewed

@@ -73,7 +73,7 @@ class Definition(pyd.BaseModel):
     exclusiveMaximum: int = None
     pii: bool = None
     classification: str = None
-    fields: Dict[str, "Definition"] = {}
+    fields: Dict[str, "Field"] = {}
     tags: List[str] = []
     links: Dict[str, str] = {}
     example: str = None
@@ -239,4 +239,8 @@ class DataContractSpecification(pyd.BaseModel):
         return DataContractSpecification(**data)
     def to_yaml(self):
-        return yaml.dump(self.model_dump(exclude_defaults=True, exclude_none=True), sort_keys=False, allow_unicode=True)
+        return yaml.dump(
+            self.model_dump(exclude_defaults=True, exclude_none=True, by_alias=True),
+            sort_keys=False,
+            allow_unicode=True,
+        )

datacontract/templates/partials/model_field.html CHANGED Viewed

@@ -110,5 +110,13 @@
 {% endif %}
 {% if field.items %}
-{{ render_nested_partial("item", field.items, level) }}
-{% endif %}
+{{ render_nested_partial("items", field.items, level) }}
+{% endif %}
+{% if field.keys %}
+{{ render_nested_partial("keys", field.keys, level) }}
+{% endif %}
+{% if field.values %}
+{{ render_nested_partial("values", field.values, level) }}
+{% endif %}

{datacontract_cli-0.10.10.dist-info → datacontract_cli-0.10.12.dist-info}/METADATA RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.1
 Name: datacontract-cli
-Version: 0.10.10
+Version: 0.10.12
 Summary: The datacontract CLI is an open source command-line tool for working with Data Contracts. It uses data contract YAML files to lint the data contract, connect to data sources and execute schema and quality tests, detect breaking changes, and export to different formats. The tool is written in Python. It can be used as a standalone CLI tool, in a CI/CD pipeline, or directly as a Python library.
 Author-email: Jochen Christ <jochen.christ@innoq.com>, Stefan Negele <stefan.negele@innoq.com>, Simon Harrer <simon.harrer@innoq.com>
 Project-URL: Homepage, https://cli.datacontract.com
@@ -11,69 +11,68 @@ Classifier: Operating System :: OS Independent
 Requires-Python: >=3.10
 Description-Content-Type: text/markdown
 License-File: LICENSE
-Requires-Dist: typer[all] <0.13,>=0.9
-Requires-Dist: pydantic <2.9.0,>=2.8.2
-Requires-Dist: pyyaml ~=6.0.1
-Requires-Dist: requests <2.33,>=2.31
-Requires-Dist: fastapi ==0.111.1
-Requires-Dist: fastparquet ==2024.5.0
-Requires-Dist: python-multipart ==0.0.9
-Requires-Dist: rich ~=13.7.0
-Requires-Dist: simple-ddl-parser ==1.5.1
-Requires-Dist: soda-core-duckdb <3.4.0,>=3.3.1
-Requires-Dist: setuptools >=60
-Requires-Dist: duckdb ==1.0.0
-Requires-Dist: fastjsonschema <2.21.0,>=2.19.1
-Requires-Dist: python-dotenv ~=1.0.0
-Requires-Dist: rdflib ==7.0.0
-Requires-Dist: opentelemetry-exporter-otlp-proto-grpc ~=1.16
-Requires-Dist: opentelemetry-exporter-otlp-proto-http ~=1.16
-Requires-Dist: boto3 <1.34.137,>=1.34.41
-Requires-Dist: botocore <1.34.137,>=1.34.41
-Requires-Dist: jinja-partials >=0.2.1
+Requires-Dist: typer<0.13,>=0.12
+Requires-Dist: pydantic<2.9.0,>=2.8.2
+Requires-Dist: pyyaml~=6.0.1
+Requires-Dist: requests<2.33,>=2.31
+Requires-Dist: fastapi==0.112.0
+Requires-Dist: uvicorn==0.30.5
+Requires-Dist: fastjsonschema<2.21.0,>=2.19.1
+Requires-Dist: fastparquet==2024.5.0
+Requires-Dist: python-multipart==0.0.9
+Requires-Dist: rich~=13.7.0
+Requires-Dist: simple-ddl-parser==1.6.0
+Requires-Dist: duckdb==1.0.0
+Requires-Dist: soda-core-duckdb<3.4.0,>=3.3.1
+Requires-Dist: setuptools>=60
+Requires-Dist: python-dotenv~=1.0.0
+Requires-Dist: rdflib==7.0.0
+Requires-Dist: opentelemetry-exporter-otlp-proto-grpc~=1.16
+Requires-Dist: opentelemetry-exporter-otlp-proto-http~=1.16
+Requires-Dist: boto3<1.35.6,>=1.34.41
+Requires-Dist: jinja-partials>=0.2.1
 Provides-Extra: all
-Requires-Dist: datacontract-cli[bigquery,databricks,deltalake,kafka,postgres,s3,snowflake,sqlserver,trino] ; extra == 'all'
+Requires-Dist: datacontract-cli[bigquery,databricks,dbml,dbt,kafka,postgres,s3,snowflake,sqlserver,trino]; extra == "all"
 Provides-Extra: avro
-Requires-Dist: avro ==1.11.3 ; extra == 'avro'
+Requires-Dist: avro==1.12.0; extra == "avro"
 Provides-Extra: bigquery
-Requires-Dist: soda-core-bigquery <3.4.0,>=3.3.1 ; extra == 'bigquery'
+Requires-Dist: soda-core-bigquery<3.4.0,>=3.3.1; extra == "bigquery"
 Provides-Extra: databricks
-Requires-Dist: soda-core-spark-df <3.4.0,>=3.3.1 ; extra == 'databricks'
-Requires-Dist: databricks-sql-connector <3.3.0,>=3.1.2 ; extra == 'databricks'
-Requires-Dist: soda-core-spark[databricks] <3.4.0,>=3.3.1 ; extra == 'databricks'
-Provides-Extra: deltalake
-Requires-Dist: deltalake <0.19,>=0.17 ; extra == 'deltalake'
+Requires-Dist: soda-core-spark-df<3.4.0,>=3.3.1; extra == "databricks"
+Requires-Dist: databricks-sql-connector<3.4.0,>=3.1.2; extra == "databricks"
+Requires-Dist: soda-core-spark[databricks]<3.4.0,>=3.3.1; extra == "databricks"
+Provides-Extra: dbml
+Requires-Dist: pydbml>=1.1.1; extra == "dbml"
+Provides-Extra: dbt
+Requires-Dist: dbt-core>=1.8.0; extra == "dbt"
 Provides-Extra: dev
-Requires-Dist: datacontract-cli[all] ; extra == 'dev'
-Requires-Dist: httpx ==0.27.0 ; extra == 'dev'
-Requires-Dist: ruff ; extra == 'dev'
-Requires-Dist: pre-commit ~=3.7.1 ; extra == 'dev'
-Requires-Dist: pytest ; extra == 'dev'
-Requires-Dist: pytest-xdist ; extra == 'dev'
-Requires-Dist: moto ==5.0.11 ; extra == 'dev'
-Requires-Dist: pymssql ==2.3.0 ; extra == 'dev'
-Requires-Dist: kafka-python ; extra == 'dev'
-Requires-Dist: trino ==0.329.0 ; extra == 'dev'
-Requires-Dist: testcontainers <4.8,>=4.5 ; extra == 'dev'
-Requires-Dist: testcontainers[core] ; extra == 'dev'
-Requires-Dist: testcontainers[minio] ; extra == 'dev'
-Requires-Dist: testcontainers[postgres] ; extra == 'dev'
-Requires-Dist: testcontainers[kafka] ; extra == 'dev'
-Requires-Dist: testcontainers[mssql] ; extra == 'dev'
+Requires-Dist: datacontract-cli[all]; extra == "dev"
+Requires-Dist: httpx==0.27.2; extra == "dev"
+Requires-Dist: kafka-python; extra == "dev"
+Requires-Dist: moto==5.0.13; extra == "dev"
+Requires-Dist: pandas>=2.1.0; extra == "dev"
+Requires-Dist: pre-commit<3.9.0,>=3.7.1; extra == "dev"
+Requires-Dist: pyarrow>=12.0.0; extra == "dev"
+Requires-Dist: pytest; extra == "dev"
+Requires-Dist: pytest-xdist; extra == "dev"
+Requires-Dist: pymssql==2.3.1; extra == "dev"
+Requires-Dist: ruff; extra == "dev"
+Requires-Dist: testcontainers[kafka,minio,mssql,postgres]==4.8.1; extra == "dev"
+Requires-Dist: trino==0.329.0; extra == "dev"
 Provides-Extra: kafka
-Requires-Dist: datacontract-cli[avro] ; extra == 'kafka'
-Requires-Dist: soda-core-spark-df <3.4.0,>=3.3.1 ; extra == 'kafka'
+Requires-Dist: datacontract-cli[avro]; extra == "kafka"
+Requires-Dist: soda-core-spark-df<3.4.0,>=3.3.1; extra == "kafka"
 Provides-Extra: postgres
-Requires-Dist: soda-core-postgres <3.4.0,>=3.3.1 ; extra == 'postgres'
+Requires-Dist: soda-core-postgres<3.4.0,>=3.3.1; extra == "postgres"
 Provides-Extra: s3
-Requires-Dist: s3fs ==2024.6.1 ; extra == 's3'
+Requires-Dist: s3fs==2024.6.1; extra == "s3"
 Provides-Extra: snowflake
-Requires-Dist: snowflake-connector-python[pandas] <3.12,>=3.6 ; extra == 'snowflake'
-Requires-Dist: soda-core-snowflake <3.4.0,>=3.3.1 ; extra == 'snowflake'
+Requires-Dist: snowflake-connector-python[pandas]<3.13,>=3.6; extra == "snowflake"
+Requires-Dist: soda-core-snowflake<3.4.0,>=3.3.1; extra == "snowflake"
 Provides-Extra: sqlserver
-Requires-Dist: soda-core-sqlserver <3.4.0,>=3.3.1 ; extra == 'sqlserver'
+Requires-Dist: soda-core-sqlserver<3.4.0,>=3.3.1; extra == "sqlserver"
 Provides-Extra: trino
-Requires-Dist: soda-core-trino <3.4.0,>=3.3.1 ; extra == 'trino'
+Requires-Dist: soda-core-trino<3.4.0,>=3.3.1; extra == "trino"
 # Data Contract CLI
@@ -82,7 +81,7 @@ Requires-Dist: soda-core-trino <3.4.0,>=3.3.1 ; extra == 'trino'
     <img alt="Test Workflow" src="https://img.shields.io/github/actions/workflow/status/datacontract/datacontract-cli/ci.yaml?branch=main"></a>
   <a href="https://github.com/datacontract/datacontract-cli">
     <img alt="Stars" src="https://img.shields.io/github/stars/datacontract/datacontract-cli" /></a>
-  <a href="https://datacontract.com/slack" rel="nofollow"><img src="https://camo.githubusercontent.com/5ade1fd1e76a6ab860802cdd2941fe2501e2ca2cb534e5d8968dbf864c13d33d/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f736c61636b2d6a6f696e5f636861742d77686974652e7376673f6c6f676f3d736c61636b267374796c653d736f6369616c" alt="Slack Status" data-canonical-src="https://img.shields.io/badge/slack-join_chat-white.svg?logo=slack&amp;style=social" style="max-width: 100%;"></a>
+  <a href="https://datacontract.com/slack" rel="nofollow"><img src="https://img.shields.io/badge/slack-join_chat-white.svg?logo=slack&amp;style=social" alt="Slack Status" data-canonical-src="https://img.shields.io/badge/slack-join_chat-white.svg?logo=slack&amp;style=social" style="max-width: 100%;"></a>
 </p>
 The `datacontract` CLI is an open source command-line tool for working with [Data Contracts](https://datacontract.com/).
@@ -197,10 +196,10 @@ $ datacontract export --format html datacontract.yaml > datacontract.html
 # import avro (other formats: sql, glue, bigquery...)
 $ datacontract import --format avro --source avro_schema.avsc
-# find differences between to data contracts
+# find differences between two data contracts
 $ datacontract diff datacontract-v1.yaml datacontract-v2.yaml
-# find differences between to data contracts categorized into error, warning, and info.
+# find differences between two data contracts categorized into error, warning, and info.
 $ datacontract changelog datacontract-v1.yaml datacontract-v2.yaml
 # fail pipeline on breaking changes. Uses changelog internally and showing only error and warning.
@@ -267,13 +266,13 @@ A list of available extras:
 | Avro Support           | `pip install datacontract-cli[avro]`       |
 | Google BigQuery        | `pip install datacontract-cli[bigquery]`   |
 | Databricks Integration | `pip install datacontract-cli[databricks]` |
-| Deltalake Integration  | `pip install datacontract-cli[deltalake]`  |
 | Kafka Integration      | `pip install datacontract-cli[kafka]`      |
 | PostgreSQL Integration | `pip install datacontract-cli[postgres]`   |
 | S3 Integration         | `pip install datacontract-cli[s3]`         |
 | Snowflake Integration  | `pip install datacontract-cli[snowflake]`  |
 | Microsoft SQL Server   | `pip install datacontract-cli[sqlserver]`  |
 | Trino                  | `pip install datacontract-cli[trino]`      |
+| Dbt                    | `pip install datacontract-cli[dbt]`        |
@@ -385,7 +384,7 @@ Supported server types:
 - [sqlserver](#sqlserver)
 - [databricks](#databricks)
 - [databricks (programmatic)](#databricks-programmatic)
-- [dataframr (programmatic)](#dataframe-programmatic)
+- [dataframe (programmatic)](#dataframe-programmatic)
 - [snowflake](#snowflake)
 - [kafka](#kafka)
 - [postgres](#postgres)
@@ -406,6 +405,12 @@ Feel free to create an [issue](https://github.com/datacontract/datacontract-cli/
 Data Contract CLI can test data that is stored in S3 buckets or any S3-compliant endpoints in various formats.
+- CSV
+- JSON
+- Delta
+- Parquet
+- Iceberg (coming soon)
 #### Examples
 ##### JSON
@@ -444,6 +449,32 @@ servers:
+### Google Cloud Storage (GCS)
+The [S3](#S3) integration also works with files on Google Cloud Storage through its [interoperability](https://cloud.google.com/storage/docs/interoperability).
+Use `https://storage.googleapis.com` as the endpoint URL.
+#### Example
+datacontract.yaml
+```yaml
+servers:
+  production:
+    type: s3
+    endpointUrl: https://storage.googleapis.com
+    location: s3://bucket-name/path/*/*.json # use s3:// schema instead of gs://
+    format: json
+    delimiter: new_line # new_line, array, or none
+```
+#### Environment Variables
+| Environment Variable                | Example        | Description                                                                              |
+|-------------------------------------|----------------|------------------------------------------------------------------------------------------|
+| `DATACONTRACT_S3_ACCESS_KEY_ID`     | `GOOG1EZZZ...` | The GCS [HMAC Key](https://cloud.google.com/storage/docs/authentication/hmackeys) Key ID |
+| `DATACONTRACT_S3_SECRET_ACCESS_KEY` | `PDWWpb...`    | The GCS [HMAC Key](https://cloud.google.com/storage/docs/authentication/hmackeys) Secret |
 ### BigQuery
 We support authentication to BigQuery using Service Account Key. The used Service Account should include the roles:
@@ -665,14 +696,31 @@ models:
 ```
 #### Environment Variables
-| Environment Variable               | Example            | Description                                         |
-|------------------------------------|--------------------|-----------------------------------------------------|
-| `DATACONTRACT_SNOWFLAKE_USERNAME`  | `datacontract`     | Username                                            |
-| `DATACONTRACT_SNOWFLAKE_PASSWORD`  | `mysecretpassword` | Password                                            |
-| `DATACONTRACT_SNOWFLAKE_ROLE`      | `DATAVALIDATION`   | The snowflake role to use.                          |
-| `DATACONTRACT_SNOWFLAKE_WAREHOUSE` | `COMPUTE_WH`       | The Snowflake Warehouse to use executing the tests. |
+All [parameters supported by Soda](https://docs.soda.io/soda/connect-snowflake.html), uppercased and prepended by `DATACONTRACT_SNOWFLAKE_` prefix.
+For example:
+| Soda parameter       | Environment Variable                        |
+|----------------------|---------------------------------------------|
+| `username`           | `DATACONTRACT_SNOWFLAKE_USERNAME`           |
+| `password`           | `DATACONTRACT_SNOWFLAKE_PASSWORD`           |
+| `warehouse`          | `DATACONTRACT_SNOWFLAKE_WAREHOUSE`          |
+| `role`               | `DATACONTRACT_SNOWFLAKE_ROLE`               |
+| `connection_timeout` | `DATACONTRACT_SNOWFLAKE_CONNECTION_TIMEOUT` |
+Beware, that parameters:
+* `account`
+* `database`
+* `schema`
+are obtained from the `servers` section of the YAML-file.
+E.g. from the example above:
+```yaml
+servers:
+  snowflake:
+    account: abcdefg-xn12345
+    database: ORDER_DB
+    schema: ORDERS_PII_V2
+```
 ### Kafka
@@ -777,7 +825,7 @@ models:
 │ *  --format        [jsonschema|pydantic-model|sodacl|dbt|dbt-sources|db  The export format. [default: None] [required]         │
 │                    t-staging-sql|odcs|rdf|avro|protobuf|great-expectati                                                        │
 │                    ons|terraform|avro-idl|sql|sql-query|html|go|bigquer                                                        │
-│                    y|dbml|spark]                                                                                               │
+│                    y|dbml|spark|sqlalchemy]                                                                                    │
 │    --output        PATH                                                  Specify the file path where the exported data will be │
 │                                                                          saved. If no path is provided, the output will be     │
 │                                                                          printed to stdout.                                    │
@@ -828,6 +876,7 @@ Available export options:
 | `pydantic-model`     | Export to pydantic models                               | ✅     |
 | `DBML`               | Export to a DBML Diagram description                    | ✅     |
 | `spark`              | Export to a Spark StructType                            | ✅     |
+| `sqlalchemy`         | Export to SQLAlchemy Models                             | ✅     |
 | Missing something?   | Please create an issue on GitHub                        | TBD    |
 #### Great Expectations
@@ -901,6 +950,7 @@ models:
         description: Example for AVRO with Timestamp (microsecond precision) https://avro.apache.org/docs/current/spec.html#Local+timestamp+%28microsecond+precision%29
         type: long
         example: 1672534861000000  # Equivalent to 2023-01-01 01:01:01 in microseconds
+        required: true
         config:
           avroLogicalType: local-timestamp-micros
           avroDefault: 1672534861000000
@@ -915,6 +965,7 @@ models:
   - **description**: A textual description of the field.
   - **type**: The data type of the field. In this example, it is `long`.
   - **example**: An example value for the field.
+  - **required**: Is this a required field (as opposed to optional/nullable).
   - **config**: Section to specify custom Avro properties.
     - **avroLogicalType**: Specifies the logical type of the field in Avro. In this example, it is `local-timestamp-micros`.
     - **avroDefault**: Specifies the default value for the field in Avro. In this example, it is 1672534861000000 which corresponds to ` 2023-01-01 01:01:01 UTC`.
@@ -925,23 +976,42 @@ models:
 ```
  Usage: datacontract import [OPTIONS]
- Create a data contract from the given source location. Prints to stdout.
-╭─ Options ───────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
-│ *  --format                  [sql|avro|glue|bigquery|jsonschema|  The format of the source file. [default: None] [required]         |
-│                               unity|spark]                                                                                          |
-│    --source                  TEXT                                 The path to the file or Glue Database that should be imported.    │
-│                                                                   [default: None]                                                   │
-│    --glue-table              TEXT                                 List of table ids to import from the Glue Database (repeat for    │
-│                                                                   multiple table ids, leave empty for all tables in the dataset).   │
-│                                                                   [default: None]                                                   │
-│    --bigquery-project        TEXT                                 The bigquery project id. [default: None]                          │
-│    --bigquery-dataset        TEXT                                 The bigquery dataset id. [default: None]                          │
-│    --bigquery-table          TEXT                                 List of table ids to import from the bigquery API (repeat for     │
-│                                                                   multiple table ids, leave empty for all tables in the dataset).   │
-│                                                                   [default: None]                                                   │
-│    --help                                                         Show this message and exit.                                       │
-╰─────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
+ Create a data contract from the given source location. Prints to stdout.
+╭─ Options ───────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
+│ *  --format                       [sql|avro|dbt|glue|jsonschema|bigquery|odcs  The format of the source file.               │
+│                                   |unity|spark]                                [default: None]                              │
+│                                                                                [required]                                   │
+│    --source                       TEXT                                         The path to the file or Glue Database that   │
+│                                                                                should be imported.                          │
+│                                                                                [default: None]                              │
+│    --glue-table                   TEXT                                         List of table ids to import from the Glue    │
+│                                                                                Database (repeat for multiple table ids,     │
+│                                                                                leave empty for all tables in the dataset).  │
+│                                                                                [default: None]                              │
+│    --bigquery-project             TEXT                                         The bigquery project id. [default: None]     │
+│    --bigquery-dataset             TEXT                                         The bigquery dataset id. [default: None]     │
+│    --bigquery-table               TEXT                                         List of table ids to import from the         │
+│                                                                                bigquery API (repeat for multiple table ids, │
+│                                                                                leave empty for all tables in the dataset).  │
+│                                                                                [default: None]                              │
+│    --unity-table-full-name        TEXT                                         Full name of a table in the unity catalog    │
+│                                                                                [default: None]                              │
+│    --dbt-model                    TEXT                                         List of models names to import from the dbt  │
+│                                                                                manifest file (repeat for multiple models    │
+│                                                                                names, leave empty for all models in the     │
+│                                                                                dataset).                                    │
+│                                                                                [default: None]                              │
+│    --dbml-schema                  TEXT                                         List of schema names to import from the DBML │
+│                                                                                file (repeat for multiple schema names,      │
+│                                                                                leave empty for all tables in the file).     │
+│                                                                                [default: None]                              │
+│    --dbml-table                   TEXT                                         List of table names to import from the DBML  │
+│                                                                                file (repeat for multiple table names, leave │
+│                                                                                empty for all tables in the file).           │
+│                                                                                [default: None]                              │
+│    --help                                                                      Show this message and exit.                  │
+╰─────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
 ```
 Example:
@@ -952,18 +1022,20 @@ datacontract import --format sql --source my_ddl.sql
 Available import options:
-| Type               | Description                                    | Status  |
-|--------------------|------------------------------------------------|---------|
+| Type               | Description                                    | Status |
+|--------------------|------------------------------------------------|--------|
 | `sql`              | Import from SQL DDL                            | ✅      |
 | `avro`             | Import from AVRO schemas                       | ✅      |
 | `glue`             | Import from AWS Glue DataCatalog               | ✅      |
-| `protobuf`         | Import from Protobuf schemas                   | TBD     |
 | `jsonschema`       | Import from JSON Schemas                       | ✅      |
 | `bigquery`         | Import from BigQuery Schemas                   | ✅      |
 | `unity`            | Import from Databricks Unity Catalog           | partial |
-| `dbt`              | Import from dbt models                         | TBD     |
+| `dbt`              | Import from dbt models                         | ✅      |
 | `odcs`             | Import from Open Data Contract Standard (ODCS) | ✅      |
-| Missing something? | Please create an issue on GitHub               | TBD     |
+| `spark`            | Import from Spark StructTypes                  | ✅      |
+| `dbml`             | Import from DBML models                        | ✅      |
+| `protobuf`         | Import from Protobuf schemas                   | TBD    |
+| Missing something? | Please create an issue on GitHub               | TBD    |
 #### BigQuery
@@ -1005,6 +1077,23 @@ export DATABRICKS_IMPORT_ACCESS_TOKEN=<token>
 datacontract import --format unity --unity-table-full-name <table_full_name>
 ```
+#### dbt
+Importing from dbt manifest file.
+You may give the `dbt-model` parameter to enumerate the tables that should be imported. If no tables are given, _all_ available tables of the database will be imported.
+Examples:
+```bash
+# Example import from dbt manifest with specifying the tables to import
+datacontract import --format dbt --source <manifest_path> --dbt-model <model_name_1> --dbt-model <model_name_2> --dbt-model <model_name_3>
+```
+```bash
+# Example import from dbt manifest importing all tables in the database
+datacontract import --format dbt --source <manifest_path>
+```
 #### Glue
 Importing from Glue reads the necessary Data directly off of the AWS API.
@@ -1032,6 +1121,38 @@ Example:
 datacontract import --format spark --source "users,orders"
 ```
+#### DBML
+Importing from DBML Documents.
+**NOTE:** Since DBML does _not_ have strict requirements on the types of columns, this import _may_ create non-valid datacontracts, as not all types of fields can be properly mapped. In this case you will have to adapt the generated document manually.
+We also assume, that the description for models and fields is stored in a Note within the DBML model.
+You may give the `dbml-table` or `dbml-schema` parameter to enumerate the tables or schemas that should be imported.
+If no tables are given, _all_ available tables of the source will be imported. Likewise, if no schema is given, _all_ schemas are imported.
+Examples:
+```bash
+# Example import from DBML file, importing everything
+datacontract import --format dbml --source <file_path>
+```
+```bash
+# Example import from DBML file, filtering for tables from specific schemas
+datacontract import --format dbml --source <file_path> --dbml-schema <schema_1> --dbml-schema <schema_2>
+```
+```bash
+# Example import from DBML file, filtering for tables with specific names
+datacontract import --format dbml --source <file_path> --dbml-table <table_name_1> --dbml-table <table_name_2>
+```
+```bash
+# Example import from DBML file, filtering for tables with specific names from a specific schema
+datacontract import --format dbml --source <file_path> --dbml-table <table_name_1> --dbml-schema <schema_1>
+```
 ### breaking
 ```
@@ -1304,7 +1425,7 @@ if __name__ == "__main__":
     data_contract = DataContract(
         data_contract_file="/path/datacontract.yaml"
     )
-    # call export
+    # Call export
     result = data_contract.export(
         export_format="custom", model="orders", server="production", custom_arg="my_custom_arg"
     )
@@ -1330,10 +1451,11 @@ Output
 Using the importer factory to add a new custom importer
 ```python
-from datacontract.model.data_contract_specification import DataContractSpecification
+from datacontract.model.data_contract_specification import DataContractSpecification, Field, Model
 from datacontract.data_contract import DataContract
 from datacontract.imports.importer import Importer
 from datacontract.imports.importer_factory import importer_factory
 import json
 # Create a custom class that implements import_source method
@@ -1344,43 +1466,89 @@ class CustomImporter(Importer):
         source_dict = json.loads(source)
         data_contract_specification.id = source_dict.get("id_custom")
         data_contract_specification.info.title = source_dict.get("title")
+        data_contract_specification.info.version = source_dict.get("version")
         data_contract_specification.info.description = source_dict.get("description_from_app")
+        for model in source_dict.get("models", []):
+            fields = {}
+            for column in model.get('columns'):
+                field = Field(
+                    description=column.get('column_description'),
+                    type=column.get('type')
+                )
+                fields[column.get('name')] = field
+            dc_model = Model(
+                description=model.get('description'),
+                fields= fields
+            )
+            data_contract_specification.models[model.get('name')] = dc_model
         return data_contract_specification
 # Register the new custom class into factory
 importer_factory.register_importer("custom_company_importer", CustomImporter)
 if __name__ == "__main__":
-    # get a custom da
-    json_from_custom_app = '{"id_custom":"uuid-custom","version":"0.0.2", "title":"my_custom_imported_data", "description_from_app": "Custom contract description"}'
+    # Get a custom data from other app
+    json_from_custom_app = '''
+    {
+        "id_custom": "uuid-custom",
+        "version": "0.0.2",
+        "title": "my_custom_imported_data",
+        "description_from_app": "Custom contract description",
+        "models": [
+            {
+            "name": "model1",
+            "description": "model description from app",
+            "columns": [
+                {
+                "name": "columnA",
+                "type": "varchar",
+                "column_description": "my_column description"
+                },
+                {
+                "name": "columnB",
+                "type": "varchar",
+                "column_description": "my_columnB description"
+                }
+            ]
+            }
+        ]
+        }
+    '''
     # Create a DataContract instance
     data_contract = DataContract()
-    # call import_from
+    # Call import_from_source
     result = data_contract.import_from_source(
-        format="custom_company_importer", data_contract_specification=DataContract.init(), source=json_from_custom_app
-    )
-    print(dict(result))
+        format="custom_company_importer",
+        data_contract_specification=DataContract.init(),
+        source=json_from_custom_app
+    )
+    print(result.to_yaml() )
 ```
 Output
+```yaml
+dataContractSpecification: 0.9.3
+id: uuid-custom
+info:
+  title: my_custom_imported_data
+  version: 0.0.2
+  description: Custom contract description
+models:
+  model1:
+    fields:
+      columnA:
+        type: varchar
+        description: my_column description
+      columnB:
+        type: varchar
+        description: my_columnB description
-```python
-{
-  'dataContractSpecification': '0.9.3',
-  'id': 'uuid-custom',
-  'info': Info(title='my_custom_imported_data', version='0.0.1', status=None, description='Custom contract description', owner=None, contact=None),
-  'servers': {},
-  'terms': None,
-  'models': {},
-  'definitions': {},
-  'examples': [],
-  'quality': None,
-  'servicelevels': None
-}
 ```
 ## Development Setup
@@ -1469,6 +1637,7 @@ We are happy to receive your contributions. Propose your change in an issue or d
 ## Companies using this tool
 - [INNOQ](https://innoq.com)
+- [Data Catering](https://data.catering/)
 - And many more. To add your company, please create a pull request.
 ## Related Tools
@@ -1476,6 +1645,7 @@ We are happy to receive your contributions. Propose your change in an issue or d
 - [Data Contract Manager](https://www.datacontract-manager.com/) is a commercial tool to manage data contracts. It contains a web UI, access management, and data governance for a full enterprise data marketplace.
 - [Data Contract GPT](https://gpt.datacontract.com) is a custom GPT that can help you write data contracts.
 - [Data Contract Editor](https://editor.datacontract.com) is an editor for Data Contracts, including a live html preview.
+- [Data Contract Playground](https://data-catering.github.io/data-contract-playground/) allows you to validate and export your data contract to different formats within your browser.
 ## License

datacontract-cli 0.10.10__py3-none-any.whl → 0.10.12__py3-none-any.whl

Potentially problematic release.

datacontract-cli 0.10.10py3-none-any.whl → 0.10.12py3-none-any.whl