airbyte-cdk 6.9.1.dev0__py3-none-any.whl → 6.9.1.dev2__py3-none-any.whl
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- airbyte_cdk/sources/declarative/concurrent_declarative_source.py +31 -25
- airbyte_cdk/sources/declarative/declarative_component_schema.yaml +2 -100
- airbyte_cdk/sources/declarative/manifest_declarative_source.py +2 -53
- airbyte_cdk/sources/declarative/models/declarative_component_schema.py +2 -95
- airbyte_cdk/sources/declarative/parsers/manifest_component_transformer.py +0 -6
- airbyte_cdk/sources/declarative/parsers/model_to_component_factory.py +21 -95
- airbyte_cdk/sources/declarative/partition_routers/__init__.py +1 -2
- airbyte_cdk/test/utils/manifest_only_fixtures.py +79 -0
- airbyte_cdk-6.9.1.dev2.dist-info/METADATA +306 -0
- {airbyte_cdk-6.9.1.dev0.dist-info → airbyte_cdk-6.9.1.dev2.dist-info}/RECORD +13 -15
- airbyte_cdk/sources/declarative/resolvers/__init__.py +0 -13
- airbyte_cdk/sources/declarative/resolvers/components_resolver.py +0 -55
- airbyte_cdk/sources/declarative/resolvers/http_components_resolver.py +0 -106
- airbyte_cdk-6.9.1.dev0.dist-info/METADATA +0 -108
- {airbyte_cdk-6.9.1.dev0.dist-info → airbyte_cdk-6.9.1.dev2.dist-info}/LICENSE.txt +0 -0
- {airbyte_cdk-6.9.1.dev0.dist-info → airbyte_cdk-6.9.1.dev2.dist-info}/WHEEL +0 -0
- {airbyte_cdk-6.9.1.dev0.dist-info → airbyte_cdk-6.9.1.dev2.dist-info}/entry_points.txt +0 -0
| @@ -1,108 +0,0 @@ | |
| 1 | 
            -
            Metadata-Version: 2.1
         | 
| 2 | 
            -
            Name: airbyte-cdk
         | 
| 3 | 
            -
            Version: 6.9.1.dev0
         | 
| 4 | 
            -
            Summary: A framework for writing Airbyte Connectors.
         | 
| 5 | 
            -
            Home-page: https://airbyte.com
         | 
| 6 | 
            -
            License: MIT
         | 
| 7 | 
            -
            Keywords: airbyte,connector-development-kit,cdk
         | 
| 8 | 
            -
            Author: Airbyte
         | 
| 9 | 
            -
            Author-email: contact@airbyte.io
         | 
| 10 | 
            -
            Requires-Python: >=3.10,<3.13
         | 
| 11 | 
            -
            Classifier: Development Status :: 3 - Alpha
         | 
| 12 | 
            -
            Classifier: Intended Audience :: Developers
         | 
| 13 | 
            -
            Classifier: License :: OSI Approved :: MIT License
         | 
| 14 | 
            -
            Classifier: Programming Language :: Python :: 3
         | 
| 15 | 
            -
            Classifier: Programming Language :: Python :: 3.10
         | 
| 16 | 
            -
            Classifier: Programming Language :: Python :: 3.11
         | 
| 17 | 
            -
            Classifier: Programming Language :: Python :: 3.12
         | 
| 18 | 
            -
            Classifier: Topic :: Scientific/Engineering
         | 
| 19 | 
            -
            Classifier: Topic :: Software Development :: Libraries :: Python Modules
         | 
| 20 | 
            -
            Provides-Extra: file-based
         | 
| 21 | 
            -
            Provides-Extra: sql
         | 
| 22 | 
            -
            Provides-Extra: vector-db-based
         | 
| 23 | 
            -
            Requires-Dist: Jinja2 (>=3.1.2,<3.2.0)
         | 
| 24 | 
            -
            Requires-Dist: PyYAML (>=6.0.1,<7.0.0)
         | 
| 25 | 
            -
            Requires-Dist: airbyte-protocol-models-dataclasses (>=0.14,<0.15)
         | 
| 26 | 
            -
            Requires-Dist: avro (>=1.11.2,<1.12.0) ; extra == "file-based"
         | 
| 27 | 
            -
            Requires-Dist: backoff
         | 
| 28 | 
            -
            Requires-Dist: cachetools
         | 
| 29 | 
            -
            Requires-Dist: cohere (==4.21) ; extra == "vector-db-based"
         | 
| 30 | 
            -
            Requires-Dist: cryptography (>=42.0.5,<44.0.0)
         | 
| 31 | 
            -
            Requires-Dist: dpath (>=2.1.6,<3.0.0)
         | 
| 32 | 
            -
            Requires-Dist: dunamai (>=1.22.0,<2.0.0)
         | 
| 33 | 
            -
            Requires-Dist: fastavro (>=1.8.0,<1.9.0) ; extra == "file-based"
         | 
| 34 | 
            -
            Requires-Dist: genson (==1.3.0)
         | 
| 35 | 
            -
            Requires-Dist: isodate (>=0.6.1,<0.7.0)
         | 
| 36 | 
            -
            Requires-Dist: jsonref (>=0.2,<0.3)
         | 
| 37 | 
            -
            Requires-Dist: jsonschema (>=4.17.3,<4.18.0)
         | 
| 38 | 
            -
            Requires-Dist: langchain (==0.1.16) ; extra == "vector-db-based"
         | 
| 39 | 
            -
            Requires-Dist: langchain_core (==0.1.42)
         | 
| 40 | 
            -
            Requires-Dist: markdown ; extra == "file-based"
         | 
| 41 | 
            -
            Requires-Dist: nltk (==3.9.1)
         | 
| 42 | 
            -
            Requires-Dist: numpy (<2)
         | 
| 43 | 
            -
            Requires-Dist: openai[embeddings] (==0.27.9) ; extra == "vector-db-based"
         | 
| 44 | 
            -
            Requires-Dist: orjson (>=3.10.7,<4.0.0)
         | 
| 45 | 
            -
            Requires-Dist: pandas (==2.2.2)
         | 
| 46 | 
            -
            Requires-Dist: pdf2image (==1.16.3) ; extra == "file-based"
         | 
| 47 | 
            -
            Requires-Dist: pdfminer.six (==20221105) ; extra == "file-based"
         | 
| 48 | 
            -
            Requires-Dist: pendulum (<3.0.0)
         | 
| 49 | 
            -
            Requires-Dist: psutil (==6.1.0)
         | 
| 50 | 
            -
            Requires-Dist: pyarrow (>=15.0.0,<15.1.0) ; extra == "file-based"
         | 
| 51 | 
            -
            Requires-Dist: pydantic (>=2.7,<3.0)
         | 
| 52 | 
            -
            Requires-Dist: pyjwt (>=2.8.0,<3.0.0)
         | 
| 53 | 
            -
            Requires-Dist: pyrate-limiter (>=3.1.0,<3.2.0)
         | 
| 54 | 
            -
            Requires-Dist: pytesseract (==0.3.10) ; extra == "file-based"
         | 
| 55 | 
            -
            Requires-Dist: python-calamine (==0.2.3) ; extra == "file-based"
         | 
| 56 | 
            -
            Requires-Dist: python-dateutil
         | 
| 57 | 
            -
            Requires-Dist: python-snappy (==0.7.3) ; extra == "file-based"
         | 
| 58 | 
            -
            Requires-Dist: python-ulid (>=3.0.0,<4.0.0)
         | 
| 59 | 
            -
            Requires-Dist: pytz (==2024.1)
         | 
| 60 | 
            -
            Requires-Dist: rapidfuzz (>=3.10.1,<4.0.0)
         | 
| 61 | 
            -
            Requires-Dist: requests
         | 
| 62 | 
            -
            Requires-Dist: requests_cache
         | 
| 63 | 
            -
            Requires-Dist: serpyco-rs (>=1.10.2,<2.0.0)
         | 
| 64 | 
            -
            Requires-Dist: sqlalchemy (>=2.0,<3.0,!=2.0.36) ; extra == "sql"
         | 
| 65 | 
            -
            Requires-Dist: tiktoken (==0.8.0) ; extra == "vector-db-based"
         | 
| 66 | 
            -
            Requires-Dist: unstructured.pytesseract (>=0.3.12) ; extra == "file-based"
         | 
| 67 | 
            -
            Requires-Dist: unstructured[docx,pptx] (==0.10.27) ; extra == "file-based"
         | 
| 68 | 
            -
            Requires-Dist: wcmatch (==10.0)
         | 
| 69 | 
            -
            Requires-Dist: xmltodict (>=0.13.0,<0.14.0)
         | 
| 70 | 
            -
            Project-URL: Documentation, https://docs.airbyte.io/
         | 
| 71 | 
            -
            Project-URL: Repository, https://github.com/airbytehq/airbyte-python-cdk
         | 
| 72 | 
            -
            Description-Content-Type: text/markdown
         | 
| 73 | 
            -
             | 
| 74 | 
            -
            # Airbyte Python CDK and Low-Code CDK
         | 
| 75 | 
            -
             | 
| 76 | 
            -
            Airbyte Python CDK is a framework for building Airbyte API Source Connectors. It provides a set of
         | 
| 77 | 
            -
            classes and helpers that make it easy to build a connector against an HTTP API (REST, GraphQL, etc),
         | 
| 78 | 
            -
            or a generic Python source connector.
         | 
| 79 | 
            -
             | 
| 80 | 
            -
            ## Building Connectors with the CDK
         | 
| 81 | 
            -
             | 
| 82 | 
            -
            If you're looking to build a connector, we highly recommend that you first
         | 
| 83 | 
            -
            [start with the Connector Builder](https://docs.airbyte.com/connector-development/connector-builder-ui/overview).
         | 
| 84 | 
            -
            It should be enough for 90% connectors out there. For more flexible and complex connectors, use the
         | 
| 85 | 
            -
            [low-code CDK and `SourceDeclarativeManifest`](https://docs.airbyte.com/connector-development/config-based/low-code-cdk-overview).
         | 
| 86 | 
            -
             | 
| 87 | 
            -
            For more information on building connectors, please see the [Connector Development](https://docs.airbyte.com/connector-development/) guide on [docs.airbyte.com](https://docs.airbyte.com).
         | 
| 88 | 
            -
             | 
| 89 | 
            -
            ## Python CDK Overview
         | 
| 90 | 
            -
             | 
| 91 | 
            -
            Airbyte CDK code is within `airbyte_cdk` directory. Here's a high level overview of what's inside:
         | 
| 92 | 
            -
             | 
| 93 | 
            -
            - `airbyte_cdk/connector_builder`. Internal wrapper that helps the Connector Builder platform run a declarative manifest (low-code connector). You should not use this code directly. If you need to run a `SourceDeclarativeManifest`, take a look at [`source-declarative-manifest`](https://github.com/airbytehq/airbyte/tree/master/airbyte-integrations/connectors/source-declarative-manifest) connector implementation instead.
         | 
| 94 | 
            -
            - `airbyte_cdk/cli/source_declarative_manifest`. This module defines the `source-declarative-manifest` (aka "SDM") connector execution logic and associated CLI.
         | 
| 95 | 
            -
            - `airbyte_cdk/destinations`. Basic Destination connector support! If you're building a Destination connector in Python, try that. Some of our vector DB destinations like `destination-pinecone` are using that code.
         | 
| 96 | 
            -
            - `airbyte_cdk/models` expose `airbyte_protocol.models` as a part of `airbyte_cdk` package.
         | 
| 97 | 
            -
            - `airbyte_cdk/sources/concurrent_source` is the Concurrent CDK implementation. It supports reading data from streams concurrently per slice / partition, useful for connectors with high throughput and high number of records.
         | 
| 98 | 
            -
            - `airbyte_cdk/sources/declarative` is the low-code CDK. It works on top of Airbyte Python CDK, but provides a declarative manifest language to define streams, operations, etc. This makes it easier to build connectors without writing Python code.
         | 
| 99 | 
            -
            - `airbyte_cdk/sources/file_based` is the CDK for file-based sources. Examples include S3, Azure, GCS, etc.
         | 
| 100 | 
            -
             | 
| 101 | 
            -
            ## Contributing
         | 
| 102 | 
            -
             | 
| 103 | 
            -
            For instructions on how to contribute, please see our [Contributing Guide](docs/CONTRIBUTING.md).
         | 
| 104 | 
            -
             | 
| 105 | 
            -
            ## Release Management
         | 
| 106 | 
            -
             | 
| 107 | 
            -
            Please see the [Release Management](docs/RELEASES.md) guide for information on how to perform releases and pre-releases.
         | 
| 108 | 
            -
             | 
| 
            File without changes
         | 
| 
            File without changes
         | 
| 
            File without changes
         |