airbyte-cdk 6.9.1.dev0__py3-none-any.whl → 6.9.1.dev2__py3-none-any.whl

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
@@ -1,108 +0,0 @@
1
- Metadata-Version: 2.1
2
- Name: airbyte-cdk
3
- Version: 6.9.1.dev0
4
- Summary: A framework for writing Airbyte Connectors.
5
- Home-page: https://airbyte.com
6
- License: MIT
7
- Keywords: airbyte,connector-development-kit,cdk
8
- Author: Airbyte
9
- Author-email: contact@airbyte.io
10
- Requires-Python: >=3.10,<3.13
11
- Classifier: Development Status :: 3 - Alpha
12
- Classifier: Intended Audience :: Developers
13
- Classifier: License :: OSI Approved :: MIT License
14
- Classifier: Programming Language :: Python :: 3
15
- Classifier: Programming Language :: Python :: 3.10
16
- Classifier: Programming Language :: Python :: 3.11
17
- Classifier: Programming Language :: Python :: 3.12
18
- Classifier: Topic :: Scientific/Engineering
19
- Classifier: Topic :: Software Development :: Libraries :: Python Modules
20
- Provides-Extra: file-based
21
- Provides-Extra: sql
22
- Provides-Extra: vector-db-based
23
- Requires-Dist: Jinja2 (>=3.1.2,<3.2.0)
24
- Requires-Dist: PyYAML (>=6.0.1,<7.0.0)
25
- Requires-Dist: airbyte-protocol-models-dataclasses (>=0.14,<0.15)
26
- Requires-Dist: avro (>=1.11.2,<1.12.0) ; extra == "file-based"
27
- Requires-Dist: backoff
28
- Requires-Dist: cachetools
29
- Requires-Dist: cohere (==4.21) ; extra == "vector-db-based"
30
- Requires-Dist: cryptography (>=42.0.5,<44.0.0)
31
- Requires-Dist: dpath (>=2.1.6,<3.0.0)
32
- Requires-Dist: dunamai (>=1.22.0,<2.0.0)
33
- Requires-Dist: fastavro (>=1.8.0,<1.9.0) ; extra == "file-based"
34
- Requires-Dist: genson (==1.3.0)
35
- Requires-Dist: isodate (>=0.6.1,<0.7.0)
36
- Requires-Dist: jsonref (>=0.2,<0.3)
37
- Requires-Dist: jsonschema (>=4.17.3,<4.18.0)
38
- Requires-Dist: langchain (==0.1.16) ; extra == "vector-db-based"
39
- Requires-Dist: langchain_core (==0.1.42)
40
- Requires-Dist: markdown ; extra == "file-based"
41
- Requires-Dist: nltk (==3.9.1)
42
- Requires-Dist: numpy (<2)
43
- Requires-Dist: openai[embeddings] (==0.27.9) ; extra == "vector-db-based"
44
- Requires-Dist: orjson (>=3.10.7,<4.0.0)
45
- Requires-Dist: pandas (==2.2.2)
46
- Requires-Dist: pdf2image (==1.16.3) ; extra == "file-based"
47
- Requires-Dist: pdfminer.six (==20221105) ; extra == "file-based"
48
- Requires-Dist: pendulum (<3.0.0)
49
- Requires-Dist: psutil (==6.1.0)
50
- Requires-Dist: pyarrow (>=15.0.0,<15.1.0) ; extra == "file-based"
51
- Requires-Dist: pydantic (>=2.7,<3.0)
52
- Requires-Dist: pyjwt (>=2.8.0,<3.0.0)
53
- Requires-Dist: pyrate-limiter (>=3.1.0,<3.2.0)
54
- Requires-Dist: pytesseract (==0.3.10) ; extra == "file-based"
55
- Requires-Dist: python-calamine (==0.2.3) ; extra == "file-based"
56
- Requires-Dist: python-dateutil
57
- Requires-Dist: python-snappy (==0.7.3) ; extra == "file-based"
58
- Requires-Dist: python-ulid (>=3.0.0,<4.0.0)
59
- Requires-Dist: pytz (==2024.1)
60
- Requires-Dist: rapidfuzz (>=3.10.1,<4.0.0)
61
- Requires-Dist: requests
62
- Requires-Dist: requests_cache
63
- Requires-Dist: serpyco-rs (>=1.10.2,<2.0.0)
64
- Requires-Dist: sqlalchemy (>=2.0,<3.0,!=2.0.36) ; extra == "sql"
65
- Requires-Dist: tiktoken (==0.8.0) ; extra == "vector-db-based"
66
- Requires-Dist: unstructured.pytesseract (>=0.3.12) ; extra == "file-based"
67
- Requires-Dist: unstructured[docx,pptx] (==0.10.27) ; extra == "file-based"
68
- Requires-Dist: wcmatch (==10.0)
69
- Requires-Dist: xmltodict (>=0.13.0,<0.14.0)
70
- Project-URL: Documentation, https://docs.airbyte.io/
71
- Project-URL: Repository, https://github.com/airbytehq/airbyte-python-cdk
72
- Description-Content-Type: text/markdown
73
-
74
- # Airbyte Python CDK and Low-Code CDK
75
-
76
- Airbyte Python CDK is a framework for building Airbyte API Source Connectors. It provides a set of
77
- classes and helpers that make it easy to build a connector against an HTTP API (REST, GraphQL, etc),
78
- or a generic Python source connector.
79
-
80
- ## Building Connectors with the CDK
81
-
82
- If you're looking to build a connector, we highly recommend that you first
83
- [start with the Connector Builder](https://docs.airbyte.com/connector-development/connector-builder-ui/overview).
84
- It should be enough for 90% connectors out there. For more flexible and complex connectors, use the
85
- [low-code CDK and `SourceDeclarativeManifest`](https://docs.airbyte.com/connector-development/config-based/low-code-cdk-overview).
86
-
87
- For more information on building connectors, please see the [Connector Development](https://docs.airbyte.com/connector-development/) guide on [docs.airbyte.com](https://docs.airbyte.com).
88
-
89
- ## Python CDK Overview
90
-
91
- Airbyte CDK code is within `airbyte_cdk` directory. Here's a high level overview of what's inside:
92
-
93
- - `airbyte_cdk/connector_builder`. Internal wrapper that helps the Connector Builder platform run a declarative manifest (low-code connector). You should not use this code directly. If you need to run a `SourceDeclarativeManifest`, take a look at [`source-declarative-manifest`](https://github.com/airbytehq/airbyte/tree/master/airbyte-integrations/connectors/source-declarative-manifest) connector implementation instead.
94
- - `airbyte_cdk/cli/source_declarative_manifest`. This module defines the `source-declarative-manifest` (aka "SDM") connector execution logic and associated CLI.
95
- - `airbyte_cdk/destinations`. Basic Destination connector support! If you're building a Destination connector in Python, try that. Some of our vector DB destinations like `destination-pinecone` are using that code.
96
- - `airbyte_cdk/models` expose `airbyte_protocol.models` as a part of `airbyte_cdk` package.
97
- - `airbyte_cdk/sources/concurrent_source` is the Concurrent CDK implementation. It supports reading data from streams concurrently per slice / partition, useful for connectors with high throughput and high number of records.
98
- - `airbyte_cdk/sources/declarative` is the low-code CDK. It works on top of Airbyte Python CDK, but provides a declarative manifest language to define streams, operations, etc. This makes it easier to build connectors without writing Python code.
99
- - `airbyte_cdk/sources/file_based` is the CDK for file-based sources. Examples include S3, Azure, GCS, etc.
100
-
101
- ## Contributing
102
-
103
- For instructions on how to contribute, please see our [Contributing Guide](docs/CONTRIBUTING.md).
104
-
105
- ## Release Management
106
-
107
- Please see the [Release Management](docs/RELEASES.md) guide for information on how to perform releases and pre-releases.
108
-