recce-nightly 1.1.0.20250423__py3-none-any.whl → 1.1.0.20250427__py3-none-any.whl

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Potentially problematic release.


This version of recce-nightly might be problematic. Click here for more details.

@@ -1,311 +0,0 @@
1
- Metadata-Version: 2.4
2
- Name: recce-nightly
3
- Version: 1.1.0.20250423
4
- Summary: Environment diff tool for dbt
5
- Home-page: https://github.com/InfuseAI/recce
6
- Author: InfuseAI Dev Team
7
- Author-email: dev@infuseai.io
8
- Project-URL: Bug Tracker, https://github.com/InfuseAI/recce/issues
9
- Classifier: Programming Language :: Python :: 3.9
10
- Classifier: Programming Language :: Python :: 3.10
11
- Classifier: Programming Language :: Python :: 3.11
12
- Classifier: License :: OSI Approved :: Apache Software License
13
- Classifier: Operating System :: OS Independent
14
- Classifier: Development Status :: 4 - Beta
15
- Requires-Python: >=3.9
16
- Description-Content-Type: text/markdown
17
- License-File: LICENSE
18
- Requires-Dist: boto3
19
- Requires-Dist: requests>=2.28.1
20
- Requires-Dist: ruamel.yaml>=0.18.6
21
- Requires-Dist: click>=7.1
22
- Requires-Dist: deepdiff<8.0,>=7.0
23
- Requires-Dist: portalocker
24
- Requires-Dist: fastapi
25
- Requires-Dist: itsdangerous
26
- Requires-Dist: uvicorn
27
- Requires-Dist: pydantic
28
- Requires-Dist: jinja2
29
- Requires-Dist: requests>=2.28.1
30
- Requires-Dist: rich>=12.0.0
31
- Requires-Dist: sentry-sdk
32
- Requires-Dist: watchdog
33
- Requires-Dist: websockets
34
- Requires-Dist: py-markdown-table
35
- Requires-Dist: python-dateutil
36
- Requires-Dist: python-multipart
37
- Requires-Dist: GitPython
38
- Requires-Dist: PyGithub
39
- Requires-Dist: sqlglot
40
- Provides-Extra: dev
41
- Requires-Dist: pytest>=4.6; extra == "dev"
42
- Requires-Dist: pytest-flake8; extra == "dev"
43
- Requires-Dist: flake8==3.9.2; extra == "dev"
44
- Requires-Dist: pytest-mypy; extra == "dev"
45
- Requires-Dist: pytest-cov; extra == "dev"
46
- Requires-Dist: twine; extra == "dev"
47
- Requires-Dist: tox; extra == "dev"
48
- Requires-Dist: pandas; extra == "dev"
49
- Requires-Dist: httpx; extra == "dev"
50
- Dynamic: author
51
- Dynamic: author-email
52
- Dynamic: classifier
53
- Dynamic: description
54
- Dynamic: description-content-type
55
- Dynamic: home-page
56
- Dynamic: license-file
57
- Dynamic: project-url
58
- Dynamic: provides-extra
59
- Dynamic: requires-dist
60
- Dynamic: requires-python
61
- Dynamic: summary
62
-
63
- <p align="center">
64
- <a href="https://datarecce.io">
65
- <picture>
66
- <source media="(prefers-color-scheme: dark)" srcset="https://datarecce.io/assets/images/recce-logo-stacked.png">
67
- <source media="(prefers-color-scheme: light)" srcset="https://datarecce.io/assets/images/recce-logo-stacked.png">
68
- <img alt="Recce: DataRecce.io" src="https://datarecce.io/assets/images/recce-logo-stacked.png" width="200" style="width: 100%; max-width: 200px; display: block; margin: 0 auto 20px;">
69
- </picture>
70
- </a>
71
- </p>
72
-
73
- <h3 align="center">The&nbsp;data&nbsp;validation&nbsp;toolkit<br />for teams that care about building better data</h3>
74
-
75
- <p align="center">
76
- <a href="https://pypi.org/project/recce/"><img src="https://img.shields.io/badge/pip_install-recce-006DAD?style=flat-square" alt="install"></a> &nbsp;
77
- <a href="https://pypi.org/project/recce/"><img src="https://img.shields.io/pypi/v/recce?style=flat-square" alt="pipy"></a> &nbsp;
78
- <a href="https://pypi.org/project/recce/"><img src="https://img.shields.io/pypi/pyversions/recce?style=flat-square" alt="Python"></a> &nbsp;
79
- <a href="https://pypi.org/project/recce/#files"><img src="https://img.shields.io/pypi/dw/recce?style=flat-square" alt="downloads"></a> &nbsp;
80
- <a href="https://github.com/DataRecce/recce/blob/main/LICENSE"><img src="https://img.shields.io/github/license/DataRecce/recce?style=flat-square" alt="license"></a> &nbsp;
81
- <a href="https://getdbt.slack.com/archives/C05C28V7CPP"><img src="https://img.shields.io/badge/Slack-4A154B?style=flat-square&amp;logo=slack&amp;logoColor=white" alt="Slack"></a> &nbsp;
82
- <a href="https://discord.com/invite/5zb2aK9KBV"><img src="https://img.shields.io/discord/664381609771925514?color=%237289DA&amp;label=chat&amp;logo=discord&amp;logoColor=white&amp;style=flat-square" alt="InfuseAI Discord Invite"></a> &nbsp;
83
- </p>
84
-
85
- <p align="center">
86
- <a href="https://cal.com/team/recce/chat?utm_source=banner&utm_campaign=oss">
87
- <img alt="Book us with Cal.com" src="https://cal.com/book-with-cal-light.svg" />
88
- </a>
89
- </p>
90
-
91
- ## Introduction
92
-
93
- `Recce` is data validation toolkit for pull request (PR) review in dbt projects. Get enhanced visibility into how your
94
- team’s dbt modeling changes impact data by comparing your dev branch with stable production data. Run manual data checks
95
- during development, and automate checks in CI for PR review.
96
-
97
- ## Quick Start
98
-
99
- You can launch Recce in any dbt project in just two commands:
100
-
101
- ```Bash
102
- # cd into your dbt project
103
- pip install -U recce
104
- recce server
105
- ```
106
-
107
- To use the full suite of diffing tools in Recce, please prepare two environments to compare against. Follow
108
- our [5-minute Jaffle Shop tutorial](https://docs.datarecce.io/get-started-jaffle-shop/) to try it out for yourself.
109
-
110
- ## What you get
111
-
112
- `recce server` launches a web UI that shows you the area of your lineage that is impacted by the branch changes.
113
-
114
- <a href="https://datarecce.io"><img src="https://datarecce.io/assets/images/readme/recce-overview-screenshot.png" style="width: 100%; max-width: 600px; display: block; margin: 0 auto 20px;" /></a>
115
-
116
- ### Using Recce for Impact Assessment in dbt PR Review
117
-
118
- - Select nodes in the lineage to perform Checks (diffs) as part of your impact assessment during development or PR
119
- review.
120
- - Add Checks to your Checklist to note observed impacts.
121
- - Share your Checklist with the PR reviewer.
122
- - (`Recce Cloud`) Automatically sync Check status between Recce Instances
123
- - (`Recce Cloud`) Block PR merging until all Recce Checks have been approved
124
-
125
- Read more about using [Recce for Impact Assessment](https://datarecce.io/blog/hands-on-data-impact-analysis-recce/) on
126
- the Recce blog.
127
-
128
- ## Try the Online Demo
129
-
130
- We provide three online Recce demos (based on Jaffle Shop), each is related to a specific pull request. Use these demos
131
- to inspect the data impact caused by the modeling changes in the PR.
132
-
133
- For each demo, review the following:
134
-
135
- - The pull request comment
136
- - The code changes
137
- - How the lineage and data has changed in `Recce`
138
-
139
- This will enable you to validate if the intention of the PR has been successfully implemented without unintended impact.
140
-
141
- > [!TIP]
142
- > Don't forget to click the Checks tab to view the Recce Checklist, and perform your own Checks for further
143
- > investigation.
144
- >
145
-
146
- ### Demo 1: Calculation logic change
147
-
148
- This pull request adjusts the **logic** for how customer lifetime value is calculated:
149
-
150
- - [Demo #1](https://pr1.demo.datarecce.io/)
151
- - [Pull request #1](https://github.com/DataRecce/jaffle_shop_duckdb/pull/1)
152
-
153
- ### Demo 2: Refactoring
154
-
155
- This pull request performs some **refactoring** on the customers model by turning two CTEs into intermediate models,
156
- enhancing readability and maintainability:
157
-
158
- - [Demo #2](https://pr2.demo.datarecce.io/)
159
- - [Pull request #2](https://github.com/DataRecce/jaffle_shop_duckdb/pull/2)
160
-
161
- ### Demo 3: Analysis
162
-
163
- This pull request introduces a new Rounding Effect **Analysis** feature, aimed at analyzing and reporting the impacts of
164
- rounding in our data processing.
165
-
166
- - [Demo #3](https://pr3.demo.datarecce.io/)
167
- - [Pull request #3](https://github.com/DataRecce/jaffle_shop_duckdb/pull/3)
168
-
169
- ### Demo 4: Enhancing the existing model
170
-
171
- This pull request **enhances** the existing `customers` model by categorizing whether a customer has placed an order as
172
- part of a promotion.
173
-
174
- - [Demo #4](https://pr44.demo.datarecce.io/)
175
- - [Pull request #44](https://github.com/DataRecce/jaffle_shop_duckdb/pull/44)
176
-
177
- ## Why `Recce`
178
-
179
- [dbt](https://www.getdbt.com/) has brought many software best practices to data projects, such as:
180
-
181
- - Version controlled code
182
- - Modular SQL
183
- - Reproducible pipelines
184
-
185
- Even so, 'bad merges' still happen and erroneous data and silent errors make their way into prod data. As self-serve
186
- analytics opens dbt projects to many roles, and the size of dbt projects increase, the job of reviewing data modeling
187
- changes is even more critical.
188
-
189
- The only way to understand the impact of code changes on data is to compare the data before-and-after the changes.
190
-
191
- ## Features
192
-
193
- `Recce` provides a data review environment for data teams to check their work during development, and then again as part
194
- of PR review. The suite of tools and diffs in Recce are specifically geared towards surfacing, understanding, and
195
- recording data impact from code changes.
196
-
197
- ### Lineage Diff
198
-
199
- [Lineage Diff](https://docs.datarecce.io/features/lineage/) is the main interface to `Recce` and shows which nodes in
200
- the lineage have been added, removed, or modified.
201
-
202
- ### Structural Diffs
203
-
204
- - [Schema Diff](https://docs.datarecce.io/features/lineage/#schema-diff) - Show the struture of the table including
205
- added or removed columns
206
- - [Row Count Diff](https://docs.datarecce.io/features/lineage/#row-count-diff) - Compares the row count for tables
207
-
208
- ### Statistical Diffs
209
-
210
- Advanced Diffs provide high level statistics about data change:
211
-
212
- - [Profile Diff](https://docs.datarecce.io/features/lineage/#profile-diff): Compares stats such as count, distinct
213
- count, min, max, average.
214
- - [Value Diff](https://docs.datarecce.io/features/lineage/#value-diff): The matched count and percentage for each column
215
- in the table.
216
- - [Top-K Diff](https://docs.datarecce.io/features/lineage/#top-k-diff): Compares the distribution of a categorical
217
- column.
218
- - [Histogram Diff](https://docs.datarecce.io/features/lineage/#histogram-diff): Compares the distribution of a numeric
219
- column in an overlay histogram chart.
220
-
221
- ### Query Diff
222
-
223
- [Query Diff](https://docs.datarecce.io/features/query/) compares the results of any ad-hoc query, and supports the use
224
- of dbt macros.
225
-
226
- ### Checklist
227
-
228
- The [checklist](https://docs.datarecce.io/features/checklist/) provides a way to record the results of your data
229
- validation process.
230
-
231
- - Save the results of checks
232
- - Re-run checks
233
- - Annotate checks to add context
234
- - Share the results of checks
235
- - (`Recce Cloud`) Sync checks and check results across Recce instances
236
- - (`Recce Cloud`) Block PR merging until checks have been approved
237
-
238
- ## Who's using `Recce`?
239
-
240
- `Recce` is useful for validating your own work or the work of others, and can also be used to share data impact with
241
- non-technical stakeholders to approve data checks.
242
-
243
- - **Data engineers** can use `Recce` to ensure the structural integrity of the data and understand the scope of impact
244
- before merging.
245
- - **Analysts** can use `Recce` to self-review and understand how data modeling changes have changed the data.
246
- - **Stakeholders** can use `Recce` to sign-off on data after updates have been made
247
-
248
- ## Documentation / How to use `Recce`
249
-
250
- The [Recce Documentation](https://docs.datarecce.io/) covers everything you need to get started.
251
-
252
- We’d advise first following the [5-minute tutorial](https://docs.datarecce.io/get-started-jaffle-shop/) that uses Jaffle
253
- Shop and then [trying out Recce](https://docs.datarecce.io/get-started/) in your own project.
254
-
255
- For advice on best practices in preparing dbt environments to enable effective PR review, check
256
- out [Best Practices for Preparing Environments](https://docs.datarecce.io/guides/best-practices-prep-env/).
257
-
258
- ## Recce Cloud
259
-
260
- `Recce Cloud` provides a backbone of supporting services that make Recce usage more suitable for teams reviewing
261
- multiple pull requests.
262
-
263
- With `Recce Cloud`:
264
-
265
- - `Recce` Instances can be launched **directly from a PR**
266
- - **Checks** are automatically **synced** across `Recce` Instances
267
- - **Blocked merging** until all checks are approved
268
-
269
- [Recce Cloud](https://datarecce.io/cloud) is currently in early-access private beta.
270
-
271
- To find out how you can get access
272
- please [book an appointment](https://cal.com/team/recce/chat?utm_source=banner&utm_campaign=oss) for a short meeting.
273
-
274
- <a href="https://cal.com/team/recce/chat?utm_source=banner&utm_campaign=oss">
275
- <img alt="Book us with [Cal.com](http://cal.com/)" src="https://cal.com/book-with-cal-light.svg" />
276
- </a>
277
-
278
- ## Data Security
279
-
280
- `Recce` consists of a local server application that you run on your own device or compute services.
281
-
282
- - Diffs or queries that are performed by `Recce` happen either in your data warehouse, or in the browser itself.
283
- - `Recce` does not store your data.
284
-
285
- For `Recce Cloud` users:
286
-
287
- - An encrypted version of your `Recce` [state file](https://docs.datarecce.io/features/state-file/) is storedon
288
- `Recce Cloud`. This file is encrypted *before* transmission.
289
-
290
- ## Community & Support
291
-
292
- Here's where you can get in touch with the `Recce` team and find support:
293
-
294
- - [dbt Slack](https://www.getdbt.com/community/join-the-community) in
295
- the [#tools-recce](https://getdbt.slack.com/archives/C05C28V7CPP) channel
296
- - [Recce Discord](https://discord.gg/VpwXRC34jz)
297
- - Email us [product@datarecce.io](mailto:product@datarecce.io)
298
-
299
- If you believe you have found a bug, or there is some missing functionality in Recce, please open
300
- a [GitHub Issue](https://github.com/DataRecce/recce/issues).
301
-
302
- ## Recce on the web
303
-
304
- You can follow along with news about `Recce` and blogs from our team in the following places:
305
-
306
- - [DataRecce.io](https://DataRecce.io)
307
- - [LinkedIn](https://www.linkedin.com/company/datarecce)
308
- - [Medium blog](https://medium.com/inthepipeline)
309
- - [@datarecce](https://x.com/DataRecce) on Twitter/X
310
- - [@DataRecce@mastodon.social](https://mastodon.social/@DataRecce) on Mastodon
311
- - [@datarecce.bsky.social](https://bsky.app/profile/datarecce.bsky.social) on BlueSky