pyleotups 0.0.1__tar.gz
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- pyleotups-0.0.1/PKG-INFO +16 -0
- pyleotups-0.0.1/README.md +49 -0
- pyleotups-0.0.1/pyleotups/__init__.py +6 -0
- pyleotups-0.0.1/pyleotups/core/Dataset.py +803 -0
- pyleotups-0.0.1/pyleotups/core/__init__.py +2 -0
- pyleotups-0.0.1/pyleotups/tests/__init__.py +0 -0
- pyleotups-0.0.1/pyleotups/tests/helpers/__init__.py +0 -0
- pyleotups-0.0.1/pyleotups/tests/helpers/mock_study_response.py +8 -0
- pyleotups-0.0.1/pyleotups/tests/test_Dataset.py +393 -0
- pyleotups-0.0.1/pyleotups/tests/test_NonStandardParser.py +66 -0
- pyleotups-0.0.1/pyleotups/utils/NOAADataset.py +160 -0
- pyleotups-0.0.1/pyleotups/utils/PaleoData.py +120 -0
- pyleotups-0.0.1/pyleotups/utils/Parser/NonStandardParser.py +389 -0
- pyleotups-0.0.1/pyleotups/utils/Parser/StandardParser.py +940 -0
- pyleotups-0.0.1/pyleotups/utils/Parser/__init__.py +2 -0
- pyleotups-0.0.1/pyleotups/utils/Publication.py +129 -0
- pyleotups-0.0.1/pyleotups/utils/Site.py +68 -0
- pyleotups-0.0.1/pyleotups/utils/__init__.py +6 -0
- pyleotups-0.0.1/pyleotups/utils/helpers.py +49 -0
- pyleotups-0.0.1/pyleotups.egg-info/PKG-INFO +16 -0
- pyleotups-0.0.1/pyleotups.egg-info/SOURCES.txt +24 -0
- pyleotups-0.0.1/pyleotups.egg-info/dependency_links.txt +1 -0
- pyleotups-0.0.1/pyleotups.egg-info/requires.txt +7 -0
- pyleotups-0.0.1/pyleotups.egg-info/top_level.txt +1 -0
- pyleotups-0.0.1/pyproject.toml +33 -0
- pyleotups-0.0.1/setup.cfg +4 -0
pyleotups-0.0.1/PKG-INFO
ADDED
|
@@ -0,0 +1,16 @@
|
|
|
1
|
+
Metadata-Version: 2.4
|
|
2
|
+
Name: pyleotups
|
|
3
|
+
Version: 0.0.1
|
|
4
|
+
Summary: Automated Paleoclimate Data Extraction and Processing
|
|
5
|
+
Author-email: Dhiren Oswal <linkedearth@gmail.com>, Deborah Khider <linkedearth@gmail.com>, Jay Pujara <linkedearth@gmail.com>
|
|
6
|
+
License: Apache License 2.0
|
|
7
|
+
Project-URL: Homepage, https://github.com/LinkedEarth/pyleotups/pyleotups
|
|
8
|
+
Keywords: pyleotups,Table Understanding,Data Analysis
|
|
9
|
+
Requires-Python: >=3.8
|
|
10
|
+
Requires-Dist: pandas==2.1.4
|
|
11
|
+
Requires-Dist: requests
|
|
12
|
+
Requires-Dist: numpy
|
|
13
|
+
Requires-Dist: pybtex
|
|
14
|
+
Requires-Dist: setuptools
|
|
15
|
+
Requires-Dist: tqdm
|
|
16
|
+
Requires-Dist: nbstripout
|
|
@@ -0,0 +1,49 @@
|
|
|
1
|
+
[]()
|
|
2
|
+
[](https://www.nsf.gov/awardsearch/showAward?AWD_ID=2411267)
|
|
3
|
+
[](https://doi.org/10.5281/zenodo.16009164)
|
|
4
|
+
|
|
5
|
+
|
|
6
|
+
<p align="center">
|
|
7
|
+
<img src="https://github.com/LinkedEarth/Logos/blob/master/PyleoTUPS/pyleotups_logo.png?raw=true" width="50%">
|
|
8
|
+
</p>
|
|
9
|
+
|
|
10
|
+
|
|
11
|
+
# PyleoTUPS: Automated Paleoclimate Data Extraction and Processing
|
|
12
|
+
|
|
13
|
+
PyleoTUPS is a Python package designed to streamline paleoclimate data workflows by automating the extraction and processing of datasets from major paleoclimate repositories. The package addresses a critical bottleneck in paleoclimate research: the time-consuming manual process of accessing, extracting, and formatting data from diverse file formats and repositories.
|
|
14
|
+
|
|
15
|
+
## Key Features
|
|
16
|
+
|
|
17
|
+
* **Automated Data Extraction**: Leverages table understanding techniques to automatically extract data tables from complex text files, including NOAA Paleoclimate templates that have evolved over decades with varying formats and structures.
|
|
18
|
+
* **Multi-Repository Access**: Provides unified access to datasets from two major paleoclimate repositories - NOAA NCEI Paleoclimate and PANGAEA (coming soon!) - through their respective APIs and direct file processing capabilities.
|
|
19
|
+
* **Format Flexibility**: Handles multiple input formats including structured text files, CSV, and Excel files, automatically parsing embedded metadata and data tables regardless of template variations.
|
|
20
|
+
* **Scientific Python Integration**: Returns extracted data as pandas DataFrames with preserved metadata attributes, ensuring seamless integration with the broader Python scientific ecosystem including NumPy, SciPy, and specialized paleoclimate libraries.
|
|
21
|
+
* **Metadata Preservation**: Maintains comprehensive metadata linkage, storing dataset-level information (location, authors, publications) as dictionaries while preserving column-level metadata as DataFrame attributes.
|
|
22
|
+
* **FAIR Data Compliance**: Supports community standards for Findable, Accessible, Interoperable, and Reusable (FAIR) data practices, with built-in compatibility with the [Linked Paleo Data (LiPD)](https://lipd.net) format and [NOAA PaST Thesaurus vocabulary](https://www.ncei.noaa.gov/products/paleoclimatology/paleoenvironmental-standard-terms-thesaurus).
|
|
23
|
+
|
|
24
|
+
## Target Users
|
|
25
|
+
|
|
26
|
+
PyleoTUPS is designed for paleoclimate researchers, Earth system modelers, and data scientists working with paleoclimate observations. Whether you're conducting systematic data synthesis, model evaluation, or exploratory analysis, PyleoTUPS reduces the technical barriers to accessing and utilizing paleoclimate datasets.
|
|
27
|
+
|
|
28
|
+
## Versions
|
|
29
|
+
|
|
30
|
+
See our [releases page](https://github.com/LinkedEarth/PyleoTUPS/releases) for details on what's included in each version.
|
|
31
|
+
|
|
32
|
+
|
|
33
|
+
## Development
|
|
34
|
+
|
|
35
|
+
PyleoTUPS development takes place on GitHub: https://github.com/LinkedEarth/PyleoTUPS
|
|
36
|
+
|
|
37
|
+
Please submit any reproducible bugs you encounter to the [issue tracker](https://github.com/LinkedEarth/PyleoTUPS/issues). For usage questions, please use [Discourse](https://discourse.linked.earth).
|
|
38
|
+
|
|
39
|
+
|
|
40
|
+
## License
|
|
41
|
+
|
|
42
|
+
The project is licensed under the Apache 2.0 license. Please refer to the file call license.
|
|
43
|
+
If you use the code in publications, please credit the work using the citation file.
|
|
44
|
+
|
|
45
|
+
|
|
46
|
+
### Disclaimer
|
|
47
|
+
|
|
48
|
+
This material is based upon work supported by the National Science Foundation under Grant Number CSSI-2411267. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the investigators and do not necessarily reflect the views of the National Science Foundation.
|
|
49
|
+
|