simcats-datasets 2.5.0__py3-none-any.whl → 2.6.0__py3-none-any.whl
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- simcats_datasets/__init__.py +1 -1
- simcats_datasets/generation/_create_dataset.py +50 -0
- simcats_datasets/generation/_create_simulated_dataset.py +168 -69
- simcats_datasets/loading/_load_dataset.py +24 -0
- simcats_datasets/loading/load_ground_truth.py +213 -2
- simcats_datasets/loading/pytorch.py +9 -2
- simcats_datasets/support_functions/clip_line_to_rectangle.py +15 -4
- simcats_datasets/support_functions/convert_lines.py +34 -0
- simcats_datasets/support_functions/data_preprocessing.py +112 -1
- simcats_datasets/support_functions/get_coulomb_oscillation_area_boundaries.py +471 -0
- simcats_datasets/support_functions/metadata_utils.py +62 -0
- simcats_datasets-2.6.0.dist-info/METADATA +163 -0
- simcats_datasets-2.6.0.dist-info/RECORD +22 -0
- {simcats_datasets-2.5.0.dist-info → simcats_datasets-2.6.0.dist-info}/WHEEL +1 -1
- simcats_datasets-2.5.0.dist-info/METADATA +0 -837
- simcats_datasets-2.5.0.dist-info/RECORD +0 -20
- {simcats_datasets-2.5.0.dist-info → simcats_datasets-2.6.0.dist-info/licenses}/LICENSE +0 -0
- {simcats_datasets-2.5.0.dist-info → simcats_datasets-2.6.0.dist-info}/top_level.txt +0 -0
|
@@ -0,0 +1,163 @@
|
|
|
1
|
+
Metadata-Version: 2.4
|
|
2
|
+
Name: simcats-datasets
|
|
3
|
+
Version: 2.6.0
|
|
4
|
+
Summary: SimCATS-Datasets is a Python package that simplifies the creation and loading of SimCATS datasets.
|
|
5
|
+
Author-email: Fabian Hader <f.hader@fz-juelich.de>, Fabian Fuchs <f.fuchs@fz-juelich.de>, Karin Havemann <k.havemann@fz-juelich.de>, Sarah Fleitmann <s.fleitmann@fz-juelich.de>, Jan Vogelbruch <j.vogelbruch@fz-juelich.de>
|
|
6
|
+
License-Expression: GPL-3.0-or-later
|
|
7
|
+
Project-URL: homepage, https://github.com/f-hader/SimCATS-Datasets
|
|
8
|
+
Project-URL: documentation, https://simcats-datasets.readthedocs.io
|
|
9
|
+
Project-URL: source, https://github.com/f-hader/SimCATS-Datasets
|
|
10
|
+
Project-URL: tracker, https://github.com/f-hader/SimCATS-Datasets/issues
|
|
11
|
+
Classifier: Development Status :: 5 - Production/Stable
|
|
12
|
+
Classifier: Intended Audience :: Science/Research
|
|
13
|
+
Classifier: Programming Language :: Python
|
|
14
|
+
Classifier: Programming Language :: Python :: 3
|
|
15
|
+
Classifier: Programming Language :: Python :: 3.7
|
|
16
|
+
Classifier: Programming Language :: Python :: 3.8
|
|
17
|
+
Classifier: Programming Language :: Python :: 3.9
|
|
18
|
+
Classifier: Programming Language :: Python :: 3.10
|
|
19
|
+
Classifier: Programming Language :: Python :: 3.11
|
|
20
|
+
Classifier: Topic :: Scientific/Engineering
|
|
21
|
+
Classifier: Typing :: Typed
|
|
22
|
+
Requires-Python: >=3.7
|
|
23
|
+
Description-Content-Type: text/markdown
|
|
24
|
+
License-File: LICENSE
|
|
25
|
+
Requires-Dist: bezier
|
|
26
|
+
Requires-Dist: bm3d
|
|
27
|
+
Requires-Dist: h5py
|
|
28
|
+
Requires-Dist: hdf5storage
|
|
29
|
+
Requires-Dist: numpy
|
|
30
|
+
Requires-Dist: opencv-python
|
|
31
|
+
Requires-Dist: pandas
|
|
32
|
+
Requires-Dist: parallelbar
|
|
33
|
+
Requires-Dist: parse
|
|
34
|
+
Requires-Dist: scikit-image
|
|
35
|
+
Requires-Dist: simcats>=2.0.0
|
|
36
|
+
Requires-Dist: torch
|
|
37
|
+
Requires-Dist: tqdm
|
|
38
|
+
Requires-Dist: xarray
|
|
39
|
+
Dynamic: license-file
|
|
40
|
+
|
|
41
|
+
<h1 align="center">
|
|
42
|
+
<img src="https://raw.githubusercontent.com/f-hader/SimCATS-Datasets/main/SimCATS-Datasets_symbol.svg" alt="SimCATS logo">
|
|
43
|
+
<br>
|
|
44
|
+
</h1>
|
|
45
|
+
|
|
46
|
+
<div align="center">
|
|
47
|
+
<a href="https://github.com/f-hader/SimCATS-Datasets/blob/main/LICENSE">
|
|
48
|
+
<img src="https://img.shields.io/badge/License-GPLv3-blue.svg" alt="License: GPLv3"/>
|
|
49
|
+
</a>
|
|
50
|
+
<a href="https://pypi.org/project/simcats-datasets/">
|
|
51
|
+
<img src="https://img.shields.io/pypi/v/simcats-datasets.svg" alt="PyPi Latest Release"/>
|
|
52
|
+
</a>
|
|
53
|
+
<a href="https://simcats-datasets.readthedocs.io/en/latest/">
|
|
54
|
+
<img src="https://img.shields.io/readthedocs/simcats-datasets" alt="Read the Docs"/>
|
|
55
|
+
</a>
|
|
56
|
+
<a href="https://doi.org/10.1109/TQE.2024.3445967">
|
|
57
|
+
<img src="https://img.shields.io/badge/DOI (SimCATS Paper)-10.1109/TQE.2024.3445967-007ec6.svg" alt="DOI Paper"/>
|
|
58
|
+
</a>
|
|
59
|
+
<a href="https://doi.org/10.5281/zenodo.13862231">
|
|
60
|
+
<img src="https://img.shields.io/badge/DOI (Code)-10.5281/zenodo.13862231-007ec6.svg" alt="DOI Code"/>
|
|
61
|
+
</a>
|
|
62
|
+
</div>
|
|
63
|
+
|
|
64
|
+
# SimCATS-Datasets
|
|
65
|
+
|
|
66
|
+
`SimCATS-Datasets` is a Python package that simplifies the creation and loading of `SimCATS` datasets. Please have a look at
|
|
67
|
+
[this repository](https://github.com/f-hader/SimCATS) regarding `SimCATS` itself.
|
|
68
|
+
|
|
69
|
+
## Installation
|
|
70
|
+
|
|
71
|
+
The framework supports Python versions 3.7 - 3.11 and installs via pip:
|
|
72
|
+
```
|
|
73
|
+
pip install simcats-datasets
|
|
74
|
+
```
|
|
75
|
+
|
|
76
|
+
Alternatively, the `SimCATS-Datasets` package can be installed by cloning the GitHub repository, navigating to the
|
|
77
|
+
folder containing the `setup.py` file, and executing
|
|
78
|
+
```
|
|
79
|
+
pip install .
|
|
80
|
+
```
|
|
81
|
+
|
|
82
|
+
For installation in development/editable mode, use the option `-e`.
|
|
83
|
+
|
|
84
|
+
<!-- start sec:documentation -->
|
|
85
|
+
## Documentation
|
|
86
|
+
|
|
87
|
+
The official documentation is hosted on [ReadtheDocs](https://simcats-datasets.readthedocs.io) but can also be built
|
|
88
|
+
locally. To do this, first install the packages `sphinx`, `sphinx-rtd-theme`, `sphinx-autoapi`, `myst-nb `, and
|
|
89
|
+
`jupytext` with
|
|
90
|
+
|
|
91
|
+
```
|
|
92
|
+
pip install sphinx sphinx-rtd-theme sphinx-autoapi myst-nb jupytext
|
|
93
|
+
```
|
|
94
|
+
|
|
95
|
+
and then, in the `docs` folder, execute the following command:
|
|
96
|
+
|
|
97
|
+
```
|
|
98
|
+
.\make html
|
|
99
|
+
```
|
|
100
|
+
|
|
101
|
+
To view the generated HTML documentation, open the file `docs\build\html\index.html`.
|
|
102
|
+
<!-- end sec:documentation -->
|
|
103
|
+
|
|
104
|
+
|
|
105
|
+
## Loading Datasets
|
|
106
|
+
|
|
107
|
+
Datasets created with `SimCATS-Datasets` are stored in HDF5 files. These datasets can be loaded using the function
|
|
108
|
+
`load_dataset` from `simcats_datasets.loading`.
|
|
109
|
+
|
|
110
|
+
The return value of the function is a named tuple. The fields can be accessed by their name or index. As with normal
|
|
111
|
+
tuples, it is also possible to unpack the returned fields directly into separate variables. The available fields
|
|
112
|
+
depend on which data was specified to be loaded. Please look at the docstring for further information.
|
|
113
|
+
|
|
114
|
+
Additionally, `SimCATS-Datasets` offers a pytorch dataset (see `torch.utils.data.Dataset`) implementation called
|
|
115
|
+
`SimcatsDataset`. It allows the direct use of `SimCATS` datasets for machine learning purposes with Torch and can be
|
|
116
|
+
imported from `simcats_datasets.loading.pytorch`.
|
|
117
|
+
|
|
118
|
+
## Creating Datasets
|
|
119
|
+
|
|
120
|
+
To create a simulated dataset, import `create_simulated_dataset` from `simcats_datasets.generation`. This function
|
|
121
|
+
allows the creation of simulated CSDs with ground truth very easily. It is also possible to add further CSDs to already
|
|
122
|
+
existing datasets. The function will detect the existing dataset automatically. For the function's usage, please have a
|
|
123
|
+
look at its docstring.
|
|
124
|
+
|
|
125
|
+
| :warning: WARNING |
|
|
126
|
+
|:-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
|
|
127
|
+
| The functionalities for creating and extending simulated datasets using SimCATS expect that the SimCATS simulation uses the IdealCSDInterface implementation called IdealCSDGeometric. Other implementations might cause problems because the expected information for creating labeled lines etc. might be unavailable. |
|
|
128
|
+
|
|
129
|
+
|
|
130
|
+
Alternatively, to using `create_simulated_dataset` and directly simulating a dataset with `SimCATS`, it is also possible
|
|
131
|
+
to create a `SimCATS-Dataset` compatible dataset with existing data (for example, experimentally measured data or data
|
|
132
|
+
simulated with other frameworks). This can be done using `create_dataset` from `simcats_datasets.generation`.
|
|
133
|
+
|
|
134
|
+
## Citations
|
|
135
|
+
|
|
136
|
+
```bibtex
|
|
137
|
+
@article{hader2024simcats,
|
|
138
|
+
author={Hader, Fabian and Fleitmann, Sarah and Vogelbruch, Jan and Geck, Lotte and Waasen, Stefan van},
|
|
139
|
+
journal={IEEE Transactions on Quantum Engineering},
|
|
140
|
+
title={Simulation of Charge Stability Diagrams for Automated Tuning Solutions (SimCATS)},
|
|
141
|
+
year={2024},
|
|
142
|
+
volume={5},
|
|
143
|
+
pages={1-14},
|
|
144
|
+
doi={10.1109/TQE.2024.3445967}
|
|
145
|
+
}
|
|
146
|
+
```
|
|
147
|
+
|
|
148
|
+
## License, CLA, and Copyright
|
|
149
|
+
|
|
150
|
+
[![CC BY-NC-SA 4.0][gplv3-shield]][gplv3]
|
|
151
|
+
|
|
152
|
+
This work is licensed under a
|
|
153
|
+
[GNU General Public License 3][gplv3].
|
|
154
|
+
|
|
155
|
+
[![GPLv3][gplv3-image]][gplv3]
|
|
156
|
+
|
|
157
|
+
[gplv3]: https://www.gnu.org/licenses/gpl-3.0.html
|
|
158
|
+
[gplv3-image]: https://www.gnu.org/graphics/gplv3-127x51.png
|
|
159
|
+
[gplv3-shield]: https://img.shields.io/badge/License-GPLv3-blue.svg
|
|
160
|
+
|
|
161
|
+
Contributions must follow the Contributor License Agreement. For more information, see the [CONTRIBUTING.md](https://github.com/f-hader/SimCATS-Datasets/blob/main/CONTRIBUTING.md) file at the top of the GitHub repository.
|
|
162
|
+
|
|
163
|
+
Copyright © 2026 Peter Grünberg Institute - Integrated Computing Architectures (ICA / PGI-4), Forschungszentrum Jülich GmbH
|
|
@@ -0,0 +1,22 @@
|
|
|
1
|
+
simcats_datasets/__init__.py,sha256=CGdvmMJ1FurXbest3M1UNpqxe3IPeIGGoUgoSHbViRQ,37
|
|
2
|
+
simcats_datasets/generation/__init__.py,sha256=yjmfaDuZVqAD2EhLkmNIP1HEiuPTXgBIZVhBO8otGTE,344
|
|
3
|
+
simcats_datasets/generation/_create_dataset.py,sha256=0La9eIqFIYxg6HypjhO7TpXdgJBzqkcp_KGIn9PklbU,19947
|
|
4
|
+
simcats_datasets/generation/_create_simulated_dataset.py,sha256=Mt8jU7bfeUrsqVAeZnrJ-q_J-JDoONL6OxjLW_QgOak,24603
|
|
5
|
+
simcats_datasets/loading/__init__.py,sha256=KiaJAJX0uIBg_bbZ8yYmolanfd6oV8JbOBxnct3U3TQ,287
|
|
6
|
+
simcats_datasets/loading/_load_dataset.py,sha256=gVqe7HHg4OixRMZo8tn1oMXTliWyPE3Tje9Ra9RFnmQ,12207
|
|
7
|
+
simcats_datasets/loading/load_ground_truth.py,sha256=4MVbusWOeIzz4aoTOk7x2ux-gywGTgZaVDh8naMDMhs,45268
|
|
8
|
+
simcats_datasets/loading/pytorch.py,sha256=P8FLmQRIrzbwYnQe0yWKJRFFjjZdVdxbeGT36XfOdus,28990
|
|
9
|
+
simcats_datasets/support_functions/__init__.py,sha256=mWmt41IjosziXPd86JiR-O7A_k06IMnYej2h1YsNMUo,73
|
|
10
|
+
simcats_datasets/support_functions/_json_encoders.py,sha256=BZrVXtUqwmqT0ki-jnWraYUnNMy0l18tJWdFjzJSgqg,1435
|
|
11
|
+
simcats_datasets/support_functions/clip_line_to_rectangle.py,sha256=120YsOAbqSoB5uC45NLWfiZx4G5JwBAN5zFQCoU4dJc,9399
|
|
12
|
+
simcats_datasets/support_functions/convert_lines.py,sha256=qD1faASp4p6JwjMLyXYvUq-rcLVCc2oH5_duA3U-uFo,6534
|
|
13
|
+
simcats_datasets/support_functions/data_preprocessing.py,sha256=yBUsQJvpl3_ivJXUH48bgXi_yRF55G05Sj9ADtKonyI,17611
|
|
14
|
+
simcats_datasets/support_functions/get_coulomb_oscillation_area_boundaries.py,sha256=l9-AbFYYHIFV4ADGLXxyJUBDISQ_9AhUk4RtQK678JE,26862
|
|
15
|
+
simcats_datasets/support_functions/get_lead_transition_labels.py,sha256=-TnTxmJuxokD1Vprb9-IuE62Ys82HvlJ8VEQXCx7tY0,5691
|
|
16
|
+
simcats_datasets/support_functions/metadata_utils.py,sha256=d4f7fCdoOft74GZywRXt1AQaSQ5MwILqaIomCrp9rTI,2479
|
|
17
|
+
simcats_datasets/support_functions/pytorch_format_output.py,sha256=o6v73-s-u8YhPmeZxtxLpBKQ4dRasnHard9DoMlEUpI,8337
|
|
18
|
+
simcats_datasets-2.6.0.dist-info/licenses/LICENSE,sha256=IwGE9guuL-ryRPEKi6wFPI_zOhg7zDZbTYuHbSt_SAk,35823
|
|
19
|
+
simcats_datasets-2.6.0.dist-info/METADATA,sha256=gaY2C_Fck0ohMcB0DP-87V0hHDTgGLU0ZJUV4rdwT1w,7653
|
|
20
|
+
simcats_datasets-2.6.0.dist-info/WHEEL,sha256=wUyA8OaulRlbfwMtmQsvNngGrxQHAvkKcvRmdizlJi0,92
|
|
21
|
+
simcats_datasets-2.6.0.dist-info/top_level.txt,sha256=6PQ9YF0TvHYNZHES-hfT1RCExihiv4o5J_O2JyYea74,17
|
|
22
|
+
simcats_datasets-2.6.0.dist-info/RECORD,,
|