PyPI - h5yaml - Versions diffs - 0.0.3__tar.gz → 0.0.5__tar.gz - Mend

h5yaml 0.0.3tar.gz → 0.0.5tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (12) hide show

{h5yaml-0.0.3 → h5yaml-0.0.5}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: h5yaml
-Version: 0.0.3
+Version: 0.0.5
 Summary: Use YAML configuration file to generate HDF5/netCDF4 formated files.
 Project-URL: Homepage, https://github.com/rmvanhees/h5_yaml
 Project-URL: Source, https://github.com/rmvanhees/h5_yaml
@@ -14,28 +14,51 @@ Classifier: Intended Audience :: Developers
 Classifier: Intended Audience :: Science/Research
 Classifier: Operating System :: OS Independent
 Classifier: Programming Language :: Python :: 3 :: Only
+Classifier: Programming Language :: Python :: 3.9
+Classifier: Programming Language :: Python :: 3.10
+Classifier: Programming Language :: Python :: 3.11
 Classifier: Programming Language :: Python :: 3.12
 Classifier: Programming Language :: Python :: 3.13
 Classifier: Topic :: Scientific/Engineering :: Atmospheric Science
-Requires-Python: >=3.12
+Requires-Python: >=3.9
 Requires-Dist: h5py>=3.13
 Requires-Dist: netcdf4>=1.7
 Requires-Dist: numpy>=2.2
 Requires-Dist: pyyaml>=6.0
 Description-Content-Type: text/markdown
-# H5_YAML
+# H5YAML
+[![image](https://img.shields.io/pypi/v/h5yaml.svg?label=release)](https://github.com/rmvanhees/h5yaml/)
+[![image](https://img.shields.io/pypi/l/h5yaml.svg)](https://github.com/rmvanhees/h5yaml/LICENSE)
+[![image](https://img.shields.io/pypi/dm/h5yaml.svg)](https://pypi.org/project/h5yaml/)
+[![image](https://img.shields.io/pypi/status/h5yaml.svg?label=status)](https://pypi.org/project/h5yaml/)
 ## Description
-Use YAML configuration file to generate HDF5/netCDF4 formated files.
+This package let you generate [HDF5](https://docs.h5py.org/en/stable/)/[netCDF4](https://unidata.github.io/netcdf4-python/)
+formatted files as defined in a [YAML](https://yaml.org/) configuration file. This has several advantages:
-The class `NcYaml` must be used when strict conformance to the netCDF4 format is
-required. However, the python netCDF4 implementation does not allow variable-length
-data to have a compound data-type. The class `H5Yaml` does not have this restiction
-and will generate HDF5 formated files which can be read by netCDF4 software.
+ * you define the layout of your HDF5/netCDF4 file using YAML which is human-readable and has intuitive syntax.
+ * you can reuse the YAML configuration file to to have all your product have a consistent layout.
+ * you can make updates by only changing the YAML configuration file
+ * you can have the layout of your HDF5/netCDF4 file as a python dictionary, thus without accessing any HDF5/netCDF4 file
+The `H5YAML` package has two classes to generate a HDF5/netCDF4 formatted file.
+ 1. The class `H5Yaml` uses the [h5py](https://pypi.org/project/h5py/) package, which is a Pythonic interface to
+    the HDF5 binary data format.
+    Let 'h5_def.yaml' be your YAML configuration file then ```H5Yaml("h5_def.yaml").create("foo.h5")``` will create
+	the HDF5 file 'foo.h5'. This can be read by netCDF4 software, because it uses dimension-scales to each dataset.
+ 2. The class `NcYaml` uses the [netCDF4](https://pypi.org/project/netCDF4/) package, which provides an object-oriented
+    python interface to the netCDF version 4 library.
+    Let 'nc_def.yaml' be your YAML configuration file then ```NcYaml("nc_def.yaml").create("foo.nc")``` will create
+	the netCDF4/HDF5 file 'foo.nc'
+The class `NcYaml` must be used when strict conformance to the netCDF4 format is required.
+However, package `netCDF4` has some limitations, which `h5py` has not, for example it does
+not allow variable-length variables to have a compound data-type.
 ## Installation
-Relases of the code, starting from version 0.1, will be made available via PyPi.
+Releases of the code, starting from version 0.1, will be made available via PyPI.
 ## Usage
@@ -54,7 +77,7 @@ The YAML file should be structured as follows:
      - science_data
    ```
- * The section 'dimensions' is obligatory, you shouold define the dimensions for each
+ * The section 'dimensions' is obligatory, you should define the dimensions for each
    variable in your file. The 'dimensions' section may look like this:
    ```
@@ -144,7 +167,7 @@ The YAML file should be structured as follows:
 ### Notes and ToDo:
  * The usage of older versions of h5py may result in broken netCDF4 files
- * Explain usage of parameter '_chunks', which is currently not correcly implemented.
+ * Explain usage of parameter '_chunks', which is currently not correctly implemented.
  * Explain that the usage of variable length data-sets may break netCDF4 compatibility
 ## Support [TBW]
@@ -161,6 +184,3 @@ The code is developed by R.M. van Hees (SRON)
 * Copyright: SRON (https://www.sron.nl).
 * License: BSD-3-clause
-## Project status
-Beta

{h5yaml-0.0.3 → h5yaml-0.0.5}/README.md RENAMED Viewed

@@ -1,15 +1,35 @@
-# H5_YAML
+# H5YAML
+[![image](https://img.shields.io/pypi/v/h5yaml.svg?label=release)](https://github.com/rmvanhees/h5yaml/)
+[![image](https://img.shields.io/pypi/l/h5yaml.svg)](https://github.com/rmvanhees/h5yaml/LICENSE)
+[![image](https://img.shields.io/pypi/dm/h5yaml.svg)](https://pypi.org/project/h5yaml/)
+[![image](https://img.shields.io/pypi/status/h5yaml.svg?label=status)](https://pypi.org/project/h5yaml/)
 ## Description
-Use YAML configuration file to generate HDF5/netCDF4 formated files.
+This package let you generate [HDF5](https://docs.h5py.org/en/stable/)/[netCDF4](https://unidata.github.io/netcdf4-python/)
+formatted files as defined in a [YAML](https://yaml.org/) configuration file. This has several advantages:
-The class `NcYaml` must be used when strict conformance to the netCDF4 format is
-required. However, the python netCDF4 implementation does not allow variable-length
-data to have a compound data-type. The class `H5Yaml` does not have this restiction
-and will generate HDF5 formated files which can be read by netCDF4 software.
+ * you define the layout of your HDF5/netCDF4 file using YAML which is human-readable and has intuitive syntax.
+ * you can reuse the YAML configuration file to to have all your product have a consistent layout.
+ * you can make updates by only changing the YAML configuration file
+ * you can have the layout of your HDF5/netCDF4 file as a python dictionary, thus without accessing any HDF5/netCDF4 file
+The `H5YAML` package has two classes to generate a HDF5/netCDF4 formatted file.
+ 1. The class `H5Yaml` uses the [h5py](https://pypi.org/project/h5py/) package, which is a Pythonic interface to
+    the HDF5 binary data format.
+    Let 'h5_def.yaml' be your YAML configuration file then ```H5Yaml("h5_def.yaml").create("foo.h5")``` will create
+	the HDF5 file 'foo.h5'. This can be read by netCDF4 software, because it uses dimension-scales to each dataset.
+ 2. The class `NcYaml` uses the [netCDF4](https://pypi.org/project/netCDF4/) package, which provides an object-oriented
+    python interface to the netCDF version 4 library.
+    Let 'nc_def.yaml' be your YAML configuration file then ```NcYaml("nc_def.yaml").create("foo.nc")``` will create
+	the netCDF4/HDF5 file 'foo.nc'
+The class `NcYaml` must be used when strict conformance to the netCDF4 format is required.
+However, package `netCDF4` has some limitations, which `h5py` has not, for example it does
+not allow variable-length variables to have a compound data-type.
 ## Installation
-Relases of the code, starting from version 0.1, will be made available via PyPi.
+Releases of the code, starting from version 0.1, will be made available via PyPI.
 ## Usage
@@ -28,7 +48,7 @@ The YAML file should be structured as follows:
      - science_data
    ```
- * The section 'dimensions' is obligatory, you shouold define the dimensions for each
+ * The section 'dimensions' is obligatory, you should define the dimensions for each
    variable in your file. The 'dimensions' section may look like this:
    ```
@@ -118,7 +138,7 @@ The YAML file should be structured as follows:
 ### Notes and ToDo:
  * The usage of older versions of h5py may result in broken netCDF4 files
- * Explain usage of parameter '_chunks', which is currently not correcly implemented.
+ * Explain usage of parameter '_chunks', which is currently not correctly implemented.
  * Explain that the usage of variable length data-sets may break netCDF4 compatibility
 ## Support [TBW]
@@ -135,6 +155,3 @@ The code is developed by R.M. van Hees (SRON)
 * Copyright: SRON (https://www.sron.nl).
 * License: BSD-3-clause
-## Project status
-Beta

{h5yaml-0.0.3 → h5yaml-0.0.5}/pyproject.toml RENAMED Viewed

@@ -14,33 +14,35 @@ license = "BSD-3-Clause"
 authors = [
   {name = "Richard van Hees", email = "r.m.van.hees@sron.nl"}
 ]
-requires-python = ">=3.12"
+requires-python = ">=3.9"
 classifiers = [
-  "Development Status :: 4 - Beta",
-  "Intended Audience :: Developers",
-  "Intended Audience :: Science/Research",
-  "Operating System :: OS Independent",
-  "Programming Language :: Python :: 3 :: Only",
-  "Programming Language :: Python :: 3.12",
-  "Programming Language :: Python :: 3.13",
-  "Topic :: Scientific/Engineering :: Atmospheric Science",
+   "Development Status :: 4 - Beta",
+   "Intended Audience :: Developers",
+   "Intended Audience :: Science/Research",
+   "Operating System :: OS Independent",
+   "Programming Language :: Python :: 3 :: Only",
+   "Programming Language :: Python :: 3.9",
+   "Programming Language :: Python :: 3.10",
+   "Programming Language :: Python :: 3.11",
+   "Programming Language :: Python :: 3.12",
+   "Programming Language :: Python :: 3.13",
+   "Topic :: Scientific/Engineering :: Atmospheric Science",
 ]
 keywords = [
-  "HDF5", "netCDF4", "YAML"
+   "HDF5", "netCDF4", "YAML"
 ]
 dynamic = [
-  "version",
+   "version",
 ]
 dependencies = [
-  "h5py>=3.13",
-  "netCDF4>=1.7",
-  "numpy>=2.2",
-  "pyYAML>=6.0",
+   "h5py>=3.13",
+   "netCDF4>=1.7",
+   "numpy>=2.2",
+   "pyYAML>=6.0",
 ]
 [project.scripts]
 [project.urls]
 Homepage = "https://github.com/rmvanhees/h5_yaml"
 Source = "https://github.com/rmvanhees/h5_yaml"
@@ -72,25 +74,25 @@ target-version = "py312"
 [tool.ruff.lint]
 select = [
-  "D",    # pydocstyle
-  "E",    # pycodestyle
-  "F",    # pyflakes
-  "I",    # isort
-  "N",    # pep8-naming
-  "W",    # pycodestyle
-  "ANN",  # flake8-annotations
-  "B",    # flake8-bugbear
-  "ISC",  # flake8-implicit-str-concat
-  "PGH",  # flake8-pie
-  "PYI",  # flake8-pyi
-  "Q",    # flake8-quotes
-  "SIM",  # flake8-simplify
-  "TID",  # flake8-tidy-imports
-  "TCH",  # flake8-type-checking
-  "NPY",  # NumPy-specific
-  "PERF", # Perflint
-  "RUF",  # Ruff Specific
-  "UP",   # pyupgrade
+   "D",    # pydocstyle
+   "E",    # pycodestyle
+   "F",    # pyflakes
+   "I",    # isort
+   "N",    # pep8-naming
+   "W",    # pycodestyle
+   "ANN",  # flake8-annotations
+   "B",    # flake8-bugbear
+   "ISC",  # flake8-implicit-str-concat
+   "PGH",  # flake8-pie
+   "PYI",  # flake8-pyi
+   "Q",    # flake8-quotes
+   "SIM",  # flake8-simplify
+   "TID",  # flake8-tidy-imports
+   "TCH",  # flake8-type-checking
+   "NPY",  # NumPy-specific
+   "PERF", # Perflint
+   "RUF",  # Ruff Specific
+   "UP",   # pyupgrade
 ]
 ignore = ["D203", "D213"]

h5yaml-0.0.5/src/h5yaml/Data/h5_testing.yaml ADDED Viewed

@@ -0,0 +1,103 @@
+# YAML
+#
+# Configuration file to test the implementation of classes H5Yaml and NcYaml
+#
+# This file is part of h5_yaml:
+#    https://github.com/rmvanhees/h5_yaml.git
+#
+# Copyright (c) 2025 SRON
+#    All Rights Reserved
+#
+# License:  BSD-3-Clause
+#
+# Define groups
+groups:
+  - group_00
+  - group_01
+  - group_02
+# Define dimensions
+# Note dimensions with an attribute 'long_name' will also be generated as variable
+dimensions:
+  number_of_images:
+    _dtype: u2
+    _size: 0
+  samples_per_image:
+    _dtype: u4
+    _size: 203500
+  column:
+    _dtype: u2
+    _size: 640
+  row:
+    _dtype: u2
+    _size: 512
+  time:
+    _dtype: f8
+    _size: 0
+    _FillValue: -32767
+    long_name: Attitude sample time (seconds of day)
+    calendar: proleptic_gregorian
+    units: seconds since %Y-%m-%d %H:%M:%S
+    valid_min: 0
+    valid_max: 92400
+# Define compound types
+# - compound elements must have a data-type, and can have a unit and long_name
+compounds:
+  stats_dtype:
+    time: [u8, seconds since 1970-01-01T00:00:00, timestamp]
+    index: [u2, '1', index]
+    tbl_id: [u1, '1', binning id]
+    saa: [u1, '1', saa-flag]
+    coad: [u1, '1', co-addings]
+    texp: [f4, ms, exposure time]
+    lat: [f4, degree, latitude]
+    lon: [f4, degree, longitude]
+    avg: [f4, '1', '$S - S_{ref}$']
+    unc: [f4, '1', '\u03c3($S - S_{ref}$)']
+    dark_offs: [f4, '1', dark-offset]
+  geo_dtype:
+    lat: [f4, latitude]
+    lon: [f4, longitude]
+# Define variables
+variables:
+  /group_00/detector_images:
+    _dtype: u2
+    _dims: [number_of_images, column, row]
+    _FillValue: 65535
+    long_name: Detector pixel values
+    comment: unbinned full-frame data
+    units: '1'
+    valid_min: 0
+    valid_max: 65534
+  /group_01/detector_images:
+    _dtype: u2
+    _dims: [number_of_images, samples_per_image]
+    _FillValue: 65535
+    _compression: 1
+    long_name: Detector pixel values
+    comment: variable binned data (filled to the largest samples_per_image)
+    units: '1'
+    valid_min: 0
+    valid_max: 65534
+  /group_01/stats:
+    _dtype: stats_dtype
+    _dims: [time]
+    comment: detector map statistics
+  /group_02/detector_images:
+    _dtype: u2
+    _dims: [number_of_images]
+    _vlen: True
+    _FillValue: 65535
+    long_name: Detector pixel values
+    comment: variable binned (vlen) data
+    units: '1'
+    valid_min: 0
+    valid_max: 65534
+  /group_02/stats:
+    _dtype: stats_dtype
+    _vlen: True
+    _dims: [time]
+    comment: detector map statistics (vlen)

h5yaml-0.0.3/src/h5yaml/Data/h5_testing.yaml → h5yaml-0.0.5/src/h5yaml/Data/nc_testing.yaml RENAMED Viewed

@@ -57,10 +57,6 @@ compounds:
     unc: [f4, '1', '\u03c3($S - S_{ref}$)']
     dark_offs: [f4, '1', dark-offset]
-  geo_dtype:
-    lat: [f4, latitude]
-    lon: [f4, longitude]
 # Define variables
 variables:
   /group_00/detector_images:
@@ -84,7 +80,6 @@ variables:
     valid_max: 65534
   /group_01/stats:
     _dtype: stats_dtype
-    _vlen: True
     _dims: [time]
     comment: detector map statistics
   /group_02/detector_images:

{h5yaml-0.0.3 → h5yaml-0.0.5}/src/h5yaml/lib/chunksizes.py RENAMED Viewed

@@ -35,20 +35,28 @@ def guess_chunks(dims: ArrayLike[int], dtype_sz: int) -> str | tuple[int]:
     """
     fixed_size = dtype_sz
-    for val in [x for x in dims if x > 0]:
-        fixed_size *= val
-    if 0 in dims:  # variable with an unlimited dimension
-        udim = dims.index(0)
-    else:  # variable has no unlimited dimension
-        udim = 0
-        if fixed_size < 65536:
+    if len(dims) > 1:
+        for val in [x for x in dims[1:] if x > 0]:
+            fixed_size *= val
+    # first variables without an unlimited dimension
+    if 0 not in dims:
+        if fixed_size < 400000:
             return "contiguous"
+        res = list(dims)
+        res[0] = max(1, 2048000 // fixed_size)
+        return tuple(res)
+    # then variables with an unlimited dimension
     if len(dims) == 1:
         return (1024,)
+    udim = dims.index(0)
     res = list(dims)
-    res[udim] = min(1024, (2048 * 1024) // (fixed_size // max(1, dims[0])))
+    if fixed_size < 400000:
+        res[udim] = 1024
+    else:
+        res[udim] = max(1, 2048000 // fixed_size)
     return tuple(res)

h5yaml-0.0.3/src/h5yaml/yaml_h5py.py → h5yaml-0.0.5/src/h5yaml/yaml_h5.py RENAMED Viewed

@@ -23,6 +23,8 @@ import numpy as np
 from h5yaml.conf_from_yaml import conf_from_yaml
 from h5yaml.lib.chunksizes import guess_chunks
+# - helper function ------------------------------------
 # - class definition -----------------------------------
 class H5Yaml:
@@ -53,22 +55,20 @@ class H5Yaml:
     def __dimensions(self: H5Yaml, fid: h5py.File) -> None:
         """Add dimensions to HDF5 product."""
-        for key, value in self.h5_def["dimensions"].items():
+        for key, val in self.h5_def["dimensions"].items():
             fillvalue = None
-            if "_FillValue" in value:
+            if "_FillValue" in val:
                 fillvalue = (
-                    np.nan if value["_FillValue"] == "NaN" else int(value["_FillValue"])
+                    np.nan if val["_FillValue"] == "NaN" else int(val["_FillValue"])
                 )
-            if value["_size"] == 0:
-                ds_chunk = value.get("_chunks", (50,))
+            if val["_size"] == 0:
+                ds_chunk = val.get("_chunks", (50,))
                 dset = fid.create_dataset(
                     key,
                     shape=(0,),
                     dtype=(
-                        h5py.string_dtype()
-                        if value["_dtype"] == "str"
-                        else value["_dtype"]
+                        h5py.string_dtype() if val["_dtype"] == "str" else val["_dtype"]
                     ),
                     chunks=ds_chunk if isinstance(ds_chunk, tuple) else tuple(ds_chunk),
                     maxshape=(None,),
@@ -77,21 +77,48 @@ class H5Yaml:
             else:
                 dset = fid.create_dataset(
                     key,
-                    shape=(value["_size"],),
-                    dtype=value["_dtype"],
+                    shape=(val["_size"],),
+                    dtype=val["_dtype"],
                 )
-                if "_values" in value:
-                    dset[:] = value["_values"]
+                if "_values" in val:
+                    dset[:] = val["_values"]
             dset.make_scale(
                 Path(key).name
-                if "long_name" in value
+                if "long_name" in val
                 else "This is a netCDF dimension but not a netCDF variable."
             )
-            for attr, attr_val in value.items():
+            for attr, attr_val in val.items():
                 if attr.startswith("_"):
                     continue
-                dset.attrs[attr] = attr_val
+                if attr in ("valid_min", "valid_max"):
+                    match val["_dtype"]:
+                        case "i1":
+                            dset.attrs[attr] = np.int8(attr_val)
+                        case "i2":
+                            dset.attrs[attr] = np.int16(attr_val)
+                        case "i4":
+                            dset.attrs[attr] = np.int32(attr_val)
+                        case "i8":
+                            dset.attrs[attr] = np.int64(attr_val)
+                        case "u1":
+                            dset.attrs[attr] = np.uint8(attr_val)
+                        case "u2":
+                            dset.attrs[attr] = np.uint16(attr_val)
+                        case "u4":
+                            dset.attrs[attr] = np.uint32(attr_val)
+                        case "u8":
+                            dset.attrs[attr] = np.uint64(attr_val)
+                        case "f2":
+                            dset.attrs[attr] = np.float16(attr_val)
+                        case "f4":
+                            dset.attrs[attr] = np.float32(attr_val)
+                        case "f8":
+                            dset.attrs[attr] = np.float64(attr_val)
+                        case _:
+                            dset.attrs[attr] = attr_val
+                else:
+                    dset.attrs[attr] = attr_val
     def __compounds(self: H5Yaml, fid: h5py.File) -> dict[str, str | int | float]:
         """Add compound datatypes to HDF5 product."""
@@ -112,14 +139,14 @@ class H5Yaml:
                 for key, value in res.items():
                     self.h5_def["compounds"][key] = value
-        for key, value in self.h5_def["compounds"].items():
+        for key, val in self.h5_def["compounds"].items():
             compounds[key] = {
                 "dtype": [],
                 "units": [],
                 "names": [],
             }
-            for _key, _val in value.items():
+            for _key, _val in val.items():
                 compounds[key]["dtype"].append((_key, _val[0]))
                 if len(_val) == 3:
                     compounds[key]["units"].append(_val[1])
@@ -156,12 +183,19 @@ class H5Yaml:
                     np.nan if val["_FillValue"] == "NaN" else int(val["_FillValue"])
                 )
-            compression = None
-            shuffle = False
-            # currently only gzip compression is supported
-            if "_compression" in val:
-                compression = val["_compression"]
-                shuffle = True
+            # check for scalar dataset
+            if val["_dims"][0] == "scalar":
+                dset = fid.create_dataset(
+                    key,
+                    (),
+                    dtype=ds_dtype,
+                    fillvalue=fillvalue,
+                )
+                for attr, attr_val in val.items():
+                    if attr.startswith("_"):
+                        continue
+                    dset.attrs[attr] = attr_val
+                continue
             n_udim = 0
             ds_shape = ()
@@ -194,8 +228,22 @@ class H5Yaml:
                     fillvalue=fillvalue,
                 )
             else:
+                compression = None
+                shuffle = False
+                # currently only gzip compression is supported
+                if "_compression" in val:
+                    compression = val["_compression"]
+                    shuffle = True
                 if val.get("_vlen"):
-                    ds_dtype = h5py.vlen_dtype(ds_dtype)
+                    ds_name = (
+                        val["_dtype"].split("_")[0]
+                        if "_" in val["_dtype"]
+                        else val["_dtype"]
+                    ) + "_vlen"
+                    if ds_name not in fid:
+                        fid[ds_name] = h5py.vlen_dtype(ds_dtype)
+                    ds_dtype = fid[ds_name]
                     fillvalue = None
                     if ds_maxshape == (None,):
                         ds_chunk = (16,)
@@ -217,7 +265,36 @@ class H5Yaml:
             for attr, attr_val in val.items():
                 if attr.startswith("_"):
                     continue
-                dset.attrs[attr] = attr_val
+                if attr in ("valid_min", "valid_max"):
+                    match val["_dtype"]:
+                        case "i1":
+                            dset.attrs[attr] = np.int8(attr_val)
+                        case "i2":
+                            dset.attrs[attr] = np.int16(attr_val)
+                        case "i4":
+                            dset.attrs[attr] = np.int32(attr_val)
+                        case "i8":
+                            dset.attrs[attr] = np.int64(attr_val)
+                        case "u1":
+                            dset.attrs[attr] = np.uint8(attr_val)
+                        case "u2":
+                            dset.attrs[attr] = np.uint16(attr_val)
+                        case "u4":
+                            dset.attrs[attr] = np.uint32(attr_val)
+                        case "u8":
+                            dset.attrs[attr] = np.uint64(attr_val)
+                        case "f2":
+                            dset.attrs[attr] = np.float16(attr_val)
+                        case "f4":
+                            dset.attrs[attr] = np.float32(attr_val)
+                        case "f8":
+                            dset.attrs[attr] = np.float64(attr_val)
+                        case _:
+                            dset.attrs[attr] = attr_val
+                elif attr == "flag_values":
+                    dset.attrs[attr] = np.array(attr_val, dtype="u1")
+                else:
+                    dset.attrs[attr] = attr_val
             if compounds is not None and val["_dtype"] in compounds:
                 if compounds[val["_dtype"]]["units"]:

{h5yaml-0.0.3 → h5yaml-0.0.5}/src/h5yaml/yaml_nc.py RENAMED Viewed

@@ -15,6 +15,7 @@ __all__ = ["NcYaml"]
 import logging
 from importlib.resources import files
+from pathlib import PurePosixPath
 from typing import TYPE_CHECKING
 import numpy as np
@@ -47,12 +48,20 @@ class NcYaml:
     def __groups(self: NcYaml, fid: Dataset) -> None:
         """Create groups in HDF5 product."""
         for key in self.h5_def["groups"]:
-            _ = fid.createGroup(key)
+            pkey = PurePosixPath(key)
+            if pkey.is_absolute():
+                _ = fid[pkey.parent].createGroup(pkey.name)
+            else:
+                _ = fid.createGroup(key)
     def __dimensions(self: NcYaml, fid: Dataset) -> None:
         """Add dimensions to HDF5 product."""
         for key, value in self.h5_def["dimensions"].items():
-            _ = fid.createDimension(key, value["_size"])
+            pkey = PurePosixPath(key)
+            if pkey.is_absolute():
+                _ = fid[pkey.parent].createDimension(pkey.name, value["_size"])
+            else:
+                _ = fid.createDimension(key, value["_size"])
             if "long_name" not in value:
                 continue
@@ -63,13 +72,22 @@ class NcYaml:
                     np.nan if value["_FillValue"] == "NaN" else int(value["_FillValue"])
                 )
-            dset = fid.createVariable(
-                key,
-                value["_dtype"],
-                dimensions=(key,),
-                fill_value=fillvalue,
-                contiguous=value["_size"] != 0,
-            )
+            if pkey.is_absolute():
+                dset = fid[pkey.parent].createVariable(
+                    pkey.name,
+                    value["_dtype"],
+                    dimensions=(pkey.name,),
+                    fill_value=fillvalue,
+                    contiguous=value["_size"] != 0,
+                )
+            else:
+                dset = fid.createVariable(
+                    key,
+                    value["_dtype"],
+                    dimensions=(key,),
+                    fill_value=fillvalue,
+                    contiguous=value["_size"] != 0,
+                )
             dset.setncatts({k: v for k, v in value.items() if not k.startswith("_")})
     def __compounds(self: NcYaml, fid: Dataset) -> dict[str, str | int | float]:
@@ -145,11 +163,17 @@ class NcYaml:
                 compression = "zlib"
                 complevel = val["_compression"]
+            var_dims = []
             n_udim = 0
             ds_shape = ()
             ds_maxshape = ()
             for coord in val["_dims"]:
-                dim_sz = fid.dimensions[coord].size
+                pcoord = PurePosixPath(coord)
+                var_dims.append(pcoord.name if pcoord.is_absolute() else coord)
+                if pcoord.is_absolute():
+                    dim_sz = fid[pcoord.parent].dimensions[pcoord.name].size
+                else:
+                    dim_sz = fid.dimensions[coord].size
                 n_udim += int(dim_sz == 0)
                 ds_shape += (dim_sz,)
                 ds_maxshape += (dim_sz if dim_sz > 0 else None,)
@@ -163,12 +187,18 @@ class NcYaml:
                 val["_chunks"] if "_chunks" in val else guess_chunks(ds_shape, sz_dtype)
             )
+            pkey = PurePosixPath(key)
+            var_grp = fid[pkey.parent] if pkey.is_absolute() else fid
+            var_name = pkey.name if pkey.is_absolute() else key
+            if val["_dtype"] in fid.cmptypes:
+                val["_dtype"] = fid.cmptypes[val["_dtype"]]
             # create the variable
             if ds_chunk == "contiguous":
-                dset = fid.createVariable(
-                    key,
+                dset = var_grp.createVariable(
+                    var_name,
                     val["_dtype"],
-                    dimensions=(key,),
+                    dimensions=var_dims,
                     fill_value=fillvalue,
                     contiguous=True,
                 )
@@ -181,13 +211,10 @@ class NcYaml:
                     if ds_maxshape == (None,):
                         ds_chunk = (16,)
-                if key in fid.cmptypes:
-                    val["_dtype"] = fid.cmptypes[key]
-                dset = fid.createVariable(
-                    key,
+                dset = var_grp.createVariable(
+                    var_name,
                     val["_dtype"],
-                    dimensions=val["_dims"],
+                    dimensions=var_dims,
                     fill_value=fillvalue,
                     contiguous=False,
                     compression=compression,
@@ -231,7 +258,7 @@ class NcYaml:
 def tests() -> None:
     """..."""
     print("Calling NcYaml")
-    NcYaml(files("h5yaml.Data") / "h5_testing.yaml").create("test_yaml.nc")
+    NcYaml(files("h5yaml.Data") / "nc_testing.yaml").create("test_yaml.nc")
 if __name__ == "__main__":

{h5yaml-0.0.3 → h5yaml-0.0.5}/.gitignore RENAMED Viewed

File without changes

{h5yaml-0.0.3 → h5yaml-0.0.5}/LICENSE RENAMED Viewed

File without changes

{h5yaml-0.0.3 → h5yaml-0.0.5}/MANIFEST.in RENAMED Viewed

File without changes

{h5yaml-0.0.3 → h5yaml-0.0.5}/src/h5yaml/conf_from_yaml.py RENAMED Viewed

File without changes

h5yaml 0.0.3__tar.gz → 0.0.5__tar.gz

h5yaml 0.0.3tar.gz → 0.0.5tar.gz