PyPI - iterable-io - Versions diffs - 1.0.0__tar.gz - Mend

iterable-io 1.0.0__tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (10) hide show

iterable-io-1.0.0/PKG-INFO +101 -0
iterable-io-1.0.0/README.md +79 -0
iterable-io-1.0.0/iterable_io.egg-info/PKG-INFO +101 -0
iterable-io-1.0.0/iterable_io.egg-info/SOURCES.txt +8 -0
iterable-io-1.0.0/iterable_io.egg-info/dependency_links.txt +1 -0
iterable-io-1.0.0/iterable_io.egg-info/top_level.txt +1 -0
iterable-io-1.0.0/iterableio.py +168 -0
iterable-io-1.0.0/setup.cfg +4 -0
iterable-io-1.0.0/setup.py +42 -0
iterable-io-1.0.0/tests/test_iteratorio.py +170 -0

iterable-io-1.0.0/PKG-INFO ADDED Viewed

@@ -0,0 +1,101 @@
+Metadata-Version: 2.1
+Name: iterable-io
+Version: 1.0.0
+Summary: Adapt generators and other iterables to a file-like interface
+Home-page: https://github.com/pR0Ps/iterable-io
+License: LGPLv3
+Project-URL: Source, https://github.com/pR0Ps/iterable-io
+Project-URL: Changelog, https://github.com/pR0Ps/iterable-io/blob/master/CHANGELOG.md
+Classifier: Programming Language :: Python :: 3
+Classifier: Programming Language :: Python :: 3.5
+Classifier: Programming Language :: Python :: 3.6
+Classifier: Programming Language :: Python :: 3.7
+Classifier: Programming Language :: Python :: 3.8
+Classifier: Programming Language :: Python :: 3.9
+Classifier: Programming Language :: Python :: 3.10
+Classifier: Programming Language :: Python :: 3.11
+Classifier: Programming Language :: Python :: 3.12
+Classifier: Operating System :: OS Independent
+Classifier: License :: OSI Approved :: GNU Lesser General Public License v3 (LGPLv3)
+Requires-Python: >=3.5
+Description-Content-Type: text/markdown
+iterable-io
+===========
+[![Status](https://github.com/pR0Ps/iterable-io/workflows/tests/badge.svg)](https://github.com/pR0Ps/iterable-io/actions/workflows/tests.yml)
+[![Version](https://img.shields.io/pypi/v/iterable-io.svg)](https://pypi.org/project/iterable-io/)
+![Python](https://img.shields.io/pypi/pyversions/iterable-io.svg)
+`iterable-io` is a small Python library that provides an adapter so that it's possible to read from
+[iterable](https://docs.python.org/3/glossary.html#term-iterable) objects in the same way as
+[file-like](https://docs.python.org/3/glossary.html#term-file-object) objects.
+It is primarily useful as "glue" between two incompatible interfaces. As an example, in the case
+where one interface expects a file-like object to call `.read()` on, and the other only provides a
+generator of bytes.
+One way to solve this issue would be to write all the bytes in the generator to a temporary file,
+then provide that file instead, but if the generator produces a large amount of data then this is
+both slow to start, and resource-intensive.
+This library allows streaming data between these two incompatible interfaces so as data is requested
+by `.read()`, it's pulled from the iterable. This keeps resource usage low and removes the startup
+delay.
+Installation
+------------
+```
+pip install iterable-io
+```
+Documentation
+-------------
+The functionality of this library is accessed via a single function: `open_iterable()`.
+`open_iterable()` is designed to work the same was as the builtin `open()`, except that it takes an
+iterable to "open" instead of a file. For example, it can open the iterable in binary or text mode,
+has options for buffering, encoding, etc. See the docstring of `open_iterable` for more detailed
+documentation.
+Simple examples
+---------------
+The following examples should be enough to understand in which cases `open_iterable()` would be
+useful and get a high-level understanding of how to use it:
+Read bytes from a generator of bytes:
+```python
+gen = generate_bytes()
+# adapt the generator to a file-like object in binary mode
+# (fp.read() will return bytes)
+fp = open_iterable(gen, "rb")
+while chunk := fp.read(4096):
+    process_chunk(chunk)
+```
+Read lines of text from a generator of bytes:
+```python
+gen = generate_bytes()
+# adapt the generator to a file-like object in text mode
+# (fp.read() will return a string, fp.readline is also available)
+fp = open_iterable(gen, "rt", encoding="utf-8")
+for line in fp:
+    process_line_of_text(line)
+```
+Tests
+-----
+This package contains extensive tests. To run them, install `pytest` (`pip install pytest`) and run
+`py.test` in the project directory.
+License
+-------
+Licensed under the [GNU LGPLv3](https://www.gnu.org/licenses/lgpl-3.0.html).

iterable-io-1.0.0/README.md ADDED Viewed

@@ -0,0 +1,79 @@
+iterable-io
+===========
+[![Status](https://github.com/pR0Ps/iterable-io/workflows/tests/badge.svg)](https://github.com/pR0Ps/iterable-io/actions/workflows/tests.yml)
+[![Version](https://img.shields.io/pypi/v/iterable-io.svg)](https://pypi.org/project/iterable-io/)
+![Python](https://img.shields.io/pypi/pyversions/iterable-io.svg)
+`iterable-io` is a small Python library that provides an adapter so that it's possible to read from
+[iterable](https://docs.python.org/3/glossary.html#term-iterable) objects in the same way as
+[file-like](https://docs.python.org/3/glossary.html#term-file-object) objects.
+It is primarily useful as "glue" between two incompatible interfaces. As an example, in the case
+where one interface expects a file-like object to call `.read()` on, and the other only provides a
+generator of bytes.
+One way to solve this issue would be to write all the bytes in the generator to a temporary file,
+then provide that file instead, but if the generator produces a large amount of data then this is
+both slow to start, and resource-intensive.
+This library allows streaming data between these two incompatible interfaces so as data is requested
+by `.read()`, it's pulled from the iterable. This keeps resource usage low and removes the startup
+delay.
+Installation
+------------
+```
+pip install iterable-io
+```
+Documentation
+-------------
+The functionality of this library is accessed via a single function: `open_iterable()`.
+`open_iterable()` is designed to work the same was as the builtin `open()`, except that it takes an
+iterable to "open" instead of a file. For example, it can open the iterable in binary or text mode,
+has options for buffering, encoding, etc. See the docstring of `open_iterable` for more detailed
+documentation.
+Simple examples
+---------------
+The following examples should be enough to understand in which cases `open_iterable()` would be
+useful and get a high-level understanding of how to use it:
+Read bytes from a generator of bytes:
+```python
+gen = generate_bytes()
+# adapt the generator to a file-like object in binary mode
+# (fp.read() will return bytes)
+fp = open_iterable(gen, "rb")
+while chunk := fp.read(4096):
+    process_chunk(chunk)
+```
+Read lines of text from a generator of bytes:
+```python
+gen = generate_bytes()
+# adapt the generator to a file-like object in text mode
+# (fp.read() will return a string, fp.readline is also available)
+fp = open_iterable(gen, "rt", encoding="utf-8")
+for line in fp:
+    process_line_of_text(line)
+```
+Tests
+-----
+This package contains extensive tests. To run them, install `pytest` (`pip install pytest`) and run
+`py.test` in the project directory.
+License
+-------
+Licensed under the [GNU LGPLv3](https://www.gnu.org/licenses/lgpl-3.0.html).

iterable-io-1.0.0/iterable_io.egg-info/PKG-INFO ADDED Viewed

@@ -0,0 +1,101 @@
+Metadata-Version: 2.1
+Name: iterable-io
+Version: 1.0.0
+Summary: Adapt generators and other iterables to a file-like interface
+Home-page: https://github.com/pR0Ps/iterable-io
+License: LGPLv3
+Project-URL: Source, https://github.com/pR0Ps/iterable-io
+Project-URL: Changelog, https://github.com/pR0Ps/iterable-io/blob/master/CHANGELOG.md
+Classifier: Programming Language :: Python :: 3
+Classifier: Programming Language :: Python :: 3.5
+Classifier: Programming Language :: Python :: 3.6
+Classifier: Programming Language :: Python :: 3.7
+Classifier: Programming Language :: Python :: 3.8
+Classifier: Programming Language :: Python :: 3.9
+Classifier: Programming Language :: Python :: 3.10
+Classifier: Programming Language :: Python :: 3.11
+Classifier: Programming Language :: Python :: 3.12
+Classifier: Operating System :: OS Independent
+Classifier: License :: OSI Approved :: GNU Lesser General Public License v3 (LGPLv3)
+Requires-Python: >=3.5
+Description-Content-Type: text/markdown
+iterable-io
+===========
+[![Status](https://github.com/pR0Ps/iterable-io/workflows/tests/badge.svg)](https://github.com/pR0Ps/iterable-io/actions/workflows/tests.yml)
+[![Version](https://img.shields.io/pypi/v/iterable-io.svg)](https://pypi.org/project/iterable-io/)
+![Python](https://img.shields.io/pypi/pyversions/iterable-io.svg)
+`iterable-io` is a small Python library that provides an adapter so that it's possible to read from
+[iterable](https://docs.python.org/3/glossary.html#term-iterable) objects in the same way as
+[file-like](https://docs.python.org/3/glossary.html#term-file-object) objects.
+It is primarily useful as "glue" between two incompatible interfaces. As an example, in the case
+where one interface expects a file-like object to call `.read()` on, and the other only provides a
+generator of bytes.
+One way to solve this issue would be to write all the bytes in the generator to a temporary file,
+then provide that file instead, but if the generator produces a large amount of data then this is
+both slow to start, and resource-intensive.
+This library allows streaming data between these two incompatible interfaces so as data is requested
+by `.read()`, it's pulled from the iterable. This keeps resource usage low and removes the startup
+delay.
+Installation
+------------
+```
+pip install iterable-io
+```
+Documentation
+-------------
+The functionality of this library is accessed via a single function: `open_iterable()`.
+`open_iterable()` is designed to work the same was as the builtin `open()`, except that it takes an
+iterable to "open" instead of a file. For example, it can open the iterable in binary or text mode,
+has options for buffering, encoding, etc. See the docstring of `open_iterable` for more detailed
+documentation.
+Simple examples
+---------------
+The following examples should be enough to understand in which cases `open_iterable()` would be
+useful and get a high-level understanding of how to use it:
+Read bytes from a generator of bytes:
+```python
+gen = generate_bytes()
+# adapt the generator to a file-like object in binary mode
+# (fp.read() will return bytes)
+fp = open_iterable(gen, "rb")
+while chunk := fp.read(4096):
+    process_chunk(chunk)
+```
+Read lines of text from a generator of bytes:
+```python
+gen = generate_bytes()
+# adapt the generator to a file-like object in text mode
+# (fp.read() will return a string, fp.readline is also available)
+fp = open_iterable(gen, "rt", encoding="utf-8")
+for line in fp:
+    process_line_of_text(line)
+```
+Tests
+-----
+This package contains extensive tests. To run them, install `pytest` (`pip install pytest`) and run
+`py.test` in the project directory.
+License
+-------
+Licensed under the [GNU LGPLv3](https://www.gnu.org/licenses/lgpl-3.0.html).

iterable-io-1.0.0/iterable_io.egg-info/SOURCES.txt ADDED Viewed

@@ -0,0 +1,8 @@
+README.md
+iterableio.py
+setup.py
+iterable_io.egg-info/PKG-INFO
+iterable_io.egg-info/SOURCES.txt
+iterable_io.egg-info/dependency_links.txt
+iterable_io.egg-info/top_level.txt
+tests/test_iteratorio.py

iterable-io-1.0.0/iterable_io.egg-info/dependency_links.txt ADDED Viewed

	@@ -0,0 +1 @@
1	+

iterable-io-1.0.0/iterable_io.egg-info/top_level.txt ADDED Viewed

	@@ -0,0 +1 @@
1	+ iterableio

iterable-io-1.0.0/iterableio.py ADDED Viewed

@@ -0,0 +1,168 @@
+#!/usr/bin/env python
+import io
+class RawIterableReader(io.RawIOBase):
+    """A io.RawIOBase implemention for an iterable of bytes
+    In most cases, this class should not be used directly. See the included
+    `open_iterable` function for a high-level interface.
+    """
+    def __init__(self, iterable):
+        self._iter = iter(iterable)
+        self._extra = bytearray()
+        self._total = 0
+    def readable(self):
+        return True
+    def close(self):
+        self._iter = None
+        super().close()
+    def tell(self):
+        """The total number of bytes that have been read"""
+        self._checkClosed()
+        return self._total - len(self._extra)
+    def readinto(self, b):
+        """Read bytes into a pre-allocated bytes-like object b
+        Returns the number of bytes read, 0 indicates EOF
+        """
+        self._checkClosed()
+        num = len(b)
+        if self._iter is not None:
+            while len(self._extra) < num:
+                try:
+                    new = next(self._iter)
+                except StopIteration:
+                    self._iter = None
+                    break
+                else:
+                    self._total += len(new)
+                    self._extra += new
+        ret, self._extra = self._extra[:num], self._extra[num:]
+        lret = len(ret)
+        b[:lret] = ret
+        return lret
+def open_iterable(iterable, mode="r", buffering=-1, encoding=None, errors=None, newline=None):
+    """Open an iterable of bytes to read from it using a file-like interface
+    The `iterable` must be an iterable of bytes.
+    mode is an optional string that specifies the mode in which the file is
+    opened. It defaults to 'rt' which means open for reading in text mode. In
+    text mode, if encoding is not specified the encoding used is platform
+    dependent. (For reading raw bytes use binary mode and leave encoding
+    unspecified.) The available modes are:
+    ========= ===============================================================
+    Character Meaning
+    --------- ---------------------------------------------------------------
+    'r'       open for reading (default)
+    'b'       binary mode
+    't'       text mode (default)
+    ========= ===============================================================
+    Iterables opened in binary mode (appending 'b' to the mode argument) return
+    contents as bytes objects without any decoding. In text mode (the default),
+    the contents of the iterable are returned as strings, the bytes having been
+    first decoded using a platform-dependent encoding or using the specified
+    encoding if given.
+    buffering is an optional integer used to set the buffering policy. Pass 0
+    to switch buffering off (only allowed in binary mode), and an integer > 0
+    to indicate the size of a fixed-size chunk buffer. When no buffering
+    argument is given, `io.DEFAULT_BUFFER_SIZE` will be used. On many systems,
+    the buffer will typically be 4096 or 8192 bytes long.
+    encoding is the str name of the encoding used to decode or encode the
+    file. This should only be used in text mode. The default encoding is
+    platform dependent, but any encoding supported by Python can be
+    passed. See the codecs module for the list of supported encodings.
+    errors is an optional string that specifies how encoding errors are to
+    be handled---this argument should not be used in binary mode. Pass
+    'strict' to raise a ValueError exception if there is an encoding error
+    (the default of None has the same effect), or pass 'ignore' to ignore
+    errors. Note that ignoring encoding errors can lead to data loss.
+    See the documentation for codecs.register for a list of the permitted
+    encoding error strings.
+    newline is a string controlling how universal newlines works (it only
+    applies to text mode). It can be None, '', '\n', '\r', and '\r\n'. It works
+    as follows:
+    * On input, if newline is None, universal newlines mode is
+      enabled. Lines in the input can end in '\n', '\r', or '\r\n', and
+      these are translated into '\n' before being returned to the
+      caller. If it is '', universal newline mode is enabled, but line
+      endings are returned to the caller untranslated. If it has any of
+      the other legal values, input lines are only terminated by the given
+      string, and the line ending is returned to the caller untranslated.
+    * On output, if newline is None, any '\n' characters written are
+      translated to the system default line separator, os.linesep. If
+      newline is '', no translation takes place. If newline is any of the
+      other legal values, any '\n' characters written are translated to
+      the given string.
+    open_iterable() returns a file object whose type depends on the mode, and
+    through which the standard file operations such as read() are performed.
+    When open_iterable() is used to open an iterable in a text mode ('rt'), it
+    returns an io.TextIOWrapper. When used to open an iterable in a binary
+    mode, the returned class varies: For unbuffered access, a RawIterableReader
+    is returned and in buffered mode it returns an io.BufferedReader.
+    """
+    # This function is modeled after `io.open`, found in `Lib/_pyio.py`
+    modes = set(mode)
+    if modes - set("rtb") or len(mode) > len(modes):
+        raise ValueError("invalid mode: '{}'".format(mode))
+    reading = "r" in modes
+    binary = "b" in modes
+    text = "t" in modes or (reading and not binary)
+    if not reading:
+        raise ValueError("Must specify read mode")
+    if text and binary:
+        raise ValueError("can't have text and binary mode at once")
+    if binary and encoding is not None:
+        raise ValueError("binary mode doesn't take an encoding argument")
+    if binary and errors is not None:
+        raise ValueError("binary mode doesn't take an errors argument")
+    if binary and newline is not None:
+        raise ValueError("binary mode doesn't take a newline argument")
+    if text and buffering == 0:
+        raise ValueError("can't have unbuffered text I/O")
+    ret = RawIterableReader(iterable)
+    try:
+        if buffering == 0:
+            # unbuffered binary mode
+            return ret
+        if buffering < 0:
+            buffering = io.DEFAULT_BUFFER_SIZE
+        ret = io.BufferedReader(ret, buffering)
+        if binary:
+            # buffered binary mode
+            return ret
+        # buffered text mode
+        ret = io.TextIOWrapper(ret, encoding, errors, newline)
+        ret.mode = mode
+        return ret
+    except:
+        ret.close()
+        raise

iterable-io-1.0.0/setup.cfg ADDED Viewed

@@ -0,0 +1,4 @@
+[egg_info]
+tag_build =
+tag_date = 0

iterable-io-1.0.0/setup.py ADDED Viewed

@@ -0,0 +1,42 @@
+#!/usr/bin/env python
+from setuptools import setup
+import os.path
+try:
+    DIR = os.path.abspath(os.path.dirname(__file__))
+    with open(os.path.join(DIR, "README.md"), encoding='utf-8') as f:
+        long_description = f.read()
+except Exception:
+    long_description=None
+setup(
+    name="iterable-io",
+    version="1.0.0",
+    description="Adapt generators and other iterables to a file-like interface",
+    long_description=long_description,
+    long_description_content_type="text/markdown",
+    url="https://github.com/pR0Ps/iterable-io",
+    project_urls={
+        "Source": "https://github.com/pR0Ps/iterable-io",
+        "Changelog": "https://github.com/pR0Ps/iterable-io/blob/master/CHANGELOG.md",
+    },
+    license="LGPLv3",
+    classifiers=[
+        "Programming Language :: Python :: 3",
+        "Programming Language :: Python :: 3.5",
+        "Programming Language :: Python :: 3.6",
+        "Programming Language :: Python :: 3.7",
+        "Programming Language :: Python :: 3.8",
+        "Programming Language :: Python :: 3.9",
+        "Programming Language :: Python :: 3.10",
+        "Programming Language :: Python :: 3.11",
+        "Programming Language :: Python :: 3.12",
+        "Operating System :: OS Independent",
+        "License :: OSI Approved :: GNU Lesser General Public License v3 (LGPLv3)"
+    ],
+    py_modules=["iterableio"],
+    python_requires=">=3.5",
+)

iterable-io-1.0.0/tests/test_iteratorio.py ADDED Viewed

@@ -0,0 +1,170 @@
+#!/usr/bin/env python
+import io
+from iterableio import RawIterableReader, open_iterable
+import pytest
+@pytest.mark.parametrize("mode, buffering, encoding, errors, newline",[
+    # bad modes
+    ("", -1, None, None, None),
+    ("abc", -1, None, None, None),
+    ("rtb", -1, None, None, None),
+    ("rt", 0, None, None, None),  # need buffering
+    ("rt", "bad int", None, None, None),  # invalid buffering int
+    # can't provide text decoding params in binary mode
+    ("rb", 0, "utf-8", None, None),
+    ("rb", 0, None, "ignore", None),
+    ("rb", 0, None, None, "\n"),
+])
+def test_invalid_input(mode, buffering, encoding, errors, newline):
+    """Test that invalid params are caught"""
+    with pytest.raises((ValueError, TypeError, LookupError)):
+        open_iterable([], mode, buffering, encoding, errors, newline)
+@pytest.mark.parametrize("buffering", (0, -1, 1))
+def test_reading(buffering):
+    def gen():
+        yield from (
+            b'\x01\x02\x03\x04\x05',
+            b"abcde",
+            b"fghij",
+            b"klmno",
+            b"qrstu",
+            b"vwxyz",
+            b'\x06\x07\x08\x09\x10',
+        )
+    _data = b"".join(gen())
+    with open_iterable(gen(), "rb", buffering=buffering) as i:
+        assert i.readable()
+        assert not i.seekable()
+        assert not i.writable()
+        cnt = 0
+        for amt in (0, 1, 2, 3, 4, 5, 10, 1, 1, 0):
+            d = i.read(amt)
+            assert len(d) == amt
+            assert d == _data[cnt:cnt+amt]
+            cnt += amt
+            assert i.tell() == cnt
+        assert i.read() == _data[cnt:]
+        assert i.read() == b""
+        assert i.tell() == len(_data)
+def test_returned_class():
+    """Test that the correct class is returned depending on the mode and buffering spec"""
+    assert isinstance(open_iterable([], "rb", buffering=0), RawIterableReader)
+    assert isinstance(open_iterable([], "rb", buffering=-1), io.BufferedReader)
+    assert isinstance(open_iterable([], "rb", buffering=1), io.BufferedReader)
+    assert isinstance(open_iterable([], "rt", buffering=-1), io.TextIOWrapper)
+    assert isinstance(open_iterable([], "rt", buffering=1), io.TextIOWrapper)
+@pytest.mark.parametrize("mode, buffering",[
+    ("rb", 0),
+    ("rb", -1),
+    ("rt", -1),
+])
+def test_contextmgr_close(mode, buffering):
+    with open_iterable([], mode, buffering) as i:
+        assert not i.closed
+    assert i.closed
+@pytest.mark.parametrize("mode, buffering",[
+    ("rb", 0),
+    ("rb", -1),
+    ("rt", -1),
+])
+def test_unreadable_after_close(mode, buffering):
+    i = open_iterable([b"12345"], mode, buffering)
+    assert not i.read(0)
+    assert i.read(1) in (b"1", "1")
+    assert not i.closed
+    i.close()
+    assert i.closed
+    with pytest.raises(ValueError, match="closed"):
+        i.read()
+    with pytest.raises(ValueError, match="closed"):
+        i.tell()
+def test_yield_empty_bytes():
+    """Test that a generator is only 'done' when it stops yielding, not when it yields empty bytes"""
+    def gen():
+        yield from (
+            b"1",
+            b"", b"", b"", b"", b"", b"", b"",
+            b"2", b"3",
+            b"", b"", b"", b"", b"", b"",
+            b"4",
+        )
+    i = RawIterableReader(gen())
+    out = []
+    while True:
+        b = i.read(1)
+        if not b:
+            break
+        out.append(b)
+    assert len(out) == 4
+    assert b"".join(out) == b"1234"
+def test_read_text():
+    def gen():
+        # 9 lines yielded in non-line chunks
+        yield from (
+            x.encode("utf-8") for x in (
+                "this is a line\n",
+                "",
+                "",
+                "_a",
+                "another line\n",
+                "another line1\n",
+                "another line2\n",
+                "another line_",
+                "a",
+                "aaaaaaa\nbbbbbbbb",
+                "_",
+                "1",
+                "2",
+                "3",
+                "4",
+                "5",
+                "_line line line another line actually\n",
+                "another line\n",
+                "ending line\n",
+                "actual ending line no trailing newline",
+            )
+        )
+    real = "".join(x.decode("utf-8") for x in gen())
+    # read across chunks and lines
+    with open_iterable(gen(), encoding="utf-8") as i:
+        assert i.read(10) == real[:10]
+        assert i.read(10) == real[10:20]
+    with open_iterable(gen(), encoding="utf-8") as i:
+        lines = list(i)
+    with open_iterable(gen(), encoding="utf-8") as i:
+        assert lines == i.readlines()
+    assert len(lines) == len(real.splitlines()) == 9
+    assert "".join(lines) ==  real