polyleven 0.9.0__cp311-cp311-macosx_11_0_arm64.whl

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
@@ -0,0 +1,24 @@
1
+ Copyright (c) 2021 Fujimoto Seiji <fujimoto@ceptord.net>
2
+ Copyright (c) 2021 Max Bachmann <kontakt@maxbachmann.de>
3
+ Copyright (c) 2022 Nick Mazuk
4
+ Copyright (c) 2022 Michael Weiss <code@mweiss.ch>
5
+ Copyright (c) 2024 Alex Morgan <lexyym@gmail.com>
6
+
7
+ Permission is hereby granted, free of charge, to any person obtaining a copy
8
+ of this software and associated documentation files (the "Software"), to deal
9
+ in the Software without restriction, including without limitation the rights
10
+ to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
11
+ copies of the Software, and to permit persons to whom the Software is
12
+ furnished to do so, subject to the following conditions:
13
+
14
+ The above copyright notice and this permission notice shall be included in all
15
+ copies or substantial portions of the Software.
16
+
17
+ THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
18
+ IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
19
+ FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
20
+ AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
21
+ LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
22
+ OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
23
+ SOFTWARE.
24
+
@@ -0,0 +1,124 @@
1
+ Metadata-Version: 2.2
2
+ Name: polyleven
3
+ Version: 0.9.0
4
+ Summary: A fast C-implemented library for Levenshtein distance
5
+ Maintainer-email: Fujimoto Seiji <fujimoto@ceptord.net>
6
+ Project-URL: github, https://github.com/fujimotos/polyleven
7
+ Keywords: Levenshtein distance
8
+ Classifier: Development Status :: 5 - Production/Stable
9
+ Classifier: Operating System :: OS Independent
10
+ Classifier: Programming Language :: Python :: 3
11
+ Classifier: Programming Language :: C
12
+ Requires-Python: >=3.8
13
+ Description-Content-Type: text/x-rst
14
+ License-File: LICENSE
15
+
16
+ ==============================================
17
+ Polyleven -- Fast Pythonic Levenshtein Library
18
+ ==============================================
19
+
20
+ :License: MIT License
21
+
22
+ 1. Introduction
23
+ ===============
24
+
25
+ polyleven is a Pythonic Levenshtein distance library that:
26
+
27
+ - Is *fast* independent of input types, and hence can be used for
28
+ both short (like English words) and long input types (like DNA
29
+ sequences).
30
+
31
+ - Is *stand-alone* depending only on core Python packages (such as
32
+ ``setuptools`` for installation).
33
+
34
+ - Can be used readily in a manner not covered by restrictive
35
+ licenses such as GPL, hence can be used freely in private codes.
36
+
37
+ 2. How to install
38
+ =================
39
+
40
+ The official package is available on PyPI::
41
+
42
+ $ pip install polyleven
43
+
44
+ 3. How to use
45
+ =============
46
+
47
+ Polyleven provides a single interface function ``levenshtein()``. You
48
+ can use this function to measure the similarity of two strings.
49
+
50
+ >>> from polyleven import levenshtein
51
+ >>> levenshtein('aaa', 'ccc')
52
+ 3
53
+
54
+ If you only care about distances under a certain threshold, you can
55
+ pass the max threshold to the third argument.
56
+
57
+ >>> levenshtein('acc', 'ccc', 1)
58
+ 1
59
+ >>> levenshtein('aaa', 'ccc', 1)
60
+ 2
61
+
62
+ In general, you can gain a noticeable speed boost with threshold
63
+ :math:`k < 3`.
64
+
65
+ 4. Benchmark
66
+ ============
67
+
68
+ 4.1 English Words
69
+ ------------------
70
+
71
+ To compare Polyleven with other Pythonic edit distance libraries,
72
+ a million word pairs was generated from `SCOWL`_.
73
+
74
+ .. _SCOWL: http://wordlist.aspell.net/
75
+
76
+ Each library was measured how long it takes to evaluate all of
77
+ these words. The following table summarises the result:
78
+
79
+ ============================== ============ ================
80
+ Function Name TIME[sec] SPEED[pairs/s]
81
+ ============================== ============ ================
82
+ edlib 4.763 208216
83
+ editdistance 1.943 510450
84
+ jellyfish.levenshtein_distance 0.722 1374081
85
+ distance.levenshtein 0.623 1591396
86
+ Levenshtein.distance 0.500 1982764
87
+ polyleven.levenshtein 0.431 2303420
88
+ ============================== ============ ================
89
+
90
+ 4.2. Longer Inputs
91
+ ------------------
92
+
93
+ To evaluate the efficiency for longer inputs, I created 5000 pairs
94
+ of random strings of size 16, 32, 64, 128, 256, 512 and 1024.
95
+
96
+ Each library was measured how fast it can process these entries. [#fn1]_
97
+
98
+ ============ ===== ===== ===== ===== ===== ===== ======
99
+ Library N=16 N=32 N=64 N=128 N=256 N=512 N=1024
100
+ ============ ===== ===== ===== ===== ===== ===== ======
101
+ edlib 0.040 0.063 0.094 0.205 0.432 0.908 2.089
102
+ editdistance 0.027 0.049 0.086 0.178 0.336 0.740 58.139
103
+ jellyfish 0.009 0.032 0.118 0.470 1.874 8.877 42.848
104
+ distance 0.007 0.029 0.109 0.431 1.726 6.950 27.998
105
+ Levenshtein 0.006 0.022 0.085 0.336 1.328 5.286 21.097
106
+ polyleven 0.003 0.005 0.010 0.043 0.149 0.550 2.109
107
+ ============ ===== ===== ===== ===== ===== ===== ======
108
+
109
+ 3.3. List of Libraries
110
+ ----------------------
111
+
112
+ ============ ======= ==========================================
113
+ Library Version URL
114
+ ============ ======= ==========================================
115
+ edlib v1.2.1 https://github.com/Martinsos/edlib
116
+ editdistance v0.4 https://github.com/aflc/editdistance
117
+ jellyfish v0.5.6 https://github.com/jamesturk/jellyfish
118
+ distance v0.1.3 https://github.com/doukremt/distance
119
+ Levenshtein v0.12 https://github.com/ztane/python-Levenshtein
120
+ polyleven v0.3 https://github.com/fujimotos/polyleven
121
+ ============ ======= ==========================================
122
+
123
+ .. [#fn1] Measured using Python 3.5.3 on Debian Jessie with Intel Core
124
+ i3-4010U (1.70GHz)
@@ -0,0 +1,6 @@
1
+ polyleven.cpython-311-darwin.so,sha256=jU7_9YVNiu3G80y5kbyI8K8QDOGw-hd_ZWyIs5WvnQ0,51168
2
+ polyleven-0.9.0.dist-info/RECORD,,
3
+ polyleven-0.9.0.dist-info/LICENSE,sha256=T_T9pykid2jtdVaFe-RY949YdsV8Lz7c9efEdPgdE7w,1269
4
+ polyleven-0.9.0.dist-info/WHEEL,sha256=L-bHjLRny89F_LWf23FhUoYKNLvf3bh7q-angPNsNnI,109
5
+ polyleven-0.9.0.dist-info/top_level.txt,sha256=12GbQ6DLcEtqgc30L3CguDVut0T-AYu2LoAm0fY4-cY,21
6
+ polyleven-0.9.0.dist-info/METADATA,sha256=l7A8e0nKPH9WSJ_Q4GSzFqkjkl1t3IvrlAKd-eNo-fA,4309
@@ -0,0 +1,5 @@
1
+ Wheel-Version: 1.0
2
+ Generator: setuptools (75.8.1)
3
+ Root-Is-Purelib: false
4
+ Tag: cp311-cp311-macosx_11_0_arm64
5
+
@@ -0,0 +1,2 @@
1
+ polyleven
2
+ wheelhouse
Binary file