polyleven 0.8__cp38-cp38-win32.whl → 0.10.0__cp38-cp38-win32.whl

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
@@ -2,6 +2,8 @@ Copyright (c) 2021 Fujimoto Seiji <fujimoto@ceptord.net>
2
2
  Copyright (c) 2021 Max Bachmann <kontakt@maxbachmann.de>
3
3
  Copyright (c) 2022 Nick Mazuk
4
4
  Copyright (c) 2022 Michael Weiss <code@mweiss.ch>
5
+ Copyright (c) 2024 Alex Morgan <lexyym@gmail.com>
6
+ Copyright (c) 2026 Michael Mok <pmmmwh@gmail.com>
5
7
 
6
8
  Permission is hereby granted, free of charge, to any person obtaining a copy
7
9
  of this software and associated documentation files (the "Software"), to deal
@@ -1,130 +1,122 @@
1
- Metadata-Version: 2.1
2
- Name: polyleven
3
- Version: 0.8
4
- Summary: A fast C-implemented library for Levenshtein distance
5
- Home-page: https://ceptord.net/
6
- Author: Fujimoto Seiji
7
- Author-email: fujimoto@ceptord.net
8
- License: MIT License
9
- Project-URL: Documentation, https://ceptord.net/
10
- Project-URL: GitHub Mirror, https://github.com/fujimotos/polyleven
11
- Keywords: Levenshtein distance
12
- Classifier: Development Status :: 5 - Production/Stable
13
- Classifier: Operating System :: OS Independent
14
- Classifier: Programming Language :: Python :: 3
15
- Classifier: Programming Language :: C
16
- Classifier: License :: OSI Approved :: MIT License
17
- Requires-Python: >=3.4
18
- Description-Content-Type: text/x-rst
19
- License-File: LICENSE
20
-
21
- ==============================================
22
- Polyleven -- Fast Pythonic Levenshtein Library
23
- ==============================================
24
-
25
- :Website: https://ceptord.net/
26
- :Latest Release: v0.8 (2022-10-02)
27
- :License: MIT License
28
-
29
- 1. Introduction
30
- ===============
31
-
32
- polyleven is a Pythonic Levenshtein distance library that:
33
-
34
- - Is *fast* independent of input types, and hence can be used for
35
- both short (like English words) and long input types (like DNA
36
- sequences).
37
-
38
- - Can be used readily in a manner not covered by restrictive
39
- licenses such as GPL, hence can be used freely in private codes.
40
-
41
- - Supports Python 3.x.
42
-
43
- 2. How to install
44
- =================
45
-
46
- The official package is available on PyPI::
47
-
48
- $ pip install polyleven
49
-
50
- 3. How to use
51
- =============
52
-
53
- Polyleven provides a single interface function ``levenshtein()``. You
54
- can use this function to measure the similarity of two strings.
55
-
56
- >>> from polyleven import levenshtein
57
- >>> levenshtein('aaa', 'ccc')
58
- 3
59
-
60
- If you only care about distances under a certain threshold, you can
61
- pass the max threshold to the third argument.
62
-
63
- >>> levenshtein('acc', 'ccc', 1)
64
- 1
65
- >>> levenshtein('aaa', 'ccc', 1)
66
- 2
67
-
68
- In general, you can gain a noticeable speed boost with threshold
69
- :math:`k < 3`.
70
-
71
- 4. Benchmark
72
- ============
73
-
74
- 4.1 English Words
75
- ------------------
76
-
77
- To compare Polyleven with other Pythonic edit distance libraries,
78
- a million word pairs was generated from `SCOWL`_.
79
-
80
- .. _SCOWL: http://wordlist.aspell.net/
81
-
82
- Each library was measured how long it takes to evaluate all of
83
- these words. The following table summarises the result:
84
-
85
- ============================== ============ ================
86
- Function Name TIME[sec] SPEED[pairs/s]
87
- ============================== ============ ================
88
- edlib 4.763 208216
89
- editdistance 1.943 510450
90
- jellyfish.levenshtein_distance 0.722 1374081
91
- distance.levenshtein 0.623 1591396
92
- Levenshtein.distance 0.500 1982764
93
- polyleven.levenshtein 0.431 2303420
94
- ============================== ============ ================
95
-
96
- 4.2. Longer Inputs
97
- ------------------
98
-
99
- To evaluate the efficiency for longer inputs, I created 5000 pairs
100
- of random strings of size 16, 32, 64, 128, 256, 512 and 1024.
101
-
102
- Each library was measured how fast it can process these entries. [#fn1]_
103
-
104
- ============ ===== ===== ===== ===== ===== ===== ======
105
- Library N=16 N=32 N=64 N=128 N=256 N=512 N=1024
106
- ============ ===== ===== ===== ===== ===== ===== ======
107
- edlib 0.040 0.063 0.094 0.205 0.432 0.908 2.089
108
- editdistance 0.027 0.049 0.086 0.178 0.336 0.740 58.139
109
- jellyfish 0.009 0.032 0.118 0.470 1.874 8.877 42.848
110
- distance 0.007 0.029 0.109 0.431 1.726 6.950 27.998
111
- Levenshtein 0.006 0.022 0.085 0.336 1.328 5.286 21.097
112
- polyleven 0.003 0.005 0.010 0.043 0.149 0.550 2.109
113
- ============ ===== ===== ===== ===== ===== ===== ======
114
-
115
- 3.3. List of Libraries
116
- ----------------------
117
-
118
- ============ ======= ==========================================
119
- Library Version URL
120
- ============ ======= ==========================================
121
- edlib v1.2.1 https://github.com/Martinsos/edlib
122
- editdistance v0.4 https://github.com/aflc/editdistance
123
- jellyfish v0.5.6 https://github.com/jamesturk/jellyfish
124
- distance v0.1.3 https://github.com/doukremt/distance
125
- Levenshtein v0.12 https://github.com/ztane/python-Levenshtein
126
- polyleven v0.3 https://github.com/fujimotos/polyleven
127
- ============ ======= ==========================================
128
-
129
- .. [#fn1] Measured using Python 3.5.3 on Debian Jessie with Intel Core
130
- i3-4010U (1.70GHz)
1
+ Metadata-Version: 2.1
2
+ Name: polyleven
3
+ Version: 0.10.0
4
+ Summary: A fast C-implemented library for Levenshtein distance
5
+ Maintainer-email: Fujimoto Seiji <fujimoto@ceptord.net>
6
+ Project-URL: github, https://github.com/fujimotos/polyleven
7
+ Keywords: Levenshtein distance
8
+ Classifier: Development Status :: 5 - Production/Stable
9
+ Classifier: Operating System :: OS Independent
10
+ Classifier: Programming Language :: Python :: 3
11
+ Classifier: Programming Language :: C
12
+ Requires-Python: >=3.8
13
+ Description-Content-Type: text/x-rst
14
+ License-File: LICENSE
15
+
16
+ ==============================================
17
+ Polyleven -- Fast Pythonic Levenshtein Library
18
+ ==============================================
19
+
20
+ :License: MIT License
21
+
22
+ 1. Introduction
23
+ ===============
24
+
25
+ polyleven is a Pythonic Levenshtein distance library that:
26
+
27
+ - Is **fast** independent of input types, and hence can be applied to
28
+ both short (like English words) and long inputs (like DNA sequences).
29
+
30
+ - Is **stand-alone** depending only on core Python packages.
31
+
32
+ - Is distributed under the **MIT License**, hence can be used freely
33
+ in private projects.
34
+
35
+ 2. How to install
36
+ =================
37
+
38
+ The official package is available on PyPI::
39
+
40
+ $ pip install polyleven
41
+
42
+ 3. How to use
43
+ =============
44
+
45
+ Polyleven provides a single interface function ``levenshtein()``. You
46
+ can use this function to measure the similarity of two strings.
47
+
48
+ >>> from polyleven import levenshtein
49
+ >>> levenshtein('aaa', 'ccc')
50
+ 3
51
+
52
+ If you only care about distances under a certain threshold, you can
53
+ pass the max threshold to the third argument.
54
+
55
+ >>> levenshtein('acc', 'ccc', 1)
56
+ 1
57
+ >>> levenshtein('aaa', 'ccc', 1)
58
+ 2
59
+
60
+ In general, you can gain a noticeable speed boost with threshold
61
+ :math:`k < 3`.
62
+
63
+ 4. Benchmark
64
+ ============
65
+
66
+ 4.1 English Words
67
+ ------------------
68
+
69
+ To compare Polyleven with other Pythonic edit distance libraries,
70
+ a million word pairs was generated from `SCOWL`_.
71
+
72
+ .. _SCOWL: http://wordlist.aspell.net/
73
+
74
+ Each library was measured how long it takes to evaluate all of
75
+ these words. The following table summarises the result:
76
+
77
+ ============================== ============ ================
78
+ Function Name TIME[sec] SPEED[pairs/s]
79
+ ============================== ============ ================
80
+ edlib 4.763 208216
81
+ editdistance 1.943 510450
82
+ jellyfish.levenshtein_distance 0.722 1374081
83
+ distance.levenshtein 0.623 1591396
84
+ Levenshtein.distance 0.500 1982764
85
+ polyleven.levenshtein 0.431 2303420
86
+ ============================== ============ ================
87
+
88
+ 4.2. Longer Inputs
89
+ ------------------
90
+
91
+ To evaluate the efficiency for longer inputs, I created 5000 pairs
92
+ of random strings of size 16, 32, 64, 128, 256, 512 and 1024.
93
+
94
+ Each library was measured how fast it can process these entries. [#fn1]_
95
+
96
+ ============ ===== ===== ===== ===== ===== ===== ======
97
+ Library N=16 N=32 N=64 N=128 N=256 N=512 N=1024
98
+ ============ ===== ===== ===== ===== ===== ===== ======
99
+ edlib 0.040 0.063 0.094 0.205 0.432 0.908 2.089
100
+ editdistance 0.027 0.049 0.086 0.178 0.336 0.740 58.139
101
+ jellyfish 0.009 0.032 0.118 0.470 1.874 8.877 42.848
102
+ distance 0.007 0.029 0.109 0.431 1.726 6.950 27.998
103
+ Levenshtein 0.006 0.022 0.085 0.336 1.328 5.286 21.097
104
+ polyleven 0.003 0.005 0.010 0.043 0.149 0.550 2.109
105
+ ============ ===== ===== ===== ===== ===== ===== ======
106
+
107
+ 3.3. List of Libraries
108
+ ----------------------
109
+
110
+ ============ ======= ==========================================
111
+ Library Version URL
112
+ ============ ======= ==========================================
113
+ edlib v1.2.1 https://github.com/Martinsos/edlib
114
+ editdistance v0.4 https://github.com/aflc/editdistance
115
+ jellyfish v0.5.6 https://github.com/jamesturk/jellyfish
116
+ distance v0.1.3 https://github.com/doukremt/distance
117
+ Levenshtein v0.12 https://github.com/ztane/python-Levenshtein
118
+ polyleven v0.3 https://github.com/fujimotos/polyleven
119
+ ============ ======= ==========================================
120
+
121
+ .. [#fn1] Measured using Python 3.5.3 on Debian Jessie with Intel Core
122
+ i3-4010U (1.70GHz)
@@ -0,0 +1,6 @@
1
+ polyleven.cp38-win32.pyd,sha256=6UNMCc3uDAIl8ZHNT7M0ojA6lCwTi7xykcTDkbtEb_4,14336
2
+ polyleven-0.10.0.dist-info/LICENSE,sha256=xnZjXO5s8lo3gIlvxuEZIT0bkSbZWKupIH9qtavDHwE,1344
3
+ polyleven-0.10.0.dist-info/METADATA,sha256=bHSFVGGyLBg2ZJAWvNBxpMHtmvoXRScMJJ-IDrfWphw,4351
4
+ polyleven-0.10.0.dist-info/WHEEL,sha256=eVYmGj5rXRtpsCfsI3rXVwEX_vwxTLMXmwf_caP9MfU,95
5
+ polyleven-0.10.0.dist-info/top_level.txt,sha256=12GbQ6DLcEtqgc30L3CguDVut0T-AYu2LoAm0fY4-cY,21
6
+ polyleven-0.10.0.dist-info/RECORD,,
@@ -1,5 +1,5 @@
1
1
  Wheel-Version: 1.0
2
- Generator: bdist_wheel (0.37.1)
2
+ Generator: setuptools (75.3.3)
3
3
  Root-Is-Purelib: false
4
4
  Tag: cp38-cp38-win32
5
5
 
@@ -0,0 +1,2 @@
1
+ polyleven
2
+ wheelhouse
polyleven.cp38-win32.pyd CHANGED
Binary file
@@ -1,6 +0,0 @@
1
- polyleven.cp38-win32.pyd,sha256=_KpmtUuOh4ALLaAVaNcj3G5DrFYLXOZn1pY_zyJ0cJw,13312
2
- polyleven-0.8.dist-info/LICENSE,sha256=b_7rLdReW-_Bj1D0M7xcb5qG9KZuLSDeOB3Vqwk3Z1Q,1242
3
- polyleven-0.8.dist-info/METADATA,sha256=XEutlMdLtn6kTOhXgyvOKE6kvhNCoI9qoUcJdhcac_c,4455
4
- polyleven-0.8.dist-info/WHEEL,sha256=f5A3Ypo9PM0E06rgvH7-s12WthkA89SedEqLArPnjKU,96
5
- polyleven-0.8.dist-info/top_level.txt,sha256=v9yjpvGjpSenYL_Mbe2BIS_vq01x_pd07i352hs88RQ,10
6
- polyleven-0.8.dist-info/RECORD,,
@@ -1 +0,0 @@
1
- polyleven