pvleopard 2.0.1__tar.gz → 2.0.3__tar.gz
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- {pvleopard-2.0.1 → pvleopard-2.0.3}/MANIFEST.in +0 -1
- pvleopard-2.0.3/PKG-INFO +104 -0
- {pvleopard-2.0.1 → pvleopard-2.0.3}/README.md +19 -10
- {pvleopard-2.0.1 → pvleopard-2.0.3}/pvleopard/_util.py +10 -7
- pvleopard-2.0.3/pvleopard/lib/raspberry-pi/cortex-a76/libpv_leopard.so +0 -0
- pvleopard-2.0.3/pvleopard/lib/raspberry-pi/cortex-a76-aarch64/libpv_leopard.so +0 -0
- pvleopard-2.0.3/pvleopard.egg-info/PKG-INFO +104 -0
- {pvleopard-2.0.1 → pvleopard-2.0.3}/pvleopard.egg-info/SOURCES.txt +2 -1
- {pvleopard-2.0.1 → pvleopard-2.0.3}/setup.py +4 -4
- pvleopard-2.0.1/PKG-INFO +0 -97
- pvleopard-2.0.1/pvleopard/lib/jetson/cortex-a57-aarch64/libpv_leopard.so +0 -0
- pvleopard-2.0.1/pvleopard.egg-info/PKG-INFO +0 -97
- {pvleopard-2.0.1 → pvleopard-2.0.3}/pvleopard/LICENSE +0 -0
- {pvleopard-2.0.1 → pvleopard-2.0.3}/pvleopard/__init__.py +0 -0
- {pvleopard-2.0.1 → pvleopard-2.0.3}/pvleopard/_factory.py +0 -0
- {pvleopard-2.0.1 → pvleopard-2.0.3}/pvleopard/_leopard.py +0 -0
- {pvleopard-2.0.1 → pvleopard-2.0.3}/pvleopard/lib/common/leopard_params.pv +0 -0
- {pvleopard-2.0.1 → pvleopard-2.0.3}/pvleopard/lib/linux/x86_64/libpv_leopard.so +0 -0
- {pvleopard-2.0.1 → pvleopard-2.0.3}/pvleopard/lib/mac/arm64/libpv_leopard.dylib +0 -0
- {pvleopard-2.0.1 → pvleopard-2.0.3}/pvleopard/lib/mac/x86_64/libpv_leopard.dylib +0 -0
- {pvleopard-2.0.1 → pvleopard-2.0.3}/pvleopard/lib/raspberry-pi/cortex-a53/libpv_leopard.so +0 -0
- {pvleopard-2.0.1 → pvleopard-2.0.3}/pvleopard/lib/raspberry-pi/cortex-a53-aarch64/libpv_leopard.so +0 -0
- {pvleopard-2.0.1 → pvleopard-2.0.3}/pvleopard/lib/raspberry-pi/cortex-a72/libpv_leopard.so +0 -0
- {pvleopard-2.0.1 → pvleopard-2.0.3}/pvleopard/lib/raspberry-pi/cortex-a72-aarch64/libpv_leopard.so +0 -0
- {pvleopard-2.0.1 → pvleopard-2.0.3}/pvleopard/lib/windows/amd64/libpv_leopard.dll +0 -0
- {pvleopard-2.0.1 → pvleopard-2.0.3}/pvleopard.egg-info/dependency_links.txt +0 -0
- {pvleopard-2.0.1 → pvleopard-2.0.3}/pvleopard.egg-info/top_level.txt +0 -0
- {pvleopard-2.0.1 → pvleopard-2.0.3}/setup.cfg +0 -0
pvleopard-2.0.3/PKG-INFO
ADDED
@@ -0,0 +1,104 @@
|
|
1
|
+
Metadata-Version: 2.1
|
2
|
+
Name: pvleopard
|
3
|
+
Version: 2.0.3
|
4
|
+
Summary: Leopard Speech-to-Text Engine.
|
5
|
+
Home-page: https://github.com/Picovoice/leopard
|
6
|
+
Author: Picovoice
|
7
|
+
Author-email: hello@picovoice.ai
|
8
|
+
Keywords: Speech-to-Text,Speech Recognition,Voice Recognition,ASR,Automatic Speech Recognition
|
9
|
+
Classifier: Development Status :: 5 - Production/Stable
|
10
|
+
Classifier: Intended Audience :: Developers
|
11
|
+
Classifier: License :: OSI Approved :: Apache Software License
|
12
|
+
Classifier: Operating System :: OS Independent
|
13
|
+
Classifier: Programming Language :: Python :: 3
|
14
|
+
Classifier: Topic :: Multimedia :: Sound/Audio :: Speech
|
15
|
+
Requires-Python: >=3.8
|
16
|
+
Description-Content-Type: text/markdown
|
17
|
+
|
18
|
+
# Leopard Binding for Python
|
19
|
+
|
20
|
+
## Leopard Speech-to-Text Engine
|
21
|
+
|
22
|
+
Made in Vancouver, Canada by [Picovoice](https://picovoice.ai)
|
23
|
+
|
24
|
+
Leopard is an on-device speech-to-text engine. Leopard is:
|
25
|
+
|
26
|
+
- Private; All voice processing runs locally.
|
27
|
+
- [Accurate](https://picovoice.ai/docs/benchmark/stt/)
|
28
|
+
- [Compact and Computationally-Efficient](https://github.com/Picovoice/speech-to-text-benchmark#rtf)
|
29
|
+
- Cross-Platform:
|
30
|
+
- Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64)
|
31
|
+
- Android and iOS
|
32
|
+
- Chrome, Safari, Firefox, and Edge
|
33
|
+
- Raspberry Pi (3, 4, 5)
|
34
|
+
|
35
|
+
## Compatibility
|
36
|
+
|
37
|
+
- Python 3.8+
|
38
|
+
- Runs on Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64), and Raspberry Pi (3, 4, 5).
|
39
|
+
|
40
|
+
## Installation
|
41
|
+
|
42
|
+
```console
|
43
|
+
pip3 install pvleopard
|
44
|
+
```
|
45
|
+
|
46
|
+
## AccessKey
|
47
|
+
|
48
|
+
Leopard requires a valid Picovoice `AccessKey` at initialization. `AccessKey` acts as your credentials when using Leopard SDKs.
|
49
|
+
You can get your `AccessKey` for free. Make sure to keep your `AccessKey` secret.
|
50
|
+
Signup or Login to [Picovoice Console](https://console.picovoice.ai/) to get your `AccessKey`.
|
51
|
+
|
52
|
+
## Usage
|
53
|
+
|
54
|
+
Create an instance of the engine and transcribe an audio file:
|
55
|
+
|
56
|
+
```python
|
57
|
+
import pvleopard
|
58
|
+
|
59
|
+
leopard = pvleopard.create(access_key='${ACCESS_KEY}')
|
60
|
+
|
61
|
+
transcript, words = leopard.process_file('${AUDIO_FILE_PATH}')
|
62
|
+
print(transcript)
|
63
|
+
for word in words:
|
64
|
+
print(
|
65
|
+
"{word=\"%s\" start_sec=%.2f end_sec=%.2f confidence=%.2f speaker_tag=%d}"
|
66
|
+
% (word.word, word.start_sec, word.end_sec, word.confidence, word.speaker_tag))
|
67
|
+
```
|
68
|
+
|
69
|
+
Replace `${ACCESS_KEY}` with yours obtained from [Picovoice Console](https://console.picovoice.ai/) and
|
70
|
+
`${AUDIO_FILE_PATH}` to the path an audio file.
|
71
|
+
|
72
|
+
Finally, when done be sure to explicitly release the resources:
|
73
|
+
```python
|
74
|
+
leopard.delete()
|
75
|
+
```
|
76
|
+
|
77
|
+
### Language Model
|
78
|
+
|
79
|
+
The Leopard Python SDK comes preloaded with a default English language model (`.pv` file).
|
80
|
+
Default models for other supported languages can be found in [lib/common](../../lib/common).
|
81
|
+
|
82
|
+
Create custom language models using the [Picovoice Console](https://console.picovoice.ai/). Here you can train
|
83
|
+
language models with custom vocabulary and boost words in the existing vocabulary.
|
84
|
+
|
85
|
+
Pass in the `.pv` file via the `model_path` argument:
|
86
|
+
```python
|
87
|
+
leopard = pvleopard.create(
|
88
|
+
access_key='${ACCESS_KEY}',
|
89
|
+
model_path='${MODEL_FILE_PATH}')
|
90
|
+
```
|
91
|
+
|
92
|
+
### Word Metadata
|
93
|
+
|
94
|
+
Along with the transcript, Leopard returns metadata for each transcribed word. Available metadata items are:
|
95
|
+
|
96
|
+
- **Start Time:** Indicates when the word started in the transcribed audio. Value is in seconds.
|
97
|
+
- **End Time:** Indicates when the word ended in the transcribed audio. Value is in seconds.
|
98
|
+
- **Confidence:** Leopard's confidence that the transcribed word is accurate. It is a number within `[0, 1]`.
|
99
|
+
- **Speaker Tag:** If speaker diarization is enabled on initialization, the speaker tag is a non-negative integer identifying unique speakers, with `0` reserved for unknown speakers. If speaker diarization is not enabled, the value will always be `-1`.
|
100
|
+
|
101
|
+
## Demos
|
102
|
+
|
103
|
+
[pvleoparddemo](https://pypi.org/project/pvleoparddemo/) provides command-line utilities for processing audio using
|
104
|
+
Leopard.
|
@@ -13,12 +13,12 @@ Leopard is an on-device speech-to-text engine. Leopard is:
|
|
13
13
|
- Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64)
|
14
14
|
- Android and iOS
|
15
15
|
- Chrome, Safari, Firefox, and Edge
|
16
|
-
- Raspberry Pi (4,
|
16
|
+
- Raspberry Pi (3, 4, 5)
|
17
17
|
|
18
18
|
## Compatibility
|
19
19
|
|
20
|
-
- Python 3.
|
21
|
-
- Runs on Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64), Raspberry Pi (4,
|
20
|
+
- Python 3.8+
|
21
|
+
- Runs on Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64), and Raspberry Pi (3, 4, 5).
|
22
22
|
|
23
23
|
## Installation
|
24
24
|
|
@@ -32,7 +32,7 @@ Leopard requires a valid Picovoice `AccessKey` at initialization. `AccessKey` ac
|
|
32
32
|
You can get your `AccessKey` for free. Make sure to keep your `AccessKey` secret.
|
33
33
|
Signup or Login to [Picovoice Console](https://console.picovoice.ai/) to get your `AccessKey`.
|
34
34
|
|
35
|
-
|
35
|
+
## Usage
|
36
36
|
|
37
37
|
Create an instance of the engine and transcribe an audio file:
|
38
38
|
|
@@ -41,23 +41,23 @@ import pvleopard
|
|
41
41
|
|
42
42
|
leopard = pvleopard.create(access_key='${ACCESS_KEY}')
|
43
43
|
|
44
|
-
transcript, words = leopard.process_file('${
|
44
|
+
transcript, words = leopard.process_file('${AUDIO_FILE_PATH}')
|
45
45
|
print(transcript)
|
46
46
|
for word in words:
|
47
47
|
print(
|
48
|
-
"{word=\"%s\" start_sec=%.2f end_sec=%.2f confidence=%.2f}"
|
49
|
-
% (word.word, word.start_sec, word.end_sec, word.confidence))
|
48
|
+
"{word=\"%s\" start_sec=%.2f end_sec=%.2f confidence=%.2f speaker_tag=%d}"
|
49
|
+
% (word.word, word.start_sec, word.end_sec, word.confidence, word.speaker_tag))
|
50
50
|
```
|
51
51
|
|
52
52
|
Replace `${ACCESS_KEY}` with yours obtained from [Picovoice Console](https://console.picovoice.ai/) and
|
53
|
-
`${
|
53
|
+
`${AUDIO_FILE_PATH}` to the path an audio file.
|
54
54
|
|
55
55
|
Finally, when done be sure to explicitly release the resources:
|
56
56
|
```python
|
57
57
|
leopard.delete()
|
58
58
|
```
|
59
59
|
|
60
|
-
|
60
|
+
### Language Model
|
61
61
|
|
62
62
|
The Leopard Python SDK comes preloaded with a default English language model (`.pv` file).
|
63
63
|
Default models for other supported languages can be found in [lib/common](../../lib/common).
|
@@ -69,9 +69,18 @@ Pass in the `.pv` file via the `model_path` argument:
|
|
69
69
|
```python
|
70
70
|
leopard = pvleopard.create(
|
71
71
|
access_key='${ACCESS_KEY}',
|
72
|
-
model_path='${
|
72
|
+
model_path='${MODEL_FILE_PATH}')
|
73
73
|
```
|
74
74
|
|
75
|
+
### Word Metadata
|
76
|
+
|
77
|
+
Along with the transcript, Leopard returns metadata for each transcribed word. Available metadata items are:
|
78
|
+
|
79
|
+
- **Start Time:** Indicates when the word started in the transcribed audio. Value is in seconds.
|
80
|
+
- **End Time:** Indicates when the word ended in the transcribed audio. Value is in seconds.
|
81
|
+
- **Confidence:** Leopard's confidence that the transcribed word is accurate. It is a number within `[0, 1]`.
|
82
|
+
- **Speaker Tag:** If speaker diarization is enabled on initialization, the speaker tag is a non-negative integer identifying unique speakers, with `0` reserved for unknown speakers. If speaker diarization is not enabled, the value will always be `-1`.
|
83
|
+
|
75
84
|
## Demos
|
76
85
|
|
77
86
|
[pvleoparddemo](https://pypi.org/project/pvleoparddemo/) provides command-line utilities for processing audio using
|
@@ -1,5 +1,5 @@
|
|
1
1
|
#
|
2
|
-
# Copyright 2022-
|
2
|
+
# Copyright 2022-2024 Picovoice Inc.
|
3
3
|
#
|
4
4
|
# You may not use this file except in compliance with the license. A copy of the license is located in the "LICENSE"
|
5
5
|
# file accompanying this source.
|
@@ -37,16 +37,21 @@ def _linux_machine():
|
|
37
37
|
|
38
38
|
if '0xd03' == cpu_part:
|
39
39
|
return 'cortex-a53' + arch_info
|
40
|
-
elif '0xd07' == cpu_part:
|
41
|
-
return 'cortex-a57' + arch_info
|
42
40
|
elif '0xd08' == cpu_part:
|
43
41
|
return 'cortex-a72' + arch_info
|
42
|
+
elif "0xd0b" == cpu_part:
|
43
|
+
return "cortex-a76" + arch_info
|
44
44
|
else:
|
45
45
|
raise NotImplementedError("Unsupported CPU: `%s`." % cpu_part)
|
46
46
|
|
47
47
|
|
48
|
-
_RASPBERRY_PI_MACHINES = {
|
49
|
-
|
48
|
+
_RASPBERRY_PI_MACHINES = {
|
49
|
+
"cortex-a53",
|
50
|
+
"cortex-a72",
|
51
|
+
"cortex-a76",
|
52
|
+
"cortex-a53-aarch64",
|
53
|
+
"cortex-a72-aarch64",
|
54
|
+
"cortex-a76-aarch64"}
|
50
55
|
|
51
56
|
|
52
57
|
def default_library_path(relative):
|
@@ -59,8 +64,6 @@ def default_library_path(relative):
|
|
59
64
|
linux_machine = _linux_machine()
|
60
65
|
if linux_machine == 'x86_64':
|
61
66
|
return os.path.join(os.path.dirname(__file__), relative, 'lib/linux/x86_64/libpv_leopard.so')
|
62
|
-
elif linux_machine in _JETSON_MACHINES:
|
63
|
-
return os.path.join(os.path.dirname(__file__), relative, 'lib/jetson/%s/libpv_leopard.so' % linux_machine)
|
64
67
|
elif linux_machine in _RASPBERRY_PI_MACHINES:
|
65
68
|
return os.path.join(
|
66
69
|
os.path.dirname(__file__),
|
Binary file
|
@@ -0,0 +1,104 @@
|
|
1
|
+
Metadata-Version: 2.1
|
2
|
+
Name: pvleopard
|
3
|
+
Version: 2.0.3
|
4
|
+
Summary: Leopard Speech-to-Text Engine.
|
5
|
+
Home-page: https://github.com/Picovoice/leopard
|
6
|
+
Author: Picovoice
|
7
|
+
Author-email: hello@picovoice.ai
|
8
|
+
Keywords: Speech-to-Text,Speech Recognition,Voice Recognition,ASR,Automatic Speech Recognition
|
9
|
+
Classifier: Development Status :: 5 - Production/Stable
|
10
|
+
Classifier: Intended Audience :: Developers
|
11
|
+
Classifier: License :: OSI Approved :: Apache Software License
|
12
|
+
Classifier: Operating System :: OS Independent
|
13
|
+
Classifier: Programming Language :: Python :: 3
|
14
|
+
Classifier: Topic :: Multimedia :: Sound/Audio :: Speech
|
15
|
+
Requires-Python: >=3.8
|
16
|
+
Description-Content-Type: text/markdown
|
17
|
+
|
18
|
+
# Leopard Binding for Python
|
19
|
+
|
20
|
+
## Leopard Speech-to-Text Engine
|
21
|
+
|
22
|
+
Made in Vancouver, Canada by [Picovoice](https://picovoice.ai)
|
23
|
+
|
24
|
+
Leopard is an on-device speech-to-text engine. Leopard is:
|
25
|
+
|
26
|
+
- Private; All voice processing runs locally.
|
27
|
+
- [Accurate](https://picovoice.ai/docs/benchmark/stt/)
|
28
|
+
- [Compact and Computationally-Efficient](https://github.com/Picovoice/speech-to-text-benchmark#rtf)
|
29
|
+
- Cross-Platform:
|
30
|
+
- Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64)
|
31
|
+
- Android and iOS
|
32
|
+
- Chrome, Safari, Firefox, and Edge
|
33
|
+
- Raspberry Pi (3, 4, 5)
|
34
|
+
|
35
|
+
## Compatibility
|
36
|
+
|
37
|
+
- Python 3.8+
|
38
|
+
- Runs on Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64), and Raspberry Pi (3, 4, 5).
|
39
|
+
|
40
|
+
## Installation
|
41
|
+
|
42
|
+
```console
|
43
|
+
pip3 install pvleopard
|
44
|
+
```
|
45
|
+
|
46
|
+
## AccessKey
|
47
|
+
|
48
|
+
Leopard requires a valid Picovoice `AccessKey` at initialization. `AccessKey` acts as your credentials when using Leopard SDKs.
|
49
|
+
You can get your `AccessKey` for free. Make sure to keep your `AccessKey` secret.
|
50
|
+
Signup or Login to [Picovoice Console](https://console.picovoice.ai/) to get your `AccessKey`.
|
51
|
+
|
52
|
+
## Usage
|
53
|
+
|
54
|
+
Create an instance of the engine and transcribe an audio file:
|
55
|
+
|
56
|
+
```python
|
57
|
+
import pvleopard
|
58
|
+
|
59
|
+
leopard = pvleopard.create(access_key='${ACCESS_KEY}')
|
60
|
+
|
61
|
+
transcript, words = leopard.process_file('${AUDIO_FILE_PATH}')
|
62
|
+
print(transcript)
|
63
|
+
for word in words:
|
64
|
+
print(
|
65
|
+
"{word=\"%s\" start_sec=%.2f end_sec=%.2f confidence=%.2f speaker_tag=%d}"
|
66
|
+
% (word.word, word.start_sec, word.end_sec, word.confidence, word.speaker_tag))
|
67
|
+
```
|
68
|
+
|
69
|
+
Replace `${ACCESS_KEY}` with yours obtained from [Picovoice Console](https://console.picovoice.ai/) and
|
70
|
+
`${AUDIO_FILE_PATH}` to the path an audio file.
|
71
|
+
|
72
|
+
Finally, when done be sure to explicitly release the resources:
|
73
|
+
```python
|
74
|
+
leopard.delete()
|
75
|
+
```
|
76
|
+
|
77
|
+
### Language Model
|
78
|
+
|
79
|
+
The Leopard Python SDK comes preloaded with a default English language model (`.pv` file).
|
80
|
+
Default models for other supported languages can be found in [lib/common](../../lib/common).
|
81
|
+
|
82
|
+
Create custom language models using the [Picovoice Console](https://console.picovoice.ai/). Here you can train
|
83
|
+
language models with custom vocabulary and boost words in the existing vocabulary.
|
84
|
+
|
85
|
+
Pass in the `.pv` file via the `model_path` argument:
|
86
|
+
```python
|
87
|
+
leopard = pvleopard.create(
|
88
|
+
access_key='${ACCESS_KEY}',
|
89
|
+
model_path='${MODEL_FILE_PATH}')
|
90
|
+
```
|
91
|
+
|
92
|
+
### Word Metadata
|
93
|
+
|
94
|
+
Along with the transcript, Leopard returns metadata for each transcribed word. Available metadata items are:
|
95
|
+
|
96
|
+
- **Start Time:** Indicates when the word started in the transcribed audio. Value is in seconds.
|
97
|
+
- **End Time:** Indicates when the word ended in the transcribed audio. Value is in seconds.
|
98
|
+
- **Confidence:** Leopard's confidence that the transcribed word is accurate. It is a number within `[0, 1]`.
|
99
|
+
- **Speaker Tag:** If speaker diarization is enabled on initialization, the speaker tag is a non-negative integer identifying unique speakers, with `0` reserved for unknown speakers. If speaker diarization is not enabled, the value will always be `-1`.
|
100
|
+
|
101
|
+
## Demos
|
102
|
+
|
103
|
+
[pvleoparddemo](https://pypi.org/project/pvleoparddemo/) provides command-line utilities for processing audio using
|
104
|
+
Leopard.
|
@@ -11,7 +11,6 @@ pvleopard.egg-info/SOURCES.txt
|
|
11
11
|
pvleopard.egg-info/dependency_links.txt
|
12
12
|
pvleopard.egg-info/top_level.txt
|
13
13
|
pvleopard/lib/common/leopard_params.pv
|
14
|
-
pvleopard/lib/jetson/cortex-a57-aarch64/libpv_leopard.so
|
15
14
|
pvleopard/lib/linux/x86_64/libpv_leopard.so
|
16
15
|
pvleopard/lib/mac/arm64/libpv_leopard.dylib
|
17
16
|
pvleopard/lib/mac/x86_64/libpv_leopard.dylib
|
@@ -19,4 +18,6 @@ pvleopard/lib/raspberry-pi/cortex-a53/libpv_leopard.so
|
|
19
18
|
pvleopard/lib/raspberry-pi/cortex-a53-aarch64/libpv_leopard.so
|
20
19
|
pvleopard/lib/raspberry-pi/cortex-a72/libpv_leopard.so
|
21
20
|
pvleopard/lib/raspberry-pi/cortex-a72-aarch64/libpv_leopard.so
|
21
|
+
pvleopard/lib/raspberry-pi/cortex-a76/libpv_leopard.so
|
22
|
+
pvleopard/lib/raspberry-pi/cortex-a76-aarch64/libpv_leopard.so
|
22
23
|
pvleopard/lib/windows/amd64/libpv_leopard.dll
|
@@ -1,5 +1,5 @@
|
|
1
1
|
#
|
2
|
-
# Copyright 2022-
|
2
|
+
# Copyright 2022-2024 Picovoice Inc.
|
3
3
|
#
|
4
4
|
# You may not use this file except in compliance with the license. A copy of the license is located in the "LICENSE"
|
5
5
|
# file accompanying this source.
|
@@ -15,7 +15,7 @@ import shutil
|
|
15
15
|
import setuptools
|
16
16
|
|
17
17
|
INCLUDE_FILES = ('../../LICENSE', '__init__.py', '_factory.py', '_leopard.py', '_util.py')
|
18
|
-
INCLUDE_LIBS = ('linux', 'mac', 'windows', '
|
18
|
+
INCLUDE_LIBS = ('linux', 'mac', 'windows', 'raspberry-pi')
|
19
19
|
|
20
20
|
os.system('git clean -dfx')
|
21
21
|
|
@@ -48,7 +48,7 @@ with open(os.path.join(os.path.dirname(__file__), 'README.md'), 'r') as f:
|
|
48
48
|
|
49
49
|
setuptools.setup(
|
50
50
|
name="pvleopard",
|
51
|
-
version="2.0.
|
51
|
+
version="2.0.3",
|
52
52
|
author="Picovoice",
|
53
53
|
author_email="hello@picovoice.ai",
|
54
54
|
description="Leopard Speech-to-Text Engine.",
|
@@ -65,6 +65,6 @@ setuptools.setup(
|
|
65
65
|
"Programming Language :: Python :: 3",
|
66
66
|
"Topic :: Multimedia :: Sound/Audio :: Speech"
|
67
67
|
],
|
68
|
-
python_requires='>=3.
|
68
|
+
python_requires='>=3.8',
|
69
69
|
keywords="Speech-to-Text, Speech Recognition, Voice Recognition, ASR, Automatic Speech Recognition",
|
70
70
|
)
|
pvleopard-2.0.1/PKG-INFO
DELETED
@@ -1,97 +0,0 @@
|
|
1
|
-
Metadata-Version: 2.1
|
2
|
-
Name: pvleopard
|
3
|
-
Version: 2.0.1
|
4
|
-
Summary: Leopard Speech-to-Text Engine.
|
5
|
-
Home-page: https://github.com/Picovoice/leopard
|
6
|
-
Author: Picovoice
|
7
|
-
Author-email: hello@picovoice.ai
|
8
|
-
License: UNKNOWN
|
9
|
-
Description: # Leopard Binding for Python
|
10
|
-
|
11
|
-
## Leopard Speech-to-Text Engine
|
12
|
-
|
13
|
-
Made in Vancouver, Canada by [Picovoice](https://picovoice.ai)
|
14
|
-
|
15
|
-
Leopard is an on-device speech-to-text engine. Leopard is:
|
16
|
-
|
17
|
-
- Private; All voice processing runs locally.
|
18
|
-
- [Accurate](https://picovoice.ai/docs/benchmark/stt/)
|
19
|
-
- [Compact and Computationally-Efficient](https://github.com/Picovoice/speech-to-text-benchmark#rtf)
|
20
|
-
- Cross-Platform:
|
21
|
-
- Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64)
|
22
|
-
- Android and iOS
|
23
|
-
- Chrome, Safari, Firefox, and Edge
|
24
|
-
- Raspberry Pi (4, 3) and NVIDIA Jetson Nano
|
25
|
-
|
26
|
-
## Compatibility
|
27
|
-
|
28
|
-
- Python 3.5+
|
29
|
-
- Runs on Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64), Raspberry Pi (4, 3), and NVIDIA Jetson Nano.
|
30
|
-
|
31
|
-
## Installation
|
32
|
-
|
33
|
-
```console
|
34
|
-
pip3 install pvleopard
|
35
|
-
```
|
36
|
-
|
37
|
-
## AccessKey
|
38
|
-
|
39
|
-
Leopard requires a valid Picovoice `AccessKey` at initialization. `AccessKey` acts as your credentials when using Leopard SDKs.
|
40
|
-
You can get your `AccessKey` for free. Make sure to keep your `AccessKey` secret.
|
41
|
-
Signup or Login to [Picovoice Console](https://console.picovoice.ai/) to get your `AccessKey`.
|
42
|
-
|
43
|
-
### Usage
|
44
|
-
|
45
|
-
Create an instance of the engine and transcribe an audio file:
|
46
|
-
|
47
|
-
```python
|
48
|
-
import pvleopard
|
49
|
-
|
50
|
-
leopard = pvleopard.create(access_key='${ACCESS_KEY}')
|
51
|
-
|
52
|
-
transcript, words = leopard.process_file('${AUDIO_PATH}')
|
53
|
-
print(transcript)
|
54
|
-
for word in words:
|
55
|
-
print(
|
56
|
-
"{word=\"%s\" start_sec=%.2f end_sec=%.2f confidence=%.2f}"
|
57
|
-
% (word.word, word.start_sec, word.end_sec, word.confidence))
|
58
|
-
```
|
59
|
-
|
60
|
-
Replace `${ACCESS_KEY}` with yours obtained from [Picovoice Console](https://console.picovoice.ai/) and
|
61
|
-
`${AUDIO_PATH}` to the path an audio file.
|
62
|
-
|
63
|
-
Finally, when done be sure to explicitly release the resources:
|
64
|
-
```python
|
65
|
-
leopard.delete()
|
66
|
-
```
|
67
|
-
|
68
|
-
## Language Model
|
69
|
-
|
70
|
-
The Leopard Python SDK comes preloaded with a default English language model (`.pv` file).
|
71
|
-
Default models for other supported languages can be found in [lib/common](../../lib/common).
|
72
|
-
|
73
|
-
Create custom language models using the [Picovoice Console](https://console.picovoice.ai/). Here you can train
|
74
|
-
language models with custom vocabulary and boost words in the existing vocabulary.
|
75
|
-
|
76
|
-
Pass in the `.pv` file via the `model_path` argument:
|
77
|
-
```python
|
78
|
-
leopard = pvleopard.create(
|
79
|
-
access_key='${ACCESS_KEY}',
|
80
|
-
model_path='${MODEL_PATH}')
|
81
|
-
```
|
82
|
-
|
83
|
-
## Demos
|
84
|
-
|
85
|
-
[pvleoparddemo](https://pypi.org/project/pvleoparddemo/) provides command-line utilities for processing audio using
|
86
|
-
Leopard.
|
87
|
-
|
88
|
-
Keywords: Speech-to-Text,Speech Recognition,Voice Recognition,ASR,Automatic Speech Recognition
|
89
|
-
Platform: UNKNOWN
|
90
|
-
Classifier: Development Status :: 5 - Production/Stable
|
91
|
-
Classifier: Intended Audience :: Developers
|
92
|
-
Classifier: License :: OSI Approved :: Apache Software License
|
93
|
-
Classifier: Operating System :: OS Independent
|
94
|
-
Classifier: Programming Language :: Python :: 3
|
95
|
-
Classifier: Topic :: Multimedia :: Sound/Audio :: Speech
|
96
|
-
Requires-Python: >=3.5
|
97
|
-
Description-Content-Type: text/markdown
|
Binary file
|
@@ -1,97 +0,0 @@
|
|
1
|
-
Metadata-Version: 2.1
|
2
|
-
Name: pvleopard
|
3
|
-
Version: 2.0.1
|
4
|
-
Summary: Leopard Speech-to-Text Engine.
|
5
|
-
Home-page: https://github.com/Picovoice/leopard
|
6
|
-
Author: Picovoice
|
7
|
-
Author-email: hello@picovoice.ai
|
8
|
-
License: UNKNOWN
|
9
|
-
Description: # Leopard Binding for Python
|
10
|
-
|
11
|
-
## Leopard Speech-to-Text Engine
|
12
|
-
|
13
|
-
Made in Vancouver, Canada by [Picovoice](https://picovoice.ai)
|
14
|
-
|
15
|
-
Leopard is an on-device speech-to-text engine. Leopard is:
|
16
|
-
|
17
|
-
- Private; All voice processing runs locally.
|
18
|
-
- [Accurate](https://picovoice.ai/docs/benchmark/stt/)
|
19
|
-
- [Compact and Computationally-Efficient](https://github.com/Picovoice/speech-to-text-benchmark#rtf)
|
20
|
-
- Cross-Platform:
|
21
|
-
- Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64)
|
22
|
-
- Android and iOS
|
23
|
-
- Chrome, Safari, Firefox, and Edge
|
24
|
-
- Raspberry Pi (4, 3) and NVIDIA Jetson Nano
|
25
|
-
|
26
|
-
## Compatibility
|
27
|
-
|
28
|
-
- Python 3.5+
|
29
|
-
- Runs on Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64), Raspberry Pi (4, 3), and NVIDIA Jetson Nano.
|
30
|
-
|
31
|
-
## Installation
|
32
|
-
|
33
|
-
```console
|
34
|
-
pip3 install pvleopard
|
35
|
-
```
|
36
|
-
|
37
|
-
## AccessKey
|
38
|
-
|
39
|
-
Leopard requires a valid Picovoice `AccessKey` at initialization. `AccessKey` acts as your credentials when using Leopard SDKs.
|
40
|
-
You can get your `AccessKey` for free. Make sure to keep your `AccessKey` secret.
|
41
|
-
Signup or Login to [Picovoice Console](https://console.picovoice.ai/) to get your `AccessKey`.
|
42
|
-
|
43
|
-
### Usage
|
44
|
-
|
45
|
-
Create an instance of the engine and transcribe an audio file:
|
46
|
-
|
47
|
-
```python
|
48
|
-
import pvleopard
|
49
|
-
|
50
|
-
leopard = pvleopard.create(access_key='${ACCESS_KEY}')
|
51
|
-
|
52
|
-
transcript, words = leopard.process_file('${AUDIO_PATH}')
|
53
|
-
print(transcript)
|
54
|
-
for word in words:
|
55
|
-
print(
|
56
|
-
"{word=\"%s\" start_sec=%.2f end_sec=%.2f confidence=%.2f}"
|
57
|
-
% (word.word, word.start_sec, word.end_sec, word.confidence))
|
58
|
-
```
|
59
|
-
|
60
|
-
Replace `${ACCESS_KEY}` with yours obtained from [Picovoice Console](https://console.picovoice.ai/) and
|
61
|
-
`${AUDIO_PATH}` to the path an audio file.
|
62
|
-
|
63
|
-
Finally, when done be sure to explicitly release the resources:
|
64
|
-
```python
|
65
|
-
leopard.delete()
|
66
|
-
```
|
67
|
-
|
68
|
-
## Language Model
|
69
|
-
|
70
|
-
The Leopard Python SDK comes preloaded with a default English language model (`.pv` file).
|
71
|
-
Default models for other supported languages can be found in [lib/common](../../lib/common).
|
72
|
-
|
73
|
-
Create custom language models using the [Picovoice Console](https://console.picovoice.ai/). Here you can train
|
74
|
-
language models with custom vocabulary and boost words in the existing vocabulary.
|
75
|
-
|
76
|
-
Pass in the `.pv` file via the `model_path` argument:
|
77
|
-
```python
|
78
|
-
leopard = pvleopard.create(
|
79
|
-
access_key='${ACCESS_KEY}',
|
80
|
-
model_path='${MODEL_PATH}')
|
81
|
-
```
|
82
|
-
|
83
|
-
## Demos
|
84
|
-
|
85
|
-
[pvleoparddemo](https://pypi.org/project/pvleoparddemo/) provides command-line utilities for processing audio using
|
86
|
-
Leopard.
|
87
|
-
|
88
|
-
Keywords: Speech-to-Text,Speech Recognition,Voice Recognition,ASR,Automatic Speech Recognition
|
89
|
-
Platform: UNKNOWN
|
90
|
-
Classifier: Development Status :: 5 - Production/Stable
|
91
|
-
Classifier: Intended Audience :: Developers
|
92
|
-
Classifier: License :: OSI Approved :: Apache Software License
|
93
|
-
Classifier: Operating System :: OS Independent
|
94
|
-
Classifier: Programming Language :: Python :: 3
|
95
|
-
Classifier: Topic :: Multimedia :: Sound/Audio :: Speech
|
96
|
-
Requires-Python: >=3.5
|
97
|
-
Description-Content-Type: text/markdown
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
{pvleopard-2.0.1 → pvleopard-2.0.3}/pvleopard/lib/raspberry-pi/cortex-a53-aarch64/libpv_leopard.so
RENAMED
File without changes
|
File without changes
|
{pvleopard-2.0.1 → pvleopard-2.0.3}/pvleopard/lib/raspberry-pi/cortex-a72-aarch64/libpv_leopard.so
RENAMED
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|