pvleopard 2.0.1__tar.gz → 2.0.4__tar.gz
Sign up to get free protection for your applications and to get access to all the features.
- {pvleopard-2.0.1 → pvleopard-2.0.4}/MANIFEST.in +0 -1
- pvleopard-2.0.4/PKG-INFO +104 -0
- {pvleopard-2.0.1 → pvleopard-2.0.4}/README.md +20 -11
- {pvleopard-2.0.1 → pvleopard-2.0.4}/pvleopard/_util.py +14 -8
- pvleopard-2.0.4/pvleopard/lib/raspberry-pi/cortex-a76/libpv_leopard.so +0 -0
- pvleopard-2.0.4/pvleopard/lib/raspberry-pi/cortex-a76-aarch64/libpv_leopard.so +0 -0
- pvleopard-2.0.4/pvleopard/lib/windows/arm64/libpv_leopard.dll +0 -0
- pvleopard-2.0.4/pvleopard.egg-info/PKG-INFO +104 -0
- {pvleopard-2.0.1 → pvleopard-2.0.4}/pvleopard.egg-info/SOURCES.txt +4 -2
- {pvleopard-2.0.1 → pvleopard-2.0.4}/setup.py +4 -4
- pvleopard-2.0.1/PKG-INFO +0 -97
- pvleopard-2.0.1/pvleopard/lib/jetson/cortex-a57-aarch64/libpv_leopard.so +0 -0
- pvleopard-2.0.1/pvleopard.egg-info/PKG-INFO +0 -97
- {pvleopard-2.0.1 → pvleopard-2.0.4}/pvleopard/LICENSE +0 -0
- {pvleopard-2.0.1 → pvleopard-2.0.4}/pvleopard/__init__.py +0 -0
- {pvleopard-2.0.1 → pvleopard-2.0.4}/pvleopard/_factory.py +0 -0
- {pvleopard-2.0.1 → pvleopard-2.0.4}/pvleopard/_leopard.py +0 -0
- {pvleopard-2.0.1 → pvleopard-2.0.4}/pvleopard/lib/common/leopard_params.pv +0 -0
- {pvleopard-2.0.1 → pvleopard-2.0.4}/pvleopard/lib/linux/x86_64/libpv_leopard.so +0 -0
- {pvleopard-2.0.1 → pvleopard-2.0.4}/pvleopard/lib/mac/arm64/libpv_leopard.dylib +0 -0
- {pvleopard-2.0.1 → pvleopard-2.0.4}/pvleopard/lib/mac/x86_64/libpv_leopard.dylib +0 -0
- {pvleopard-2.0.1 → pvleopard-2.0.4}/pvleopard/lib/raspberry-pi/cortex-a53/libpv_leopard.so +0 -0
- {pvleopard-2.0.1 → pvleopard-2.0.4}/pvleopard/lib/raspberry-pi/cortex-a53-aarch64/libpv_leopard.so +0 -0
- {pvleopard-2.0.1 → pvleopard-2.0.4}/pvleopard/lib/raspberry-pi/cortex-a72/libpv_leopard.so +0 -0
- {pvleopard-2.0.1 → pvleopard-2.0.4}/pvleopard/lib/raspberry-pi/cortex-a72-aarch64/libpv_leopard.so +0 -0
- {pvleopard-2.0.1 → pvleopard-2.0.4}/pvleopard/lib/windows/amd64/libpv_leopard.dll +0 -0
- {pvleopard-2.0.1 → pvleopard-2.0.4}/pvleopard.egg-info/dependency_links.txt +0 -0
- {pvleopard-2.0.1 → pvleopard-2.0.4}/pvleopard.egg-info/top_level.txt +0 -0
- {pvleopard-2.0.1 → pvleopard-2.0.4}/setup.cfg +0 -0
pvleopard-2.0.4/PKG-INFO
ADDED
@@ -0,0 +1,104 @@
|
|
1
|
+
Metadata-Version: 2.1
|
2
|
+
Name: pvleopard
|
3
|
+
Version: 2.0.4
|
4
|
+
Summary: Leopard Speech-to-Text Engine.
|
5
|
+
Home-page: https://github.com/Picovoice/leopard
|
6
|
+
Author: Picovoice
|
7
|
+
Author-email: hello@picovoice.ai
|
8
|
+
Keywords: Speech-to-Text,Speech Recognition,Voice Recognition,ASR,Automatic Speech Recognition
|
9
|
+
Classifier: Development Status :: 5 - Production/Stable
|
10
|
+
Classifier: Intended Audience :: Developers
|
11
|
+
Classifier: License :: OSI Approved :: Apache Software License
|
12
|
+
Classifier: Operating System :: OS Independent
|
13
|
+
Classifier: Programming Language :: Python :: 3
|
14
|
+
Classifier: Topic :: Multimedia :: Sound/Audio :: Speech
|
15
|
+
Requires-Python: >=3.8
|
16
|
+
Description-Content-Type: text/markdown
|
17
|
+
|
18
|
+
# Leopard Binding for Python
|
19
|
+
|
20
|
+
## Leopard Speech-to-Text Engine
|
21
|
+
|
22
|
+
Made in Vancouver, Canada by [Picovoice](https://picovoice.ai)
|
23
|
+
|
24
|
+
Leopard is an on-device speech-to-text engine. Leopard is:
|
25
|
+
|
26
|
+
- Private; All voice processing runs locally.
|
27
|
+
- [Accurate](https://picovoice.ai/docs/benchmark/stt/)
|
28
|
+
- [Compact and Computationally-Efficient](https://github.com/Picovoice/speech-to-text-benchmark#rtf)
|
29
|
+
- Cross-Platform:
|
30
|
+
- Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64, arm64)
|
31
|
+
- Android and iOS
|
32
|
+
- Chrome, Safari, Firefox, and Edge
|
33
|
+
- Raspberry Pi (3, 4, 5)
|
34
|
+
|
35
|
+
## Compatibility
|
36
|
+
|
37
|
+
- Python 3.8+
|
38
|
+
- Runs on Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64, arm64), and Raspberry Pi (3, 4, 5).
|
39
|
+
|
40
|
+
## Installation
|
41
|
+
|
42
|
+
```console
|
43
|
+
pip3 install pvleopard
|
44
|
+
```
|
45
|
+
|
46
|
+
## AccessKey
|
47
|
+
|
48
|
+
Leopard requires a valid Picovoice `AccessKey` at initialization. `AccessKey` acts as your credentials when using Leopard SDKs.
|
49
|
+
You can get your `AccessKey` for free. Make sure to keep your `AccessKey` secret.
|
50
|
+
Signup or Login to [Picovoice Console](https://console.picovoice.ai/) to get your `AccessKey`.
|
51
|
+
|
52
|
+
## Usage
|
53
|
+
|
54
|
+
Create an instance of the engine and transcribe an audio file:
|
55
|
+
|
56
|
+
```python
|
57
|
+
import pvleopard
|
58
|
+
|
59
|
+
leopard = pvleopard.create(access_key='${ACCESS_KEY}')
|
60
|
+
|
61
|
+
transcript, words = leopard.process_file('${AUDIO_FILE_PATH}')
|
62
|
+
print(transcript)
|
63
|
+
for word in words:
|
64
|
+
print(
|
65
|
+
"{word=\"%s\" start_sec=%.2f end_sec=%.2f confidence=%.2f speaker_tag=%d}"
|
66
|
+
% (word.word, word.start_sec, word.end_sec, word.confidence, word.speaker_tag))
|
67
|
+
```
|
68
|
+
|
69
|
+
Replace `${ACCESS_KEY}` with yours obtained from [Picovoice Console](https://console.picovoice.ai/) and
|
70
|
+
`${AUDIO_FILE_PATH}` to the path an audio file.
|
71
|
+
|
72
|
+
Finally, when done be sure to explicitly release the resources:
|
73
|
+
```python
|
74
|
+
leopard.delete()
|
75
|
+
```
|
76
|
+
|
77
|
+
### Language Model
|
78
|
+
|
79
|
+
The Leopard Python SDK comes preloaded with a default English language model (`.pv` file).
|
80
|
+
Default models for other supported languages can be found in [lib/common](../../lib/common).
|
81
|
+
|
82
|
+
Create custom language models using the [Picovoice Console](https://console.picovoice.ai/). Here you can train
|
83
|
+
language models with custom vocabulary and boost words in the existing vocabulary.
|
84
|
+
|
85
|
+
Pass in the `.pv` file via the `model_path` argument:
|
86
|
+
```python
|
87
|
+
leopard = pvleopard.create(
|
88
|
+
access_key='${ACCESS_KEY}',
|
89
|
+
model_path='${MODEL_FILE_PATH}')
|
90
|
+
```
|
91
|
+
|
92
|
+
### Word Metadata
|
93
|
+
|
94
|
+
Along with the transcript, Leopard returns metadata for each transcribed word. Available metadata items are:
|
95
|
+
|
96
|
+
- **Start Time:** Indicates when the word started in the transcribed audio. Value is in seconds.
|
97
|
+
- **End Time:** Indicates when the word ended in the transcribed audio. Value is in seconds.
|
98
|
+
- **Confidence:** Leopard's confidence that the transcribed word is accurate. It is a number within `[0, 1]`.
|
99
|
+
- **Speaker Tag:** If speaker diarization is enabled on initialization, the speaker tag is a non-negative integer identifying unique speakers, with `0` reserved for unknown speakers. If speaker diarization is not enabled, the value will always be `-1`.
|
100
|
+
|
101
|
+
## Demos
|
102
|
+
|
103
|
+
[pvleoparddemo](https://pypi.org/project/pvleoparddemo/) provides command-line utilities for processing audio using
|
104
|
+
Leopard.
|
@@ -10,15 +10,15 @@ Leopard is an on-device speech-to-text engine. Leopard is:
|
|
10
10
|
- [Accurate](https://picovoice.ai/docs/benchmark/stt/)
|
11
11
|
- [Compact and Computationally-Efficient](https://github.com/Picovoice/speech-to-text-benchmark#rtf)
|
12
12
|
- Cross-Platform:
|
13
|
-
- Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64)
|
13
|
+
- Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64, arm64)
|
14
14
|
- Android and iOS
|
15
15
|
- Chrome, Safari, Firefox, and Edge
|
16
|
-
- Raspberry Pi (4,
|
16
|
+
- Raspberry Pi (3, 4, 5)
|
17
17
|
|
18
18
|
## Compatibility
|
19
19
|
|
20
|
-
- Python 3.
|
21
|
-
- Runs on Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64), Raspberry Pi (4,
|
20
|
+
- Python 3.8+
|
21
|
+
- Runs on Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64, arm64), and Raspberry Pi (3, 4, 5).
|
22
22
|
|
23
23
|
## Installation
|
24
24
|
|
@@ -32,7 +32,7 @@ Leopard requires a valid Picovoice `AccessKey` at initialization. `AccessKey` ac
|
|
32
32
|
You can get your `AccessKey` for free. Make sure to keep your `AccessKey` secret.
|
33
33
|
Signup or Login to [Picovoice Console](https://console.picovoice.ai/) to get your `AccessKey`.
|
34
34
|
|
35
|
-
|
35
|
+
## Usage
|
36
36
|
|
37
37
|
Create an instance of the engine and transcribe an audio file:
|
38
38
|
|
@@ -41,23 +41,23 @@ import pvleopard
|
|
41
41
|
|
42
42
|
leopard = pvleopard.create(access_key='${ACCESS_KEY}')
|
43
43
|
|
44
|
-
transcript, words = leopard.process_file('${
|
44
|
+
transcript, words = leopard.process_file('${AUDIO_FILE_PATH}')
|
45
45
|
print(transcript)
|
46
46
|
for word in words:
|
47
47
|
print(
|
48
|
-
"{word=\"%s\" start_sec=%.2f end_sec=%.2f confidence=%.2f}"
|
49
|
-
% (word.word, word.start_sec, word.end_sec, word.confidence))
|
48
|
+
"{word=\"%s\" start_sec=%.2f end_sec=%.2f confidence=%.2f speaker_tag=%d}"
|
49
|
+
% (word.word, word.start_sec, word.end_sec, word.confidence, word.speaker_tag))
|
50
50
|
```
|
51
51
|
|
52
52
|
Replace `${ACCESS_KEY}` with yours obtained from [Picovoice Console](https://console.picovoice.ai/) and
|
53
|
-
`${
|
53
|
+
`${AUDIO_FILE_PATH}` to the path an audio file.
|
54
54
|
|
55
55
|
Finally, when done be sure to explicitly release the resources:
|
56
56
|
```python
|
57
57
|
leopard.delete()
|
58
58
|
```
|
59
59
|
|
60
|
-
|
60
|
+
### Language Model
|
61
61
|
|
62
62
|
The Leopard Python SDK comes preloaded with a default English language model (`.pv` file).
|
63
63
|
Default models for other supported languages can be found in [lib/common](../../lib/common).
|
@@ -69,9 +69,18 @@ Pass in the `.pv` file via the `model_path` argument:
|
|
69
69
|
```python
|
70
70
|
leopard = pvleopard.create(
|
71
71
|
access_key='${ACCESS_KEY}',
|
72
|
-
model_path='${
|
72
|
+
model_path='${MODEL_FILE_PATH}')
|
73
73
|
```
|
74
74
|
|
75
|
+
### Word Metadata
|
76
|
+
|
77
|
+
Along with the transcript, Leopard returns metadata for each transcribed word. Available metadata items are:
|
78
|
+
|
79
|
+
- **Start Time:** Indicates when the word started in the transcribed audio. Value is in seconds.
|
80
|
+
- **End Time:** Indicates when the word ended in the transcribed audio. Value is in seconds.
|
81
|
+
- **Confidence:** Leopard's confidence that the transcribed word is accurate. It is a number within `[0, 1]`.
|
82
|
+
- **Speaker Tag:** If speaker diarization is enabled on initialization, the speaker tag is a non-negative integer identifying unique speakers, with `0` reserved for unknown speakers. If speaker diarization is not enabled, the value will always be `-1`.
|
83
|
+
|
75
84
|
## Demos
|
76
85
|
|
77
86
|
[pvleoparddemo](https://pypi.org/project/pvleoparddemo/) provides command-line utilities for processing audio using
|
@@ -1,5 +1,5 @@
|
|
1
1
|
#
|
2
|
-
# Copyright 2022-
|
2
|
+
# Copyright 2022-2024 Picovoice Inc.
|
3
3
|
#
|
4
4
|
# You may not use this file except in compliance with the license. A copy of the license is located in the "LICENSE"
|
5
5
|
# file accompanying this source.
|
@@ -37,16 +37,21 @@ def _linux_machine():
|
|
37
37
|
|
38
38
|
if '0xd03' == cpu_part:
|
39
39
|
return 'cortex-a53' + arch_info
|
40
|
-
elif '0xd07' == cpu_part:
|
41
|
-
return 'cortex-a57' + arch_info
|
42
40
|
elif '0xd08' == cpu_part:
|
43
41
|
return 'cortex-a72' + arch_info
|
42
|
+
elif "0xd0b" == cpu_part:
|
43
|
+
return "cortex-a76" + arch_info
|
44
44
|
else:
|
45
45
|
raise NotImplementedError("Unsupported CPU: `%s`." % cpu_part)
|
46
46
|
|
47
47
|
|
48
|
-
_RASPBERRY_PI_MACHINES = {
|
49
|
-
|
48
|
+
_RASPBERRY_PI_MACHINES = {
|
49
|
+
"cortex-a53",
|
50
|
+
"cortex-a72",
|
51
|
+
"cortex-a76",
|
52
|
+
"cortex-a53-aarch64",
|
53
|
+
"cortex-a72-aarch64",
|
54
|
+
"cortex-a76-aarch64"}
|
50
55
|
|
51
56
|
|
52
57
|
def default_library_path(relative):
|
@@ -59,15 +64,16 @@ def default_library_path(relative):
|
|
59
64
|
linux_machine = _linux_machine()
|
60
65
|
if linux_machine == 'x86_64':
|
61
66
|
return os.path.join(os.path.dirname(__file__), relative, 'lib/linux/x86_64/libpv_leopard.so')
|
62
|
-
elif linux_machine in _JETSON_MACHINES:
|
63
|
-
return os.path.join(os.path.dirname(__file__), relative, 'lib/jetson/%s/libpv_leopard.so' % linux_machine)
|
64
67
|
elif linux_machine in _RASPBERRY_PI_MACHINES:
|
65
68
|
return os.path.join(
|
66
69
|
os.path.dirname(__file__),
|
67
70
|
relative,
|
68
71
|
'lib/raspberry-pi/%s/libpv_leopard.so' % linux_machine)
|
69
72
|
elif platform.system() == 'Windows':
|
70
|
-
|
73
|
+
if platform.machine().lower() == 'amd64':
|
74
|
+
return os.path.join(os.path.dirname(__file__), relative, 'lib/windows/amd64/libpv_leopard.dll')
|
75
|
+
elif platform.machine().lower() == 'arm64':
|
76
|
+
return os.path.join(os.path.dirname(__file__), relative, 'lib/windows/arm64/libpv_leopard.dll')
|
71
77
|
|
72
78
|
raise NotImplementedError('Unsupported platform.')
|
73
79
|
|
Binary file
|
Binary file
|
@@ -0,0 +1,104 @@
|
|
1
|
+
Metadata-Version: 2.1
|
2
|
+
Name: pvleopard
|
3
|
+
Version: 2.0.4
|
4
|
+
Summary: Leopard Speech-to-Text Engine.
|
5
|
+
Home-page: https://github.com/Picovoice/leopard
|
6
|
+
Author: Picovoice
|
7
|
+
Author-email: hello@picovoice.ai
|
8
|
+
Keywords: Speech-to-Text,Speech Recognition,Voice Recognition,ASR,Automatic Speech Recognition
|
9
|
+
Classifier: Development Status :: 5 - Production/Stable
|
10
|
+
Classifier: Intended Audience :: Developers
|
11
|
+
Classifier: License :: OSI Approved :: Apache Software License
|
12
|
+
Classifier: Operating System :: OS Independent
|
13
|
+
Classifier: Programming Language :: Python :: 3
|
14
|
+
Classifier: Topic :: Multimedia :: Sound/Audio :: Speech
|
15
|
+
Requires-Python: >=3.8
|
16
|
+
Description-Content-Type: text/markdown
|
17
|
+
|
18
|
+
# Leopard Binding for Python
|
19
|
+
|
20
|
+
## Leopard Speech-to-Text Engine
|
21
|
+
|
22
|
+
Made in Vancouver, Canada by [Picovoice](https://picovoice.ai)
|
23
|
+
|
24
|
+
Leopard is an on-device speech-to-text engine. Leopard is:
|
25
|
+
|
26
|
+
- Private; All voice processing runs locally.
|
27
|
+
- [Accurate](https://picovoice.ai/docs/benchmark/stt/)
|
28
|
+
- [Compact and Computationally-Efficient](https://github.com/Picovoice/speech-to-text-benchmark#rtf)
|
29
|
+
- Cross-Platform:
|
30
|
+
- Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64, arm64)
|
31
|
+
- Android and iOS
|
32
|
+
- Chrome, Safari, Firefox, and Edge
|
33
|
+
- Raspberry Pi (3, 4, 5)
|
34
|
+
|
35
|
+
## Compatibility
|
36
|
+
|
37
|
+
- Python 3.8+
|
38
|
+
- Runs on Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64, arm64), and Raspberry Pi (3, 4, 5).
|
39
|
+
|
40
|
+
## Installation
|
41
|
+
|
42
|
+
```console
|
43
|
+
pip3 install pvleopard
|
44
|
+
```
|
45
|
+
|
46
|
+
## AccessKey
|
47
|
+
|
48
|
+
Leopard requires a valid Picovoice `AccessKey` at initialization. `AccessKey` acts as your credentials when using Leopard SDKs.
|
49
|
+
You can get your `AccessKey` for free. Make sure to keep your `AccessKey` secret.
|
50
|
+
Signup or Login to [Picovoice Console](https://console.picovoice.ai/) to get your `AccessKey`.
|
51
|
+
|
52
|
+
## Usage
|
53
|
+
|
54
|
+
Create an instance of the engine and transcribe an audio file:
|
55
|
+
|
56
|
+
```python
|
57
|
+
import pvleopard
|
58
|
+
|
59
|
+
leopard = pvleopard.create(access_key='${ACCESS_KEY}')
|
60
|
+
|
61
|
+
transcript, words = leopard.process_file('${AUDIO_FILE_PATH}')
|
62
|
+
print(transcript)
|
63
|
+
for word in words:
|
64
|
+
print(
|
65
|
+
"{word=\"%s\" start_sec=%.2f end_sec=%.2f confidence=%.2f speaker_tag=%d}"
|
66
|
+
% (word.word, word.start_sec, word.end_sec, word.confidence, word.speaker_tag))
|
67
|
+
```
|
68
|
+
|
69
|
+
Replace `${ACCESS_KEY}` with yours obtained from [Picovoice Console](https://console.picovoice.ai/) and
|
70
|
+
`${AUDIO_FILE_PATH}` to the path an audio file.
|
71
|
+
|
72
|
+
Finally, when done be sure to explicitly release the resources:
|
73
|
+
```python
|
74
|
+
leopard.delete()
|
75
|
+
```
|
76
|
+
|
77
|
+
### Language Model
|
78
|
+
|
79
|
+
The Leopard Python SDK comes preloaded with a default English language model (`.pv` file).
|
80
|
+
Default models for other supported languages can be found in [lib/common](../../lib/common).
|
81
|
+
|
82
|
+
Create custom language models using the [Picovoice Console](https://console.picovoice.ai/). Here you can train
|
83
|
+
language models with custom vocabulary and boost words in the existing vocabulary.
|
84
|
+
|
85
|
+
Pass in the `.pv` file via the `model_path` argument:
|
86
|
+
```python
|
87
|
+
leopard = pvleopard.create(
|
88
|
+
access_key='${ACCESS_KEY}',
|
89
|
+
model_path='${MODEL_FILE_PATH}')
|
90
|
+
```
|
91
|
+
|
92
|
+
### Word Metadata
|
93
|
+
|
94
|
+
Along with the transcript, Leopard returns metadata for each transcribed word. Available metadata items are:
|
95
|
+
|
96
|
+
- **Start Time:** Indicates when the word started in the transcribed audio. Value is in seconds.
|
97
|
+
- **End Time:** Indicates when the word ended in the transcribed audio. Value is in seconds.
|
98
|
+
- **Confidence:** Leopard's confidence that the transcribed word is accurate. It is a number within `[0, 1]`.
|
99
|
+
- **Speaker Tag:** If speaker diarization is enabled on initialization, the speaker tag is a non-negative integer identifying unique speakers, with `0` reserved for unknown speakers. If speaker diarization is not enabled, the value will always be `-1`.
|
100
|
+
|
101
|
+
## Demos
|
102
|
+
|
103
|
+
[pvleoparddemo](https://pypi.org/project/pvleoparddemo/) provides command-line utilities for processing audio using
|
104
|
+
Leopard.
|
@@ -11,7 +11,6 @@ pvleopard.egg-info/SOURCES.txt
|
|
11
11
|
pvleopard.egg-info/dependency_links.txt
|
12
12
|
pvleopard.egg-info/top_level.txt
|
13
13
|
pvleopard/lib/common/leopard_params.pv
|
14
|
-
pvleopard/lib/jetson/cortex-a57-aarch64/libpv_leopard.so
|
15
14
|
pvleopard/lib/linux/x86_64/libpv_leopard.so
|
16
15
|
pvleopard/lib/mac/arm64/libpv_leopard.dylib
|
17
16
|
pvleopard/lib/mac/x86_64/libpv_leopard.dylib
|
@@ -19,4 +18,7 @@ pvleopard/lib/raspberry-pi/cortex-a53/libpv_leopard.so
|
|
19
18
|
pvleopard/lib/raspberry-pi/cortex-a53-aarch64/libpv_leopard.so
|
20
19
|
pvleopard/lib/raspberry-pi/cortex-a72/libpv_leopard.so
|
21
20
|
pvleopard/lib/raspberry-pi/cortex-a72-aarch64/libpv_leopard.so
|
22
|
-
pvleopard/lib/
|
21
|
+
pvleopard/lib/raspberry-pi/cortex-a76/libpv_leopard.so
|
22
|
+
pvleopard/lib/raspberry-pi/cortex-a76-aarch64/libpv_leopard.so
|
23
|
+
pvleopard/lib/windows/amd64/libpv_leopard.dll
|
24
|
+
pvleopard/lib/windows/arm64/libpv_leopard.dll
|
@@ -1,5 +1,5 @@
|
|
1
1
|
#
|
2
|
-
# Copyright 2022-
|
2
|
+
# Copyright 2022-2024 Picovoice Inc.
|
3
3
|
#
|
4
4
|
# You may not use this file except in compliance with the license. A copy of the license is located in the "LICENSE"
|
5
5
|
# file accompanying this source.
|
@@ -15,7 +15,7 @@ import shutil
|
|
15
15
|
import setuptools
|
16
16
|
|
17
17
|
INCLUDE_FILES = ('../../LICENSE', '__init__.py', '_factory.py', '_leopard.py', '_util.py')
|
18
|
-
INCLUDE_LIBS = ('linux', 'mac', 'windows', '
|
18
|
+
INCLUDE_LIBS = ('linux', 'mac', 'windows', 'raspberry-pi')
|
19
19
|
|
20
20
|
os.system('git clean -dfx')
|
21
21
|
|
@@ -48,7 +48,7 @@ with open(os.path.join(os.path.dirname(__file__), 'README.md'), 'r') as f:
|
|
48
48
|
|
49
49
|
setuptools.setup(
|
50
50
|
name="pvleopard",
|
51
|
-
version="2.0.
|
51
|
+
version="2.0.4",
|
52
52
|
author="Picovoice",
|
53
53
|
author_email="hello@picovoice.ai",
|
54
54
|
description="Leopard Speech-to-Text Engine.",
|
@@ -65,6 +65,6 @@ setuptools.setup(
|
|
65
65
|
"Programming Language :: Python :: 3",
|
66
66
|
"Topic :: Multimedia :: Sound/Audio :: Speech"
|
67
67
|
],
|
68
|
-
python_requires='>=3.
|
68
|
+
python_requires='>=3.8',
|
69
69
|
keywords="Speech-to-Text, Speech Recognition, Voice Recognition, ASR, Automatic Speech Recognition",
|
70
70
|
)
|
pvleopard-2.0.1/PKG-INFO
DELETED
@@ -1,97 +0,0 @@
|
|
1
|
-
Metadata-Version: 2.1
|
2
|
-
Name: pvleopard
|
3
|
-
Version: 2.0.1
|
4
|
-
Summary: Leopard Speech-to-Text Engine.
|
5
|
-
Home-page: https://github.com/Picovoice/leopard
|
6
|
-
Author: Picovoice
|
7
|
-
Author-email: hello@picovoice.ai
|
8
|
-
License: UNKNOWN
|
9
|
-
Description: # Leopard Binding for Python
|
10
|
-
|
11
|
-
## Leopard Speech-to-Text Engine
|
12
|
-
|
13
|
-
Made in Vancouver, Canada by [Picovoice](https://picovoice.ai)
|
14
|
-
|
15
|
-
Leopard is an on-device speech-to-text engine. Leopard is:
|
16
|
-
|
17
|
-
- Private; All voice processing runs locally.
|
18
|
-
- [Accurate](https://picovoice.ai/docs/benchmark/stt/)
|
19
|
-
- [Compact and Computationally-Efficient](https://github.com/Picovoice/speech-to-text-benchmark#rtf)
|
20
|
-
- Cross-Platform:
|
21
|
-
- Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64)
|
22
|
-
- Android and iOS
|
23
|
-
- Chrome, Safari, Firefox, and Edge
|
24
|
-
- Raspberry Pi (4, 3) and NVIDIA Jetson Nano
|
25
|
-
|
26
|
-
## Compatibility
|
27
|
-
|
28
|
-
- Python 3.5+
|
29
|
-
- Runs on Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64), Raspberry Pi (4, 3), and NVIDIA Jetson Nano.
|
30
|
-
|
31
|
-
## Installation
|
32
|
-
|
33
|
-
```console
|
34
|
-
pip3 install pvleopard
|
35
|
-
```
|
36
|
-
|
37
|
-
## AccessKey
|
38
|
-
|
39
|
-
Leopard requires a valid Picovoice `AccessKey` at initialization. `AccessKey` acts as your credentials when using Leopard SDKs.
|
40
|
-
You can get your `AccessKey` for free. Make sure to keep your `AccessKey` secret.
|
41
|
-
Signup or Login to [Picovoice Console](https://console.picovoice.ai/) to get your `AccessKey`.
|
42
|
-
|
43
|
-
### Usage
|
44
|
-
|
45
|
-
Create an instance of the engine and transcribe an audio file:
|
46
|
-
|
47
|
-
```python
|
48
|
-
import pvleopard
|
49
|
-
|
50
|
-
leopard = pvleopard.create(access_key='${ACCESS_KEY}')
|
51
|
-
|
52
|
-
transcript, words = leopard.process_file('${AUDIO_PATH}')
|
53
|
-
print(transcript)
|
54
|
-
for word in words:
|
55
|
-
print(
|
56
|
-
"{word=\"%s\" start_sec=%.2f end_sec=%.2f confidence=%.2f}"
|
57
|
-
% (word.word, word.start_sec, word.end_sec, word.confidence))
|
58
|
-
```
|
59
|
-
|
60
|
-
Replace `${ACCESS_KEY}` with yours obtained from [Picovoice Console](https://console.picovoice.ai/) and
|
61
|
-
`${AUDIO_PATH}` to the path an audio file.
|
62
|
-
|
63
|
-
Finally, when done be sure to explicitly release the resources:
|
64
|
-
```python
|
65
|
-
leopard.delete()
|
66
|
-
```
|
67
|
-
|
68
|
-
## Language Model
|
69
|
-
|
70
|
-
The Leopard Python SDK comes preloaded with a default English language model (`.pv` file).
|
71
|
-
Default models for other supported languages can be found in [lib/common](../../lib/common).
|
72
|
-
|
73
|
-
Create custom language models using the [Picovoice Console](https://console.picovoice.ai/). Here you can train
|
74
|
-
language models with custom vocabulary and boost words in the existing vocabulary.
|
75
|
-
|
76
|
-
Pass in the `.pv` file via the `model_path` argument:
|
77
|
-
```python
|
78
|
-
leopard = pvleopard.create(
|
79
|
-
access_key='${ACCESS_KEY}',
|
80
|
-
model_path='${MODEL_PATH}')
|
81
|
-
```
|
82
|
-
|
83
|
-
## Demos
|
84
|
-
|
85
|
-
[pvleoparddemo](https://pypi.org/project/pvleoparddemo/) provides command-line utilities for processing audio using
|
86
|
-
Leopard.
|
87
|
-
|
88
|
-
Keywords: Speech-to-Text,Speech Recognition,Voice Recognition,ASR,Automatic Speech Recognition
|
89
|
-
Platform: UNKNOWN
|
90
|
-
Classifier: Development Status :: 5 - Production/Stable
|
91
|
-
Classifier: Intended Audience :: Developers
|
92
|
-
Classifier: License :: OSI Approved :: Apache Software License
|
93
|
-
Classifier: Operating System :: OS Independent
|
94
|
-
Classifier: Programming Language :: Python :: 3
|
95
|
-
Classifier: Topic :: Multimedia :: Sound/Audio :: Speech
|
96
|
-
Requires-Python: >=3.5
|
97
|
-
Description-Content-Type: text/markdown
|
Binary file
|
@@ -1,97 +0,0 @@
|
|
1
|
-
Metadata-Version: 2.1
|
2
|
-
Name: pvleopard
|
3
|
-
Version: 2.0.1
|
4
|
-
Summary: Leopard Speech-to-Text Engine.
|
5
|
-
Home-page: https://github.com/Picovoice/leopard
|
6
|
-
Author: Picovoice
|
7
|
-
Author-email: hello@picovoice.ai
|
8
|
-
License: UNKNOWN
|
9
|
-
Description: # Leopard Binding for Python
|
10
|
-
|
11
|
-
## Leopard Speech-to-Text Engine
|
12
|
-
|
13
|
-
Made in Vancouver, Canada by [Picovoice](https://picovoice.ai)
|
14
|
-
|
15
|
-
Leopard is an on-device speech-to-text engine. Leopard is:
|
16
|
-
|
17
|
-
- Private; All voice processing runs locally.
|
18
|
-
- [Accurate](https://picovoice.ai/docs/benchmark/stt/)
|
19
|
-
- [Compact and Computationally-Efficient](https://github.com/Picovoice/speech-to-text-benchmark#rtf)
|
20
|
-
- Cross-Platform:
|
21
|
-
- Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64)
|
22
|
-
- Android and iOS
|
23
|
-
- Chrome, Safari, Firefox, and Edge
|
24
|
-
- Raspberry Pi (4, 3) and NVIDIA Jetson Nano
|
25
|
-
|
26
|
-
## Compatibility
|
27
|
-
|
28
|
-
- Python 3.5+
|
29
|
-
- Runs on Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64), Raspberry Pi (4, 3), and NVIDIA Jetson Nano.
|
30
|
-
|
31
|
-
## Installation
|
32
|
-
|
33
|
-
```console
|
34
|
-
pip3 install pvleopard
|
35
|
-
```
|
36
|
-
|
37
|
-
## AccessKey
|
38
|
-
|
39
|
-
Leopard requires a valid Picovoice `AccessKey` at initialization. `AccessKey` acts as your credentials when using Leopard SDKs.
|
40
|
-
You can get your `AccessKey` for free. Make sure to keep your `AccessKey` secret.
|
41
|
-
Signup or Login to [Picovoice Console](https://console.picovoice.ai/) to get your `AccessKey`.
|
42
|
-
|
43
|
-
### Usage
|
44
|
-
|
45
|
-
Create an instance of the engine and transcribe an audio file:
|
46
|
-
|
47
|
-
```python
|
48
|
-
import pvleopard
|
49
|
-
|
50
|
-
leopard = pvleopard.create(access_key='${ACCESS_KEY}')
|
51
|
-
|
52
|
-
transcript, words = leopard.process_file('${AUDIO_PATH}')
|
53
|
-
print(transcript)
|
54
|
-
for word in words:
|
55
|
-
print(
|
56
|
-
"{word=\"%s\" start_sec=%.2f end_sec=%.2f confidence=%.2f}"
|
57
|
-
% (word.word, word.start_sec, word.end_sec, word.confidence))
|
58
|
-
```
|
59
|
-
|
60
|
-
Replace `${ACCESS_KEY}` with yours obtained from [Picovoice Console](https://console.picovoice.ai/) and
|
61
|
-
`${AUDIO_PATH}` to the path an audio file.
|
62
|
-
|
63
|
-
Finally, when done be sure to explicitly release the resources:
|
64
|
-
```python
|
65
|
-
leopard.delete()
|
66
|
-
```
|
67
|
-
|
68
|
-
## Language Model
|
69
|
-
|
70
|
-
The Leopard Python SDK comes preloaded with a default English language model (`.pv` file).
|
71
|
-
Default models for other supported languages can be found in [lib/common](../../lib/common).
|
72
|
-
|
73
|
-
Create custom language models using the [Picovoice Console](https://console.picovoice.ai/). Here you can train
|
74
|
-
language models with custom vocabulary and boost words in the existing vocabulary.
|
75
|
-
|
76
|
-
Pass in the `.pv` file via the `model_path` argument:
|
77
|
-
```python
|
78
|
-
leopard = pvleopard.create(
|
79
|
-
access_key='${ACCESS_KEY}',
|
80
|
-
model_path='${MODEL_PATH}')
|
81
|
-
```
|
82
|
-
|
83
|
-
## Demos
|
84
|
-
|
85
|
-
[pvleoparddemo](https://pypi.org/project/pvleoparddemo/) provides command-line utilities for processing audio using
|
86
|
-
Leopard.
|
87
|
-
|
88
|
-
Keywords: Speech-to-Text,Speech Recognition,Voice Recognition,ASR,Automatic Speech Recognition
|
89
|
-
Platform: UNKNOWN
|
90
|
-
Classifier: Development Status :: 5 - Production/Stable
|
91
|
-
Classifier: Intended Audience :: Developers
|
92
|
-
Classifier: License :: OSI Approved :: Apache Software License
|
93
|
-
Classifier: Operating System :: OS Independent
|
94
|
-
Classifier: Programming Language :: Python :: 3
|
95
|
-
Classifier: Topic :: Multimedia :: Sound/Audio :: Speech
|
96
|
-
Requires-Python: >=3.5
|
97
|
-
Description-Content-Type: text/markdown
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|
{pvleopard-2.0.1 → pvleopard-2.0.4}/pvleopard/lib/raspberry-pi/cortex-a53-aarch64/libpv_leopard.so
RENAMED
File without changes
|
File without changes
|
{pvleopard-2.0.1 → pvleopard-2.0.4}/pvleopard/lib/raspberry-pi/cortex-a72-aarch64/libpv_leopard.so
RENAMED
File without changes
|
File without changes
|
File without changes
|
File without changes
|
File without changes
|