pvleopard 2.0.1__tar.gz → 2.0.3__tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (28) hide show
  1. {pvleopard-2.0.1 → pvleopard-2.0.3}/MANIFEST.in +0 -1
  2. pvleopard-2.0.3/PKG-INFO +104 -0
  3. {pvleopard-2.0.1 → pvleopard-2.0.3}/README.md +19 -10
  4. {pvleopard-2.0.1 → pvleopard-2.0.3}/pvleopard/_util.py +10 -7
  5. pvleopard-2.0.3/pvleopard/lib/raspberry-pi/cortex-a76/libpv_leopard.so +0 -0
  6. pvleopard-2.0.3/pvleopard/lib/raspberry-pi/cortex-a76-aarch64/libpv_leopard.so +0 -0
  7. pvleopard-2.0.3/pvleopard.egg-info/PKG-INFO +104 -0
  8. {pvleopard-2.0.1 → pvleopard-2.0.3}/pvleopard.egg-info/SOURCES.txt +2 -1
  9. {pvleopard-2.0.1 → pvleopard-2.0.3}/setup.py +4 -4
  10. pvleopard-2.0.1/PKG-INFO +0 -97
  11. pvleopard-2.0.1/pvleopard/lib/jetson/cortex-a57-aarch64/libpv_leopard.so +0 -0
  12. pvleopard-2.0.1/pvleopard.egg-info/PKG-INFO +0 -97
  13. {pvleopard-2.0.1 → pvleopard-2.0.3}/pvleopard/LICENSE +0 -0
  14. {pvleopard-2.0.1 → pvleopard-2.0.3}/pvleopard/__init__.py +0 -0
  15. {pvleopard-2.0.1 → pvleopard-2.0.3}/pvleopard/_factory.py +0 -0
  16. {pvleopard-2.0.1 → pvleopard-2.0.3}/pvleopard/_leopard.py +0 -0
  17. {pvleopard-2.0.1 → pvleopard-2.0.3}/pvleopard/lib/common/leopard_params.pv +0 -0
  18. {pvleopard-2.0.1 → pvleopard-2.0.3}/pvleopard/lib/linux/x86_64/libpv_leopard.so +0 -0
  19. {pvleopard-2.0.1 → pvleopard-2.0.3}/pvleopard/lib/mac/arm64/libpv_leopard.dylib +0 -0
  20. {pvleopard-2.0.1 → pvleopard-2.0.3}/pvleopard/lib/mac/x86_64/libpv_leopard.dylib +0 -0
  21. {pvleopard-2.0.1 → pvleopard-2.0.3}/pvleopard/lib/raspberry-pi/cortex-a53/libpv_leopard.so +0 -0
  22. {pvleopard-2.0.1 → pvleopard-2.0.3}/pvleopard/lib/raspberry-pi/cortex-a53-aarch64/libpv_leopard.so +0 -0
  23. {pvleopard-2.0.1 → pvleopard-2.0.3}/pvleopard/lib/raspberry-pi/cortex-a72/libpv_leopard.so +0 -0
  24. {pvleopard-2.0.1 → pvleopard-2.0.3}/pvleopard/lib/raspberry-pi/cortex-a72-aarch64/libpv_leopard.so +0 -0
  25. {pvleopard-2.0.1 → pvleopard-2.0.3}/pvleopard/lib/windows/amd64/libpv_leopard.dll +0 -0
  26. {pvleopard-2.0.1 → pvleopard-2.0.3}/pvleopard.egg-info/dependency_links.txt +0 -0
  27. {pvleopard-2.0.1 → pvleopard-2.0.3}/pvleopard.egg-info/top_level.txt +0 -0
  28. {pvleopard-2.0.1 → pvleopard-2.0.3}/setup.cfg +0 -0
@@ -7,5 +7,4 @@ include pvleopard/lib/common/leopard_params.pv
7
7
  recursive-include pvleopard/lib/linux *
8
8
  recursive-include pvleopard/lib/mac *
9
9
  recursive-include pvleopard/lib/windows *
10
- recursive-include pvleopard/lib/jetson *
11
10
  recursive-include pvleopard/lib/raspberry-pi *
@@ -0,0 +1,104 @@
1
+ Metadata-Version: 2.1
2
+ Name: pvleopard
3
+ Version: 2.0.3
4
+ Summary: Leopard Speech-to-Text Engine.
5
+ Home-page: https://github.com/Picovoice/leopard
6
+ Author: Picovoice
7
+ Author-email: hello@picovoice.ai
8
+ Keywords: Speech-to-Text,Speech Recognition,Voice Recognition,ASR,Automatic Speech Recognition
9
+ Classifier: Development Status :: 5 - Production/Stable
10
+ Classifier: Intended Audience :: Developers
11
+ Classifier: License :: OSI Approved :: Apache Software License
12
+ Classifier: Operating System :: OS Independent
13
+ Classifier: Programming Language :: Python :: 3
14
+ Classifier: Topic :: Multimedia :: Sound/Audio :: Speech
15
+ Requires-Python: >=3.8
16
+ Description-Content-Type: text/markdown
17
+
18
+ # Leopard Binding for Python
19
+
20
+ ## Leopard Speech-to-Text Engine
21
+
22
+ Made in Vancouver, Canada by [Picovoice](https://picovoice.ai)
23
+
24
+ Leopard is an on-device speech-to-text engine. Leopard is:
25
+
26
+ - Private; All voice processing runs locally.
27
+ - [Accurate](https://picovoice.ai/docs/benchmark/stt/)
28
+ - [Compact and Computationally-Efficient](https://github.com/Picovoice/speech-to-text-benchmark#rtf)
29
+ - Cross-Platform:
30
+ - Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64)
31
+ - Android and iOS
32
+ - Chrome, Safari, Firefox, and Edge
33
+ - Raspberry Pi (3, 4, 5)
34
+
35
+ ## Compatibility
36
+
37
+ - Python 3.8+
38
+ - Runs on Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64), and Raspberry Pi (3, 4, 5).
39
+
40
+ ## Installation
41
+
42
+ ```console
43
+ pip3 install pvleopard
44
+ ```
45
+
46
+ ## AccessKey
47
+
48
+ Leopard requires a valid Picovoice `AccessKey` at initialization. `AccessKey` acts as your credentials when using Leopard SDKs.
49
+ You can get your `AccessKey` for free. Make sure to keep your `AccessKey` secret.
50
+ Signup or Login to [Picovoice Console](https://console.picovoice.ai/) to get your `AccessKey`.
51
+
52
+ ## Usage
53
+
54
+ Create an instance of the engine and transcribe an audio file:
55
+
56
+ ```python
57
+ import pvleopard
58
+
59
+ leopard = pvleopard.create(access_key='${ACCESS_KEY}')
60
+
61
+ transcript, words = leopard.process_file('${AUDIO_FILE_PATH}')
62
+ print(transcript)
63
+ for word in words:
64
+ print(
65
+ "{word=\"%s\" start_sec=%.2f end_sec=%.2f confidence=%.2f speaker_tag=%d}"
66
+ % (word.word, word.start_sec, word.end_sec, word.confidence, word.speaker_tag))
67
+ ```
68
+
69
+ Replace `${ACCESS_KEY}` with yours obtained from [Picovoice Console](https://console.picovoice.ai/) and
70
+ `${AUDIO_FILE_PATH}` to the path an audio file.
71
+
72
+ Finally, when done be sure to explicitly release the resources:
73
+ ```python
74
+ leopard.delete()
75
+ ```
76
+
77
+ ### Language Model
78
+
79
+ The Leopard Python SDK comes preloaded with a default English language model (`.pv` file).
80
+ Default models for other supported languages can be found in [lib/common](../../lib/common).
81
+
82
+ Create custom language models using the [Picovoice Console](https://console.picovoice.ai/). Here you can train
83
+ language models with custom vocabulary and boost words in the existing vocabulary.
84
+
85
+ Pass in the `.pv` file via the `model_path` argument:
86
+ ```python
87
+ leopard = pvleopard.create(
88
+ access_key='${ACCESS_KEY}',
89
+ model_path='${MODEL_FILE_PATH}')
90
+ ```
91
+
92
+ ### Word Metadata
93
+
94
+ Along with the transcript, Leopard returns metadata for each transcribed word. Available metadata items are:
95
+
96
+ - **Start Time:** Indicates when the word started in the transcribed audio. Value is in seconds.
97
+ - **End Time:** Indicates when the word ended in the transcribed audio. Value is in seconds.
98
+ - **Confidence:** Leopard's confidence that the transcribed word is accurate. It is a number within `[0, 1]`.
99
+ - **Speaker Tag:** If speaker diarization is enabled on initialization, the speaker tag is a non-negative integer identifying unique speakers, with `0` reserved for unknown speakers. If speaker diarization is not enabled, the value will always be `-1`.
100
+
101
+ ## Demos
102
+
103
+ [pvleoparddemo](https://pypi.org/project/pvleoparddemo/) provides command-line utilities for processing audio using
104
+ Leopard.
@@ -13,12 +13,12 @@ Leopard is an on-device speech-to-text engine. Leopard is:
13
13
  - Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64)
14
14
  - Android and iOS
15
15
  - Chrome, Safari, Firefox, and Edge
16
- - Raspberry Pi (4, 3) and NVIDIA Jetson Nano
16
+ - Raspberry Pi (3, 4, 5)
17
17
 
18
18
  ## Compatibility
19
19
 
20
- - Python 3.5+
21
- - Runs on Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64), Raspberry Pi (4, 3), and NVIDIA Jetson Nano.
20
+ - Python 3.8+
21
+ - Runs on Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64), and Raspberry Pi (3, 4, 5).
22
22
 
23
23
  ## Installation
24
24
 
@@ -32,7 +32,7 @@ Leopard requires a valid Picovoice `AccessKey` at initialization. `AccessKey` ac
32
32
  You can get your `AccessKey` for free. Make sure to keep your `AccessKey` secret.
33
33
  Signup or Login to [Picovoice Console](https://console.picovoice.ai/) to get your `AccessKey`.
34
34
 
35
- ### Usage
35
+ ## Usage
36
36
 
37
37
  Create an instance of the engine and transcribe an audio file:
38
38
 
@@ -41,23 +41,23 @@ import pvleopard
41
41
 
42
42
  leopard = pvleopard.create(access_key='${ACCESS_KEY}')
43
43
 
44
- transcript, words = leopard.process_file('${AUDIO_PATH}')
44
+ transcript, words = leopard.process_file('${AUDIO_FILE_PATH}')
45
45
  print(transcript)
46
46
  for word in words:
47
47
  print(
48
- "{word=\"%s\" start_sec=%.2f end_sec=%.2f confidence=%.2f}"
49
- % (word.word, word.start_sec, word.end_sec, word.confidence))
48
+ "{word=\"%s\" start_sec=%.2f end_sec=%.2f confidence=%.2f speaker_tag=%d}"
49
+ % (word.word, word.start_sec, word.end_sec, word.confidence, word.speaker_tag))
50
50
  ```
51
51
 
52
52
  Replace `${ACCESS_KEY}` with yours obtained from [Picovoice Console](https://console.picovoice.ai/) and
53
- `${AUDIO_PATH}` to the path an audio file.
53
+ `${AUDIO_FILE_PATH}` to the path an audio file.
54
54
 
55
55
  Finally, when done be sure to explicitly release the resources:
56
56
  ```python
57
57
  leopard.delete()
58
58
  ```
59
59
 
60
- ## Language Model
60
+ ### Language Model
61
61
 
62
62
  The Leopard Python SDK comes preloaded with a default English language model (`.pv` file).
63
63
  Default models for other supported languages can be found in [lib/common](../../lib/common).
@@ -69,9 +69,18 @@ Pass in the `.pv` file via the `model_path` argument:
69
69
  ```python
70
70
  leopard = pvleopard.create(
71
71
  access_key='${ACCESS_KEY}',
72
- model_path='${MODEL_PATH}')
72
+ model_path='${MODEL_FILE_PATH}')
73
73
  ```
74
74
 
75
+ ### Word Metadata
76
+
77
+ Along with the transcript, Leopard returns metadata for each transcribed word. Available metadata items are:
78
+
79
+ - **Start Time:** Indicates when the word started in the transcribed audio. Value is in seconds.
80
+ - **End Time:** Indicates when the word ended in the transcribed audio. Value is in seconds.
81
+ - **Confidence:** Leopard's confidence that the transcribed word is accurate. It is a number within `[0, 1]`.
82
+ - **Speaker Tag:** If speaker diarization is enabled on initialization, the speaker tag is a non-negative integer identifying unique speakers, with `0` reserved for unknown speakers. If speaker diarization is not enabled, the value will always be `-1`.
83
+
75
84
  ## Demos
76
85
 
77
86
  [pvleoparddemo](https://pypi.org/project/pvleoparddemo/) provides command-line utilities for processing audio using
@@ -1,5 +1,5 @@
1
1
  #
2
- # Copyright 2022-2023 Picovoice Inc.
2
+ # Copyright 2022-2024 Picovoice Inc.
3
3
  #
4
4
  # You may not use this file except in compliance with the license. A copy of the license is located in the "LICENSE"
5
5
  # file accompanying this source.
@@ -37,16 +37,21 @@ def _linux_machine():
37
37
 
38
38
  if '0xd03' == cpu_part:
39
39
  return 'cortex-a53' + arch_info
40
- elif '0xd07' == cpu_part:
41
- return 'cortex-a57' + arch_info
42
40
  elif '0xd08' == cpu_part:
43
41
  return 'cortex-a72' + arch_info
42
+ elif "0xd0b" == cpu_part:
43
+ return "cortex-a76" + arch_info
44
44
  else:
45
45
  raise NotImplementedError("Unsupported CPU: `%s`." % cpu_part)
46
46
 
47
47
 
48
- _RASPBERRY_PI_MACHINES = {'cortex-a53', 'cortex-a72', 'cortex-a53-aarch64', 'cortex-a72-aarch64'}
49
- _JETSON_MACHINES = {'cortex-a57-aarch64'}
48
+ _RASPBERRY_PI_MACHINES = {
49
+ "cortex-a53",
50
+ "cortex-a72",
51
+ "cortex-a76",
52
+ "cortex-a53-aarch64",
53
+ "cortex-a72-aarch64",
54
+ "cortex-a76-aarch64"}
50
55
 
51
56
 
52
57
  def default_library_path(relative):
@@ -59,8 +64,6 @@ def default_library_path(relative):
59
64
  linux_machine = _linux_machine()
60
65
  if linux_machine == 'x86_64':
61
66
  return os.path.join(os.path.dirname(__file__), relative, 'lib/linux/x86_64/libpv_leopard.so')
62
- elif linux_machine in _JETSON_MACHINES:
63
- return os.path.join(os.path.dirname(__file__), relative, 'lib/jetson/%s/libpv_leopard.so' % linux_machine)
64
67
  elif linux_machine in _RASPBERRY_PI_MACHINES:
65
68
  return os.path.join(
66
69
  os.path.dirname(__file__),
@@ -0,0 +1,104 @@
1
+ Metadata-Version: 2.1
2
+ Name: pvleopard
3
+ Version: 2.0.3
4
+ Summary: Leopard Speech-to-Text Engine.
5
+ Home-page: https://github.com/Picovoice/leopard
6
+ Author: Picovoice
7
+ Author-email: hello@picovoice.ai
8
+ Keywords: Speech-to-Text,Speech Recognition,Voice Recognition,ASR,Automatic Speech Recognition
9
+ Classifier: Development Status :: 5 - Production/Stable
10
+ Classifier: Intended Audience :: Developers
11
+ Classifier: License :: OSI Approved :: Apache Software License
12
+ Classifier: Operating System :: OS Independent
13
+ Classifier: Programming Language :: Python :: 3
14
+ Classifier: Topic :: Multimedia :: Sound/Audio :: Speech
15
+ Requires-Python: >=3.8
16
+ Description-Content-Type: text/markdown
17
+
18
+ # Leopard Binding for Python
19
+
20
+ ## Leopard Speech-to-Text Engine
21
+
22
+ Made in Vancouver, Canada by [Picovoice](https://picovoice.ai)
23
+
24
+ Leopard is an on-device speech-to-text engine. Leopard is:
25
+
26
+ - Private; All voice processing runs locally.
27
+ - [Accurate](https://picovoice.ai/docs/benchmark/stt/)
28
+ - [Compact and Computationally-Efficient](https://github.com/Picovoice/speech-to-text-benchmark#rtf)
29
+ - Cross-Platform:
30
+ - Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64)
31
+ - Android and iOS
32
+ - Chrome, Safari, Firefox, and Edge
33
+ - Raspberry Pi (3, 4, 5)
34
+
35
+ ## Compatibility
36
+
37
+ - Python 3.8+
38
+ - Runs on Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64), and Raspberry Pi (3, 4, 5).
39
+
40
+ ## Installation
41
+
42
+ ```console
43
+ pip3 install pvleopard
44
+ ```
45
+
46
+ ## AccessKey
47
+
48
+ Leopard requires a valid Picovoice `AccessKey` at initialization. `AccessKey` acts as your credentials when using Leopard SDKs.
49
+ You can get your `AccessKey` for free. Make sure to keep your `AccessKey` secret.
50
+ Signup or Login to [Picovoice Console](https://console.picovoice.ai/) to get your `AccessKey`.
51
+
52
+ ## Usage
53
+
54
+ Create an instance of the engine and transcribe an audio file:
55
+
56
+ ```python
57
+ import pvleopard
58
+
59
+ leopard = pvleopard.create(access_key='${ACCESS_KEY}')
60
+
61
+ transcript, words = leopard.process_file('${AUDIO_FILE_PATH}')
62
+ print(transcript)
63
+ for word in words:
64
+ print(
65
+ "{word=\"%s\" start_sec=%.2f end_sec=%.2f confidence=%.2f speaker_tag=%d}"
66
+ % (word.word, word.start_sec, word.end_sec, word.confidence, word.speaker_tag))
67
+ ```
68
+
69
+ Replace `${ACCESS_KEY}` with yours obtained from [Picovoice Console](https://console.picovoice.ai/) and
70
+ `${AUDIO_FILE_PATH}` to the path an audio file.
71
+
72
+ Finally, when done be sure to explicitly release the resources:
73
+ ```python
74
+ leopard.delete()
75
+ ```
76
+
77
+ ### Language Model
78
+
79
+ The Leopard Python SDK comes preloaded with a default English language model (`.pv` file).
80
+ Default models for other supported languages can be found in [lib/common](../../lib/common).
81
+
82
+ Create custom language models using the [Picovoice Console](https://console.picovoice.ai/). Here you can train
83
+ language models with custom vocabulary and boost words in the existing vocabulary.
84
+
85
+ Pass in the `.pv` file via the `model_path` argument:
86
+ ```python
87
+ leopard = pvleopard.create(
88
+ access_key='${ACCESS_KEY}',
89
+ model_path='${MODEL_FILE_PATH}')
90
+ ```
91
+
92
+ ### Word Metadata
93
+
94
+ Along with the transcript, Leopard returns metadata for each transcribed word. Available metadata items are:
95
+
96
+ - **Start Time:** Indicates when the word started in the transcribed audio. Value is in seconds.
97
+ - **End Time:** Indicates when the word ended in the transcribed audio. Value is in seconds.
98
+ - **Confidence:** Leopard's confidence that the transcribed word is accurate. It is a number within `[0, 1]`.
99
+ - **Speaker Tag:** If speaker diarization is enabled on initialization, the speaker tag is a non-negative integer identifying unique speakers, with `0` reserved for unknown speakers. If speaker diarization is not enabled, the value will always be `-1`.
100
+
101
+ ## Demos
102
+
103
+ [pvleoparddemo](https://pypi.org/project/pvleoparddemo/) provides command-line utilities for processing audio using
104
+ Leopard.
@@ -11,7 +11,6 @@ pvleopard.egg-info/SOURCES.txt
11
11
  pvleopard.egg-info/dependency_links.txt
12
12
  pvleopard.egg-info/top_level.txt
13
13
  pvleopard/lib/common/leopard_params.pv
14
- pvleopard/lib/jetson/cortex-a57-aarch64/libpv_leopard.so
15
14
  pvleopard/lib/linux/x86_64/libpv_leopard.so
16
15
  pvleopard/lib/mac/arm64/libpv_leopard.dylib
17
16
  pvleopard/lib/mac/x86_64/libpv_leopard.dylib
@@ -19,4 +18,6 @@ pvleopard/lib/raspberry-pi/cortex-a53/libpv_leopard.so
19
18
  pvleopard/lib/raspberry-pi/cortex-a53-aarch64/libpv_leopard.so
20
19
  pvleopard/lib/raspberry-pi/cortex-a72/libpv_leopard.so
21
20
  pvleopard/lib/raspberry-pi/cortex-a72-aarch64/libpv_leopard.so
21
+ pvleopard/lib/raspberry-pi/cortex-a76/libpv_leopard.so
22
+ pvleopard/lib/raspberry-pi/cortex-a76-aarch64/libpv_leopard.so
22
23
  pvleopard/lib/windows/amd64/libpv_leopard.dll
@@ -1,5 +1,5 @@
1
1
  #
2
- # Copyright 2022-2023 Picovoice Inc.
2
+ # Copyright 2022-2024 Picovoice Inc.
3
3
  #
4
4
  # You may not use this file except in compliance with the license. A copy of the license is located in the "LICENSE"
5
5
  # file accompanying this source.
@@ -15,7 +15,7 @@ import shutil
15
15
  import setuptools
16
16
 
17
17
  INCLUDE_FILES = ('../../LICENSE', '__init__.py', '_factory.py', '_leopard.py', '_util.py')
18
- INCLUDE_LIBS = ('linux', 'mac', 'windows', 'jetson', 'raspberry-pi')
18
+ INCLUDE_LIBS = ('linux', 'mac', 'windows', 'raspberry-pi')
19
19
 
20
20
  os.system('git clean -dfx')
21
21
 
@@ -48,7 +48,7 @@ with open(os.path.join(os.path.dirname(__file__), 'README.md'), 'r') as f:
48
48
 
49
49
  setuptools.setup(
50
50
  name="pvleopard",
51
- version="2.0.1",
51
+ version="2.0.3",
52
52
  author="Picovoice",
53
53
  author_email="hello@picovoice.ai",
54
54
  description="Leopard Speech-to-Text Engine.",
@@ -65,6 +65,6 @@ setuptools.setup(
65
65
  "Programming Language :: Python :: 3",
66
66
  "Topic :: Multimedia :: Sound/Audio :: Speech"
67
67
  ],
68
- python_requires='>=3.5',
68
+ python_requires='>=3.8',
69
69
  keywords="Speech-to-Text, Speech Recognition, Voice Recognition, ASR, Automatic Speech Recognition",
70
70
  )
pvleopard-2.0.1/PKG-INFO DELETED
@@ -1,97 +0,0 @@
1
- Metadata-Version: 2.1
2
- Name: pvleopard
3
- Version: 2.0.1
4
- Summary: Leopard Speech-to-Text Engine.
5
- Home-page: https://github.com/Picovoice/leopard
6
- Author: Picovoice
7
- Author-email: hello@picovoice.ai
8
- License: UNKNOWN
9
- Description: # Leopard Binding for Python
10
-
11
- ## Leopard Speech-to-Text Engine
12
-
13
- Made in Vancouver, Canada by [Picovoice](https://picovoice.ai)
14
-
15
- Leopard is an on-device speech-to-text engine. Leopard is:
16
-
17
- - Private; All voice processing runs locally.
18
- - [Accurate](https://picovoice.ai/docs/benchmark/stt/)
19
- - [Compact and Computationally-Efficient](https://github.com/Picovoice/speech-to-text-benchmark#rtf)
20
- - Cross-Platform:
21
- - Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64)
22
- - Android and iOS
23
- - Chrome, Safari, Firefox, and Edge
24
- - Raspberry Pi (4, 3) and NVIDIA Jetson Nano
25
-
26
- ## Compatibility
27
-
28
- - Python 3.5+
29
- - Runs on Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64), Raspberry Pi (4, 3), and NVIDIA Jetson Nano.
30
-
31
- ## Installation
32
-
33
- ```console
34
- pip3 install pvleopard
35
- ```
36
-
37
- ## AccessKey
38
-
39
- Leopard requires a valid Picovoice `AccessKey` at initialization. `AccessKey` acts as your credentials when using Leopard SDKs.
40
- You can get your `AccessKey` for free. Make sure to keep your `AccessKey` secret.
41
- Signup or Login to [Picovoice Console](https://console.picovoice.ai/) to get your `AccessKey`.
42
-
43
- ### Usage
44
-
45
- Create an instance of the engine and transcribe an audio file:
46
-
47
- ```python
48
- import pvleopard
49
-
50
- leopard = pvleopard.create(access_key='${ACCESS_KEY}')
51
-
52
- transcript, words = leopard.process_file('${AUDIO_PATH}')
53
- print(transcript)
54
- for word in words:
55
- print(
56
- "{word=\"%s\" start_sec=%.2f end_sec=%.2f confidence=%.2f}"
57
- % (word.word, word.start_sec, word.end_sec, word.confidence))
58
- ```
59
-
60
- Replace `${ACCESS_KEY}` with yours obtained from [Picovoice Console](https://console.picovoice.ai/) and
61
- `${AUDIO_PATH}` to the path an audio file.
62
-
63
- Finally, when done be sure to explicitly release the resources:
64
- ```python
65
- leopard.delete()
66
- ```
67
-
68
- ## Language Model
69
-
70
- The Leopard Python SDK comes preloaded with a default English language model (`.pv` file).
71
- Default models for other supported languages can be found in [lib/common](../../lib/common).
72
-
73
- Create custom language models using the [Picovoice Console](https://console.picovoice.ai/). Here you can train
74
- language models with custom vocabulary and boost words in the existing vocabulary.
75
-
76
- Pass in the `.pv` file via the `model_path` argument:
77
- ```python
78
- leopard = pvleopard.create(
79
- access_key='${ACCESS_KEY}',
80
- model_path='${MODEL_PATH}')
81
- ```
82
-
83
- ## Demos
84
-
85
- [pvleoparddemo](https://pypi.org/project/pvleoparddemo/) provides command-line utilities for processing audio using
86
- Leopard.
87
-
88
- Keywords: Speech-to-Text,Speech Recognition,Voice Recognition,ASR,Automatic Speech Recognition
89
- Platform: UNKNOWN
90
- Classifier: Development Status :: 5 - Production/Stable
91
- Classifier: Intended Audience :: Developers
92
- Classifier: License :: OSI Approved :: Apache Software License
93
- Classifier: Operating System :: OS Independent
94
- Classifier: Programming Language :: Python :: 3
95
- Classifier: Topic :: Multimedia :: Sound/Audio :: Speech
96
- Requires-Python: >=3.5
97
- Description-Content-Type: text/markdown
@@ -1,97 +0,0 @@
1
- Metadata-Version: 2.1
2
- Name: pvleopard
3
- Version: 2.0.1
4
- Summary: Leopard Speech-to-Text Engine.
5
- Home-page: https://github.com/Picovoice/leopard
6
- Author: Picovoice
7
- Author-email: hello@picovoice.ai
8
- License: UNKNOWN
9
- Description: # Leopard Binding for Python
10
-
11
- ## Leopard Speech-to-Text Engine
12
-
13
- Made in Vancouver, Canada by [Picovoice](https://picovoice.ai)
14
-
15
- Leopard is an on-device speech-to-text engine. Leopard is:
16
-
17
- - Private; All voice processing runs locally.
18
- - [Accurate](https://picovoice.ai/docs/benchmark/stt/)
19
- - [Compact and Computationally-Efficient](https://github.com/Picovoice/speech-to-text-benchmark#rtf)
20
- - Cross-Platform:
21
- - Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64)
22
- - Android and iOS
23
- - Chrome, Safari, Firefox, and Edge
24
- - Raspberry Pi (4, 3) and NVIDIA Jetson Nano
25
-
26
- ## Compatibility
27
-
28
- - Python 3.5+
29
- - Runs on Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64), Raspberry Pi (4, 3), and NVIDIA Jetson Nano.
30
-
31
- ## Installation
32
-
33
- ```console
34
- pip3 install pvleopard
35
- ```
36
-
37
- ## AccessKey
38
-
39
- Leopard requires a valid Picovoice `AccessKey` at initialization. `AccessKey` acts as your credentials when using Leopard SDKs.
40
- You can get your `AccessKey` for free. Make sure to keep your `AccessKey` secret.
41
- Signup or Login to [Picovoice Console](https://console.picovoice.ai/) to get your `AccessKey`.
42
-
43
- ### Usage
44
-
45
- Create an instance of the engine and transcribe an audio file:
46
-
47
- ```python
48
- import pvleopard
49
-
50
- leopard = pvleopard.create(access_key='${ACCESS_KEY}')
51
-
52
- transcript, words = leopard.process_file('${AUDIO_PATH}')
53
- print(transcript)
54
- for word in words:
55
- print(
56
- "{word=\"%s\" start_sec=%.2f end_sec=%.2f confidence=%.2f}"
57
- % (word.word, word.start_sec, word.end_sec, word.confidence))
58
- ```
59
-
60
- Replace `${ACCESS_KEY}` with yours obtained from [Picovoice Console](https://console.picovoice.ai/) and
61
- `${AUDIO_PATH}` to the path an audio file.
62
-
63
- Finally, when done be sure to explicitly release the resources:
64
- ```python
65
- leopard.delete()
66
- ```
67
-
68
- ## Language Model
69
-
70
- The Leopard Python SDK comes preloaded with a default English language model (`.pv` file).
71
- Default models for other supported languages can be found in [lib/common](../../lib/common).
72
-
73
- Create custom language models using the [Picovoice Console](https://console.picovoice.ai/). Here you can train
74
- language models with custom vocabulary and boost words in the existing vocabulary.
75
-
76
- Pass in the `.pv` file via the `model_path` argument:
77
- ```python
78
- leopard = pvleopard.create(
79
- access_key='${ACCESS_KEY}',
80
- model_path='${MODEL_PATH}')
81
- ```
82
-
83
- ## Demos
84
-
85
- [pvleoparddemo](https://pypi.org/project/pvleoparddemo/) provides command-line utilities for processing audio using
86
- Leopard.
87
-
88
- Keywords: Speech-to-Text,Speech Recognition,Voice Recognition,ASR,Automatic Speech Recognition
89
- Platform: UNKNOWN
90
- Classifier: Development Status :: 5 - Production/Stable
91
- Classifier: Intended Audience :: Developers
92
- Classifier: License :: OSI Approved :: Apache Software License
93
- Classifier: Operating System :: OS Independent
94
- Classifier: Programming Language :: Python :: 3
95
- Classifier: Topic :: Multimedia :: Sound/Audio :: Speech
96
- Requires-Python: >=3.5
97
- Description-Content-Type: text/markdown
File without changes
File without changes