pvleopard 2.0.1__tar.gz → 2.0.4__tar.gz

Sign up to get free protection for your applications and to get access to all the features.
Files changed (29) hide show
  1. {pvleopard-2.0.1 → pvleopard-2.0.4}/MANIFEST.in +0 -1
  2. pvleopard-2.0.4/PKG-INFO +104 -0
  3. {pvleopard-2.0.1 → pvleopard-2.0.4}/README.md +20 -11
  4. {pvleopard-2.0.1 → pvleopard-2.0.4}/pvleopard/_util.py +14 -8
  5. pvleopard-2.0.4/pvleopard/lib/raspberry-pi/cortex-a76/libpv_leopard.so +0 -0
  6. pvleopard-2.0.4/pvleopard/lib/raspberry-pi/cortex-a76-aarch64/libpv_leopard.so +0 -0
  7. pvleopard-2.0.4/pvleopard/lib/windows/arm64/libpv_leopard.dll +0 -0
  8. pvleopard-2.0.4/pvleopard.egg-info/PKG-INFO +104 -0
  9. {pvleopard-2.0.1 → pvleopard-2.0.4}/pvleopard.egg-info/SOURCES.txt +4 -2
  10. {pvleopard-2.0.1 → pvleopard-2.0.4}/setup.py +4 -4
  11. pvleopard-2.0.1/PKG-INFO +0 -97
  12. pvleopard-2.0.1/pvleopard/lib/jetson/cortex-a57-aarch64/libpv_leopard.so +0 -0
  13. pvleopard-2.0.1/pvleopard.egg-info/PKG-INFO +0 -97
  14. {pvleopard-2.0.1 → pvleopard-2.0.4}/pvleopard/LICENSE +0 -0
  15. {pvleopard-2.0.1 → pvleopard-2.0.4}/pvleopard/__init__.py +0 -0
  16. {pvleopard-2.0.1 → pvleopard-2.0.4}/pvleopard/_factory.py +0 -0
  17. {pvleopard-2.0.1 → pvleopard-2.0.4}/pvleopard/_leopard.py +0 -0
  18. {pvleopard-2.0.1 → pvleopard-2.0.4}/pvleopard/lib/common/leopard_params.pv +0 -0
  19. {pvleopard-2.0.1 → pvleopard-2.0.4}/pvleopard/lib/linux/x86_64/libpv_leopard.so +0 -0
  20. {pvleopard-2.0.1 → pvleopard-2.0.4}/pvleopard/lib/mac/arm64/libpv_leopard.dylib +0 -0
  21. {pvleopard-2.0.1 → pvleopard-2.0.4}/pvleopard/lib/mac/x86_64/libpv_leopard.dylib +0 -0
  22. {pvleopard-2.0.1 → pvleopard-2.0.4}/pvleopard/lib/raspberry-pi/cortex-a53/libpv_leopard.so +0 -0
  23. {pvleopard-2.0.1 → pvleopard-2.0.4}/pvleopard/lib/raspberry-pi/cortex-a53-aarch64/libpv_leopard.so +0 -0
  24. {pvleopard-2.0.1 → pvleopard-2.0.4}/pvleopard/lib/raspberry-pi/cortex-a72/libpv_leopard.so +0 -0
  25. {pvleopard-2.0.1 → pvleopard-2.0.4}/pvleopard/lib/raspberry-pi/cortex-a72-aarch64/libpv_leopard.so +0 -0
  26. {pvleopard-2.0.1 → pvleopard-2.0.4}/pvleopard/lib/windows/amd64/libpv_leopard.dll +0 -0
  27. {pvleopard-2.0.1 → pvleopard-2.0.4}/pvleopard.egg-info/dependency_links.txt +0 -0
  28. {pvleopard-2.0.1 → pvleopard-2.0.4}/pvleopard.egg-info/top_level.txt +0 -0
  29. {pvleopard-2.0.1 → pvleopard-2.0.4}/setup.cfg +0 -0
@@ -7,5 +7,4 @@ include pvleopard/lib/common/leopard_params.pv
7
7
  recursive-include pvleopard/lib/linux *
8
8
  recursive-include pvleopard/lib/mac *
9
9
  recursive-include pvleopard/lib/windows *
10
- recursive-include pvleopard/lib/jetson *
11
10
  recursive-include pvleopard/lib/raspberry-pi *
@@ -0,0 +1,104 @@
1
+ Metadata-Version: 2.1
2
+ Name: pvleopard
3
+ Version: 2.0.4
4
+ Summary: Leopard Speech-to-Text Engine.
5
+ Home-page: https://github.com/Picovoice/leopard
6
+ Author: Picovoice
7
+ Author-email: hello@picovoice.ai
8
+ Keywords: Speech-to-Text,Speech Recognition,Voice Recognition,ASR,Automatic Speech Recognition
9
+ Classifier: Development Status :: 5 - Production/Stable
10
+ Classifier: Intended Audience :: Developers
11
+ Classifier: License :: OSI Approved :: Apache Software License
12
+ Classifier: Operating System :: OS Independent
13
+ Classifier: Programming Language :: Python :: 3
14
+ Classifier: Topic :: Multimedia :: Sound/Audio :: Speech
15
+ Requires-Python: >=3.8
16
+ Description-Content-Type: text/markdown
17
+
18
+ # Leopard Binding for Python
19
+
20
+ ## Leopard Speech-to-Text Engine
21
+
22
+ Made in Vancouver, Canada by [Picovoice](https://picovoice.ai)
23
+
24
+ Leopard is an on-device speech-to-text engine. Leopard is:
25
+
26
+ - Private; All voice processing runs locally.
27
+ - [Accurate](https://picovoice.ai/docs/benchmark/stt/)
28
+ - [Compact and Computationally-Efficient](https://github.com/Picovoice/speech-to-text-benchmark#rtf)
29
+ - Cross-Platform:
30
+ - Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64, arm64)
31
+ - Android and iOS
32
+ - Chrome, Safari, Firefox, and Edge
33
+ - Raspberry Pi (3, 4, 5)
34
+
35
+ ## Compatibility
36
+
37
+ - Python 3.8+
38
+ - Runs on Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64, arm64), and Raspberry Pi (3, 4, 5).
39
+
40
+ ## Installation
41
+
42
+ ```console
43
+ pip3 install pvleopard
44
+ ```
45
+
46
+ ## AccessKey
47
+
48
+ Leopard requires a valid Picovoice `AccessKey` at initialization. `AccessKey` acts as your credentials when using Leopard SDKs.
49
+ You can get your `AccessKey` for free. Make sure to keep your `AccessKey` secret.
50
+ Signup or Login to [Picovoice Console](https://console.picovoice.ai/) to get your `AccessKey`.
51
+
52
+ ## Usage
53
+
54
+ Create an instance of the engine and transcribe an audio file:
55
+
56
+ ```python
57
+ import pvleopard
58
+
59
+ leopard = pvleopard.create(access_key='${ACCESS_KEY}')
60
+
61
+ transcript, words = leopard.process_file('${AUDIO_FILE_PATH}')
62
+ print(transcript)
63
+ for word in words:
64
+ print(
65
+ "{word=\"%s\" start_sec=%.2f end_sec=%.2f confidence=%.2f speaker_tag=%d}"
66
+ % (word.word, word.start_sec, word.end_sec, word.confidence, word.speaker_tag))
67
+ ```
68
+
69
+ Replace `${ACCESS_KEY}` with yours obtained from [Picovoice Console](https://console.picovoice.ai/) and
70
+ `${AUDIO_FILE_PATH}` to the path an audio file.
71
+
72
+ Finally, when done be sure to explicitly release the resources:
73
+ ```python
74
+ leopard.delete()
75
+ ```
76
+
77
+ ### Language Model
78
+
79
+ The Leopard Python SDK comes preloaded with a default English language model (`.pv` file).
80
+ Default models for other supported languages can be found in [lib/common](../../lib/common).
81
+
82
+ Create custom language models using the [Picovoice Console](https://console.picovoice.ai/). Here you can train
83
+ language models with custom vocabulary and boost words in the existing vocabulary.
84
+
85
+ Pass in the `.pv` file via the `model_path` argument:
86
+ ```python
87
+ leopard = pvleopard.create(
88
+ access_key='${ACCESS_KEY}',
89
+ model_path='${MODEL_FILE_PATH}')
90
+ ```
91
+
92
+ ### Word Metadata
93
+
94
+ Along with the transcript, Leopard returns metadata for each transcribed word. Available metadata items are:
95
+
96
+ - **Start Time:** Indicates when the word started in the transcribed audio. Value is in seconds.
97
+ - **End Time:** Indicates when the word ended in the transcribed audio. Value is in seconds.
98
+ - **Confidence:** Leopard's confidence that the transcribed word is accurate. It is a number within `[0, 1]`.
99
+ - **Speaker Tag:** If speaker diarization is enabled on initialization, the speaker tag is a non-negative integer identifying unique speakers, with `0` reserved for unknown speakers. If speaker diarization is not enabled, the value will always be `-1`.
100
+
101
+ ## Demos
102
+
103
+ [pvleoparddemo](https://pypi.org/project/pvleoparddemo/) provides command-line utilities for processing audio using
104
+ Leopard.
@@ -10,15 +10,15 @@ Leopard is an on-device speech-to-text engine. Leopard is:
10
10
  - [Accurate](https://picovoice.ai/docs/benchmark/stt/)
11
11
  - [Compact and Computationally-Efficient](https://github.com/Picovoice/speech-to-text-benchmark#rtf)
12
12
  - Cross-Platform:
13
- - Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64)
13
+ - Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64, arm64)
14
14
  - Android and iOS
15
15
  - Chrome, Safari, Firefox, and Edge
16
- - Raspberry Pi (4, 3) and NVIDIA Jetson Nano
16
+ - Raspberry Pi (3, 4, 5)
17
17
 
18
18
  ## Compatibility
19
19
 
20
- - Python 3.5+
21
- - Runs on Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64), Raspberry Pi (4, 3), and NVIDIA Jetson Nano.
20
+ - Python 3.8+
21
+ - Runs on Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64, arm64), and Raspberry Pi (3, 4, 5).
22
22
 
23
23
  ## Installation
24
24
 
@@ -32,7 +32,7 @@ Leopard requires a valid Picovoice `AccessKey` at initialization. `AccessKey` ac
32
32
  You can get your `AccessKey` for free. Make sure to keep your `AccessKey` secret.
33
33
  Signup or Login to [Picovoice Console](https://console.picovoice.ai/) to get your `AccessKey`.
34
34
 
35
- ### Usage
35
+ ## Usage
36
36
 
37
37
  Create an instance of the engine and transcribe an audio file:
38
38
 
@@ -41,23 +41,23 @@ import pvleopard
41
41
 
42
42
  leopard = pvleopard.create(access_key='${ACCESS_KEY}')
43
43
 
44
- transcript, words = leopard.process_file('${AUDIO_PATH}')
44
+ transcript, words = leopard.process_file('${AUDIO_FILE_PATH}')
45
45
  print(transcript)
46
46
  for word in words:
47
47
  print(
48
- "{word=\"%s\" start_sec=%.2f end_sec=%.2f confidence=%.2f}"
49
- % (word.word, word.start_sec, word.end_sec, word.confidence))
48
+ "{word=\"%s\" start_sec=%.2f end_sec=%.2f confidence=%.2f speaker_tag=%d}"
49
+ % (word.word, word.start_sec, word.end_sec, word.confidence, word.speaker_tag))
50
50
  ```
51
51
 
52
52
  Replace `${ACCESS_KEY}` with yours obtained from [Picovoice Console](https://console.picovoice.ai/) and
53
- `${AUDIO_PATH}` to the path an audio file.
53
+ `${AUDIO_FILE_PATH}` to the path an audio file.
54
54
 
55
55
  Finally, when done be sure to explicitly release the resources:
56
56
  ```python
57
57
  leopard.delete()
58
58
  ```
59
59
 
60
- ## Language Model
60
+ ### Language Model
61
61
 
62
62
  The Leopard Python SDK comes preloaded with a default English language model (`.pv` file).
63
63
  Default models for other supported languages can be found in [lib/common](../../lib/common).
@@ -69,9 +69,18 @@ Pass in the `.pv` file via the `model_path` argument:
69
69
  ```python
70
70
  leopard = pvleopard.create(
71
71
  access_key='${ACCESS_KEY}',
72
- model_path='${MODEL_PATH}')
72
+ model_path='${MODEL_FILE_PATH}')
73
73
  ```
74
74
 
75
+ ### Word Metadata
76
+
77
+ Along with the transcript, Leopard returns metadata for each transcribed word. Available metadata items are:
78
+
79
+ - **Start Time:** Indicates when the word started in the transcribed audio. Value is in seconds.
80
+ - **End Time:** Indicates when the word ended in the transcribed audio. Value is in seconds.
81
+ - **Confidence:** Leopard's confidence that the transcribed word is accurate. It is a number within `[0, 1]`.
82
+ - **Speaker Tag:** If speaker diarization is enabled on initialization, the speaker tag is a non-negative integer identifying unique speakers, with `0` reserved for unknown speakers. If speaker diarization is not enabled, the value will always be `-1`.
83
+
75
84
  ## Demos
76
85
 
77
86
  [pvleoparddemo](https://pypi.org/project/pvleoparddemo/) provides command-line utilities for processing audio using
@@ -1,5 +1,5 @@
1
1
  #
2
- # Copyright 2022-2023 Picovoice Inc.
2
+ # Copyright 2022-2024 Picovoice Inc.
3
3
  #
4
4
  # You may not use this file except in compliance with the license. A copy of the license is located in the "LICENSE"
5
5
  # file accompanying this source.
@@ -37,16 +37,21 @@ def _linux_machine():
37
37
 
38
38
  if '0xd03' == cpu_part:
39
39
  return 'cortex-a53' + arch_info
40
- elif '0xd07' == cpu_part:
41
- return 'cortex-a57' + arch_info
42
40
  elif '0xd08' == cpu_part:
43
41
  return 'cortex-a72' + arch_info
42
+ elif "0xd0b" == cpu_part:
43
+ return "cortex-a76" + arch_info
44
44
  else:
45
45
  raise NotImplementedError("Unsupported CPU: `%s`." % cpu_part)
46
46
 
47
47
 
48
- _RASPBERRY_PI_MACHINES = {'cortex-a53', 'cortex-a72', 'cortex-a53-aarch64', 'cortex-a72-aarch64'}
49
- _JETSON_MACHINES = {'cortex-a57-aarch64'}
48
+ _RASPBERRY_PI_MACHINES = {
49
+ "cortex-a53",
50
+ "cortex-a72",
51
+ "cortex-a76",
52
+ "cortex-a53-aarch64",
53
+ "cortex-a72-aarch64",
54
+ "cortex-a76-aarch64"}
50
55
 
51
56
 
52
57
  def default_library_path(relative):
@@ -59,15 +64,16 @@ def default_library_path(relative):
59
64
  linux_machine = _linux_machine()
60
65
  if linux_machine == 'x86_64':
61
66
  return os.path.join(os.path.dirname(__file__), relative, 'lib/linux/x86_64/libpv_leopard.so')
62
- elif linux_machine in _JETSON_MACHINES:
63
- return os.path.join(os.path.dirname(__file__), relative, 'lib/jetson/%s/libpv_leopard.so' % linux_machine)
64
67
  elif linux_machine in _RASPBERRY_PI_MACHINES:
65
68
  return os.path.join(
66
69
  os.path.dirname(__file__),
67
70
  relative,
68
71
  'lib/raspberry-pi/%s/libpv_leopard.so' % linux_machine)
69
72
  elif platform.system() == 'Windows':
70
- return os.path.join(os.path.dirname(__file__), relative, 'lib/windows/amd64/libpv_leopard.dll')
73
+ if platform.machine().lower() == 'amd64':
74
+ return os.path.join(os.path.dirname(__file__), relative, 'lib/windows/amd64/libpv_leopard.dll')
75
+ elif platform.machine().lower() == 'arm64':
76
+ return os.path.join(os.path.dirname(__file__), relative, 'lib/windows/arm64/libpv_leopard.dll')
71
77
 
72
78
  raise NotImplementedError('Unsupported platform.')
73
79
 
@@ -0,0 +1,104 @@
1
+ Metadata-Version: 2.1
2
+ Name: pvleopard
3
+ Version: 2.0.4
4
+ Summary: Leopard Speech-to-Text Engine.
5
+ Home-page: https://github.com/Picovoice/leopard
6
+ Author: Picovoice
7
+ Author-email: hello@picovoice.ai
8
+ Keywords: Speech-to-Text,Speech Recognition,Voice Recognition,ASR,Automatic Speech Recognition
9
+ Classifier: Development Status :: 5 - Production/Stable
10
+ Classifier: Intended Audience :: Developers
11
+ Classifier: License :: OSI Approved :: Apache Software License
12
+ Classifier: Operating System :: OS Independent
13
+ Classifier: Programming Language :: Python :: 3
14
+ Classifier: Topic :: Multimedia :: Sound/Audio :: Speech
15
+ Requires-Python: >=3.8
16
+ Description-Content-Type: text/markdown
17
+
18
+ # Leopard Binding for Python
19
+
20
+ ## Leopard Speech-to-Text Engine
21
+
22
+ Made in Vancouver, Canada by [Picovoice](https://picovoice.ai)
23
+
24
+ Leopard is an on-device speech-to-text engine. Leopard is:
25
+
26
+ - Private; All voice processing runs locally.
27
+ - [Accurate](https://picovoice.ai/docs/benchmark/stt/)
28
+ - [Compact and Computationally-Efficient](https://github.com/Picovoice/speech-to-text-benchmark#rtf)
29
+ - Cross-Platform:
30
+ - Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64, arm64)
31
+ - Android and iOS
32
+ - Chrome, Safari, Firefox, and Edge
33
+ - Raspberry Pi (3, 4, 5)
34
+
35
+ ## Compatibility
36
+
37
+ - Python 3.8+
38
+ - Runs on Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64, arm64), and Raspberry Pi (3, 4, 5).
39
+
40
+ ## Installation
41
+
42
+ ```console
43
+ pip3 install pvleopard
44
+ ```
45
+
46
+ ## AccessKey
47
+
48
+ Leopard requires a valid Picovoice `AccessKey` at initialization. `AccessKey` acts as your credentials when using Leopard SDKs.
49
+ You can get your `AccessKey` for free. Make sure to keep your `AccessKey` secret.
50
+ Signup or Login to [Picovoice Console](https://console.picovoice.ai/) to get your `AccessKey`.
51
+
52
+ ## Usage
53
+
54
+ Create an instance of the engine and transcribe an audio file:
55
+
56
+ ```python
57
+ import pvleopard
58
+
59
+ leopard = pvleopard.create(access_key='${ACCESS_KEY}')
60
+
61
+ transcript, words = leopard.process_file('${AUDIO_FILE_PATH}')
62
+ print(transcript)
63
+ for word in words:
64
+ print(
65
+ "{word=\"%s\" start_sec=%.2f end_sec=%.2f confidence=%.2f speaker_tag=%d}"
66
+ % (word.word, word.start_sec, word.end_sec, word.confidence, word.speaker_tag))
67
+ ```
68
+
69
+ Replace `${ACCESS_KEY}` with yours obtained from [Picovoice Console](https://console.picovoice.ai/) and
70
+ `${AUDIO_FILE_PATH}` to the path an audio file.
71
+
72
+ Finally, when done be sure to explicitly release the resources:
73
+ ```python
74
+ leopard.delete()
75
+ ```
76
+
77
+ ### Language Model
78
+
79
+ The Leopard Python SDK comes preloaded with a default English language model (`.pv` file).
80
+ Default models for other supported languages can be found in [lib/common](../../lib/common).
81
+
82
+ Create custom language models using the [Picovoice Console](https://console.picovoice.ai/). Here you can train
83
+ language models with custom vocabulary and boost words in the existing vocabulary.
84
+
85
+ Pass in the `.pv` file via the `model_path` argument:
86
+ ```python
87
+ leopard = pvleopard.create(
88
+ access_key='${ACCESS_KEY}',
89
+ model_path='${MODEL_FILE_PATH}')
90
+ ```
91
+
92
+ ### Word Metadata
93
+
94
+ Along with the transcript, Leopard returns metadata for each transcribed word. Available metadata items are:
95
+
96
+ - **Start Time:** Indicates when the word started in the transcribed audio. Value is in seconds.
97
+ - **End Time:** Indicates when the word ended in the transcribed audio. Value is in seconds.
98
+ - **Confidence:** Leopard's confidence that the transcribed word is accurate. It is a number within `[0, 1]`.
99
+ - **Speaker Tag:** If speaker diarization is enabled on initialization, the speaker tag is a non-negative integer identifying unique speakers, with `0` reserved for unknown speakers. If speaker diarization is not enabled, the value will always be `-1`.
100
+
101
+ ## Demos
102
+
103
+ [pvleoparddemo](https://pypi.org/project/pvleoparddemo/) provides command-line utilities for processing audio using
104
+ Leopard.
@@ -11,7 +11,6 @@ pvleopard.egg-info/SOURCES.txt
11
11
  pvleopard.egg-info/dependency_links.txt
12
12
  pvleopard.egg-info/top_level.txt
13
13
  pvleopard/lib/common/leopard_params.pv
14
- pvleopard/lib/jetson/cortex-a57-aarch64/libpv_leopard.so
15
14
  pvleopard/lib/linux/x86_64/libpv_leopard.so
16
15
  pvleopard/lib/mac/arm64/libpv_leopard.dylib
17
16
  pvleopard/lib/mac/x86_64/libpv_leopard.dylib
@@ -19,4 +18,7 @@ pvleopard/lib/raspberry-pi/cortex-a53/libpv_leopard.so
19
18
  pvleopard/lib/raspberry-pi/cortex-a53-aarch64/libpv_leopard.so
20
19
  pvleopard/lib/raspberry-pi/cortex-a72/libpv_leopard.so
21
20
  pvleopard/lib/raspberry-pi/cortex-a72-aarch64/libpv_leopard.so
22
- pvleopard/lib/windows/amd64/libpv_leopard.dll
21
+ pvleopard/lib/raspberry-pi/cortex-a76/libpv_leopard.so
22
+ pvleopard/lib/raspberry-pi/cortex-a76-aarch64/libpv_leopard.so
23
+ pvleopard/lib/windows/amd64/libpv_leopard.dll
24
+ pvleopard/lib/windows/arm64/libpv_leopard.dll
@@ -1,5 +1,5 @@
1
1
  #
2
- # Copyright 2022-2023 Picovoice Inc.
2
+ # Copyright 2022-2024 Picovoice Inc.
3
3
  #
4
4
  # You may not use this file except in compliance with the license. A copy of the license is located in the "LICENSE"
5
5
  # file accompanying this source.
@@ -15,7 +15,7 @@ import shutil
15
15
  import setuptools
16
16
 
17
17
  INCLUDE_FILES = ('../../LICENSE', '__init__.py', '_factory.py', '_leopard.py', '_util.py')
18
- INCLUDE_LIBS = ('linux', 'mac', 'windows', 'jetson', 'raspberry-pi')
18
+ INCLUDE_LIBS = ('linux', 'mac', 'windows', 'raspberry-pi')
19
19
 
20
20
  os.system('git clean -dfx')
21
21
 
@@ -48,7 +48,7 @@ with open(os.path.join(os.path.dirname(__file__), 'README.md'), 'r') as f:
48
48
 
49
49
  setuptools.setup(
50
50
  name="pvleopard",
51
- version="2.0.1",
51
+ version="2.0.4",
52
52
  author="Picovoice",
53
53
  author_email="hello@picovoice.ai",
54
54
  description="Leopard Speech-to-Text Engine.",
@@ -65,6 +65,6 @@ setuptools.setup(
65
65
  "Programming Language :: Python :: 3",
66
66
  "Topic :: Multimedia :: Sound/Audio :: Speech"
67
67
  ],
68
- python_requires='>=3.5',
68
+ python_requires='>=3.8',
69
69
  keywords="Speech-to-Text, Speech Recognition, Voice Recognition, ASR, Automatic Speech Recognition",
70
70
  )
pvleopard-2.0.1/PKG-INFO DELETED
@@ -1,97 +0,0 @@
1
- Metadata-Version: 2.1
2
- Name: pvleopard
3
- Version: 2.0.1
4
- Summary: Leopard Speech-to-Text Engine.
5
- Home-page: https://github.com/Picovoice/leopard
6
- Author: Picovoice
7
- Author-email: hello@picovoice.ai
8
- License: UNKNOWN
9
- Description: # Leopard Binding for Python
10
-
11
- ## Leopard Speech-to-Text Engine
12
-
13
- Made in Vancouver, Canada by [Picovoice](https://picovoice.ai)
14
-
15
- Leopard is an on-device speech-to-text engine. Leopard is:
16
-
17
- - Private; All voice processing runs locally.
18
- - [Accurate](https://picovoice.ai/docs/benchmark/stt/)
19
- - [Compact and Computationally-Efficient](https://github.com/Picovoice/speech-to-text-benchmark#rtf)
20
- - Cross-Platform:
21
- - Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64)
22
- - Android and iOS
23
- - Chrome, Safari, Firefox, and Edge
24
- - Raspberry Pi (4, 3) and NVIDIA Jetson Nano
25
-
26
- ## Compatibility
27
-
28
- - Python 3.5+
29
- - Runs on Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64), Raspberry Pi (4, 3), and NVIDIA Jetson Nano.
30
-
31
- ## Installation
32
-
33
- ```console
34
- pip3 install pvleopard
35
- ```
36
-
37
- ## AccessKey
38
-
39
- Leopard requires a valid Picovoice `AccessKey` at initialization. `AccessKey` acts as your credentials when using Leopard SDKs.
40
- You can get your `AccessKey` for free. Make sure to keep your `AccessKey` secret.
41
- Signup or Login to [Picovoice Console](https://console.picovoice.ai/) to get your `AccessKey`.
42
-
43
- ### Usage
44
-
45
- Create an instance of the engine and transcribe an audio file:
46
-
47
- ```python
48
- import pvleopard
49
-
50
- leopard = pvleopard.create(access_key='${ACCESS_KEY}')
51
-
52
- transcript, words = leopard.process_file('${AUDIO_PATH}')
53
- print(transcript)
54
- for word in words:
55
- print(
56
- "{word=\"%s\" start_sec=%.2f end_sec=%.2f confidence=%.2f}"
57
- % (word.word, word.start_sec, word.end_sec, word.confidence))
58
- ```
59
-
60
- Replace `${ACCESS_KEY}` with yours obtained from [Picovoice Console](https://console.picovoice.ai/) and
61
- `${AUDIO_PATH}` to the path an audio file.
62
-
63
- Finally, when done be sure to explicitly release the resources:
64
- ```python
65
- leopard.delete()
66
- ```
67
-
68
- ## Language Model
69
-
70
- The Leopard Python SDK comes preloaded with a default English language model (`.pv` file).
71
- Default models for other supported languages can be found in [lib/common](../../lib/common).
72
-
73
- Create custom language models using the [Picovoice Console](https://console.picovoice.ai/). Here you can train
74
- language models with custom vocabulary and boost words in the existing vocabulary.
75
-
76
- Pass in the `.pv` file via the `model_path` argument:
77
- ```python
78
- leopard = pvleopard.create(
79
- access_key='${ACCESS_KEY}',
80
- model_path='${MODEL_PATH}')
81
- ```
82
-
83
- ## Demos
84
-
85
- [pvleoparddemo](https://pypi.org/project/pvleoparddemo/) provides command-line utilities for processing audio using
86
- Leopard.
87
-
88
- Keywords: Speech-to-Text,Speech Recognition,Voice Recognition,ASR,Automatic Speech Recognition
89
- Platform: UNKNOWN
90
- Classifier: Development Status :: 5 - Production/Stable
91
- Classifier: Intended Audience :: Developers
92
- Classifier: License :: OSI Approved :: Apache Software License
93
- Classifier: Operating System :: OS Independent
94
- Classifier: Programming Language :: Python :: 3
95
- Classifier: Topic :: Multimedia :: Sound/Audio :: Speech
96
- Requires-Python: >=3.5
97
- Description-Content-Type: text/markdown
@@ -1,97 +0,0 @@
1
- Metadata-Version: 2.1
2
- Name: pvleopard
3
- Version: 2.0.1
4
- Summary: Leopard Speech-to-Text Engine.
5
- Home-page: https://github.com/Picovoice/leopard
6
- Author: Picovoice
7
- Author-email: hello@picovoice.ai
8
- License: UNKNOWN
9
- Description: # Leopard Binding for Python
10
-
11
- ## Leopard Speech-to-Text Engine
12
-
13
- Made in Vancouver, Canada by [Picovoice](https://picovoice.ai)
14
-
15
- Leopard is an on-device speech-to-text engine. Leopard is:
16
-
17
- - Private; All voice processing runs locally.
18
- - [Accurate](https://picovoice.ai/docs/benchmark/stt/)
19
- - [Compact and Computationally-Efficient](https://github.com/Picovoice/speech-to-text-benchmark#rtf)
20
- - Cross-Platform:
21
- - Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64)
22
- - Android and iOS
23
- - Chrome, Safari, Firefox, and Edge
24
- - Raspberry Pi (4, 3) and NVIDIA Jetson Nano
25
-
26
- ## Compatibility
27
-
28
- - Python 3.5+
29
- - Runs on Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64), Raspberry Pi (4, 3), and NVIDIA Jetson Nano.
30
-
31
- ## Installation
32
-
33
- ```console
34
- pip3 install pvleopard
35
- ```
36
-
37
- ## AccessKey
38
-
39
- Leopard requires a valid Picovoice `AccessKey` at initialization. `AccessKey` acts as your credentials when using Leopard SDKs.
40
- You can get your `AccessKey` for free. Make sure to keep your `AccessKey` secret.
41
- Signup or Login to [Picovoice Console](https://console.picovoice.ai/) to get your `AccessKey`.
42
-
43
- ### Usage
44
-
45
- Create an instance of the engine and transcribe an audio file:
46
-
47
- ```python
48
- import pvleopard
49
-
50
- leopard = pvleopard.create(access_key='${ACCESS_KEY}')
51
-
52
- transcript, words = leopard.process_file('${AUDIO_PATH}')
53
- print(transcript)
54
- for word in words:
55
- print(
56
- "{word=\"%s\" start_sec=%.2f end_sec=%.2f confidence=%.2f}"
57
- % (word.word, word.start_sec, word.end_sec, word.confidence))
58
- ```
59
-
60
- Replace `${ACCESS_KEY}` with yours obtained from [Picovoice Console](https://console.picovoice.ai/) and
61
- `${AUDIO_PATH}` to the path an audio file.
62
-
63
- Finally, when done be sure to explicitly release the resources:
64
- ```python
65
- leopard.delete()
66
- ```
67
-
68
- ## Language Model
69
-
70
- The Leopard Python SDK comes preloaded with a default English language model (`.pv` file).
71
- Default models for other supported languages can be found in [lib/common](../../lib/common).
72
-
73
- Create custom language models using the [Picovoice Console](https://console.picovoice.ai/). Here you can train
74
- language models with custom vocabulary and boost words in the existing vocabulary.
75
-
76
- Pass in the `.pv` file via the `model_path` argument:
77
- ```python
78
- leopard = pvleopard.create(
79
- access_key='${ACCESS_KEY}',
80
- model_path='${MODEL_PATH}')
81
- ```
82
-
83
- ## Demos
84
-
85
- [pvleoparddemo](https://pypi.org/project/pvleoparddemo/) provides command-line utilities for processing audio using
86
- Leopard.
87
-
88
- Keywords: Speech-to-Text,Speech Recognition,Voice Recognition,ASR,Automatic Speech Recognition
89
- Platform: UNKNOWN
90
- Classifier: Development Status :: 5 - Production/Stable
91
- Classifier: Intended Audience :: Developers
92
- Classifier: License :: OSI Approved :: Apache Software License
93
- Classifier: Operating System :: OS Independent
94
- Classifier: Programming Language :: Python :: 3
95
- Classifier: Topic :: Multimedia :: Sound/Audio :: Speech
96
- Requires-Python: >=3.5
97
- Description-Content-Type: text/markdown
File without changes
File without changes