scribe-cli 0.12.3__tar.gz → 0.13.0__tar.gz
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- {scribe_cli-0.12.3/scribe_cli.egg-info → scribe_cli-0.13.0}/PKG-INFO +6 -6
- {scribe_cli-0.12.3 → scribe_cli-0.13.0}/README.md +4 -4
- {scribe_cli-0.12.3 → scribe_cli-0.13.0}/pyproject.toml +4 -1
- {scribe_cli-0.12.3 → scribe_cli-0.13.0}/scribe/_version.py +2 -2
- {scribe_cli-0.12.3 → scribe_cli-0.13.0}/scribe/app.py +2 -2
- {scribe_cli-0.12.3 → scribe_cli-0.13.0/scribe_cli.egg-info}/PKG-INFO +6 -6
- {scribe_cli-0.12.3 → scribe_cli-0.13.0}/.github/workflows/pypi.yml +0 -0
- {scribe_cli-0.12.3 → scribe_cli-0.13.0}/.gitignore +0 -0
- {scribe_cli-0.12.3 → scribe_cli-0.13.0}/LICENSE +0 -0
- {scribe_cli-0.12.3 → scribe_cli-0.13.0}/icon.xcf +0 -0
- {scribe_cli-0.12.3 → scribe_cli-0.13.0}/scribe/__init__.py +0 -0
- {scribe_cli-0.12.3 → scribe_cli-0.13.0}/scribe/audio.py +0 -0
- {scribe_cli-0.12.3 → scribe_cli-0.13.0}/scribe/install_desktop.py +0 -0
- {scribe_cli-0.12.3 → scribe_cli-0.13.0}/scribe/keyboard.py +0 -0
- {scribe_cli-0.12.3 → scribe_cli-0.13.0}/scribe/models.py +0 -0
- {scribe_cli-0.12.3 → scribe_cli-0.13.0}/scribe/models.toml +0 -0
- {scribe_cli-0.12.3 → scribe_cli-0.13.0}/scribe/saverecording.py +0 -0
- {scribe_cli-0.12.3 → scribe_cli-0.13.0}/scribe/testpynput.py +0 -0
- {scribe_cli-0.12.3 → scribe_cli-0.13.0}/scribe/util.py +0 -0
- {scribe_cli-0.12.3 → scribe_cli-0.13.0}/scribe_cli.egg-info/SOURCES.txt +0 -0
- {scribe_cli-0.12.3 → scribe_cli-0.13.0}/scribe_cli.egg-info/dependency_links.txt +0 -0
- {scribe_cli-0.12.3 → scribe_cli-0.13.0}/scribe_cli.egg-info/entry_points.txt +0 -0
- {scribe_cli-0.12.3 → scribe_cli-0.13.0}/scribe_cli.egg-info/requires.txt +0 -0
- {scribe_cli-0.12.3 → scribe_cli-0.13.0}/scribe_cli.egg-info/top_level.txt +0 -0
- {scribe_cli-0.12.3 → scribe_cli-0.13.0}/scribe_data/__init__.py +0 -0
- {scribe_cli-0.12.3 → scribe_cli-0.13.0}/scribe_data/share/icon.png +0 -0
- {scribe_cli-0.12.3 → scribe_cli-0.13.0}/scribe_data/share/icon_recording.png +0 -0
- {scribe_cli-0.12.3 → scribe_cli-0.13.0}/scribe_data/share/icon_writing.png +0 -0
- {scribe_cli-0.12.3 → scribe_cli-0.13.0}/scribe_data/templates/scribe.desktop +0 -0
- {scribe_cli-0.12.3 → scribe_cli-0.13.0}/scripts/test_python_versions_install.sh +0 -0
- {scribe_cli-0.12.3 → scribe_cli-0.13.0}/setup.cfg +0 -0
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
Metadata-Version: 2.2
|
|
2
2
|
Name: scribe-cli
|
|
3
|
-
Version: 0.
|
|
3
|
+
Version: 0.13.0
|
|
4
4
|
Summary: scribe is a local speech recognition tool that provides real-time transcription using vosk and whisper AI, with the goal of serving as a virtual keyboard on a computer
|
|
5
5
|
Author-email: Mahé Perrette <mahe.perrette@gmail.com>
|
|
6
6
|
License: MIT License
|
|
@@ -33,7 +33,7 @@ License: MIT License
|
|
|
33
33
|
licenses of all dependencies before using or distributing this software to
|
|
34
34
|
ensure compliance with their respective terms.
|
|
35
35
|
Project-URL: Homepage, https://github.com/perrette/scribe
|
|
36
|
-
Keywords: speech recognition,transcription,AI,
|
|
36
|
+
Keywords: speech-to-text,speech recognition,transcription,language,AI,local,API,vosk,whisper,openai,keyboard,clipboard
|
|
37
37
|
Classifier: Programming Language :: Python :: 3.9
|
|
38
38
|
Classifier: Programming Language :: Python :: 3.10
|
|
39
39
|
Classifier: Programming Language :: Python :: 3.11
|
|
@@ -73,7 +73,7 @@ Requires-Dist: pystray; extra == "all"
|
|
|
73
73
|
[](https://pypi.org/project/scribe-cli)
|
|
74
74
|

|
|
75
75
|
|
|
76
|
-
# Scribe <img src="
|
|
76
|
+
# Scribe <img src="https://github.com/perrette/bard/raw/main/bard_data/share/icon.png" width=48px>
|
|
77
77
|
|
|
78
78
|
`scribe` is a speech recognition tool that provides real-time transcription using cutting-edge AI models, with the goal of serving as a virtual keyboard on a computer.
|
|
79
79
|
|
|
@@ -171,8 +171,8 @@ You can interrupt the recording via Ctrl + C and start again or change model.
|
|
|
171
171
|
|
|
172
172
|
The default (`whisper`) is excellent at transcribing a full-length audio sequences in [many languages](https://github.com/openai/whisper?tab=readme-ov-file#available-models-and-languages). It is really impressive,
|
|
173
173
|
but it cannot do real-time, and depending on the model can have relatively long execution time, especially with the `turbo` model (at least on my laptop with CPU only). The `small` model is also excellent and runs much faster. It is selected as default in `scribe` for that reason.
|
|
174
|
-
With the `whisper` model
|
|
175
|
-
|
|
174
|
+
With the `whisper` model (`whisper` and `openaiapi` backends) the registration continues for 2 minutes until you stop the registration manually to trigger the transcription (Stop in the app, Ctrl + C in the terminal).
|
|
175
|
+
These parameters can be changed. There is also the possibility to interrupt after a silence is detected. You would do: `--silence -40 --duration-duration 2` to interrupt the registration when a silence (less than -40 db recorded) lasts for more than 2 seconds. This is experimental, and the default is an exceedingly low silence threshold of -200 db and a silence duration of 120 s to effectively disable that feature and keep full manual control.
|
|
176
176
|
|
|
177
177
|
The `vosk` backend is much faster and very good at doing real-time transcription for one language, but tended to make more mistakes in my tests and it does not do punctuation.
|
|
178
178
|
It becomes really powerful when used for longer or interactive typing session with the [keyboard](#virtual-keyboard-experimental) option, e.g. to make notes or chat with an AI.
|
|
@@ -233,7 +233,7 @@ sudo HOME=$HOME XDG_RUNTIME_DIR=$XDG_RUNTIME_DIR PYNPUT_BACKEND_KEYBOARD=uinput
|
|
|
233
233
|
```
|
|
234
234
|
You're on the right path :)
|
|
235
235
|
|
|
236
|
-
## System tray icon (experimental) <img src="
|
|
236
|
+
## System tray icon (experimental) <img src="https://github.com/perrette/bard/raw/main/bard_data/share/icon.png" width=48px>
|
|
237
237
|
|
|
238
238
|
<img src=https://github.com/user-attachments/assets/4c97f4b1-1a65-4d49-9f5a-a9f4287cfa5a width=300px>
|
|
239
239
|
|
|
@@ -1,7 +1,7 @@
|
|
|
1
1
|
[](https://pypi.org/project/scribe-cli)
|
|
2
2
|

|
|
3
3
|
|
|
4
|
-
# Scribe <img src="
|
|
4
|
+
# Scribe <img src="https://github.com/perrette/bard/raw/main/bard_data/share/icon.png" width=48px>
|
|
5
5
|
|
|
6
6
|
`scribe` is a speech recognition tool that provides real-time transcription using cutting-edge AI models, with the goal of serving as a virtual keyboard on a computer.
|
|
7
7
|
|
|
@@ -99,8 +99,8 @@ You can interrupt the recording via Ctrl + C and start again or change model.
|
|
|
99
99
|
|
|
100
100
|
The default (`whisper`) is excellent at transcribing a full-length audio sequences in [many languages](https://github.com/openai/whisper?tab=readme-ov-file#available-models-and-languages). It is really impressive,
|
|
101
101
|
but it cannot do real-time, and depending on the model can have relatively long execution time, especially with the `turbo` model (at least on my laptop with CPU only). The `small` model is also excellent and runs much faster. It is selected as default in `scribe` for that reason.
|
|
102
|
-
With the `whisper` model
|
|
103
|
-
|
|
102
|
+
With the `whisper` model (`whisper` and `openaiapi` backends) the registration continues for 2 minutes until you stop the registration manually to trigger the transcription (Stop in the app, Ctrl + C in the terminal).
|
|
103
|
+
These parameters can be changed. There is also the possibility to interrupt after a silence is detected. You would do: `--silence -40 --duration-duration 2` to interrupt the registration when a silence (less than -40 db recorded) lasts for more than 2 seconds. This is experimental, and the default is an exceedingly low silence threshold of -200 db and a silence duration of 120 s to effectively disable that feature and keep full manual control.
|
|
104
104
|
|
|
105
105
|
The `vosk` backend is much faster and very good at doing real-time transcription for one language, but tended to make more mistakes in my tests and it does not do punctuation.
|
|
106
106
|
It becomes really powerful when used for longer or interactive typing session with the [keyboard](#virtual-keyboard-experimental) option, e.g. to make notes or chat with an AI.
|
|
@@ -161,7 +161,7 @@ sudo HOME=$HOME XDG_RUNTIME_DIR=$XDG_RUNTIME_DIR PYNPUT_BACKEND_KEYBOARD=uinput
|
|
|
161
161
|
```
|
|
162
162
|
You're on the right path :)
|
|
163
163
|
|
|
164
|
-
## System tray icon (experimental) <img src="
|
|
164
|
+
## System tray icon (experimental) <img src="https://github.com/perrette/bard/raw/main/bard_data/share/icon.png" width=48px>
|
|
165
165
|
|
|
166
166
|
<img src=https://github.com/user-attachments/assets/4c97f4b1-1a65-4d49-9f5a-a9f4287cfa5a width=300px>
|
|
167
167
|
|
|
@@ -202,8 +202,8 @@ def get_parser():
|
|
|
202
202
|
|
|
203
203
|
group = parser.add_argument_group("whisper options")
|
|
204
204
|
group.add_argument("--duration", default=120, type=float, help="Max duration of the whisper recording (default %(default)s s)")
|
|
205
|
-
group.add_argument("--silence", default=
|
|
206
|
-
group.add_argument("--silence-db", default=-
|
|
205
|
+
group.add_argument("--silence", default=120, type=float, help="silence duration (default %(default)s s)")
|
|
206
|
+
group.add_argument("--silence-db", default=-200, type=float, help="silence magnitude in decibel (default %(default)s db)")
|
|
207
207
|
group.add_argument("-a", "--restart-after-silence", action="store_true", help="Restart the recording after a transcription triggered by a silence")
|
|
208
208
|
group.add_argument("--download-folder-whisper", help="Folder to store Whisper models.")
|
|
209
209
|
|
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
Metadata-Version: 2.2
|
|
2
2
|
Name: scribe-cli
|
|
3
|
-
Version: 0.
|
|
3
|
+
Version: 0.13.0
|
|
4
4
|
Summary: scribe is a local speech recognition tool that provides real-time transcription using vosk and whisper AI, with the goal of serving as a virtual keyboard on a computer
|
|
5
5
|
Author-email: Mahé Perrette <mahe.perrette@gmail.com>
|
|
6
6
|
License: MIT License
|
|
@@ -33,7 +33,7 @@ License: MIT License
|
|
|
33
33
|
licenses of all dependencies before using or distributing this software to
|
|
34
34
|
ensure compliance with their respective terms.
|
|
35
35
|
Project-URL: Homepage, https://github.com/perrette/scribe
|
|
36
|
-
Keywords: speech recognition,transcription,AI,
|
|
36
|
+
Keywords: speech-to-text,speech recognition,transcription,language,AI,local,API,vosk,whisper,openai,keyboard,clipboard
|
|
37
37
|
Classifier: Programming Language :: Python :: 3.9
|
|
38
38
|
Classifier: Programming Language :: Python :: 3.10
|
|
39
39
|
Classifier: Programming Language :: Python :: 3.11
|
|
@@ -73,7 +73,7 @@ Requires-Dist: pystray; extra == "all"
|
|
|
73
73
|
[](https://pypi.org/project/scribe-cli)
|
|
74
74
|

|
|
75
75
|
|
|
76
|
-
# Scribe <img src="
|
|
76
|
+
# Scribe <img src="https://github.com/perrette/bard/raw/main/bard_data/share/icon.png" width=48px>
|
|
77
77
|
|
|
78
78
|
`scribe` is a speech recognition tool that provides real-time transcription using cutting-edge AI models, with the goal of serving as a virtual keyboard on a computer.
|
|
79
79
|
|
|
@@ -171,8 +171,8 @@ You can interrupt the recording via Ctrl + C and start again or change model.
|
|
|
171
171
|
|
|
172
172
|
The default (`whisper`) is excellent at transcribing a full-length audio sequences in [many languages](https://github.com/openai/whisper?tab=readme-ov-file#available-models-and-languages). It is really impressive,
|
|
173
173
|
but it cannot do real-time, and depending on the model can have relatively long execution time, especially with the `turbo` model (at least on my laptop with CPU only). The `small` model is also excellent and runs much faster. It is selected as default in `scribe` for that reason.
|
|
174
|
-
With the `whisper` model
|
|
175
|
-
|
|
174
|
+
With the `whisper` model (`whisper` and `openaiapi` backends) the registration continues for 2 minutes until you stop the registration manually to trigger the transcription (Stop in the app, Ctrl + C in the terminal).
|
|
175
|
+
These parameters can be changed. There is also the possibility to interrupt after a silence is detected. You would do: `--silence -40 --duration-duration 2` to interrupt the registration when a silence (less than -40 db recorded) lasts for more than 2 seconds. This is experimental, and the default is an exceedingly low silence threshold of -200 db and a silence duration of 120 s to effectively disable that feature and keep full manual control.
|
|
176
176
|
|
|
177
177
|
The `vosk` backend is much faster and very good at doing real-time transcription for one language, but tended to make more mistakes in my tests and it does not do punctuation.
|
|
178
178
|
It becomes really powerful when used for longer or interactive typing session with the [keyboard](#virtual-keyboard-experimental) option, e.g. to make notes or chat with an AI.
|
|
@@ -233,7 +233,7 @@ sudo HOME=$HOME XDG_RUNTIME_DIR=$XDG_RUNTIME_DIR PYNPUT_BACKEND_KEYBOARD=uinput
|
|
|
233
233
|
```
|
|
234
234
|
You're on the right path :)
|
|
235
235
|
|
|
236
|
-
## System tray icon (experimental) <img src="
|
|
236
|
+
## System tray icon (experimental) <img src="https://github.com/perrette/bard/raw/main/bard_data/share/icon.png" width=48px>
|
|
237
237
|
|
|
238
238
|
<img src=https://github.com/user-attachments/assets/4c97f4b1-1a65-4d49-9f5a-a9f4287cfa5a width=300px>
|
|
239
239
|
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|