scribe-cli 0.12.4__tar.gz → 0.13.0__tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (31) hide show
  1. {scribe_cli-0.12.4/scribe_cli.egg-info → scribe_cli-0.13.0}/PKG-INFO +4 -4
  2. {scribe_cli-0.12.4 → scribe_cli-0.13.0}/README.md +2 -2
  3. {scribe_cli-0.12.4 → scribe_cli-0.13.0}/pyproject.toml +4 -1
  4. {scribe_cli-0.12.4 → scribe_cli-0.13.0}/scribe/_version.py +2 -2
  5. {scribe_cli-0.12.4 → scribe_cli-0.13.0}/scribe/app.py +2 -2
  6. {scribe_cli-0.12.4 → scribe_cli-0.13.0/scribe_cli.egg-info}/PKG-INFO +4 -4
  7. {scribe_cli-0.12.4 → scribe_cli-0.13.0}/.github/workflows/pypi.yml +0 -0
  8. {scribe_cli-0.12.4 → scribe_cli-0.13.0}/.gitignore +0 -0
  9. {scribe_cli-0.12.4 → scribe_cli-0.13.0}/LICENSE +0 -0
  10. {scribe_cli-0.12.4 → scribe_cli-0.13.0}/icon.xcf +0 -0
  11. {scribe_cli-0.12.4 → scribe_cli-0.13.0}/scribe/__init__.py +0 -0
  12. {scribe_cli-0.12.4 → scribe_cli-0.13.0}/scribe/audio.py +0 -0
  13. {scribe_cli-0.12.4 → scribe_cli-0.13.0}/scribe/install_desktop.py +0 -0
  14. {scribe_cli-0.12.4 → scribe_cli-0.13.0}/scribe/keyboard.py +0 -0
  15. {scribe_cli-0.12.4 → scribe_cli-0.13.0}/scribe/models.py +0 -0
  16. {scribe_cli-0.12.4 → scribe_cli-0.13.0}/scribe/models.toml +0 -0
  17. {scribe_cli-0.12.4 → scribe_cli-0.13.0}/scribe/saverecording.py +0 -0
  18. {scribe_cli-0.12.4 → scribe_cli-0.13.0}/scribe/testpynput.py +0 -0
  19. {scribe_cli-0.12.4 → scribe_cli-0.13.0}/scribe/util.py +0 -0
  20. {scribe_cli-0.12.4 → scribe_cli-0.13.0}/scribe_cli.egg-info/SOURCES.txt +0 -0
  21. {scribe_cli-0.12.4 → scribe_cli-0.13.0}/scribe_cli.egg-info/dependency_links.txt +0 -0
  22. {scribe_cli-0.12.4 → scribe_cli-0.13.0}/scribe_cli.egg-info/entry_points.txt +0 -0
  23. {scribe_cli-0.12.4 → scribe_cli-0.13.0}/scribe_cli.egg-info/requires.txt +0 -0
  24. {scribe_cli-0.12.4 → scribe_cli-0.13.0}/scribe_cli.egg-info/top_level.txt +0 -0
  25. {scribe_cli-0.12.4 → scribe_cli-0.13.0}/scribe_data/__init__.py +0 -0
  26. {scribe_cli-0.12.4 → scribe_cli-0.13.0}/scribe_data/share/icon.png +0 -0
  27. {scribe_cli-0.12.4 → scribe_cli-0.13.0}/scribe_data/share/icon_recording.png +0 -0
  28. {scribe_cli-0.12.4 → scribe_cli-0.13.0}/scribe_data/share/icon_writing.png +0 -0
  29. {scribe_cli-0.12.4 → scribe_cli-0.13.0}/scribe_data/templates/scribe.desktop +0 -0
  30. {scribe_cli-0.12.4 → scribe_cli-0.13.0}/scripts/test_python_versions_install.sh +0 -0
  31. {scribe_cli-0.12.4 → scribe_cli-0.13.0}/setup.cfg +0 -0
@@ -1,6 +1,6 @@
1
1
  Metadata-Version: 2.2
2
2
  Name: scribe-cli
3
- Version: 0.12.4
3
+ Version: 0.13.0
4
4
  Summary: scribe is a local speech recognition tool that provides real-time transcription using vosk and whisper AI, with the goal of serving as a virtual keyboard on a computer
5
5
  Author-email: Mahé Perrette <mahe.perrette@gmail.com>
6
6
  License: MIT License
@@ -33,7 +33,7 @@ License: MIT License
33
33
  licenses of all dependencies before using or distributing this software to
34
34
  ensure compliance with their respective terms.
35
35
  Project-URL: Homepage, https://github.com/perrette/scribe
36
- Keywords: speech recognition,transcription,AI,language,vosk,whisper,openai,keyboard,clipboard
36
+ Keywords: speech-to-text,speech recognition,transcription,language,AI,local,API,vosk,whisper,openai,keyboard,clipboard
37
37
  Classifier: Programming Language :: Python :: 3.9
38
38
  Classifier: Programming Language :: Python :: 3.10
39
39
  Classifier: Programming Language :: Python :: 3.11
@@ -171,8 +171,8 @@ You can interrupt the recording via Ctrl + C and start again or change model.
171
171
 
172
172
  The default (`whisper`) is excellent at transcribing a full-length audio sequences in [many languages](https://github.com/openai/whisper?tab=readme-ov-file#available-models-and-languages). It is really impressive,
173
173
  but it cannot do real-time, and depending on the model can have relatively long execution time, especially with the `turbo` model (at least on my laptop with CPU only). The `small` model is also excellent and runs much faster. It is selected as default in `scribe` for that reason.
174
- With the `whisper` model the registration stops after a 2-second silence is detected. You can also stop the registration manually before the transcription occurs (Ctrl + C or Stop in the `--app` mode).
175
- By default, the recording will only last 120 seconds. You can fine-tune this behaviour via the `--silence`, `--duration` and `--restart-after-silence` parameters.
174
+ With the `whisper` model (`whisper` and `openaiapi` backends) the registration continues for 2 minutes until you stop the registration manually to trigger the transcription (Stop in the app, Ctrl + C in the terminal).
175
+ These parameters can be changed. There is also the possibility to interrupt after a silence is detected. You would do: `--silence -40 --duration-duration 2` to interrupt the registration when a silence (less than -40 db recorded) lasts for more than 2 seconds. This is experimental, and the default is an exceedingly low silence threshold of -200 db and a silence duration of 120 s to effectively disable that feature and keep full manual control.
176
176
 
177
177
  The `vosk` backend is much faster and very good at doing real-time transcription for one language, but tended to make more mistakes in my tests and it does not do punctuation.
178
178
  It becomes really powerful when used for longer or interactive typing session with the [keyboard](#virtual-keyboard-experimental) option, e.g. to make notes or chat with an AI.
@@ -99,8 +99,8 @@ You can interrupt the recording via Ctrl + C and start again or change model.
99
99
 
100
100
  The default (`whisper`) is excellent at transcribing a full-length audio sequences in [many languages](https://github.com/openai/whisper?tab=readme-ov-file#available-models-and-languages). It is really impressive,
101
101
  but it cannot do real-time, and depending on the model can have relatively long execution time, especially with the `turbo` model (at least on my laptop with CPU only). The `small` model is also excellent and runs much faster. It is selected as default in `scribe` for that reason.
102
- With the `whisper` model the registration stops after a 2-second silence is detected. You can also stop the registration manually before the transcription occurs (Ctrl + C or Stop in the `--app` mode).
103
- By default, the recording will only last 120 seconds. You can fine-tune this behaviour via the `--silence`, `--duration` and `--restart-after-silence` parameters.
102
+ With the `whisper` model (`whisper` and `openaiapi` backends) the registration continues for 2 minutes until you stop the registration manually to trigger the transcription (Stop in the app, Ctrl + C in the terminal).
103
+ These parameters can be changed. There is also the possibility to interrupt after a silence is detected. You would do: `--silence -40 --duration-duration 2` to interrupt the registration when a silence (less than -40 db recorded) lasts for more than 2 seconds. This is experimental, and the default is an exceedingly low silence threshold of -200 db and a silence duration of 120 s to effectively disable that feature and keep full manual control.
104
104
 
105
105
  The `vosk` backend is much faster and very good at doing real-time transcription for one language, but tended to make more mistakes in my tests and it does not do punctuation.
106
106
  It becomes really powerful when used for longer or interactive typing session with the [keyboard](#virtual-keyboard-experimental) option, e.g. to make notes or chat with an AI.
@@ -32,10 +32,13 @@ classifiers = [
32
32
  ]
33
33
 
34
34
  keywords = [
35
+ "speech-to-text",
35
36
  "speech recognition",
36
37
  "transcription",
37
- "AI",
38
38
  "language",
39
+ "AI",
40
+ "local",
41
+ "API",
39
42
  "vosk",
40
43
  "whisper",
41
44
  "openai",
@@ -17,5 +17,5 @@ __version__: str
17
17
  __version_tuple__: VERSION_TUPLE
18
18
  version_tuple: VERSION_TUPLE
19
19
 
20
- __version__ = version = '0.12.4'
21
- __version_tuple__ = version_tuple = (0, 12, 4)
20
+ __version__ = version = '0.13.0'
21
+ __version_tuple__ = version_tuple = (0, 13, 0)
@@ -202,8 +202,8 @@ def get_parser():
202
202
 
203
203
  group = parser.add_argument_group("whisper options")
204
204
  group.add_argument("--duration", default=120, type=float, help="Max duration of the whisper recording (default %(default)s s)")
205
- group.add_argument("--silence", default=2, type=float, help="silence duration (default %(default)s s)")
206
- group.add_argument("--silence-db", default=-40, type=float, help="silence magnitude in decibel (default %(default)s db)")
205
+ group.add_argument("--silence", default=120, type=float, help="silence duration (default %(default)s s)")
206
+ group.add_argument("--silence-db", default=-200, type=float, help="silence magnitude in decibel (default %(default)s db)")
207
207
  group.add_argument("-a", "--restart-after-silence", action="store_true", help="Restart the recording after a transcription triggered by a silence")
208
208
  group.add_argument("--download-folder-whisper", help="Folder to store Whisper models.")
209
209
 
@@ -1,6 +1,6 @@
1
1
  Metadata-Version: 2.2
2
2
  Name: scribe-cli
3
- Version: 0.12.4
3
+ Version: 0.13.0
4
4
  Summary: scribe is a local speech recognition tool that provides real-time transcription using vosk and whisper AI, with the goal of serving as a virtual keyboard on a computer
5
5
  Author-email: Mahé Perrette <mahe.perrette@gmail.com>
6
6
  License: MIT License
@@ -33,7 +33,7 @@ License: MIT License
33
33
  licenses of all dependencies before using or distributing this software to
34
34
  ensure compliance with their respective terms.
35
35
  Project-URL: Homepage, https://github.com/perrette/scribe
36
- Keywords: speech recognition,transcription,AI,language,vosk,whisper,openai,keyboard,clipboard
36
+ Keywords: speech-to-text,speech recognition,transcription,language,AI,local,API,vosk,whisper,openai,keyboard,clipboard
37
37
  Classifier: Programming Language :: Python :: 3.9
38
38
  Classifier: Programming Language :: Python :: 3.10
39
39
  Classifier: Programming Language :: Python :: 3.11
@@ -171,8 +171,8 @@ You can interrupt the recording via Ctrl + C and start again or change model.
171
171
 
172
172
  The default (`whisper`) is excellent at transcribing a full-length audio sequences in [many languages](https://github.com/openai/whisper?tab=readme-ov-file#available-models-and-languages). It is really impressive,
173
173
  but it cannot do real-time, and depending on the model can have relatively long execution time, especially with the `turbo` model (at least on my laptop with CPU only). The `small` model is also excellent and runs much faster. It is selected as default in `scribe` for that reason.
174
- With the `whisper` model the registration stops after a 2-second silence is detected. You can also stop the registration manually before the transcription occurs (Ctrl + C or Stop in the `--app` mode).
175
- By default, the recording will only last 120 seconds. You can fine-tune this behaviour via the `--silence`, `--duration` and `--restart-after-silence` parameters.
174
+ With the `whisper` model (`whisper` and `openaiapi` backends) the registration continues for 2 minutes until you stop the registration manually to trigger the transcription (Stop in the app, Ctrl + C in the terminal).
175
+ These parameters can be changed. There is also the possibility to interrupt after a silence is detected. You would do: `--silence -40 --duration-duration 2` to interrupt the registration when a silence (less than -40 db recorded) lasts for more than 2 seconds. This is experimental, and the default is an exceedingly low silence threshold of -200 db and a silence duration of 120 s to effectively disable that feature and keep full manual control.
176
176
 
177
177
  The `vosk` backend is much faster and very good at doing real-time transcription for one language, but tended to make more mistakes in my tests and it does not do punctuation.
178
178
  It becomes really powerful when used for longer or interactive typing session with the [keyboard](#virtual-keyboard-experimental) option, e.g. to make notes or chat with an AI.
File without changes
File without changes
File without changes
File without changes
File without changes
File without changes