stttype 2.0.2__tar.gz → 2.0.4__tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
@@ -1,6 +1,6 @@
1
1
  Metadata-Version: 2.4
2
2
  Name: stttype
3
- Version: 2.0.2
3
+ Version: 2.0.4
4
4
  Summary: Cross-platform voice-to-text typing assistant with GPU acceleration and automatic CPU fallback
5
5
  Author: LucasApps
6
6
  License: MIT
@@ -37,44 +37,54 @@ Provides-Extra: gpu
37
37
  Requires-Dist: torch>=2.0.0; extra == "gpu"
38
38
  Requires-Dist: torchaudio>=2.0.0; extra == "gpu"
39
39
 
40
- # STT Type v1.0.0
40
+ # STT Type v2.0.3
41
41
 
42
- **Python based STT module running on GPU.**
42
+ **Cross-platform voice-to-text typing assistant with GPU acceleration and automatic CPU fallback.**
43
43
 
44
- Hold **F2** to record your voice, release to transcribe and type the text automatically at your cursor position.
44
+ Hold **F2** (configurable) to record your voice, release to transcribe and type the text automatically at your cursor position.
45
45
 
46
46
  ## Features
47
47
 
48
48
  - **Cross-platform** - Works on Windows, Linux, and macOS
49
- - **Hold F2 to record** - Audio captures while key is held
50
- - **Visual indicator** - Semi-transparent red dot with "F2" label appears in top-right corner while recording
49
+ - **Hold hotkey to record** - Audio captures while key is held (default: F2)
50
+ - **Visual indicator** - Transparent red dot with hotkey label appears in top-right corner while recording
51
51
  - **Bell sounds** - Audio feedback when recording starts/stops
52
52
  - **GPU-accelerated STT** - Uses faster-whisper on your NVIDIA GPU
53
- - **Auto-typing** - Transcribed text is typed at cursor position
53
+ - **Auto CPU fallback** - Automatically falls back to CPU if GPU is not available
54
+ - **Clipboard typing** - Types text via Ctrl+V paste for reliability (no key interference)
54
55
  - **System tray mode** - Runs silently in background
55
- - **Auto-startup** - Starts automatically on login
56
+ - **Auto-startup** - Starts automatically on login (Windows)
57
+ - **Settings GUI** - `stttype --config` to change model, language, device, compute type, and hotkey
56
58
 
57
59
  ## Requirements
58
60
 
59
61
  - Python 3.9+
60
- - NVIDIA GPU with CUDA support (for GPU mode)
61
62
  - Microphone
63
+ - NVIDIA GPU with CUDA support (optional, for GPU mode)
62
64
 
63
65
  ## Installation
64
66
 
65
- ### Prerequisites
67
+ ### From PyPI (Recommended)
66
68
 
67
- Install PyTorch with CUDA support:
69
+ ```bash
70
+ pip install stttype
71
+ ```
72
+
73
+ ### Prerequisites (Optional GPU Support)
74
+
75
+ Install PyTorch with CUDA support for GPU acceleration:
68
76
 
69
77
  ```bash
70
- # Windows/Linux
78
+ # Windows/Linux with CUDA 11.8
71
79
  pip install torch torchaudio --index-url https://download.pytorch.org/whl/cu118
72
80
 
73
81
  # macOS (CPU only, no CUDA)
74
82
  pip install torch torchaudio
75
83
  ```
76
84
 
77
- ### Windows
85
+ Without PyTorch, STT Type will still work — it automatically falls back to CPU.
86
+
87
+ ### Windows (Local Install)
78
88
 
79
89
  ```powershell
80
90
  cd "E:\Lucas\STT Type"
@@ -128,8 +138,9 @@ Once installed, `stttype` works from any terminal.
128
138
  | `stttype --shutdown` | Stop all STT Type processes |
129
139
  | `stttype --status` | Check if STT Type is running |
130
140
  | `stttype --restart` | Restart STT Type |
131
- | `stttype --addtostartup` | Add to startup |
132
- | `stttype --rmtostartup` | Remove from startup |
141
+ | `stttype --config` | Open settings GUI |
142
+ | `stttype --addtostartup` | Add to startup (Windows only) |
143
+ | `stttype --rmtostartup` | Remove from startup (Windows only) |
133
144
  | `stttype --model <size>` | Set Whisper model (tiny/base/small/medium/large-v3) |
134
145
  | `stttype --lang <code>` | Set language (en/zh/auto/etc) |
135
146
  | `stttype --help` | Show help |
@@ -148,14 +159,31 @@ stttype --start --lang zh
148
159
 
149
160
  # Start with small model and auto-detect language
150
161
  stttype --start --model small --lang auto
162
+
163
+ # Open settings GUI
164
+ stttype --config
151
165
  ```
152
166
 
167
+ ### Settings GUI
168
+
169
+ Run `stttype --config` to open a settings window where you can configure:
170
+
171
+ | Setting | Options |
172
+ |---------|---------|
173
+ | **Whisper Model** | tiny, base, small, medium, large-v3 |
174
+ | **Language** | Auto-detect, English, Chinese, Spanish, French, German, Japanese, Korean, Russian, Italian, Portuguese, Arabic, Hindi |
175
+ | **Device** | Auto (GPU if available), GPU (CUDA), CPU |
176
+ | **Compute Type** | Auto, Float16 (GPU), Int8 (CPU) |
177
+ | **Hotkey** | F1 - F12 |
178
+
179
+ Settings are saved to `~/.config/stttype/config.json` and persist across restarts.
180
+
153
181
  ## How It Works
154
182
 
155
- 1. **Hold F2** - A semi-transparent red dot with "F2" appears in the top-right corner, microphone starts recording
156
- 2. **Release F2** - Red dot disappears, recording stops
157
- 3. **GPU transcribes** - Whisper processes audio on your NVIDIA GPU
158
- 4. **Text is typed** - Result appears at your cursor position
183
+ 1. **Hold hotkey** (default F2) - A transparent red dot appears in the top-right corner, microphone starts recording
184
+ 2. **Release hotkey** - Red dot disappears, recording stops
185
+ 3. **GPU/CPU transcribes** - Whisper processes audio (GPU if available, else CPU)
186
+ 4. **Text is pasted** - Result is pasted at your cursor position via Ctrl+V
159
187
 
160
188
  ## Models
161
189
 
@@ -189,12 +217,13 @@ chmod +x uninstall.sh
189
217
  | Issue | Solution |
190
218
  |-------|----------|
191
219
  | `stttype` not found | Restart terminal after installation |
192
- | "CUDA not available" | Install NVIDIA drivers and CUDA toolkit |
220
+ | "CUDA not available" | Install NVIDIA drivers and CUDA toolkit, or STT Type will auto-fallback to CPU |
193
221
  | No sound on start/stop | Check system volume |
194
222
  | Text not typing | Make sure the target window is focused |
195
223
  | Model download fails | Check internet connection |
196
224
  | Hotkeys don't work (macOS) | Grant Accessibility permissions to your terminal |
197
225
  | Hotkeys don't work (Linux) | Make sure you're running under X11 (not Wayland) |
226
+ | Garbled text when typing | Fixed in v2.0.2+ — uses clipboard paste instead of key simulation |
198
227
 
199
228
  ## Publish to PyPI
200
229
 
@@ -217,5 +246,5 @@ When prompted:
217
246
  ---
218
247
 
219
248
  **Author**: LucasApps
220
- **Version**: 1.0.0
249
+ **Version**: 2.0.3
221
250
  **License**: MIT
@@ -1,41 +1,51 @@
1
- # STT Type v1.0.0
1
+ # STT Type v2.0.3
2
2
 
3
- **Python based STT module running on GPU.**
3
+ **Cross-platform voice-to-text typing assistant with GPU acceleration and automatic CPU fallback.**
4
4
 
5
- Hold **F2** to record your voice, release to transcribe and type the text automatically at your cursor position.
5
+ Hold **F2** (configurable) to record your voice, release to transcribe and type the text automatically at your cursor position.
6
6
 
7
7
  ## Features
8
8
 
9
9
  - **Cross-platform** - Works on Windows, Linux, and macOS
10
- - **Hold F2 to record** - Audio captures while key is held
11
- - **Visual indicator** - Semi-transparent red dot with "F2" label appears in top-right corner while recording
10
+ - **Hold hotkey to record** - Audio captures while key is held (default: F2)
11
+ - **Visual indicator** - Transparent red dot with hotkey label appears in top-right corner while recording
12
12
  - **Bell sounds** - Audio feedback when recording starts/stops
13
13
  - **GPU-accelerated STT** - Uses faster-whisper on your NVIDIA GPU
14
- - **Auto-typing** - Transcribed text is typed at cursor position
14
+ - **Auto CPU fallback** - Automatically falls back to CPU if GPU is not available
15
+ - **Clipboard typing** - Types text via Ctrl+V paste for reliability (no key interference)
15
16
  - **System tray mode** - Runs silently in background
16
- - **Auto-startup** - Starts automatically on login
17
+ - **Auto-startup** - Starts automatically on login (Windows)
18
+ - **Settings GUI** - `stttype --config` to change model, language, device, compute type, and hotkey
17
19
 
18
20
  ## Requirements
19
21
 
20
22
  - Python 3.9+
21
- - NVIDIA GPU with CUDA support (for GPU mode)
22
23
  - Microphone
24
+ - NVIDIA GPU with CUDA support (optional, for GPU mode)
23
25
 
24
26
  ## Installation
25
27
 
26
- ### Prerequisites
28
+ ### From PyPI (Recommended)
27
29
 
28
- Install PyTorch with CUDA support:
30
+ ```bash
31
+ pip install stttype
32
+ ```
33
+
34
+ ### Prerequisites (Optional GPU Support)
35
+
36
+ Install PyTorch with CUDA support for GPU acceleration:
29
37
 
30
38
  ```bash
31
- # Windows/Linux
39
+ # Windows/Linux with CUDA 11.8
32
40
  pip install torch torchaudio --index-url https://download.pytorch.org/whl/cu118
33
41
 
34
42
  # macOS (CPU only, no CUDA)
35
43
  pip install torch torchaudio
36
44
  ```
37
45
 
38
- ### Windows
46
+ Without PyTorch, STT Type will still work — it automatically falls back to CPU.
47
+
48
+ ### Windows (Local Install)
39
49
 
40
50
  ```powershell
41
51
  cd "E:\Lucas\STT Type"
@@ -89,8 +99,9 @@ Once installed, `stttype` works from any terminal.
89
99
  | `stttype --shutdown` | Stop all STT Type processes |
90
100
  | `stttype --status` | Check if STT Type is running |
91
101
  | `stttype --restart` | Restart STT Type |
92
- | `stttype --addtostartup` | Add to startup |
93
- | `stttype --rmtostartup` | Remove from startup |
102
+ | `stttype --config` | Open settings GUI |
103
+ | `stttype --addtostartup` | Add to startup (Windows only) |
104
+ | `stttype --rmtostartup` | Remove from startup (Windows only) |
94
105
  | `stttype --model <size>` | Set Whisper model (tiny/base/small/medium/large-v3) |
95
106
  | `stttype --lang <code>` | Set language (en/zh/auto/etc) |
96
107
  | `stttype --help` | Show help |
@@ -109,14 +120,31 @@ stttype --start --lang zh
109
120
 
110
121
  # Start with small model and auto-detect language
111
122
  stttype --start --model small --lang auto
123
+
124
+ # Open settings GUI
125
+ stttype --config
112
126
  ```
113
127
 
128
+ ### Settings GUI
129
+
130
+ Run `stttype --config` to open a settings window where you can configure:
131
+
132
+ | Setting | Options |
133
+ |---------|---------|
134
+ | **Whisper Model** | tiny, base, small, medium, large-v3 |
135
+ | **Language** | Auto-detect, English, Chinese, Spanish, French, German, Japanese, Korean, Russian, Italian, Portuguese, Arabic, Hindi |
136
+ | **Device** | Auto (GPU if available), GPU (CUDA), CPU |
137
+ | **Compute Type** | Auto, Float16 (GPU), Int8 (CPU) |
138
+ | **Hotkey** | F1 - F12 |
139
+
140
+ Settings are saved to `~/.config/stttype/config.json` and persist across restarts.
141
+
114
142
  ## How It Works
115
143
 
116
- 1. **Hold F2** - A semi-transparent red dot with "F2" appears in the top-right corner, microphone starts recording
117
- 2. **Release F2** - Red dot disappears, recording stops
118
- 3. **GPU transcribes** - Whisper processes audio on your NVIDIA GPU
119
- 4. **Text is typed** - Result appears at your cursor position
144
+ 1. **Hold hotkey** (default F2) - A transparent red dot appears in the top-right corner, microphone starts recording
145
+ 2. **Release hotkey** - Red dot disappears, recording stops
146
+ 3. **GPU/CPU transcribes** - Whisper processes audio (GPU if available, else CPU)
147
+ 4. **Text is pasted** - Result is pasted at your cursor position via Ctrl+V
120
148
 
121
149
  ## Models
122
150
 
@@ -150,12 +178,13 @@ chmod +x uninstall.sh
150
178
  | Issue | Solution |
151
179
  |-------|----------|
152
180
  | `stttype` not found | Restart terminal after installation |
153
- | "CUDA not available" | Install NVIDIA drivers and CUDA toolkit |
181
+ | "CUDA not available" | Install NVIDIA drivers and CUDA toolkit, or STT Type will auto-fallback to CPU |
154
182
  | No sound on start/stop | Check system volume |
155
183
  | Text not typing | Make sure the target window is focused |
156
184
  | Model download fails | Check internet connection |
157
185
  | Hotkeys don't work (macOS) | Grant Accessibility permissions to your terminal |
158
186
  | Hotkeys don't work (Linux) | Make sure you're running under X11 (not Wayland) |
187
+ | Garbled text when typing | Fixed in v2.0.2+ — uses clipboard paste instead of key simulation |
159
188
 
160
189
  ## Publish to PyPI
161
190
 
@@ -178,5 +207,5 @@ When prompted:
178
207
  ---
179
208
 
180
209
  **Author**: LucasApps
181
- **Version**: 1.0.0
210
+ **Version**: 2.0.3
182
211
  **License**: MIT
@@ -4,7 +4,7 @@ build-backend = "setuptools.build_meta"
4
4
 
5
5
  [project]
6
6
  name = "stttype"
7
- version = "2.0.2"
7
+ version = "2.0.4"
8
8
  description = "Cross-platform voice-to-text typing assistant with GPU acceleration and automatic CPU fallback"
9
9
  readme = "README.md"
10
10
  requires-python = ">=3.9"
@@ -186,11 +186,22 @@ def run_config_ui():
186
186
  btn_frame.pack(pady=20)
187
187
 
188
188
  def on_save():
189
+ # Ensure we extract raw values (not display labels)
190
+ dev_val = device_var.get()
191
+ if " - " in dev_val:
192
+ dev_val = dev_val.split(" - ")[0]
193
+ comp_val = compute_var.get()
194
+ if " - " in comp_val:
195
+ comp_val = comp_val.split(" - ")[0]
196
+ lang_val = lang_var.get()
197
+ if " - " in lang_val:
198
+ lang_val = lang_val.split(" - ")[0]
199
+
189
200
  new_config = {
190
201
  "model": model_var.get(),
191
- "language": lang_var.get(),
192
- "device": device_var.get(),
193
- "compute_type": compute_var.get(),
202
+ "language": lang_val,
203
+ "device": dev_val,
204
+ "compute_type": comp_val,
194
205
  "hotkey": hotkey_var.get().lower(),
195
206
  }
196
207
  save_config(new_config)
@@ -1,6 +1,6 @@
1
1
  Metadata-Version: 2.4
2
2
  Name: stttype
3
- Version: 2.0.2
3
+ Version: 2.0.4
4
4
  Summary: Cross-platform voice-to-text typing assistant with GPU acceleration and automatic CPU fallback
5
5
  Author: LucasApps
6
6
  License: MIT
@@ -37,44 +37,54 @@ Provides-Extra: gpu
37
37
  Requires-Dist: torch>=2.0.0; extra == "gpu"
38
38
  Requires-Dist: torchaudio>=2.0.0; extra == "gpu"
39
39
 
40
- # STT Type v1.0.0
40
+ # STT Type v2.0.3
41
41
 
42
- **Python based STT module running on GPU.**
42
+ **Cross-platform voice-to-text typing assistant with GPU acceleration and automatic CPU fallback.**
43
43
 
44
- Hold **F2** to record your voice, release to transcribe and type the text automatically at your cursor position.
44
+ Hold **F2** (configurable) to record your voice, release to transcribe and type the text automatically at your cursor position.
45
45
 
46
46
  ## Features
47
47
 
48
48
  - **Cross-platform** - Works on Windows, Linux, and macOS
49
- - **Hold F2 to record** - Audio captures while key is held
50
- - **Visual indicator** - Semi-transparent red dot with "F2" label appears in top-right corner while recording
49
+ - **Hold hotkey to record** - Audio captures while key is held (default: F2)
50
+ - **Visual indicator** - Transparent red dot with hotkey label appears in top-right corner while recording
51
51
  - **Bell sounds** - Audio feedback when recording starts/stops
52
52
  - **GPU-accelerated STT** - Uses faster-whisper on your NVIDIA GPU
53
- - **Auto-typing** - Transcribed text is typed at cursor position
53
+ - **Auto CPU fallback** - Automatically falls back to CPU if GPU is not available
54
+ - **Clipboard typing** - Types text via Ctrl+V paste for reliability (no key interference)
54
55
  - **System tray mode** - Runs silently in background
55
- - **Auto-startup** - Starts automatically on login
56
+ - **Auto-startup** - Starts automatically on login (Windows)
57
+ - **Settings GUI** - `stttype --config` to change model, language, device, compute type, and hotkey
56
58
 
57
59
  ## Requirements
58
60
 
59
61
  - Python 3.9+
60
- - NVIDIA GPU with CUDA support (for GPU mode)
61
62
  - Microphone
63
+ - NVIDIA GPU with CUDA support (optional, for GPU mode)
62
64
 
63
65
  ## Installation
64
66
 
65
- ### Prerequisites
67
+ ### From PyPI (Recommended)
66
68
 
67
- Install PyTorch with CUDA support:
69
+ ```bash
70
+ pip install stttype
71
+ ```
72
+
73
+ ### Prerequisites (Optional GPU Support)
74
+
75
+ Install PyTorch with CUDA support for GPU acceleration:
68
76
 
69
77
  ```bash
70
- # Windows/Linux
78
+ # Windows/Linux with CUDA 11.8
71
79
  pip install torch torchaudio --index-url https://download.pytorch.org/whl/cu118
72
80
 
73
81
  # macOS (CPU only, no CUDA)
74
82
  pip install torch torchaudio
75
83
  ```
76
84
 
77
- ### Windows
85
+ Without PyTorch, STT Type will still work — it automatically falls back to CPU.
86
+
87
+ ### Windows (Local Install)
78
88
 
79
89
  ```powershell
80
90
  cd "E:\Lucas\STT Type"
@@ -128,8 +138,9 @@ Once installed, `stttype` works from any terminal.
128
138
  | `stttype --shutdown` | Stop all STT Type processes |
129
139
  | `stttype --status` | Check if STT Type is running |
130
140
  | `stttype --restart` | Restart STT Type |
131
- | `stttype --addtostartup` | Add to startup |
132
- | `stttype --rmtostartup` | Remove from startup |
141
+ | `stttype --config` | Open settings GUI |
142
+ | `stttype --addtostartup` | Add to startup (Windows only) |
143
+ | `stttype --rmtostartup` | Remove from startup (Windows only) |
133
144
  | `stttype --model <size>` | Set Whisper model (tiny/base/small/medium/large-v3) |
134
145
  | `stttype --lang <code>` | Set language (en/zh/auto/etc) |
135
146
  | `stttype --help` | Show help |
@@ -148,14 +159,31 @@ stttype --start --lang zh
148
159
 
149
160
  # Start with small model and auto-detect language
150
161
  stttype --start --model small --lang auto
162
+
163
+ # Open settings GUI
164
+ stttype --config
151
165
  ```
152
166
 
167
+ ### Settings GUI
168
+
169
+ Run `stttype --config` to open a settings window where you can configure:
170
+
171
+ | Setting | Options |
172
+ |---------|---------|
173
+ | **Whisper Model** | tiny, base, small, medium, large-v3 |
174
+ | **Language** | Auto-detect, English, Chinese, Spanish, French, German, Japanese, Korean, Russian, Italian, Portuguese, Arabic, Hindi |
175
+ | **Device** | Auto (GPU if available), GPU (CUDA), CPU |
176
+ | **Compute Type** | Auto, Float16 (GPU), Int8 (CPU) |
177
+ | **Hotkey** | F1 - F12 |
178
+
179
+ Settings are saved to `~/.config/stttype/config.json` and persist across restarts.
180
+
153
181
  ## How It Works
154
182
 
155
- 1. **Hold F2** - A semi-transparent red dot with "F2" appears in the top-right corner, microphone starts recording
156
- 2. **Release F2** - Red dot disappears, recording stops
157
- 3. **GPU transcribes** - Whisper processes audio on your NVIDIA GPU
158
- 4. **Text is typed** - Result appears at your cursor position
183
+ 1. **Hold hotkey** (default F2) - A transparent red dot appears in the top-right corner, microphone starts recording
184
+ 2. **Release hotkey** - Red dot disappears, recording stops
185
+ 3. **GPU/CPU transcribes** - Whisper processes audio (GPU if available, else CPU)
186
+ 4. **Text is pasted** - Result is pasted at your cursor position via Ctrl+V
159
187
 
160
188
  ## Models
161
189
 
@@ -189,12 +217,13 @@ chmod +x uninstall.sh
189
217
  | Issue | Solution |
190
218
  |-------|----------|
191
219
  | `stttype` not found | Restart terminal after installation |
192
- | "CUDA not available" | Install NVIDIA drivers and CUDA toolkit |
220
+ | "CUDA not available" | Install NVIDIA drivers and CUDA toolkit, or STT Type will auto-fallback to CPU |
193
221
  | No sound on start/stop | Check system volume |
194
222
  | Text not typing | Make sure the target window is focused |
195
223
  | Model download fails | Check internet connection |
196
224
  | Hotkeys don't work (macOS) | Grant Accessibility permissions to your terminal |
197
225
  | Hotkeys don't work (Linux) | Make sure you're running under X11 (not Wayland) |
226
+ | Garbled text when typing | Fixed in v2.0.2+ — uses clipboard paste instead of key simulation |
198
227
 
199
228
  ## Publish to PyPI
200
229
 
@@ -217,5 +246,5 @@ When prompted:
217
246
  ---
218
247
 
219
248
  **Author**: LucasApps
220
- **Version**: 1.0.0
249
+ **Version**: 2.0.3
221
250
  **License**: MIT
File without changes
File without changes
File without changes
File without changes
File without changes
File without changes
File without changes
File without changes