pec-dss 0.1.0__py3-none-any.whl → 0.1.1__py3-none-any.whl

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
pec_dss/__init__.py CHANGED
@@ -5,4 +5,4 @@ An advanced audio analysis system that identifies paralinguistic vocal events
5
5
  and attributes them to specific speakers through sophisticated speaker diarization.
6
6
  """
7
7
 
8
- __version__ = "0.1.0"
8
+ __version__ = "0.1.1"
@@ -1,6 +1,6 @@
1
1
  Metadata-Version: 2.4
2
2
  Name: pec-dss
3
- Version: 0.1.0
3
+ Version: 0.1.1
4
4
  Summary: Paralinguistic Event Classification from Diarized Speaker Segments
5
5
  Author-email: Hyunwoo Kim <hwk06023@gmail.com>
6
6
  License: GPL-3.0-only
@@ -118,6 +118,56 @@ for speaker, events in results.items():
118
118
  print(f"Speaker {speaker} has {len(events)} attributed events")
119
119
  ```
120
120
 
121
+ ### CLI Usage
122
+
123
+ ```bash
124
+ pec-dss --speakers-dir ./speakers --unidentified-dir ./events --output-dir ./results
125
+ ```
126
+
127
+ ## 📁 Directory Structure
128
+
129
+ PEC-DSS expects a specific directory structure for processing audio files:
130
+
131
+ ### Speaker Reference Structure
132
+
133
+ ```
134
+ speakers_directory/
135
+ ├── speaker_A/ # Each speaker's name becomes their ID
136
+ │ ├── audio1.wav # Reference voice samples for this speaker
137
+ │ ├── audio2.wav
138
+ │ └── ...
139
+ ├── speaker_B/
140
+ │ ├── audio1.wav
141
+ │ └── ...
142
+ └── speaker_C/
143
+ ├── audio1.wav
144
+ └── ...
145
+ ```
146
+
147
+ ### Unidentified Audio Structure
148
+
149
+ ```
150
+ unidentified_directory/
151
+ ├── laugh1.wav # Non-linguistic vocal events to be classified
152
+ ├── giggle1.wav
153
+ └── ...
154
+ ```
155
+
156
+ ### Output Structure (After Processing)
157
+
158
+ ```
159
+ output_directory/
160
+ ├── results.json # JSON file with all results
161
+ ├── speaker_A/ # Files assigned to each speaker
162
+ │ ├── 0_laugh1.wav
163
+ │ └── ...
164
+ ├── speaker_B/
165
+ │ ├── 0_giggle1.wav
166
+ │ └── ...
167
+ └── unknown/ # Files below similarity threshold (if any)
168
+ └── ...
169
+ ```
170
+
121
171
  ## 🧩 System Architecture
122
172
 
123
173
  PEC-DSS consists of the following components:
@@ -138,6 +188,8 @@ The system can identify various paralinguistic events including:
138
188
  * Coughing
139
189
  * Other non-verbal vocal expressions
140
190
 
191
+ **Note:** PEC-DSS does not automatically classify these event types. It only determines which speaker produced the audio event.
192
+
141
193
  ## 🚀 Future Developments
142
194
 
143
195
  * 🧠 Integration with more audio encoder models
@@ -0,0 +1,9 @@
1
+ pec_dss/__init__.py,sha256=QyuqbTvRCIgEd7NbGi7I521W5y0wYNvEVM11_Q6N79A,271
2
+ pec_dss/__main__.py,sha256=Ul7rrlyhsspPqAsbsJul1kzckUqBlx1irsoWwPJBoz0,144
3
+ pec_dss/cli.py,sha256=bDopqySnvDeQH7ZhdDlkDg5sxzmFV5gYpx-tWm6Dcyk,3952
4
+ pec_dss-0.1.1.dist-info/licenses/LICENSE,sha256=ixuiBLtpoK3iv89l7ylKkg9rs2GzF9ukPH7ynZYzK5s,35148
5
+ pec_dss-0.1.1.dist-info/METADATA,sha256=Vz3gYZRTS9IhLBS2CgI51abLJfVn2R9OPXMsC_TRHfI,6448
6
+ pec_dss-0.1.1.dist-info/WHEEL,sha256=pxyMxgL8-pra_rKaQ4drOZAegBVuX-G_4nRHjjgWbmo,91
7
+ pec_dss-0.1.1.dist-info/entry_points.txt,sha256=birA2tqMJICs1CXYBcEpLKiK3HgKvH0Z494fgHgwjXQ,45
8
+ pec_dss-0.1.1.dist-info/top_level.txt,sha256=vdoLeQ6vlHAHGdWl9DrCP09WTySHt1mfiqji8gRvSq4,8
9
+ pec_dss-0.1.1.dist-info/RECORD,,
@@ -1,9 +0,0 @@
1
- pec_dss/__init__.py,sha256=jibqnYamtHCJ8C9VcoxSKpgTMc114TuHFSCpctj5ctk,271
2
- pec_dss/__main__.py,sha256=Ul7rrlyhsspPqAsbsJul1kzckUqBlx1irsoWwPJBoz0,144
3
- pec_dss/cli.py,sha256=bDopqySnvDeQH7ZhdDlkDg5sxzmFV5gYpx-tWm6Dcyk,3952
4
- pec_dss-0.1.0.dist-info/licenses/LICENSE,sha256=ixuiBLtpoK3iv89l7ylKkg9rs2GzF9ukPH7ynZYzK5s,35148
5
- pec_dss-0.1.0.dist-info/METADATA,sha256=jGLZdXkQwLuusf0VmV0wpmnp0vBykC7hAsUpFePFk4M,5069
6
- pec_dss-0.1.0.dist-info/WHEEL,sha256=pxyMxgL8-pra_rKaQ4drOZAegBVuX-G_4nRHjjgWbmo,91
7
- pec_dss-0.1.0.dist-info/entry_points.txt,sha256=birA2tqMJICs1CXYBcEpLKiK3HgKvH0Z494fgHgwjXQ,45
8
- pec_dss-0.1.0.dist-info/top_level.txt,sha256=vdoLeQ6vlHAHGdWl9DrCP09WTySHt1mfiqji8gRvSq4,8
9
- pec_dss-0.1.0.dist-info/RECORD,,