batchalign 0.7.5a1__tar.gz → 0.7.5a2__tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (108) hide show
  1. {batchalign-0.7.5a1/batchalign.egg-info → batchalign-0.7.5a2}/PKG-INFO +5 -5
  2. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/README.md +4 -4
  3. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/formats/chat/utils.py +1 -1
  4. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/pipelines/morphosyntax/ud.py +4 -2
  5. batchalign-0.7.5a2/batchalign/version +3 -0
  6. {batchalign-0.7.5a1 → batchalign-0.7.5a2/batchalign.egg-info}/PKG-INFO +5 -5
  7. batchalign-0.7.5a1/batchalign/version +0 -3
  8. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/LICENSE +0 -0
  9. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/MANIFEST.in +0 -0
  10. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/__init__.py +0 -0
  11. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/__main__.py +0 -0
  12. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/cli/__init__.py +0 -0
  13. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/cli/cli.py +0 -0
  14. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/cli/dispatch.py +0 -0
  15. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/constants.py +0 -0
  16. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/document.py +0 -0
  17. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/errors.py +0 -0
  18. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/formats/__init__.py +0 -0
  19. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/formats/base.py +0 -0
  20. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/formats/chat/__init__.py +0 -0
  21. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/formats/chat/file.py +0 -0
  22. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/formats/chat/generator.py +0 -0
  23. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/formats/chat/lexer.py +0 -0
  24. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/formats/chat/parser.py +0 -0
  25. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/formats/textgrid/__init__.py +0 -0
  26. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/formats/textgrid/file.py +0 -0
  27. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/formats/textgrid/generator.py +0 -0
  28. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/formats/textgrid/parser.py +0 -0
  29. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/models/__init__.py +0 -0
  30. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/models/resolve.py +0 -0
  31. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/models/speaker/__init__.py +0 -0
  32. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/models/speaker/config.yaml +0 -0
  33. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/models/speaker/infer.py +0 -0
  34. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/models/speaker/utils.py +0 -0
  35. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/models/training/__init__.py +0 -0
  36. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/models/training/run.py +0 -0
  37. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/models/training/utils.py +0 -0
  38. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/models/utils.py +0 -0
  39. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/models/utterance/__init__.py +0 -0
  40. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/models/utterance/dataset.py +0 -0
  41. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/models/utterance/execute.py +0 -0
  42. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/models/utterance/infer.py +0 -0
  43. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/models/utterance/prep.py +0 -0
  44. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/models/utterance/train.py +0 -0
  45. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/models/whisper/__init__.py +0 -0
  46. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/models/whisper/infer_asr.py +0 -0
  47. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/models/whisper/infer_fa.py +0 -0
  48. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/pipelines/__init__.py +0 -0
  49. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/pipelines/analysis/__init__.py +0 -0
  50. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/pipelines/analysis/eval.py +0 -0
  51. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/pipelines/asr/__init__.py +0 -0
  52. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/pipelines/asr/rev.py +0 -0
  53. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/pipelines/asr/utils.py +0 -0
  54. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/pipelines/asr/whisper.py +0 -0
  55. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/pipelines/asr/whisperx.py +0 -0
  56. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/pipelines/base.py +0 -0
  57. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/pipelines/cleanup/__init__.py +0 -0
  58. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/pipelines/cleanup/cleanup.py +0 -0
  59. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/pipelines/cleanup/disfluencies.py +0 -0
  60. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/pipelines/cleanup/parse_support.py +0 -0
  61. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/pipelines/cleanup/retrace.py +0 -0
  62. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/pipelines/cleanup/support/filled_pauses.eng +0 -0
  63. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/pipelines/cleanup/support/replacements.eng +0 -0
  64. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/pipelines/cleanup/support/test.test +0 -0
  65. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/pipelines/dispatch.py +0 -0
  66. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/pipelines/fa/__init__.py +0 -0
  67. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/pipelines/fa/whisper_fa.py +0 -0
  68. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/pipelines/morphosyntax/__init__.py +0 -0
  69. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/pipelines/morphosyntax/fr/case.py +0 -0
  70. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/pipelines/morphosyntax/ja/verbforms.py +0 -0
  71. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/pipelines/pipeline.py +0 -0
  72. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/pipelines/speaker/__init__.py +0 -0
  73. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/pipelines/speaker/nemo_speaker.py +0 -0
  74. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/pipelines/utr/__init__.py +0 -0
  75. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/pipelines/utr/rev_utr.py +0 -0
  76. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/pipelines/utr/utils.py +0 -0
  77. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/pipelines/utr/whisper_utr.py +0 -0
  78. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/pipelines/utterance/__init__.py +0 -0
  79. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/pipelines/utterance/ud_utterance.py +0 -0
  80. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/tests/__init__.py +0 -0
  81. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/tests/conftest.py +0 -0
  82. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/tests/formats/chat/test_chat_file.py +0 -0
  83. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/tests/formats/chat/test_chat_generator.py +0 -0
  84. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/tests/formats/chat/test_chat_lexer.py +0 -0
  85. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/tests/formats/chat/test_chat_parser.py +0 -0
  86. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/tests/formats/chat/test_chat_utils.py +0 -0
  87. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/tests/formats/textgrid/test_textgrid.py +0 -0
  88. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/tests/pipelines/analysis/test_eval.py +0 -0
  89. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/tests/pipelines/asr/test_asr_pipeline.py +0 -0
  90. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/tests/pipelines/asr/test_asr_utils.py +0 -0
  91. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/tests/pipelines/cleanup/test_disfluency.py +0 -0
  92. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/tests/pipelines/cleanup/test_parse_support.py +0 -0
  93. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/tests/pipelines/fa/test_fa_pipeline.py +0 -0
  94. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/tests/pipelines/fixures.py +0 -0
  95. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/tests/pipelines/test_pipeline.py +0 -0
  96. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/tests/pipelines/test_pipeline_models.py +0 -0
  97. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/tests/test_document.py +0 -0
  98. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/utils/__init__.py +0 -0
  99. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/utils/config.py +0 -0
  100. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/utils/dp.py +0 -0
  101. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign/utils/utils.py +0 -0
  102. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign.egg-info/SOURCES.txt +0 -0
  103. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign.egg-info/dependency_links.txt +0 -0
  104. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign.egg-info/entry_points.txt +0 -0
  105. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign.egg-info/requires.txt +0 -0
  106. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/batchalign.egg-info/top_level.txt +0 -0
  107. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/setup.cfg +0 -0
  108. {batchalign-0.7.5a1 → batchalign-0.7.5a2}/setup.py +0 -0
@@ -1,6 +1,6 @@
1
1
  Metadata-Version: 2.1
2
2
  Name: batchalign
3
- Version: 0.7.5a1
3
+ Version: 0.7.5a2
4
4
  Summary: Python Speech Language Sample Analysis
5
5
  Author: Brian MacWhinney, Houjun Liu
6
6
  Author-email: macw@cmu.edu, houjun@cmu.edu
@@ -85,15 +85,15 @@ The TalkBank Project, of which Batchalign is a part, is supported by NIH grant H
85
85
  The following instructions provide a quick start to installing Batchalign. For most users aiming to process CHAT and audio with Batchalign, we recommend more detailed usage instructions: for [usage](https://talkbank.org/info/BA2-usage.pdf) and [human transcript cleanup](https://talkbank.org/info/BA2-cleanup.pdf). The following provides a quick start guide for the program.
86
86
 
87
87
  ### Get Python
88
- - We support Python versions 3.9, 3.10, and 3.11.
89
- - **We do not support Python 3.12** (no PyTorch support)
88
+ - We support Python versions 3.9, 3.10, 3.11 and 3.12.
89
+ - First, check to see if you have Python by running `python`. If it reports any of the versions above, skip the following step.
90
90
  - To install Python, follow the instructions...
91
91
  - for macOS
92
92
  1. Install Brew: [visit this link](https://brew.sh/)
93
93
  2. Install Python: execute `brew install python@3.11`
94
94
  - for Windows
95
95
  1. Install Python 3.11: [via this link](https://www.python.org/ftp/python/3.11.7/python-3.11.7-amd64.exe)
96
- 2. If later commands report `pip module not found`, [this page may help](https://stackoverflow.com/a/15626784)
96
+ 2. If later commands report `pip module not found`, [this page may help](https://github.com/TalkBank/batchalign2/wiki/Troubleshooting-Tips#get-pip-on-windows)
97
97
  - your distribution's instructions for Linux
98
98
 
99
99
  ### Install and Update the Package
@@ -156,7 +156,7 @@ batchalign morphotag ~/ba_input ~/ba_output
156
156
  #### forced alignment
157
157
 
158
158
  ```
159
- batchalign align --lang=eng ~/ba_input ~/ba_output
159
+ batchalign align ~/ba_input ~/ba_output
160
160
  ```
161
161
 
162
162
 
@@ -11,15 +11,15 @@ The TalkBank Project, of which Batchalign is a part, is supported by NIH grant H
11
11
  The following instructions provide a quick start to installing Batchalign. For most users aiming to process CHAT and audio with Batchalign, we recommend more detailed usage instructions: for [usage](https://talkbank.org/info/BA2-usage.pdf) and [human transcript cleanup](https://talkbank.org/info/BA2-cleanup.pdf). The following provides a quick start guide for the program.
12
12
 
13
13
  ### Get Python
14
- - We support Python versions 3.9, 3.10, and 3.11.
15
- - **We do not support Python 3.12** (no PyTorch support)
14
+ - We support Python versions 3.9, 3.10, 3.11 and 3.12.
15
+ - First, check to see if you have Python by running `python`. If it reports any of the versions above, skip the following step.
16
16
  - To install Python, follow the instructions...
17
17
  - for macOS
18
18
  1. Install Brew: [visit this link](https://brew.sh/)
19
19
  2. Install Python: execute `brew install python@3.11`
20
20
  - for Windows
21
21
  1. Install Python 3.11: [via this link](https://www.python.org/ftp/python/3.11.7/python-3.11.7-amd64.exe)
22
- 2. If later commands report `pip module not found`, [this page may help](https://stackoverflow.com/a/15626784)
22
+ 2. If later commands report `pip module not found`, [this page may help](https://github.com/TalkBank/batchalign2/wiki/Troubleshooting-Tips#get-pip-on-windows)
23
23
  - your distribution's instructions for Linux
24
24
 
25
25
  ### Install and Update the Package
@@ -82,7 +82,7 @@ batchalign morphotag ~/ba_input ~/ba_output
82
82
  #### forced alignment
83
83
 
84
84
  ```
85
- batchalign align --lang=eng ~/ba_input ~/ba_output
85
+ batchalign align ~/ba_input ~/ba_output
86
86
  ```
87
87
 
88
88
 
@@ -146,7 +146,7 @@ def annotation_clean(content, special=False):
146
146
  cleaned_word = cleaned_word.replace("~","").replace("&~","")
147
147
  cleaned_word = cleaned_word.replace(">","").replace("<","")
148
148
  cleaned_word = cleaned_word.replace("〕","").replace("//","").replace(";","")
149
- cleaned_word = re.sub(r"@.", '', cleaned_word)
149
+ cleaned_word = re.sub(r"@[^abcefpoqs]", '', cleaned_word)
150
150
  cleaned_word = re.sub(r"&.", '', cleaned_word)
151
151
 
152
152
  return cleaned_word
@@ -212,7 +212,7 @@ def handler__NOUN(word, lang=None):
212
212
  def handler__PROPN(word, lang=None):
213
213
  # code as noun
214
214
  parsed = handler__NOUN(word)
215
- return parsed.replace("propn", "noun")
215
+ return parsed.replace("noun", "propn")
216
216
 
217
217
  def handler__VERB(word, lang=None):
218
218
  # get the features
@@ -635,7 +635,9 @@ def tokenizer_processor(tokenized, lang, sent):
635
635
  before,after = conform(i).split("'")
636
636
  res.append((f'{before}\'', False))
637
637
  res.append((after, False))
638
- elif ("en" in lang) and matches_in(i, "'"):
638
+ elif (("en" in lang) and matches_in(i, "'") and
639
+ not (len(conform(i).split("'")) > 1 and
640
+ conform(i).split("'")[0].strip() == "o")):
639
641
  res.append((conform(i), True))
640
642
  elif ("nl" in lang) and conform(i).endswith("'s"):
641
643
  res.append((conform(i), False))
@@ -0,0 +1,3 @@
1
+ 0.7.5-alpha.2
2
+ September 3nd, 2024
3
+ Some tagging modification
@@ -1,6 +1,6 @@
1
1
  Metadata-Version: 2.1
2
2
  Name: batchalign
3
- Version: 0.7.5a1
3
+ Version: 0.7.5a2
4
4
  Summary: Python Speech Language Sample Analysis
5
5
  Author: Brian MacWhinney, Houjun Liu
6
6
  Author-email: macw@cmu.edu, houjun@cmu.edu
@@ -85,15 +85,15 @@ The TalkBank Project, of which Batchalign is a part, is supported by NIH grant H
85
85
  The following instructions provide a quick start to installing Batchalign. For most users aiming to process CHAT and audio with Batchalign, we recommend more detailed usage instructions: for [usage](https://talkbank.org/info/BA2-usage.pdf) and [human transcript cleanup](https://talkbank.org/info/BA2-cleanup.pdf). The following provides a quick start guide for the program.
86
86
 
87
87
  ### Get Python
88
- - We support Python versions 3.9, 3.10, and 3.11.
89
- - **We do not support Python 3.12** (no PyTorch support)
88
+ - We support Python versions 3.9, 3.10, 3.11 and 3.12.
89
+ - First, check to see if you have Python by running `python`. If it reports any of the versions above, skip the following step.
90
90
  - To install Python, follow the instructions...
91
91
  - for macOS
92
92
  1. Install Brew: [visit this link](https://brew.sh/)
93
93
  2. Install Python: execute `brew install python@3.11`
94
94
  - for Windows
95
95
  1. Install Python 3.11: [via this link](https://www.python.org/ftp/python/3.11.7/python-3.11.7-amd64.exe)
96
- 2. If later commands report `pip module not found`, [this page may help](https://stackoverflow.com/a/15626784)
96
+ 2. If later commands report `pip module not found`, [this page may help](https://github.com/TalkBank/batchalign2/wiki/Troubleshooting-Tips#get-pip-on-windows)
97
97
  - your distribution's instructions for Linux
98
98
 
99
99
  ### Install and Update the Package
@@ -156,7 +156,7 @@ batchalign morphotag ~/ba_input ~/ba_output
156
156
  #### forced alignment
157
157
 
158
158
  ```
159
- batchalign align --lang=eng ~/ba_input ~/ba_output
159
+ batchalign align ~/ba_input ~/ba_output
160
160
  ```
161
161
 
162
162
 
@@ -1,3 +0,0 @@
1
- 0.7.5-alpha.1
2
- September 3nd, 2024
3
- Removes unneeded options
File without changes
File without changes
File without changes
File without changes