BatchalignHK 0.7.19.post17__tar.gz → 0.7.19.post19__tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (126) hide show
  1. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/BatchalignHK.egg-info/PKG-INFO +2 -2
  2. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/PKG-INFO +2 -2
  3. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/README.md +1 -1
  4. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/asr/utils.py +6 -2
  5. batchalignhk-0.7.19.post19/batchalign/version +3 -0
  6. batchalignhk-0.7.19.post17/batchalign/version +0 -3
  7. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/BatchalignHK.egg-info/SOURCES.txt +0 -0
  8. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/BatchalignHK.egg-info/dependency_links.txt +0 -0
  9. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/BatchalignHK.egg-info/entry_points.txt +0 -0
  10. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/BatchalignHK.egg-info/requires.txt +0 -0
  11. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/BatchalignHK.egg-info/top_level.txt +0 -0
  12. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/LICENSE +0 -0
  13. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/MANIFEST.in +0 -0
  14. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/__init__.py +0 -0
  15. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/__main__.py +0 -0
  16. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/cli/__init__.py +0 -0
  17. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/cli/cli.py +0 -0
  18. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/cli/dispatch.py +0 -0
  19. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/constants.py +0 -0
  20. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/document.py +0 -0
  21. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/errors.py +0 -0
  22. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/formats/__init__.py +0 -0
  23. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/formats/base.py +0 -0
  24. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/formats/chat/__init__.py +0 -0
  25. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/formats/chat/file.py +0 -0
  26. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/formats/chat/generator.py +0 -0
  27. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/formats/chat/lexer.py +0 -0
  28. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/formats/chat/parser.py +0 -0
  29. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/formats/chat/utils.py +0 -0
  30. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/formats/textgrid/__init__.py +0 -0
  31. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/formats/textgrid/file.py +0 -0
  32. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/formats/textgrid/generator.py +0 -0
  33. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/formats/textgrid/parser.py +0 -0
  34. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/models/__init__.py +0 -0
  35. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/models/resolve.py +0 -0
  36. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/models/speaker/__init__.py +0 -0
  37. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/models/speaker/config.yaml +0 -0
  38. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/models/speaker/infer.py +0 -0
  39. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/models/speaker/utils.py +0 -0
  40. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/models/training/__init__.py +0 -0
  41. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/models/training/run.py +0 -0
  42. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/models/training/utils.py +0 -0
  43. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/models/utils.py +0 -0
  44. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/models/utterance/__init__.py +0 -0
  45. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/models/utterance/cantonese_infer.py +0 -0
  46. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/models/utterance/dataset.py +0 -0
  47. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/models/utterance/execute.py +0 -0
  48. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/models/utterance/infer.py +0 -0
  49. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/models/utterance/prep.py +0 -0
  50. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/models/utterance/train.py +0 -0
  51. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/models/wave2vec/__init__.py +0 -0
  52. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/models/wave2vec/infer_fa.py +0 -0
  53. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/models/whisper/__init__.py +0 -0
  54. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/models/whisper/infer_asr.py +0 -0
  55. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/models/whisper/infer_fa.py +0 -0
  56. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/__init__.py +0 -0
  57. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/analysis/__init__.py +0 -0
  58. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/analysis/eval.py +0 -0
  59. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/asr/__init__.py +0 -0
  60. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/asr/num2chinese.py +0 -0
  61. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/asr/oai_whisper.py +0 -0
  62. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/asr/rev.py +0 -0
  63. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/asr/tencent.py +0 -0
  64. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/asr/whisper.py +0 -0
  65. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/asr/whisperx.py +0 -0
  66. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/base.py +0 -0
  67. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/cleanup/__init__.py +0 -0
  68. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/cleanup/cleanup.py +0 -0
  69. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/cleanup/disfluencies.py +0 -0
  70. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/cleanup/parse_support.py +0 -0
  71. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/cleanup/retrace.py +0 -0
  72. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/cleanup/support/filled_pauses.eng +0 -0
  73. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/cleanup/support/replacements.eng +0 -0
  74. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/cleanup/support/test.test +0 -0
  75. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/dispatch.py +0 -0
  76. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/fa/__init__.py +0 -0
  77. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/fa/wave2vec_fa.py +0 -0
  78. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/fa/whisper_fa.py +0 -0
  79. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/morphosyntax/__init__.py +0 -0
  80. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/morphosyntax/coref.py +0 -0
  81. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/morphosyntax/en/irr.py +0 -0
  82. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/morphosyntax/fr/apm.py +0 -0
  83. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/morphosyntax/fr/apmn.py +0 -0
  84. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/morphosyntax/fr/case.py +0 -0
  85. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/morphosyntax/ja/verbforms.py +0 -0
  86. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/morphosyntax/ud.py +0 -0
  87. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/pipeline.py +0 -0
  88. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/speaker/__init__.py +0 -0
  89. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/speaker/nemo_speaker.py +0 -0
  90. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/translate/__init__.py +0 -0
  91. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/translate/gtrans.py +0 -0
  92. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/translate/seamless.py +0 -0
  93. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/translate/utils.py +0 -0
  94. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/utr/__init__.py +0 -0
  95. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/utr/rev_utr.py +0 -0
  96. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/utr/tencent_utr.py +0 -0
  97. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/utr/utils.py +0 -0
  98. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/utr/whisper_utr.py +0 -0
  99. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/utterance/__init__.py +0 -0
  100. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/pipelines/utterance/ud_utterance.py +0 -0
  101. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/tests/__init__.py +0 -0
  102. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/tests/conftest.py +0 -0
  103. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/tests/formats/chat/test_chat_file.py +0 -0
  104. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/tests/formats/chat/test_chat_generator.py +0 -0
  105. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/tests/formats/chat/test_chat_lexer.py +0 -0
  106. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/tests/formats/chat/test_chat_parser.py +0 -0
  107. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/tests/formats/chat/test_chat_utils.py +0 -0
  108. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/tests/formats/textgrid/test_textgrid.py +0 -0
  109. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/tests/pipelines/analysis/test_eval.py +0 -0
  110. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/tests/pipelines/asr/test_asr_pipeline.py +0 -0
  111. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/tests/pipelines/asr/test_asr_utils.py +0 -0
  112. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/tests/pipelines/cleanup/test_disfluency.py +0 -0
  113. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/tests/pipelines/cleanup/test_parse_support.py +0 -0
  114. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/tests/pipelines/fa/test_fa_pipeline.py +0 -0
  115. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/tests/pipelines/fixures.py +0 -0
  116. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/tests/pipelines/test_pipeline.py +0 -0
  117. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/tests/pipelines/test_pipeline_models.py +0 -0
  118. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/tests/test_document.py +0 -0
  119. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/utils/__init__.py +0 -0
  120. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/utils/abbrev.py +0 -0
  121. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/utils/config.py +0 -0
  122. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/utils/dp.py +0 -0
  123. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/utils/names.py +0 -0
  124. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/batchalign/utils/utils.py +0 -0
  125. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/setup.cfg +0 -0
  126. {batchalignhk-0.7.19.post17 → batchalignhk-0.7.19.post19}/setup.py +0 -0
@@ -1,6 +1,6 @@
1
1
  Metadata-Version: 2.2
2
2
  Name: BatchalignHK
3
- Version: 0.7.19.post17
3
+ Version: 0.7.19.post19
4
4
  Summary: Python Speech Language Sample Analysis
5
5
  Author: Brian MacWhinney, Houjun Liu
6
6
  Author-email: macw@cmu.edu, houjun@cmu.edu
@@ -64,7 +64,7 @@ The TalkBank Project, of which Batchalign is a part, is supported by NIH grant H
64
64
 
65
65
  ## Quick Start
66
66
 
67
- The following instructions provide a quick start to installing Batchalign. For most users aiming to process CHAT and audio with Batchalign, we recommend more detailed usage instructions: for [usage](https://talkbank.org/info/BA2-usage.pdf) and [human transcript cleanup](https://talkbank.org/info/BA2-cleanup.pdf). The following provides a quick start guide for the program.
67
+ The following instructions provide a quick start to installing Batchalign. For most users aiming to process CHAT and audio with Batchalign, we recommend more detailed usage instructions: for [usage](https://talkbank.org/0info/BA2-usage.pdf) and [human transcript cleanup](https://talkbank.org/0info/BA2-cleanup.pdf). The following provides a quick start guide for the program.
68
68
 
69
69
  ### Install and Update the Package
70
70
  Batchalign is on PyPi (as `batchalign`). We recommend the use of UV to install Batchalign:
@@ -1,6 +1,6 @@
1
1
  Metadata-Version: 2.2
2
2
  Name: BatchalignHK
3
- Version: 0.7.19.post17
3
+ Version: 0.7.19.post19
4
4
  Summary: Python Speech Language Sample Analysis
5
5
  Author: Brian MacWhinney, Houjun Liu
6
6
  Author-email: macw@cmu.edu, houjun@cmu.edu
@@ -64,7 +64,7 @@ The TalkBank Project, of which Batchalign is a part, is supported by NIH grant H
64
64
 
65
65
  ## Quick Start
66
66
 
67
- The following instructions provide a quick start to installing Batchalign. For most users aiming to process CHAT and audio with Batchalign, we recommend more detailed usage instructions: for [usage](https://talkbank.org/info/BA2-usage.pdf) and [human transcript cleanup](https://talkbank.org/info/BA2-cleanup.pdf). The following provides a quick start guide for the program.
67
+ The following instructions provide a quick start to installing Batchalign. For most users aiming to process CHAT and audio with Batchalign, we recommend more detailed usage instructions: for [usage](https://talkbank.org/0info/BA2-usage.pdf) and [human transcript cleanup](https://talkbank.org/0info/BA2-cleanup.pdf). The following provides a quick start guide for the program.
68
68
 
69
69
  ### Install and Update the Package
70
70
  Batchalign is on PyPi (as `batchalign`). We recommend the use of UV to install Batchalign:
@@ -8,7 +8,7 @@ The TalkBank Project, of which Batchalign is a part, is supported by NIH grant H
8
8
 
9
9
  ## Quick Start
10
10
 
11
- The following instructions provide a quick start to installing Batchalign. For most users aiming to process CHAT and audio with Batchalign, we recommend more detailed usage instructions: for [usage](https://talkbank.org/info/BA2-usage.pdf) and [human transcript cleanup](https://talkbank.org/info/BA2-cleanup.pdf). The following provides a quick start guide for the program.
11
+ The following instructions provide a quick start to installing Batchalign. For most users aiming to process CHAT and audio with Batchalign, we recommend more detailed usage instructions: for [usage](https://talkbank.org/0info/BA2-usage.pdf) and [human transcript cleanup](https://talkbank.org/0info/BA2-cleanup.pdf). The following provides a quick start guide for the program.
12
12
 
13
13
  ### Install and Update the Package
14
14
  Batchalign is on PyPi (as `batchalign`). We recommend the use of UV to install Batchalign:
@@ -237,11 +237,15 @@ def process_generation(output, lang="eng", utterance_engine=None):
237
237
  if word.strip() == "":
238
238
  continue
239
239
  if word not in ENDING_PUNCT+MOR_PUNCT:
240
+ word_replaced = word
241
+ if word_replaced.strip() == "i":
242
+ word_replaced = "I"
243
+
240
244
  if start == None or end == None:
241
- words.append(Form(text=word, time=None))
245
+ words.append(Form(text=word_replaced, time=None))
242
246
  else:
243
247
  seen_word = True
244
- words.append(Form(text=word, time=(int(start), int(end))))
248
+ words.append(Form(text=word_replaced, time=(int(start), int(end))))
245
249
  else:
246
250
  words.append(Form(text=word, time=None))
247
251
 
@@ -0,0 +1,3 @@
1
+ 0.7.19-post.19
2
+ July 1st, 2025
3
+ Whoops, fixed ASR regression.
@@ -1,3 +0,0 @@
1
- 0.7.19-post.17
2
- June 20th, 2025
3
- patch small bug