batchalign 0.7.19.post9__tar.gz → 0.7.19.post11__tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (124) hide show
  1. {batchalign-0.7.19.post9/batchalign.egg-info → batchalign-0.7.19.post11}/PKG-INFO +4 -3
  2. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/README.md +1 -1
  3. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/pipelines/analysis/eval.py +38 -0
  4. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/pipelines/asr/utils.py +1 -1
  5. batchalign-0.7.19.post11/batchalign/version +3 -0
  6. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11/batchalign.egg-info}/PKG-INFO +4 -3
  7. batchalign-0.7.19.post9/batchalign/version +0 -3
  8. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/LICENSE +0 -0
  9. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/MANIFEST.in +0 -0
  10. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/__init__.py +0 -0
  11. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/__main__.py +0 -0
  12. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/cli/__init__.py +0 -0
  13. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/cli/cli.py +0 -0
  14. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/cli/dispatch.py +0 -0
  15. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/constants.py +0 -0
  16. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/document.py +0 -0
  17. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/errors.py +0 -0
  18. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/formats/__init__.py +0 -0
  19. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/formats/base.py +0 -0
  20. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/formats/chat/__init__.py +0 -0
  21. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/formats/chat/file.py +0 -0
  22. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/formats/chat/generator.py +0 -0
  23. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/formats/chat/lexer.py +0 -0
  24. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/formats/chat/parser.py +0 -0
  25. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/formats/chat/utils.py +0 -0
  26. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/formats/textgrid/__init__.py +0 -0
  27. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/formats/textgrid/file.py +0 -0
  28. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/formats/textgrid/generator.py +0 -0
  29. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/formats/textgrid/parser.py +0 -0
  30. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/models/__init__.py +0 -0
  31. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/models/resolve.py +0 -0
  32. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/models/speaker/__init__.py +0 -0
  33. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/models/speaker/config.yaml +0 -0
  34. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/models/speaker/infer.py +0 -0
  35. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/models/speaker/utils.py +0 -0
  36. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/models/training/__init__.py +0 -0
  37. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/models/training/run.py +0 -0
  38. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/models/training/utils.py +0 -0
  39. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/models/utils.py +0 -0
  40. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/models/utterance/__init__.py +0 -0
  41. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/models/utterance/cantonese_infer.py +0 -0
  42. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/models/utterance/dataset.py +0 -0
  43. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/models/utterance/execute.py +0 -0
  44. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/models/utterance/infer.py +0 -0
  45. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/models/utterance/prep.py +0 -0
  46. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/models/utterance/train.py +0 -0
  47. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/models/wave2vec/__init__.py +0 -0
  48. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/models/wave2vec/infer_fa.py +0 -0
  49. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/models/whisper/__init__.py +0 -0
  50. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/models/whisper/infer_asr.py +0 -0
  51. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/models/whisper/infer_fa.py +0 -0
  52. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/pipelines/__init__.py +0 -0
  53. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/pipelines/analysis/__init__.py +0 -0
  54. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/pipelines/asr/__init__.py +0 -0
  55. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/pipelines/asr/num2chinese.py +0 -0
  56. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/pipelines/asr/oai_whisper.py +0 -0
  57. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/pipelines/asr/rev.py +0 -0
  58. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/pipelines/asr/whisper.py +0 -0
  59. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/pipelines/asr/whisperx.py +0 -0
  60. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/pipelines/base.py +0 -0
  61. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/pipelines/cleanup/__init__.py +0 -0
  62. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/pipelines/cleanup/cleanup.py +0 -0
  63. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/pipelines/cleanup/disfluencies.py +0 -0
  64. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/pipelines/cleanup/parse_support.py +0 -0
  65. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/pipelines/cleanup/retrace.py +0 -0
  66. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/pipelines/cleanup/support/filled_pauses.eng +0 -0
  67. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/pipelines/cleanup/support/replacements.eng +0 -0
  68. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/pipelines/cleanup/support/test.test +0 -0
  69. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/pipelines/dispatch.py +0 -0
  70. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/pipelines/fa/__init__.py +0 -0
  71. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/pipelines/fa/wave2vec_fa.py +0 -0
  72. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/pipelines/fa/whisper_fa.py +0 -0
  73. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/pipelines/morphosyntax/__init__.py +0 -0
  74. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/pipelines/morphosyntax/coref.py +0 -0
  75. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/pipelines/morphosyntax/en/irr.py +0 -0
  76. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/pipelines/morphosyntax/fr/apm.py +0 -0
  77. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/pipelines/morphosyntax/fr/apmn.py +0 -0
  78. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/pipelines/morphosyntax/fr/case.py +0 -0
  79. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/pipelines/morphosyntax/ja/verbforms.py +0 -0
  80. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/pipelines/morphosyntax/ud.py +0 -0
  81. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/pipelines/pipeline.py +0 -0
  82. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/pipelines/speaker/__init__.py +0 -0
  83. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/pipelines/speaker/nemo_speaker.py +0 -0
  84. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/pipelines/translate/__init__.py +0 -0
  85. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/pipelines/translate/gtrans.py +0 -0
  86. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/pipelines/translate/seamless.py +0 -0
  87. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/pipelines/translate/utils.py +0 -0
  88. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/pipelines/utr/__init__.py +0 -0
  89. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/pipelines/utr/rev_utr.py +0 -0
  90. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/pipelines/utr/utils.py +0 -0
  91. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/pipelines/utr/whisper_utr.py +0 -0
  92. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/pipelines/utterance/__init__.py +0 -0
  93. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/pipelines/utterance/ud_utterance.py +0 -0
  94. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/tests/__init__.py +0 -0
  95. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/tests/conftest.py +0 -0
  96. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/tests/formats/chat/test_chat_file.py +0 -0
  97. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/tests/formats/chat/test_chat_generator.py +0 -0
  98. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/tests/formats/chat/test_chat_lexer.py +0 -0
  99. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/tests/formats/chat/test_chat_parser.py +0 -0
  100. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/tests/formats/chat/test_chat_utils.py +0 -0
  101. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/tests/formats/textgrid/test_textgrid.py +0 -0
  102. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/tests/pipelines/analysis/test_eval.py +0 -0
  103. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/tests/pipelines/asr/test_asr_pipeline.py +0 -0
  104. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/tests/pipelines/asr/test_asr_utils.py +0 -0
  105. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/tests/pipelines/cleanup/test_disfluency.py +0 -0
  106. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/tests/pipelines/cleanup/test_parse_support.py +0 -0
  107. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/tests/pipelines/fa/test_fa_pipeline.py +0 -0
  108. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/tests/pipelines/fixures.py +0 -0
  109. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/tests/pipelines/test_pipeline.py +0 -0
  110. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/tests/pipelines/test_pipeline_models.py +0 -0
  111. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/tests/test_document.py +0 -0
  112. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/utils/__init__.py +0 -0
  113. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/utils/abbrev.py +0 -0
  114. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/utils/config.py +0 -0
  115. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/utils/dp.py +0 -0
  116. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/utils/names.py +0 -0
  117. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign/utils/utils.py +0 -0
  118. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign.egg-info/SOURCES.txt +0 -0
  119. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign.egg-info/dependency_links.txt +0 -0
  120. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign.egg-info/entry_points.txt +0 -0
  121. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign.egg-info/requires.txt +0 -0
  122. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/batchalign.egg-info/top_level.txt +0 -0
  123. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/setup.cfg +0 -0
  124. {batchalign-0.7.19.post9 → batchalign-0.7.19.post11}/setup.py +0 -0
@@ -1,6 +1,6 @@
1
- Metadata-Version: 2.2
1
+ Metadata-Version: 2.4
2
2
  Name: batchalign
3
- Version: 0.7.19.post9
3
+ Version: 0.7.19.post11
4
4
  Summary: Python Speech Language Sample Analysis
5
5
  Author: Brian MacWhinney, Houjun Liu
6
6
  Author-email: macw@cmu.edu, houjun@cmu.edu
@@ -47,6 +47,7 @@ Dynamic: author-email
47
47
  Dynamic: classifier
48
48
  Dynamic: description
49
49
  Dynamic: description-content-type
50
+ Dynamic: license-file
50
51
  Dynamic: provides-extra
51
52
  Dynamic: requires-dist
52
53
  Dynamic: summary
@@ -61,7 +62,7 @@ The TalkBank Project, of which Batchalign is a part, is supported by NIH grant H
61
62
 
62
63
  ## Quick Start
63
64
 
64
- The following instructions provide a quick start to installing Batchalign. For most users aiming to process CHAT and audio with Batchalign, we recommend more detailed usage instructions: for [usage](https://talkbank.org/info/BA2-usage.pdf) and [human transcript cleanup](https://talkbank.org/info/BA2-cleanup.pdf). The following provides a quick start guide for the program.
65
+ The following instructions provide a quick start to installing Batchalign. For most users aiming to process CHAT and audio with Batchalign, we recommend more detailed usage instructions: for [usage](https://talkbank.org/0info/BA2-usage.pdf) and [human transcript cleanup](https://talkbank.org/0info/BA2-cleanup.pdf). The following provides a quick start guide for the program.
65
66
 
66
67
  ### Install and Update the Package
67
68
  Batchalign is on PyPi (as `batchalign`). We recommend the use of UV to install Batchalign:
@@ -8,7 +8,7 @@ The TalkBank Project, of which Batchalign is a part, is supported by NIH grant H
8
8
 
9
9
  ## Quick Start
10
10
 
11
- The following instructions provide a quick start to installing Batchalign. For most users aiming to process CHAT and audio with Batchalign, we recommend more detailed usage instructions: for [usage](https://talkbank.org/info/BA2-usage.pdf) and [human transcript cleanup](https://talkbank.org/info/BA2-cleanup.pdf). The following provides a quick start guide for the program.
11
+ The following instructions provide a quick start to installing Batchalign. For most users aiming to process CHAT and audio with Batchalign, we recommend more detailed usage instructions: for [usage](https://talkbank.org/0info/BA2-usage.pdf) and [human transcript cleanup](https://talkbank.org/0info/BA2-cleanup.pdf). The following provides a quick start guide for the program.
12
12
 
13
13
  ### Install and Update the Package
14
14
  Batchalign is on PyPi (as `batchalign`). We recommend the use of UV to install Batchalign:
@@ -38,9 +38,47 @@ def conform(x):
38
38
  elif "wanna" == i.strip():
39
39
  result.append("want")
40
40
  result.append("to")
41
+ elif "ii" == i.strip():
42
+ result.append("i")
43
+ result.append("i")
44
+ elif "i'd" == i.strip():
45
+ result.append("i")
46
+ result.append("had")
47
+ elif "tshirts" == i.strip():
48
+ result.append("t")
49
+ result.append("shirts")
50
+ elif "tshirts" == i.strip():
51
+ result.append("t")
52
+ result.append("shirts")
53
+ elif "anytime" == i.strip():
54
+ result.append("any")
55
+ result.append("time")
56
+ elif "alright" == i.strip():
57
+ result.append("all")
58
+ result.append("right")
59
+ elif "sorta" == i.strip():
60
+ result.append("sort")
61
+ result.append("of")
62
+ elif "alrightie" == i.strip():
63
+ result.append("all")
64
+ result.append("right")
65
+ elif "mm" == i.strip():
66
+ result.append("hm")
67
+ elif "ai" == i.strip():
68
+ result.append("a")
69
+ result.append("i")
70
+ elif "this'll" == i.strip():
71
+ result.append("this")
72
+ result.append("will")
41
73
  elif "gotta" == i.strip():
42
74
  result.append("got")
43
75
  result.append("to")
76
+ elif "eh" == i.strip():
77
+ result.append("uh")
78
+ elif "kinda" == i.strip():
79
+ result.append("a")
80
+ result.append("kind")
81
+ result.append("of")
44
82
  elif "farmhouse" == i.strip():
45
83
  result.append("farm")
46
84
  result.append("house")
@@ -247,7 +247,7 @@ def process_generation(output, lang="eng", utterance_engine=None):
247
247
  seen_word = True
248
248
  words.append(Form(text=word_replaced, time=(int(start), int(end))))
249
249
  else:
250
- words.append(Form(text=word_replaced, time=None))
250
+ words.append(Form(text=word, time=None))
251
251
 
252
252
  final_utterances.append(Utterance(
253
253
  tier=participant,
@@ -0,0 +1,3 @@
1
+ 0.7.19-post.11
2
+ July 8st, 2025
3
+ benchmarking changes
@@ -1,6 +1,6 @@
1
- Metadata-Version: 2.2
1
+ Metadata-Version: 2.4
2
2
  Name: batchalign
3
- Version: 0.7.19.post9
3
+ Version: 0.7.19.post11
4
4
  Summary: Python Speech Language Sample Analysis
5
5
  Author: Brian MacWhinney, Houjun Liu
6
6
  Author-email: macw@cmu.edu, houjun@cmu.edu
@@ -47,6 +47,7 @@ Dynamic: author-email
47
47
  Dynamic: classifier
48
48
  Dynamic: description
49
49
  Dynamic: description-content-type
50
+ Dynamic: license-file
50
51
  Dynamic: provides-extra
51
52
  Dynamic: requires-dist
52
53
  Dynamic: summary
@@ -61,7 +62,7 @@ The TalkBank Project, of which Batchalign is a part, is supported by NIH grant H
61
62
 
62
63
  ## Quick Start
63
64
 
64
- The following instructions provide a quick start to installing Batchalign. For most users aiming to process CHAT and audio with Batchalign, we recommend more detailed usage instructions: for [usage](https://talkbank.org/info/BA2-usage.pdf) and [human transcript cleanup](https://talkbank.org/info/BA2-cleanup.pdf). The following provides a quick start guide for the program.
65
+ The following instructions provide a quick start to installing Batchalign. For most users aiming to process CHAT and audio with Batchalign, we recommend more detailed usage instructions: for [usage](https://talkbank.org/0info/BA2-usage.pdf) and [human transcript cleanup](https://talkbank.org/0info/BA2-cleanup.pdf). The following provides a quick start guide for the program.
65
66
 
66
67
  ### Install and Update the Package
67
68
  Batchalign is on PyPi (as `batchalign`). We recommend the use of UV to install Batchalign:
@@ -1,3 +0,0 @@
1
- 0.7.19-post.9
2
- June 24th, 2025
3
- "i" should be "I"