sm-transcript 0.0.3 → 0.0.4

Sign up to get free protection for your applications and to get access to all the features.
data/README.txt CHANGED
@@ -1,4 +1,4 @@
1
- $Id: README.txt 187 2010-03-22 19:32:07Z pwilkins $
1
+ $Id: README.txt 194 2010-03-28 00:09:23Z pwilkins $
2
2
 
3
3
  sm-transcript reads results of SLS processing and produces transcripts for
4
4
  the SpokenMedia browser. For each file in the source folder whose extension
@@ -17,13 +17,13 @@ Installation:
17
17
  You can get sm-transcript as either a RubyGem or as source from svn.
18
18
 
19
19
  The preferred way to install this package is as a Rubygem. You can
20
- download the gem from this page:
20
+ download and install the gem with this command:
21
21
 
22
- http://TBD
22
+ sudo gem install [--verbose] sm-transcript
23
23
 
24
- and install it with this command:
25
-
26
- sudo gem install [--verbose] <path to gem file>sm-transcript-n.n.n.gem
24
+ This command downloads the most recent version of the gem from rubygems.org
25
+ and makes it active. Previous versions of the gem remain installed, but
26
+ are deactivated.
27
27
 
28
28
  You must use "sudo" to properly install the gem. If you execute "gem
29
29
  install" (omitting the "sudo") the gem is installed in your home gem
@@ -75,14 +75,14 @@ Using the App:
75
75
  --srcdir PATH Read files from this folder (Default: ./results)
76
76
  --destdir PATH Write files to this folder (Default: ./transcripts)
77
77
  --srctype wrd | seg Kind of file to process (Default: wrd)
78
- --desttype html | ttml Kind of file to output (Default: html)
78
+ --desttype html | ttml | datajs Kind of file to output (Default: html)
79
79
  -h, --help Show this message
80
80
 
81
81
 
82
82
  Troubleshooting:
83
83
  sm-transcript requires additional gems to operate. The RubyGem
84
- installation instructions should install dependencies automatically, but
85
- they may not. If you get an error that includes
84
+ installation should install dependencies automatically, but when it
85
+ doesn't, you get an error that includes
86
86
 
87
87
  ... no such file to load -- builder (LoadError)
88
88
 
@@ -112,11 +112,11 @@ Required Gems:
112
112
  the command is already part of the core.)
113
113
  htmlentities - html parsing
114
114
  json - create JSON structured data
115
- optparse - option parsing
115
+ optparse - option parsing of command line
116
116
  ostruct - open data structures
117
117
  ppcommand - pp is a pretty printer. It is used only for debugging
118
118
  rake - make for Ruby
119
- rubygems - support for gems
119
+ rubygems - support for gems (shouldn't be needed for Ruby 1.9)
120
120
  shoulda - enhancement for Test::Unit
121
121
 
122
122
  This command installs gems on OSX and Linux:
data/Rakefile CHANGED
@@ -1,4 +1,4 @@
1
- # $Id: Rakefile 190 2010-03-26 22:00:32Z pwilkins $
1
+ # $Id: Rakefile 195 2010-04-15 17:29:55Z pwilkins $
2
2
 
3
3
  require 'rake/gempackagetask'
4
4
  require 'rake'
@@ -8,7 +8,7 @@ spec = Gem::Specification.new do |s|
8
8
  s.summary = "Convert word lists to transcripts"
9
9
  s.description= File.read(File.join(File.dirname(__FILE__), 'README.txt'))
10
10
  s.requirements = [ 'TBD' ]
11
- s.version = "0.0.3"
11
+ s.version = "0.0.4"
12
12
  s.author = "Peter Wilkins"
13
13
  s.email = "pwilkins@mit.edu"
14
14
  s.homepage = "http://spokenmedia.mit.edu"
@@ -1,5 +1,4 @@
1
- # $Id$
2
-
1
+ #
3
2
  Copyright (c) 2010 Massachusetts Institute of Technology
4
3
 
5
4
  Permission is hereby granted, free of charge, to any person obtaining a copy
@@ -32,7 +32,7 @@ module SmTranscript
32
32
  # line is expected to contain two integers separated by a space,
33
33
  # followed by a space and one or more words. The words may contain
34
34
  # characters or an apostrophe
35
- ln.scan(/^\d* \d* [\w']* *[\w']*$/) do |t|
35
+ ln.scan(/^\d* \d* [\w']* *[\w']*/) do |t|
36
36
  arr = t.split
37
37
  @words << SmTranscript::Word.new(arr[0], arr[1], arr[2])
38
38
  end
@@ -0,0 +1,540 @@
1
+ 1630 1815 this
2
+ 1815 1970 is
3
+ 1970 3055 kathie
4
+ 3055 3605 lee
5
+ 5400 5520 it's
6
+ 5520 5780 just
7
+ 5780 6685 such
8
+ 7094 7369 today
9
+ 7369 7589 i'm
10
+ 7589 7839 going
11
+ 7839 8944 to
12
+ 8944 9599 be
13
+ 9599 9805 some
14
+ 9805 9879 of
15
+ 9879 10044 you
16
+ 10044 10484 already
17
+ 10484 10634 know
18
+ 10634 11669 which
19
+ 11669 12154 teaches
20
+ 15324 15714 child
21
+ 18364 18494 my
22
+ 18494 18934 research
23
+ 18934 19424 focuses
24
+ 19424 19924 on
25
+ 20375 20590 most
26
+ 20590 21000 nineteen
27
+ 21000 22520 forty
28
+ 22520 23300 cheer
29
+ 23300 23760 larry
30
+ 23760 24178 theory
31
+ 24967 28012 particularly
32
+ 28012 28727 systems
33
+ 28727 29192 feminist
34
+ 29192 29897 theory
35
+ 29897 30082 to
36
+ 30082 31092 analyze
37
+ 31361 31559 how
38
+ 31559 32264 contemporary
39
+ 32264 32699 parisian
40
+ 32699 33139 two
41
+ 33139 33539 issues
42
+ 33539 33839 right
43
+ 35895 36125 okay
44
+ 36125 36250 and
45
+ 36250 36485 it's
46
+ 36485 36810 fairly
47
+ 36810 37085 clear
48
+ 37085 37270 to
49
+ 37270 37780 me
50
+ 37780 38315 why
51
+ 38315 38785 research
52
+ 38785 39020 is
53
+ 40425 40910 engineering
54
+ 42784 42953 i
55
+ 42953 43567 think
56
+ 43567 43763 it's
57
+ 43763 43923 fair
58
+ 43923 44003 to
59
+ 44003 44198 this
60
+ 44198 44608 for
61
+ 44608 44668 a
62
+ 44668 44948 couple
63
+ 44948 45023 of
64
+ 45023 45413 reasons
65
+ 45413 45618 one
66
+ 45618 45788 it's
67
+ 45788 46013 my
68
+ 46013 47223 approach
69
+ 47223 47353 to
70
+ 47353 47553 just
71
+ 48642 49222 trying
72
+ 49222 49957 to
73
+ 49957 50477 theoretical
74
+ 50477 51232 perspectives
75
+ 51232 51762 engineering
76
+ 51762 52552 disciplines
77
+ 52817 53407 that
78
+ 53407 54137 really
79
+ 54137 54652 elaborate
80
+ 54652 54987 leases
81
+ 54987 55953 together
82
+ 55953 56157 so
83
+ 56157 56352 for
84
+ 56352 56517 my
85
+ 56517 57808 analysis
86
+ 58590 58777 also
87
+ 58777 58892 i
88
+ 58892 59457 think
89
+ 60838 61068 it's
90
+ 61068 62288 very
91
+ 62288 62738 ways
92
+ 63400 63631 just
93
+ 63631 63745 from
94
+ 63745 63985 where
95
+ 63985 64455 start
96
+ 64455 64630 my
97
+ 64630 65030 research
98
+ 65030 65615 questions
99
+ 65615 65790 to
100
+ 65790 65952 the
101
+ 65952 66370 absolute
102
+ 66370 66610 large
103
+ 66610 67395 exactly
104
+ 67612 67797 no
105
+ 67797 68117 kidding
106
+ 68117 68687 example
107
+ 68687 68842 i
108
+ 68842 69387 just
109
+ 69387 70272 recently
110
+ 70602 70972 conversely
111
+ 70972 71097 an
112
+ 71097 72127 article
113
+ 72377 72767 professor
114
+ 72767 73352 season
115
+ 73352 73677 volt
116
+ 73677 73785 is
117
+ 73785 74433 also
118
+ 74433 75242 interested
119
+ 75242 75407 in
120
+ 75407 75822 hearing
121
+ 75822 76662 master
122
+ 76967 77085 as
123
+ 77085 77615 opposed
124
+ 77615 77704 to
125
+ 77704 78415 circumvent
126
+ 78415 78895 weren't
127
+ 78895 79246 use
128
+ 79246 79470 a
129
+ 79470 80350 visiting
130
+ 80350 80920 professor
131
+ 80920 81021 of
132
+ 81021 81410 us
133
+ 81410 81490 who
134
+ 81490 82545 studies
135
+ 82545 82690 did
136
+ 82690 82800 in
137
+ 82800 83035 our
138
+ 83035 83240 group
139
+ 83240 83900 together
140
+ 85368 85709 vision
141
+ 85709 85829 of
142
+ 85829 86119 social
143
+ 86119 87014 justice
144
+ 87014 87554 integrate
145
+ 87554 88269 picture
146
+ 88639 88812 so
147
+ 88812 88937 we
148
+ 88937 89268 came
149
+ 89268 89707 together
150
+ 89707 89792 in
151
+ 89792 89827 a
152
+ 89827 90042 way
153
+ 90042 90337 the
154
+ 90337 90797 system
155
+ 90797 91035 that
156
+ 91035 91322 has
157
+ 91322 91487 that
158
+ 91487 91787 kind
159
+ 91787 91872 of
160
+ 91872 92367 justice
161
+ 92367 93007 background
162
+ 93007 93226 and
163
+ 93226 93567 issues
164
+ 93567 93652 the
165
+ 93652 94082 justice
166
+ 94082 94787 studies
167
+ 95052 95512 kathleen
168
+ 95512 96333 brothers
169
+ 96333 96739 rhetoric
170
+ 96739 97002 can
171
+ 97002 97242 help
172
+ 97242 97697 language
173
+ 97697 98252 works
174
+ 98252 98358 we
175
+ 98358 99267 conceptualize
176
+ 99267 99767 justice
177
+ 99767 100027 to
178
+ 100027 100752 language
179
+ 101044 101268 i
180
+ 101268 101463 wrote
181
+ 101463 101955 feminist
182
+ 101955 102770 theory
183
+ 102770 102838 and
184
+ 102838 103143 talked
185
+ 103143 103368 about
186
+ 103368 103508 how
187
+ 103508 103658 we
188
+ 103658 103764 can
189
+ 103764 104208 start
190
+ 104208 104447 thinking
191
+ 104447 104558 of
192
+ 104558 104973 social
193
+ 104973 105923 justice
194
+ 105923 106183 to
195
+ 106183 106563 balance
196
+ 106563 107178 principles
197
+ 107178 108038 suggests
198
+ 108474 109432 equality
199
+ 109432 110162 generous
200
+ 110396 110712 thinking
201
+ 110712 110972 about
202
+ 110972 111062 the
203
+ 111062 112046 justice
204
+ 112046 112222 is
205
+ 112222 112522 less
206
+ 112522 113542 individualized
207
+ 113542 114127 rationals
208
+ 114127 114902 thirty
209
+ 114902 114987 or
210
+ 114987 115437 about
211
+ 116647 117237 care
212
+ 117719 117834 we
213
+ 117834 118935 mentioned
214
+ 118935 118989 the
215
+ 118989 119294 ways
216
+ 119294 119464 in
217
+ 119464 119714 which
218
+ 119714 119889 we
219
+ 119889 120669 entrance
220
+ 121162 121510 social
221
+ 121510 122760 justice
222
+ 122760 122880 we
223
+ 122880 123072 did
224
+ 123072 123305 so
225
+ 123305 123475 by
226
+ 123475 123720 looking
227
+ 123720 124420 at
228
+ 124420 124777 work
229
+ 124777 124845 i
230
+ 124845 125200 actually
231
+ 125200 125590 person
232
+ 125590 126025 in
233
+ 126025 126895 charge
234
+ 127142 127252 and
235
+ 127252 127842 markings
236
+ 127842 128232 sort of
237
+ 128232 128522 talk
238
+ 128522 128737 about
239
+ 128737 128947 what
240
+ 128947 129066 the
241
+ 129066 129587 messages
242
+ 129587 129747 to
243
+ 129747 129932 me
244
+ 129932 130250 see
245
+ 130250 130327 in
246
+ 130327 130557 this
247
+ 130557 131107 text
248
+ 131479 131765 about
249
+ 131765 132085 ways
250
+ 132085 132215 in
251
+ 132215 132615 which
252
+ 132615 133285 changes
253
+ 134614 134719 yeah
254
+ 134719 135013 that's
255
+ 135013 135945 interesting
256
+ 135945 136124 i
257
+ 136124 136371 hope
258
+ 136371 137109 everybody
259
+ 137109 137289 think
260
+ 137289 137832 there's
261
+ 137832 138069 no
262
+ 138069 139294 chance
263
+ 140973 141817 around
264
+ 141817 141964 the
265
+ 141964 142232 sort
266
+ 142232 142432 of
267
+ 142432 142797 sorry
268
+ 144677 144957 the
269
+ 144957 146297 patience
270
+ 146297 146952 to
271
+ 147407 148607 mainstream
272
+ 148607 149837 education
273
+ 158425 158663 is
274
+ 158663 159050 english
275
+ 159050 159210 so
276
+ 159210 159465 i
277
+ 159465 160074 necessarily
278
+ 160074 160474 star
279
+ 160474 161525 cluster
280
+ 161525 162085 making
281
+ 162085 162520 any
282
+ 162520 162715 do
283
+ 163310 164055 switcher
284
+ 164055 164565 necessarily
285
+ 164565 164661 to
286
+ 164661 166315 start
287
+ 166315 166720 reading
288
+ 167993 168097 and
289
+ 168097 168537 then
290
+ 168537 168617 he
291
+ 168617 168773 was
292
+ 168773 168912 a
293
+ 168912 170077 very
294
+ 170077 170822 various
295
+ 170822 171237 feminist
296
+ 171237 171627 theory
297
+ 171627 171807 that
298
+ 171807 172072 might
299
+ 172072 172317 help
300
+ 172317 172762 me
301
+ 173267 173887 developed
302
+ 173887 174152 fourier
303
+ 174152 174715 analysis
304
+ 174715 174827 of
305
+ 174827 175507 intense
306
+ 175507 176253 exactly
307
+ 178108 178303 and
308
+ 178303 178408 you
309
+ 178408 178711 know
310
+ 178711 179568 article
311
+ 179568 180446 in
312
+ 180446 180513 the
313
+ 180513 181223 summer
314
+ 181442 181641 they've
315
+ 181641 181891 looked
316
+ 181891 182216 at
317
+ 182216 182606 nature
318
+ 182606 182891 writing
319
+ 182891 183526 environmental
320
+ 183526 183926 times
321
+ 183926 184126 their
322
+ 184126 184546 readers
323
+ 184546 184681 it's
324
+ 184681 184936 a
325
+ 184936 185270 very
326
+ 185270 185436 all
327
+ 185436 185806 over
328
+ 185806 186026 the
329
+ 186026 186311 famous
330
+ 186311 186558 for
331
+ 187494 188089 terrigenous
332
+ 188089 188634 millions
333
+ 188634 189513 of
334
+ 189513 189854 fiction
335
+ 189854 190333 writers
336
+ 191527 191637 the
337
+ 191637 192046 united
338
+ 192046 193032 states
339
+ 193539 193673 and
340
+ 193673 193903 let's
341
+ 193903 194083 look
342
+ 194083 194253 at
343
+ 194253 194403 the
344
+ 194403 195228 particularly
345
+ 195228 195388 think
346
+ 195388 196433 about
347
+ 197037 197347 what's
348
+ 197347 197872 happening
349
+ 197872 198202 in
350
+ 198202 198627 nature
351
+ 198627 199442 reading
352
+ 199857 200402 because
353
+ 200402 200712 people
354
+ 200712 200932 to
355
+ 200932 201237 react
356
+ 201237 201437 to
357
+ 201437 201672 the
358
+ 201672 202267 genes
359
+ 202267 202827 gauguin
360
+ 203457 204172 are
361
+ 204172 204442 willing
362
+ 204442 204687 to
363
+ 204687 204977 was
364
+ 204977 205334 running
365
+ 205334 205667 you
366
+ 205667 205832 know
367
+ 205832 206027 it's
368
+ 206027 206672 very
369
+ 206932 207067 to
370
+ 207067 207366 talk
371
+ 207366 207673 about
372
+ 207673 207776 how
373
+ 207776 208017 these
374
+ 208017 208417 writers
375
+ 208417 208538 are
376
+ 208538 208917 actually
377
+ 208917 209427 modeling
378
+ 209427 209737 first
379
+ 209737 210438 principles
380
+ 210438 210557 of
381
+ 210557 211767 collaboration
382
+ 212280 212860 principles
383
+ 212860 213000 that
384
+ 213000 214160 diffusing
385
+ 215381 215831 our
386
+ 215831 216928 relationships
387
+ 216928 217032 in
388
+ 217032 217246 our
389
+ 217246 218083 world
390
+ 218083 218396 into
391
+ 218396 218691 just
392
+ 218691 218877 had
393
+ 218877 218961 a
394
+ 218961 219431 breakdown
395
+ 219431 219631 those
396
+ 219631 220680 boundaries
397
+ 220680 221156 actually
398
+ 221156 221581 came
399
+ 221581 222131 from
400
+ 222659 222853 so
401
+ 222853 223102 that
402
+ 223102 223436 focus
403
+ 223436 223636 on
404
+ 223636 224427 interdependence
405
+ 224427 225787 collaboration
406
+ 226185 226330 no
407
+ 226330 226645 longer
408
+ 226645 226825 have
409
+ 226825 226995 a
410
+ 226995 227640 higher
411
+ 228501 229291 relationships
412
+ 229291 229476 as
413
+ 229476 229523 a
414
+ 229523 230211 teenager
415
+ 230211 231866 culture
416
+ 231866 231971 is
417
+ 231971 232456 so
418
+ 232572 232996 it's
419
+ 232996 234035 going to
420
+ 234035 234371 argue
421
+ 234371 234602 is
422
+ 234602 234692 one
423
+ 234692 234742 of
424
+ 234742 235197 these
425
+ 235440 235995 contemporary
426
+ 235995 236505 parisian
427
+ 236505 236956 julius
428
+ 236956 237380 create
429
+ 237380 237415 a
430
+ 237415 237985 dipole
431
+ 237985 238239 kick
432
+ 238239 238375 up
433
+ 238375 238675 there
434
+ 239249 239378 a
435
+ 239378 239643 new
436
+ 239643 239903 way
437
+ 239903 240123 of
438
+ 240123 240538 thinking
439
+ 240538 240803 about
440
+ 241270 242225 nature
441
+ 242225 242430 was
442
+ 242430 242565 there
443
+ 242565 242770 was
444
+ 242770 243440 definitely
445
+ 243440 244260 for
446
+ 244260 245190 us
447
+ 245190 245505 here
448
+ 246376 246652 okay
449
+ 246652 247563 yeah
450
+ 247563 248148 it's
451
+ 248148 248333 it's
452
+ 248333 248593 very
453
+ 248593 249173 clear
454
+ 249173 249393 i
455
+ 249393 249524 mean
456
+ 249524 249683 how
457
+ 249683 249963 how
458
+ 249963 250798 james
459
+ 251968 252667 it's
460
+ 252667 253573 way
461
+ 261413 261552 oh
462
+ 261552 262502 jeez
463
+ 263969 264034 i
464
+ 264034 264495 think
465
+ 264695 266155 humanities
466
+ 266155 267095 perspective
467
+ 268395 268530 he
468
+ 268530 269260 anteks
469
+ 269260 270215 paralyzing
470
+ 270215 270360 where
471
+ 270360 270450 it
472
+ 270450 271345 says
473
+ 271345 272320 experiences
474
+ 272320 272735 to
475
+ 272735 272920 look
476
+ 272920 273070 at
477
+ 273070 273160 the
478
+ 273160 273515 issues
479
+ 273515 273893 of
480
+ 273893 274940 sexuality
481
+ 275102 275225 and
482
+ 275225 276310 difference
483
+ 276310 276555 of
484
+ 276555 276925 ways
485
+ 276925 277015 in
486
+ 277015 277215 which
487
+ 277215 277360 we
488
+ 277360 277520 think
489
+ 277520 277835 about
490
+ 277835 278805 identity
491
+ 278805 278885 in
492
+ 278885 279010 the
493
+ 279010 279403 united
494
+ 279403 280780 states
495
+ 280780 280990 how
496
+ 280990 281071 to
497
+ 281071 281975 change
498
+ 282365 282446 and
499
+ 282446 282850 its
500
+ 282850 283460 relationship
501
+ 283460 283595 to
502
+ 283595 283836 be
503
+ 283836 284890 our
504
+ 284890 285420 relationship
505
+ 285420 285555 to
506
+ 285555 285701 each
507
+ 285701 286269 other
508
+ 286269 287075 require
509
+ 287075 287630 individual
510
+ 287630 287800 or
511
+ 287800 288690 person
512
+ 288690 289355 rapes
513
+ 289923 290542 ethnicity
514
+ 290542 291257 sex
515
+ 291257 292142 nationality
516
+ 292142 292912 resonance
517
+ 293485 293975 challenge
518
+ 293975 294180 how
519
+ 294180 294385 we
520
+ 294385 294980 operate
521
+ 294980 295595 world
522
+ 296066 296226 as
523
+ 296226 296876 individuals
524
+ 296876 297011 and
525
+ 297011 297486 also
526
+ 297486 297669 we
527
+ 297669 298166 operate
528
+ 302660 302810 and
529
+ 302810 303380 later
530
+ 303380 303510 the
531
+ 303510 304185 semester
532
+ 312555 312800 it
533
+ 312800 313645 wasn't
534
+ 314017 314297 to
535
+ 314297 314767 see
536
+ 314767 315792 commonality
537
+ 315792 316232 between
538
+ 316232 317082 questions
539
+ 321525 321696 thank
540
+ 321696 321785 you