bio-vcf 0.9.2 → 0.9.4
Sign up to get free protection for your applications and to get access to all the features.
- checksums.yaml +5 -5
- data/.travis.yml +1 -21
- data/LICENSE.txt +1 -1
- data/README.md +107 -73
- data/RELEASE_NOTES.md +20 -0
- data/RELEASE_NOTES.md~ +11 -0
- data/VERSION +1 -1
- data/bin/bio-vcf +49 -30
- data/bio-vcf.gemspec +1 -1
- data/features/cli.feature +4 -1
- data/features/diff_count.feature +0 -1
- data/features/step_definitions/cli-feature.rb +13 -9
- data/features/step_definitions/diff_count.rb +1 -1
- data/features/step_definitions/somaticsniper.rb +1 -1
- data/lib/bio-vcf/pcows.rb +31 -25
- data/lib/bio-vcf/vcffile.rb +46 -0
- data/lib/bio-vcf/vcfgenotypefield.rb +20 -20
- data/lib/bio-vcf/vcfheader.rb +29 -0
- data/lib/bio-vcf/vcfrecord.rb +5 -3
- data/lib/bio-vcf/vcfsample.rb +3 -1
- data/test/data/input/empty.vcf +2 -0
- data/test/data/regression/empty-stderr.new +12 -0
- data/test/data/regression/empty.new +2 -0
- data/test/data/regression/empty.ref +2 -0
- data/test/data/regression/eval_once-stderr.new +2 -2
- data/test/data/regression/eval_r.info.dp-stderr.new +9 -7
- data/test/data/regression/ifilter_s.dp-stderr.new +9 -7
- data/test/data/regression/pass1-stderr.new +9 -7
- data/test/data/regression/r.info.dp-stderr.new +4 -8
- data/test/data/regression/r.info.dp.new +0 -33
- data/test/data/regression/rewrite.info.sample-stderr.new +9 -7
- data/test/data/regression/s.dp-stderr.new +9 -7
- data/test/data/regression/seval_s.dp-stderr.new +9 -7
- data/test/data/regression/sfilter_seval_s.dp-stderr.new +9 -7
- data/test/data/regression/thread4-stderr.new +9 -7
- data/test/data/regression/thread4_4-stderr.new +25 -44
- data/test/data/regression/thread4_4.new +0 -20
- data/test/data/regression/thread4_4_failed_filter-stderr.new +1 -1
- data/test/data/regression/thread4_4_failed_filter-stderr.ref +1 -1
- data/test/data/regression/vcf2json_full_header-stderr.new +9 -7
- data/test/data/regression/vcf2json_use_meta-stderr.new +9 -7
- metadata +11 -7
- data/features/#cli.feature# +0 -71
- data/features/filter.feature~ +0 -35
- data/test/stress/stress_test.sh~ +0 -8
@@ -1,8 +1,10 @@
|
|
1
|
-
bio-vcf 0.9.
|
2
|
-
Waiting up to 180 seconds for pid=
|
1
|
+
bio-vcf 0.9.4 (biogem Ruby 2.5.5 with pcows) by Pjotr Prins 2015-2020
|
2
|
+
Waiting up to 180 seconds for pid=30651 to complete /tmp/bio-vcf_20201222-30649-gol7rw/000001-bio-vcf
|
3
3
|
Processing remaining output...
|
4
|
-
Trying: [[
|
5
|
-
Set lock on [[
|
6
|
-
Processing output file /tmp/bio-
|
7
|
-
|
8
|
-
|
4
|
+
Trying: [[30651, 1, "/tmp/bio-vcf_20201222-30649-gol7rw/000001-bio-vcf"]]
|
5
|
+
Set lock on [[30651, 1, "/tmp/bio-vcf_20201222-30649-gol7rw/000001-bio-vcf"]]
|
6
|
+
Processing output file /tmp/bio-vcf_20201222-30649-gol7rw/000001-bio-vcf (blocking)
|
7
|
+
Checking for output_lock on existing /tmp/bio-vcf_20201222-30649-gol7rw/000001-bio-vcf
|
8
|
+
Trying to remove /tmp/bio-vcf_20201222-30649-gol7rw/000001-bio-vcf.keep
|
9
|
+
Removing /tmp/bio-vcf_20201222-30649-gol7rw/000001-bio-vcf.keep
|
10
|
+
Removing dir /tmp/bio-vcf_20201222-30649-gol7rw
|
@@ -1,5 +1,5 @@
|
|
1
|
-
bio-vcf 0.9.
|
2
|
-
Waiting up to 180 seconds for pid=
|
1
|
+
bio-vcf 0.9.4 (biogem Ruby 2.5.5 with pcows) by Pjotr Prins 2015-2020
|
2
|
+
Waiting up to 180 seconds for pid=30660 to complete /tmp/bio-vcf_20201222-30658-19yukj0/000001-bio-vcf
|
3
3
|
[:format, {"GT"=>0, "AD"=>1, "DP"=>2, "GQ"=>3, "PL"=>4}, :sample, ["./."]]
|
4
4
|
[:filter, "s.dp>10"]
|
5
5
|
[:format, {"GT"=>0, "AD"=>1, "DP"=>2, "GQ"=>3, "PL"=>4}, :sample, ["./."]]
|
@@ -9,8 +9,10 @@ Waiting up to 180 seconds for pid=9396 to complete /tmp/bio-vcf_20160122-9394-61
|
|
9
9
|
[:format, {"GT"=>0, "AD"=>1, "DP"=>2, "GQ"=>3, "PL"=>4}, :sample, ["./."]]
|
10
10
|
[:filter, "s.dp>10"]
|
11
11
|
Processing remaining output...
|
12
|
-
Trying: [[
|
13
|
-
Set lock on [[
|
14
|
-
Processing output file /tmp/bio-
|
15
|
-
|
16
|
-
|
12
|
+
Trying: [[30660, 1, "/tmp/bio-vcf_20201222-30658-19yukj0/000001-bio-vcf"]]
|
13
|
+
Set lock on [[30660, 1, "/tmp/bio-vcf_20201222-30658-19yukj0/000001-bio-vcf"]]
|
14
|
+
Processing output file /tmp/bio-vcf_20201222-30658-19yukj0/000001-bio-vcf (blocking)
|
15
|
+
Checking for output_lock on existing /tmp/bio-vcf_20201222-30658-19yukj0/000001-bio-vcf
|
16
|
+
Trying to remove /tmp/bio-vcf_20201222-30658-19yukj0/000001-bio-vcf.keep
|
17
|
+
Removing /tmp/bio-vcf_20201222-30658-19yukj0/000001-bio-vcf.keep
|
18
|
+
Removing dir /tmp/bio-vcf_20201222-30658-19yukj0
|
@@ -1,8 +1,10 @@
|
|
1
|
-
bio-vcf 0.9.
|
2
|
-
Waiting up to 180 seconds for pid=
|
1
|
+
bio-vcf 0.9.4 (biogem Ruby 2.5.5 with pcows) by Pjotr Prins 2015-2020
|
2
|
+
Waiting up to 180 seconds for pid=30973 to complete /tmp/bio-vcf_20201222-30971-lfyqh6/000001-bio-vcf
|
3
3
|
Processing remaining output...
|
4
|
-
Trying: [[
|
5
|
-
Set lock on [[
|
6
|
-
Processing output file /tmp/bio-
|
7
|
-
|
8
|
-
|
4
|
+
Trying: [[30973, 1, "/tmp/bio-vcf_20201222-30971-lfyqh6/000001-bio-vcf"]]
|
5
|
+
Set lock on [[30973, 1, "/tmp/bio-vcf_20201222-30971-lfyqh6/000001-bio-vcf"]]
|
6
|
+
Processing output file /tmp/bio-vcf_20201222-30971-lfyqh6/000001-bio-vcf (blocking)
|
7
|
+
Checking for output_lock on existing /tmp/bio-vcf_20201222-30971-lfyqh6/000001-bio-vcf
|
8
|
+
Trying to remove /tmp/bio-vcf_20201222-30971-lfyqh6/000001-bio-vcf.keep
|
9
|
+
Removing /tmp/bio-vcf_20201222-30971-lfyqh6/000001-bio-vcf.keep
|
10
|
+
Removing dir /tmp/bio-vcf_20201222-30971-lfyqh6
|
@@ -1,44 +1,25 @@
|
|
1
|
-
bio-vcf 0.9.
|
2
|
-
..Set lock on [[
|
3
|
-
Processing output file /tmp/bio-
|
4
|
-
.Checking for output_lock on existing /tmp/bio-
|
5
|
-
|
6
|
-
|
7
|
-
|
8
|
-
|
9
|
-
|
10
|
-
|
11
|
-
|
12
|
-
|
13
|
-
|
14
|
-
|
15
|
-
|
16
|
-
|
17
|
-
Removing /tmp/bio-
|
18
|
-
Set lock on [[
|
19
|
-
Processing output file /tmp/bio-
|
20
|
-
.Checking for output_lock on existing /tmp/bio-
|
21
|
-
|
22
|
-
|
23
|
-
|
24
|
-
|
25
|
-
Removing /tmp/bio-
|
26
|
-
Set lock on [[9317, 7, "/tmp/bio-vcf_20160122-9264-14nbd6g/000007-bio-vcf"]]
|
27
|
-
Processing output file /tmp/bio-vcf_20160122-9264-14nbd6g/000007-bio-vcf (non-blocking)
|
28
|
-
.Checking for output_lock on existing /tmp/bio-vcf_20160122-9264-14nbd6g/000007-bio-vcf
|
29
|
-
Removing /tmp/bio-vcf_20160122-9264-14nbd6g/000007-bio-vcf.keep
|
30
|
-
Set lock on [[9326, 8, "/tmp/bio-vcf_20160122-9264-14nbd6g/000008-bio-vcf"]]
|
31
|
-
Processing output file /tmp/bio-vcf_20160122-9264-14nbd6g/000008-bio-vcf (non-blocking)
|
32
|
-
Waiting up to 180 seconds for pid=9343 to complete /tmp/bio-vcf_20160122-9264-14nbd6g/000010-bio-vcf
|
33
|
-
Processing remaining output...
|
34
|
-
Checking for output_lock on existing /tmp/bio-vcf_20160122-9264-14nbd6g/000008-bio-vcf
|
35
|
-
Removing /tmp/bio-vcf_20160122-9264-14nbd6g/000008-bio-vcf.keep
|
36
|
-
Set lock on [[9335, 9, "/tmp/bio-vcf_20160122-9264-14nbd6g/000009-bio-vcf"]]
|
37
|
-
Processing output file /tmp/bio-vcf_20160122-9264-14nbd6g/000009-bio-vcf (non-blocking)
|
38
|
-
Checking for output_lock on existing /tmp/bio-vcf_20160122-9264-14nbd6g/000009-bio-vcf
|
39
|
-
Removing /tmp/bio-vcf_20160122-9264-14nbd6g/000009-bio-vcf.keep
|
40
|
-
Set lock on [[9343, 10, "/tmp/bio-vcf_20160122-9264-14nbd6g/000010-bio-vcf"]]
|
41
|
-
Processing output file /tmp/bio-vcf_20160122-9264-14nbd6g/000010-bio-vcf (non-blocking)
|
42
|
-
Checking for output_lock on existing /tmp/bio-vcf_20160122-9264-14nbd6g/000010-bio-vcf
|
43
|
-
Removing /tmp/bio-vcf_20160122-9264-14nbd6g/000010-bio-vcf.keep
|
44
|
-
Removing dir /tmp/bio-vcf_20160122-9264-14nbd6g
|
1
|
+
bio-vcf 0.9.4 (biogem Ruby 2.5.5 with pcows) by Pjotr Prins 2015-2020
|
2
|
+
..Set lock on [[30984, 1, "/tmp/bio-vcf_20201222-30981-rnkgf0/000001-bio-vcf"]]
|
3
|
+
Processing output file /tmp/bio-vcf_20201222-30981-rnkgf0/000001-bio-vcf (non-blocking)
|
4
|
+
.Checking for output_lock on existing /tmp/bio-vcf_20201222-30981-rnkgf0/000001-bio-vcf
|
5
|
+
Trying to remove /tmp/bio-vcf_20201222-30981-rnkgf0/000001-bio-vcf.keep
|
6
|
+
Removing /tmp/bio-vcf_20201222-30981-rnkgf0/000001-bio-vcf.keep
|
7
|
+
Set lock on [[30989, 2, "/tmp/bio-vcf_20201222-30981-rnkgf0/000002-bio-vcf"]]
|
8
|
+
Processing output file /tmp/bio-vcf_20201222-30981-rnkgf0/000002-bio-vcf (non-blocking)
|
9
|
+
.Checking for output_lock on existing /tmp/bio-vcf_20201222-30981-rnkgf0/000002-bio-vcf
|
10
|
+
Trying to remove /tmp/bio-vcf_20201222-30981-rnkgf0/000002-bio-vcf.keep
|
11
|
+
Removing /tmp/bio-vcf_20201222-30981-rnkgf0/000002-bio-vcf.keep
|
12
|
+
Set lock on [[30998, 3, "/tmp/bio-vcf_20201222-30981-rnkgf0/000003-bio-vcf"]]
|
13
|
+
Processing output file /tmp/bio-vcf_20201222-30981-rnkgf0/000003-bio-vcf (non-blocking)
|
14
|
+
.Checking for output_lock on existing /tmp/bio-vcf_20201222-30981-rnkgf0/000003-bio-vcf
|
15
|
+
.Checking for output_lock on existing /tmp/bio-vcf_20201222-30981-rnkgf0/000003-bio-vcf
|
16
|
+
Trying to remove /tmp/bio-vcf_20201222-30981-rnkgf0/000003-bio-vcf.keep
|
17
|
+
Removing /tmp/bio-vcf_20201222-30981-rnkgf0/000003-bio-vcf.keep
|
18
|
+
Set lock on [[31007, 4, "/tmp/bio-vcf_20201222-30981-rnkgf0/000004-bio-vcf"]]
|
19
|
+
Processing output file /tmp/bio-vcf_20201222-30981-rnkgf0/000004-bio-vcf (non-blocking)
|
20
|
+
.Checking for output_lock on existing /tmp/bio-vcf_20201222-30981-rnkgf0/000004-bio-vcf
|
21
|
+
.Waiting for slot (timeout=180)
|
22
|
+
ERROR:
|
23
|
+
Trying to remove /tmp/bio-vcf_20201222-30981-rnkgf0/000004-bio-vcf.keep
|
24
|
+
Removing /tmp/bio-vcf_20201222-30981-rnkgf0/000004-bio-vcf.keep
|
25
|
+
Removing dir /tmp/bio-vcf_20201222-30981-rnkgf0
|
@@ -128,23 +128,3 @@
|
|
128
128
|
1 13118 . A G 2317.85 . AC=7;AF=0.500;AN=14;BaseQRankSum=-9.647;DP=703;Dels=0.00;FS=362.380;HaplotypeScore=2.3393;MLEAC=7;MLEAF=0.500;MQ=23.54;MQ0=228;MQRankSum=-14.592;QD=3.30;ReadPosRankSum=3.437 GT:AD:DP:GQ:PL 0/1:21,13:34:95:95,0,277 0/1:82,47:129:99:444,0,1445 0/1:65,36:101:99:412,0,1179 0/1:92,53:145:99:455,0,1989 0/1:58,40:99:99:312,0,1216 0/1:52,33:85:99:186,0,1000 0/1:61,47:108:99:453,0,1079
|
129
129
|
1 13178 . G A 72.69 . AC=2;AF=0.143;AN=14;BaseQRankSum=-2.194;DP=929;Dels=0.00;FS=8.154;HaplotypeScore=1.8585;MLEAC=2;MLEAF=0.143;MQ=13.49;MQ0=590;MQRankSum=-3.030;QD=0.38;ReadPosRankSum=-0.383 GT:AD:DP:GQ:PL 0/1:49,3:52:11:11,0,292 0/0:161,11:172:60:0,60,1328 0/1:122,15:137:99:99,0,936 0/0:164,8:172:99:0,112,1436 0/0:120,9:129:62:0,62,771 0/0:113,6:119:12:0,12,885 0/0:138,10:148:99:0,112,1596
|
130
130
|
1 13302 . C T 1067.42 . AC=6;AF=0.429;AN=14;BaseQRankSum=11.584;DP=692;Dels=0.00;FS=86.736;HaplotypeScore=2.8260;MLEAC=6;MLEAF=0.429;MQ=27.08;MQ0=150;MQRankSum=-14.794;QD=1.63;ReadPosRankSum=-9.193 GT:AD:DP:GQ:PL 0/0:26,10:36:41:0,41,616 0/1:91,45:136:99:183,0,2085 0/1:71,28:99:86:86,0,1557 0/1:96,50:146:99:333,0,2098 0/1:55,34:90:99:283,0,1321 0/1:47,18:65:99:157,0,943 0/1:89,28:117:67:67,0,1921
|
131
|
-
1 13757 . G A 271.89 . AC=2;AF=0.143;AN=14;BaseQRankSum=2.916;DP=1149;Dels=0.00;FS=11.172;HaplotypeScore=7.1466;MLEAC=2;MLEAF=0.143;MQ=18.17;MQ0=374;MQRankSum=3.990;QD=0.87;ReadPosRankSum=1.309 GT:AD:DP:GQ:PL 0/0:47,4:53:16:0,16,344 0/0:185,16:201:60:0,60,2690 0/1:157,24:181:99:162,0,2556 0/0:232,18:250:99:0,137,3600 0/0:139,13:152:15:0,15,2266 0/1:114,16:130:99:147,0,1525 0/0:168,14:182:2:0,2,2230
|
132
|
-
1 13868 . A G 1481.85 . AC=7;AF=0.500;AN=14;BaseQRankSum=2.036;DP=1108;Dels=0.00;FS=4.601;HaplotypeScore=6.7136;MLEAC=7;MLEAF=0.500;MQ=13.23;MQ0=622;MQRankSum=1.659;QD=1.34;ReadPosRankSum=0.283 GT:AD:DP:GQ:PL 0/1:60,15:75:80:80,0,568 0/1:146,46:192:99:289,0,1030 0/1:146,36:182:99:203,0,1001 0/1:172,52:224:99:222,0,1150 0/1:106,36:142:99:181,0,621 0/1:79,32:111:99:126,0,467 0/1:129,38:167:99:420,0,767
|
133
|
-
1 13896 . C A 389.42 . AC=6;AF=0.429;AN=14;BaseQRankSum=-0.743;DP=830;Dels=0.00;FS=0.000;HaplotypeScore=1.6931;MLEAC=6;MLEAF=0.429;MQ=11.88;MQ0=518;MQRankSum=-1.292;QD=0.54;ReadPosRankSum=0.433 GT:AD:DP:GQ:PL 0/1:50,12:62:99:107,0,367 0/1:107,28:135:40:40,0,660 0/1:120,23:143:37:37,0,675 0/1:144,31:175:99:172,0,690 0/0:98,14:112:42:0,42,577 0/1:65,16:81:34:34,0,290 0/1:102,19:121:41:41,0,677
|
134
|
-
1 14354 . C A 109.72 . AC=3;AF=0.250;AN=12;BaseQRankSum=-2.527;DP=764;Dels=0.00;FS=0.000;HaplotypeScore=0.3231;MLEAC=3;MLEAF=0.250;MQ=5.02;MQ0=710;MQRankSum=1.079;QD=0.47;ReadPosRankSum=-0.178 GT:AD:DP:GQ:PL 0/1:36,7:43:68:68,0,75 0/0:141,17:158:9:0,9,86 0/1:103,12:115:42:42,0,72 0/0:132,13:145:21:0,21,201 ./. 0/1:62,10:72:38:38,0,117 0/0:108,11:119:9:0,9,86
|
135
|
-
1 14464 . A T 3945.85 . AC=7;AF=0.500;AN=14;BaseQRankSum=0.104;DP=809;Dels=0.00;FS=38.588;HaplotypeScore=5.8354;MLEAC=7;MLEAF=0.500;MQ=23.21;MQ0=241;MQRankSum=6.859;QD=4.88;ReadPosRankSum=-0.294 GT:AD:DP:GQ:PL 0/1:41,10:51:99:183,0,569 0/1:107,48:155:99:829,0,1388 0/1:99,42:141:99:691,0,1103 0/1:114,36:150:99:454,0,1544 0/1:48,35:83:99:781,0,561 0/1:64,25:89:99:409,0,781 0/1:102,38:140:99:638,0,1310
|
136
|
-
1 14673 . G C 1012.85 . AC=7;AF=0.500;AN=14;BaseQRankSum=3.674;DP=754;Dels=0.00;FS=115.496;HaplotypeScore=14.9167;MLEAC=7;MLEAF=0.500;MQ=23.97;MQ0=268;MQRankSum=-1.459;QD=1.34;ReadPosRankSum=-3.039 GT:AD:DP:GQ:PL 0/1:28,8:36:34:34,0,409 0/1:117,25:142:99:125,0,1964 0/1:87,30:117:99:214,0,1506 0/1:130,27:157:99:193,0,2076 0/1:77,18:95:99:114,0,1273 0/1:49,27:76:99:200,0,719 0/1:103,28:131:99:172,0,1620
|
137
|
-
1 14699 . C G 64.65 . AC=2;AF=0.143;AN=14;BaseQRankSum=6.064;DP=719;Dels=0.00;FS=29.480;HaplotypeScore=5.5900;MLEAC=2;MLEAF=0.143;MQ=24.79;MQ0=233;MQRankSum=-8.066;QD=0.27;ReadPosRankSum=-0.696 GT:AD:DP:GQ:PL 0/0:35,6:43:51:0,51,720 0/1:91,37:128:59:59,0,1859 0/0:80,29:109:99:0,101,1700 0/0:113,34:147:14:0,14,1987 0/0:72,26:98:37:0,37,1319 0/0:53,25:78:78:0,78,1035 0/1:84,30:114:44:44,0,1326
|
138
|
-
1 14907 . A G 9688.85 . AC=7;AF=0.500;AN=14;BaseQRankSum=3.299;DP=1083;Dels=0.00;FS=20.983;HaplotypeScore=11.4413;MLEAC=7;MLEAF=0.500;MQ=33.27;MQ0=51;MQRankSum=1.076;QD=8.95;ReadPosRankSum=0.653 GT:AD:DP:GQ:PL 0/1:27,30:57:99:603,0,550 0/1:117,99:216:99:2024,0,2888 0/1:91,71:162:99:1489,0,1990 0/1:119,86:205:99:1876,0,2588 0/1:100,53:153:99:1001,0,2297 0/1:59,59:118:99:1240,0,1241 0/1:91,67:158:99:1495,0,2003
|
139
|
-
1 14930 . A G 8665.85 . AC=7;AF=0.500;AN=14;BaseQRankSum=-13.760;DP=1092;Dels=0.00;FS=17.240;HaplotypeScore=20.0469;MLEAC=7;MLEAF=0.500;MQ=34.72;MQ0=22;MQRankSum=1.127;QD=7.94;ReadPosRankSum=0.785 GT:AD:DP:GQ:PL 0/1:32,36:68:99:609,0,659 0/1:123,92:216:99:1636,0,3225 0/1:92,78:170:99:1449,0,2202 0/1:123,86:210:99:1748,0,3028 0/1:85,50:136:99:802,0,2117 0/1:62,63:125:99:1072,0,1437 0/1:92,71:164:99:1389,0,2055
|
140
|
-
1 14933 . G A 248.54 . AC=3;AF=0.214;AN=14;BaseQRankSum=1.588;DP=1089;Dels=0.00;FS=14.333;HaplotypeScore=19.7793;MLEAC=3;MLEAF=0.214;MQ=34.91;MQ0=18;MQRankSum=1.439;QD=0.45;ReadPosRankSum=0.752 GT:AD:DP:GQ:PL 0/0:59,9:68:1:0,1,1361 0/1:192,24:216:99:125,0,4941 0/0:160,9:169:99:0,142,4118 0/1:191,21:212:39:39,0,4848 0/0:126,6:132:99:0,210,3367 0/1:113,15:128:99:124,0,2578 0/0:154,10:164:99:0,158,3934
|
141
|
-
1 14948 . G A 570.85 . AC=7;AF=0.500;AN=14;BaseQRankSum=9.167;DP=1050;Dels=0.00;FS=11.131;HaplotypeScore=6.4398;MLEAC=7;MLEAF=0.500;MQ=32.69;MQ0=53;MQRankSum=0.017;QD=0.54;ReadPosRankSum=-1.181 GT:AD:DP:GQ:PL 0/1:56,7:63:79:79,0,1134 0/1:168,24:192:70:70,0,3900 0/1:160,21:181:99:147,0,3515 0/1:188,23:211:57:57,0,4113 0/1:115,14:129:32:32,0,2406 0/1:103,18:121:99:173,0,2016 0/1:132,21:153:52:52,0,2861
|
142
|
-
1 14976 . G A 195.07 . AC=3;AF=0.214;AN=14;BaseQRankSum=7.340;DP=939;Dels=0.00;FS=32.492;HaplotypeScore=4.6786;MLEAC=3;MLEAF=0.214;MQ=27.87;MQ0=167;MQRankSum=1.477;QD=0.53;ReadPosRankSum=-0.560 GT:AD:DP:GQ:PL 0/1:50,6:56:12:12,0,929 0/0:152,14:166:81:0,81,3159 0/0:152,9:161:99:0,113,2885 0/1:174,22:196:99:127,0,3211 0/0:100,9:109:97:0,97,1900 0/1:103,13:116:97:97,0,1548 0/0:122,13:135:55:0,55,2246
|
143
|
-
1 15118 . A G 2503.85 . AC=7;AF=0.500;AN=14;BaseQRankSum=-5.645;DP=1004;Dels=0.00;FS=0.313;HaplotypeScore=4.1078;MLEAC=7;MLEAF=0.500;MQ=21.49;MQ0=207;MQRankSum=-0.007;QD=2.49;ReadPosRankSum=-0.291 GT:AD:DP:GQ:PL 0/1:38,8:46:50:50,0,575 0/1:148,50:198:99:671,0,2598 0/1:97,32:129:99:358,0,1911 0/1:176,53:230:99:700,0,2989 0/1:91,19:113:99:146,0,1741 0/1:92,31:126:99:347,0,1543 0/1:126,30:158:99:271,0,2193
|
144
|
-
1 15190 . G A 2564.85 . AC=7;AF=0.500;AN=14;BaseQRankSum=11.094;DP=1090;Dels=0.00;FS=0.297;HaplotypeScore=4.3151;MLEAC=7;MLEAF=0.500;MQ=26.11;MQ0=187;MQRankSum=-0.724;QD=2.35;ReadPosRankSum=0.530 GT:AD:DP:GQ:PL 0/1:42,11:53:99:177,0,696 0/1:157,51:208:99:839,0,3145 0/1:142,28:170:91:91,0,2919 0/1:160,40:200:99:499,0,2815 0/1:109,17:126:99:167,0,2363 0/1:111,34:145:99:554,0,2268 0/1:141,38:179:99:277,0,2827
|
145
|
-
1 15211 . T G 14032.85 . AC=7;AF=0.500;AN=14;BaseQRankSum=-11.292;DP=992;Dels=0.00;FS=0.281;HaplotypeScore=11.0809;MLEAC=7;MLEAF=0.500;MQ=27.20;MQ0=159;MQRankSum=3.859;QD=14.15;ReadPosRankSum=-1.972 GT:AD:DP:GQ:PL 0/1:19,35:54:99:858,0,183 0/1:63,120:183:99:2771,0,960 0/1:50,111:161:99:2213,0,740 0/1:58,113:171:99:2326,0,685 0/1:39,81:120:99:1707,0,678 0/1:38,96:134:99:2126,0,499 0/1:61,107:168:99:2071,0,895
|
146
|
-
1 15274 . A G,T 8909.81 . AC=7,7;AF=0.500,0.500;AN=14;BaseQRankSum=2.416;DP=636;Dels=0.00;FS=0.000;HaplotypeScore=5.1061;MLEAC=7,7;MLEAF=0.500,0.500;MQ=22.77;MQ0=195;MQRankSum=2.299;QD=14.01;ReadPosRankSum=1.707 GT:AD:DP:GQ:PL 1/2:1,14,22:37:99:717,494,467,224,0,170 1/2:1,45,75:121:99:1922,1334,1265,588,0,441 1/2:0,31,71:102:99:1960,1563,1515,396,0,222 1/2:1,59,77:137:99:2009,1376,1298,633,0,486 1/2:0,18,52:71:62:938,789,771,149,0,62 1/2:0,29,38:67:99:1141,883,853,258,0,162 1/2:0,38,60:98:99:1464,1064,1016,399,0,285
|
147
|
-
1 15447 . A G 109.56 . AC=3;AF=0.214;AN=14;BaseQRankSum=-9.704;DP=1213;Dels=0.00;FS=0.000;HaplotypeScore=16.4636;MLEAC=3;MLEAF=0.214;MQ=19.12;MQ0=346;MQRankSum=-0.997;QD=0.20;ReadPosRankSum=-1.088 GT:AD:DP:GQ:PL 0/0:42,4:46:34:0,34,690 0/0:220,22:242:99:0,99,3639 0/0:167,16:183:42:0,42,2582 0/1:201,25:226:78:78,0,2854 0/1:121,16:137:39:39,0,1877 0/0:160,13:173:99:0,139,2724 0/1:157,18:175:34:34,0,2472
|
148
|
-
1 15688 . C T 54.29 . AC=2;AF=0.143;AN=14;BaseQRankSum=-8.686;DP=904;Dels=0.00;FS=11.971;HaplotypeScore=3.1454;MLEAC=2;MLEAF=0.143;MQ=16.75;MQ0=361;MQRankSum=-0.069;QD=0.16;ReadPosRankSum=0.355 GT:AD:DP:GQ:PL 0/0:34,3:37:15:0,15,335 0/0:166,16:182:61:0,61,2190 0/1:131,16:147:69:69,0,1563 0/1:167,17:184:22:22,0,1894 0/0:88,12:100:15:0,15,1099 0/0:88,13:101:1:0,1,1126 0/0:132,16:148:18:0,18,1706
|
149
|
-
1 16068 . T C 377.22 . AC=4;AF=0.286;AN=14;BaseQRankSum=-7.618;DP=398;Dels=0.00;FS=12.349;HaplotypeScore=1.8069;MLEAC=4;MLEAF=0.286;MQ=26.64;MQ0=64;MQRankSum=-4.589;QD=1.80;ReadPosRankSum=-0.448 GT:AD:DP:GQ:PL 0/1:18,15:33:99:182,0,404 0/0:51,6:57:31:0,31,815 0/1:52,16:68:64:64,0,1101 0/0:68,13:81:99:0,99,1505 0/0:41,8:49:51:0,51,1010 0/1:40,9:49:21:21,0,702 0/1:38,19:58:99:153,0,760
|
150
|
-
1 16103 . T G 2348.85 . AC=7;AF=0.500;AN=14;BaseQRankSum=-1.646;DP=526;Dels=0.00;FS=2.577;HaplotypeScore=0.9645;MLEAC=7;MLEAF=0.500;MQ=26.64;MQ0=82;MQRankSum=-2.736;QD=4.47;ReadPosRankSum=-2.044 GT:AD:DP:GQ:PL 0/1:21,29:50:99:504,0,450 0/1:55,24:79:99:209,0,838 0/1:49,37:86:99:456,0,1002 0/1:74,32:106:99:193,0,1519 0/1:41,19:60:99:141,0,740 0/1:38,23:61:99:257,0,548 0/1:38,46:84:99:628,0,641
|
@@ -2,4 +2,4 @@ Unknown field name <t> in record, did you mean r.info.t?
|
|
2
2
|
Unknown field name <t> in record, did you mean r.info.t?
|
3
3
|
Unknown field name <t> in record, did you mean r.info.t?
|
4
4
|
Unknown field name <t> in record, did you mean r.info.t?
|
5
|
-
execution expired
|
5
|
+
ERROR: execution expired
|
@@ -2,4 +2,4 @@ Unknown field name <t> in record, did you mean r.info.t?
|
|
2
2
|
Unknown field name <t> in record, did you mean r.info.t?
|
3
3
|
Unknown field name <t> in record, did you mean r.info.t?
|
4
4
|
Unknown field name <t> in record, did you mean r.info.t?
|
5
|
-
execution expired
|
5
|
+
ERROR: execution expired
|
@@ -1,8 +1,10 @@
|
|
1
|
-
bio-vcf 0.9.
|
2
|
-
Waiting up to 180 seconds for pid=
|
1
|
+
bio-vcf 0.9.4 (biogem Ruby 2.5.5 with pcows) by Pjotr Prins 2015-2020
|
2
|
+
Waiting up to 180 seconds for pid=30682 to complete /tmp/bio-vcf_20201222-30680-iio5eo/000001-bio-vcf
|
3
3
|
Processing remaining output...
|
4
|
-
Trying: [[
|
5
|
-
Set lock on [[
|
6
|
-
Processing output file /tmp/bio-
|
7
|
-
|
8
|
-
|
4
|
+
Trying: [[30682, 1, "/tmp/bio-vcf_20201222-30680-iio5eo/000001-bio-vcf"]]
|
5
|
+
Set lock on [[30682, 1, "/tmp/bio-vcf_20201222-30680-iio5eo/000001-bio-vcf"]]
|
6
|
+
Processing output file /tmp/bio-vcf_20201222-30680-iio5eo/000001-bio-vcf (blocking)
|
7
|
+
Checking for output_lock on existing /tmp/bio-vcf_20201222-30680-iio5eo/000001-bio-vcf
|
8
|
+
Trying to remove /tmp/bio-vcf_20201222-30680-iio5eo/000001-bio-vcf.keep
|
9
|
+
Removing /tmp/bio-vcf_20201222-30680-iio5eo/000001-bio-vcf.keep
|
10
|
+
Removing dir /tmp/bio-vcf_20201222-30680-iio5eo
|
@@ -1,8 +1,10 @@
|
|
1
|
-
bio-vcf 0.9.
|
2
|
-
Waiting up to 180 seconds for pid=
|
1
|
+
bio-vcf 0.9.4 (biogem Ruby 2.5.5 with pcows) by Pjotr Prins 2015-2020
|
2
|
+
Waiting up to 180 seconds for pid=30691 to complete /tmp/bio-vcf_20201222-30689-1bguuj5/000001-bio-vcf
|
3
3
|
Processing remaining output...
|
4
|
-
Trying: [[
|
5
|
-
Set lock on [[
|
6
|
-
Processing output file /tmp/bio-
|
7
|
-
|
8
|
-
|
4
|
+
Trying: [[30691, 1, "/tmp/bio-vcf_20201222-30689-1bguuj5/000001-bio-vcf"]]
|
5
|
+
Set lock on [[30691, 1, "/tmp/bio-vcf_20201222-30689-1bguuj5/000001-bio-vcf"]]
|
6
|
+
Processing output file /tmp/bio-vcf_20201222-30689-1bguuj5/000001-bio-vcf (blocking)
|
7
|
+
Checking for output_lock on existing /tmp/bio-vcf_20201222-30689-1bguuj5/000001-bio-vcf
|
8
|
+
Trying to remove /tmp/bio-vcf_20201222-30689-1bguuj5/000001-bio-vcf.keep
|
9
|
+
Removing /tmp/bio-vcf_20201222-30689-1bguuj5/000001-bio-vcf.keep
|
10
|
+
Removing dir /tmp/bio-vcf_20201222-30689-1bguuj5
|
metadata
CHANGED
@@ -1,14 +1,14 @@
|
|
1
1
|
--- !ruby/object:Gem::Specification
|
2
2
|
name: bio-vcf
|
3
3
|
version: !ruby/object:Gem::Version
|
4
|
-
version: 0.9.
|
4
|
+
version: 0.9.4
|
5
5
|
platform: ruby
|
6
6
|
authors:
|
7
7
|
- Pjotr Prins
|
8
8
|
autorequire:
|
9
9
|
bindir: bin
|
10
10
|
cert_chain: []
|
11
|
-
date:
|
11
|
+
date: 2020-12-22 00:00:00.000000000 Z
|
12
12
|
dependencies: []
|
13
13
|
description: Smart lazy multi-threaded parser for VCF format with useful filtering
|
14
14
|
and output rewriting (JSON, RDF etc.)
|
@@ -25,16 +25,16 @@ files:
|
|
25
25
|
- Gemfile.lock
|
26
26
|
- LICENSE.txt
|
27
27
|
- README.md
|
28
|
+
- RELEASE_NOTES.md
|
29
|
+
- RELEASE_NOTES.md~
|
28
30
|
- Rakefile
|
29
31
|
- TAGS
|
30
32
|
- VERSION
|
31
33
|
- bin/bio-vcf
|
32
34
|
- bio-vcf.gemspec
|
33
|
-
- features/#cli.feature#
|
34
35
|
- features/cli.feature
|
35
36
|
- features/diff_count.feature
|
36
37
|
- features/filter.feature
|
37
|
-
- features/filter.feature~
|
38
38
|
- features/multisample.feature
|
39
39
|
- features/sfilter.feature
|
40
40
|
- features/somaticsniper.feature
|
@@ -54,6 +54,7 @@ files:
|
|
54
54
|
- lib/bio-vcf/utils.rb
|
55
55
|
- lib/bio-vcf/variant.rb
|
56
56
|
- lib/bio-vcf/vcf.rb
|
57
|
+
- lib/bio-vcf/vcffile.rb
|
57
58
|
- lib/bio-vcf/vcfgenotypefield.rb
|
58
59
|
- lib/bio-vcf/vcfheader.rb
|
59
60
|
- lib/bio-vcf/vcfheader_line.rb
|
@@ -71,11 +72,15 @@ files:
|
|
71
72
|
- template/vcf2rdf.erb
|
72
73
|
- template/vcf2rdf_header.erb
|
73
74
|
- test/data/input/dbsnp.vcf
|
75
|
+
- test/data/input/empty.vcf
|
74
76
|
- test/data/input/gatk_exome.vcf
|
75
77
|
- test/data/input/gatk_wgs.vcf
|
76
78
|
- test/data/input/multisample.vcf
|
77
79
|
- test/data/input/somaticsniper.vcf
|
78
80
|
- test/data/input/test.bed
|
81
|
+
- test/data/regression/empty-stderr.new
|
82
|
+
- test/data/regression/empty.new
|
83
|
+
- test/data/regression/empty.ref
|
79
84
|
- test/data/regression/eval_once-stderr.new
|
80
85
|
- test/data/regression/eval_once.new
|
81
86
|
- test/data/regression/eval_once.ref
|
@@ -120,9 +125,8 @@ files:
|
|
120
125
|
- test/data/regression/vcf2json_use_meta.ref
|
121
126
|
- test/performance/metrics.md
|
122
127
|
- test/stress/stress_test.sh
|
123
|
-
- test/stress/stress_test.sh~
|
124
128
|
- test/tmp/test.vcf
|
125
|
-
homepage: http://github.com/
|
129
|
+
homepage: http://github.com/vcflib/bio-vcf
|
126
130
|
licenses:
|
127
131
|
- MIT
|
128
132
|
metadata: {}
|
@@ -142,7 +146,7 @@ required_rubygems_version: !ruby/object:Gem::Requirement
|
|
142
146
|
version: '0'
|
143
147
|
requirements: []
|
144
148
|
rubyforge_project:
|
145
|
-
rubygems_version: 2.
|
149
|
+
rubygems_version: 2.7.6.2
|
146
150
|
signing_key:
|
147
151
|
specification_version: 4
|
148
152
|
summary: Fast multi-purpose multi-threaded VCF parser
|
data/features/#cli.feature#
DELETED
@@ -1,71 +0,0 @@
|
|
1
|
-
@cli
|
2
|
-
Feature: Command-line interface (CLI)
|
3
|
-
|
4
|
-
bio-vcf has a powerful command line interface. Here we regression test features.
|
5
|
-
|
6
|
-
Scenario: Test the info filter using dp
|
7
|
-
Given I have input file(s) named "test/data/input/multisample.vcf"
|
8
|
-
When I execute "./bin/bio-vcf -i --filter 'r.info.dp>100'"
|
9
|
-
Then I expect the named output to match the named output "r.info.dp"
|
10
|
-
|
11
|
-
Scenario: Test the info filter using dp and threads
|
12
|
-
Given I have input file(s) named "test/data/input/multisample.vcf"
|
13
|
-
When I execute "./bin/bio-vcf -i --num-threads 4 --filter 'r.info.dp>2'"
|
14
|
-
Then I expect the named output to match the named output "thread4"
|
15
|
-
|
16
|
-
Scenario: Test the info filter using dp and threads with lines
|
17
|
-
Given I have input file(s) named "test/data/input/multisample.vcf"
|
18
|
-
When I execute "./bin/bio-vcf -i --num-threads 4 --thread-lines 4 --filter 'r.info.dp>2'"
|
19
|
-
Then I expect the named output to match the named output "thread4_4"
|
20
|
-
|
21
|
-
Scenario: Test the sample filter using dp
|
22
|
-
Given I have input file(s) named "test/data/input/multisample.vcf"
|
23
|
-
When I execute "./bin/bio-vcf -i --sfilter 's.dp>20'"
|
24
|
-
Then I expect the named output to match the named output "s.dp"
|
25
|
-
|
26
|
-
Scenario: Test the include sample filter using dp
|
27
|
-
Given I have input file(s) named "test/data/input/multisample.vcf"
|
28
|
-
When I execute "./bin/bio-vcf -i --ifilter 's.dp>100' --seval s.dp"
|
29
|
-
Then I expect the named output to match the named output "ifilter_s.dp"
|
30
|
-
|
31
|
-
Scenario: Test the info eval using dp
|
32
|
-
Given I have input file(s) named "test/data/input/multisample.vcf"
|
33
|
-
When I execute "./bin/bio-vcf -i --eval 'r.info.dp'"
|
34
|
-
Then I expect the named output to match the named output "eval_r.info.dp"
|
35
|
-
|
36
|
-
Scenario: Test the sample eval using dp
|
37
|
-
Given I have input file(s) named "test/data/input/multisample.vcf"
|
38
|
-
When I execute "./bin/bio-vcf -i --seval 's.dp'"
|
39
|
-
Then I expect the named output to match the named output "seval_s.dp"
|
40
|
-
|
41
|
-
Scenario: Test the sample filter + eval using dp
|
42
|
-
Given I have input file(s) named "test/data/input/multisample.vcf"
|
43
|
-
When I execute "./bin/bio-vcf -i --sfilter 's.dp>10' --seval 's.dp'"
|
44
|
-
Then I expect the named output to match the named output "sfilter_seval_s.dp"
|
45
|
-
|
46
|
-
Scenario: Rewrite an info field
|
47
|
-
Given I have input file(s) named "test/data/input/multisample.vcf"
|
48
|
-
When I execute "./bin/bio-vcf --rewrite rec.info[\'sample\']=\'XXXXX\'"
|
49
|
-
Then I expect the named output to match the named output "rewrite.info.sample"
|
50
|
-
|
51
|
-
Scenario: Test eval-once
|
52
|
-
Given I have input file(s) named "test/data/input/multisample.vcf"
|
53
|
-
When I execute "./bin/bio-vcf --eval-once header.meta[\'GATKCommandLine\']"
|
54
|
-
Then I expect the named output to match the named output "eval_once"
|
55
|
-
|
56
|
-
Scenario: Test JSON output with header meta data
|
57
|
-
Given I have input file(s) named "test/data/input/multisample.vcf"
|
58
|
-
When I execute "./bin/bio-vcf --template template/vcf2json_full_header.erb"
|
59
|
-
Then I expect the named output to match the named output "vcf2json_full_header"
|
60
|
-
|
61
|
-
Scenario: Test JSON output with header meta data and query samples
|
62
|
-
Given I have input file(s) named "test/data/input/multisample.vcf"
|
63
|
-
When I execute "./bin/bio-vcf --template template/vcf2json_use_meta.erb"
|
64
|
-
Then I expect the named output to match the named output "vcf2json_use_meta"
|
65
|
-
|
66
|
-
Scenario: Test deadlock on failed filter with threads
|
67
|
-
Given I have input file(s) named "test/data/input/multisample.vcf"
|
68
|
-
When I execute "./bin/bio-vcf -q --timeout 4 --num-threads 4 --thread-lines 4 --filter 't.info.dp>2'"
|
69
|
-
Then I expect an error and the named output to match the named output "thread4_4_failed_filter" in under 30 seconds
|
70
|
-
|
71
|
-
|
data/features/filter.feature~
DELETED
@@ -1,35 +0,0 @@
|
|
1
|
-
@meta
|
2
|
-
Feature: Parsing VCF meta information from the header
|
3
|
-
|
4
|
-
Take a header and parse that information as defined by the VCF standard.
|
5
|
-
|
6
|
-
Scenario: When parsing a header line
|
7
|
-
|
8
|
-
Given the VCF header lines
|
9
|
-
"""
|
10
|
-
##fileformat=VCFv4.1
|
11
|
-
##fileDate=20140121
|
12
|
-
##phasing=none
|
13
|
-
##reference=file:///data/GENOMES/human_GATK_GRCh37/GRCh37_gatk.fasta
|
14
|
-
##FORMAT=<ID=GT,Number=1,Type=String,Description="Genotype">
|
15
|
-
##FORMAT=<ID=DP,Number=1,Type=Integer,Description="Total read depth">
|
16
|
-
##FORMAT=<ID=DP4,Number=4,Type=Integer,Description="# high-quality ref-forward bases, ref-reverse, alt-forward and alt-reverse bases">
|
17
|
-
##INFO=<ID=PM,Number=0,Type=Flag,Description="Variant is Precious(Clinical,Pubmed Cited)">
|
18
|
-
#CHROM POS ID REF ALT QUAL FILTER INFO FORMAT NORMAL TUMOR
|
19
|
-
"""
|
20
|
-
When I parse the VCF header
|
21
|
-
Then I expect vcf.columns to be [CHROM','POS','ID','REF','ALT','QUAL','FILTER','INFO','FORMAT','NORMAL','TUMOR']
|
22
|
-
And I expect vcf.fileformat to be "VCFv4.1"
|
23
|
-
And I expect vcf.fileDate to be "20140121"
|
24
|
-
And I expect vcf.field['fileDate'] to be "20140121"
|
25
|
-
And I expect vcf.phasing to be "none"
|
26
|
-
And I expect vcf.reference to be "file:///data/GENOMES/human_GATK_GRCh37/GRCh37_gatk.fasta"
|
27
|
-
And I expect vcf.format['GT'] to be {"ID"=>"GT", "Number"=>"1", "Type"=>"String", "Description"=>"Genotype"}
|
28
|
-
And I expect vcf.format['DP'] to be {"ID"=>"DP", "Number"=>"1", "Type"=>"Integer", "Description"=>"Total read depth"}
|
29
|
-
And I expect vcf.format['DP4'] to be {"ID"=>"DP4", "Number"=>"4", "Type"=>"Integer", "Description"=>"# high-quality ref-forward bases, ref-reverse, alt-forward and alt-reverse bases"}
|
30
|
-
And I expect vcf.info['PM'] to be {"ID"=>"PM", "Number"=>"0", "Type"=>"Flag", "Description"=>"Variant is Precious(Clinical,Pubmed Cited)"}'
|
31
|
-
And I expect vcf.meta to contain all header meta information
|
32
|
-
|
33
|
-
Scenario: When parsing the header of somatic_sniper.vcf
|
34
|
-
|
35
|
-
Do something
|