This reports the protocol used to align the Rice_FST-TDNA features to Oryza_sativa_indica-chromosome-20070724. Tue Aug 7 15:03:35 2007 Source of Rice_FST-TDNA : Downloaded from Genbank with query "txid4530[orgn] AND GSS[PROP] AND T-DNA insertion lines" Alignment procedure details --------------------------- 14533 Rice_FST-TDNA are aligned to Oryza_sativa_indica-chromosome-20070724 using blat with blat parameters -minScore=160 followed by PslReps with -minAli=0.90 -nearTop=0.01 -singleHit. This was followed by a filtering procedure described below and applied in general to 'Same Species Genomic' data sets. Initial summary # alignments : 16335 # unique Features these alignments represent: 13248 % of total features these alignments represent : 91.16 % Following is the GAP distribution Gaps # alignments -------- -------- 0 12463 1 951 2 407 3 244 4 154 5 118 6 115 7 83 8 96 9 59 10 57 20 375 30 212 40 102 50 66 60 65 70 45 80 34 90 27 100 17 200 104 300 100 400 70 500 50 600 7 700 4 800 5 900 18 10000 104 Features with gaps > 40 bp are deleted # remaining Alignments : 15436 # unique Features these remaining alignments represent: 12536 % of total features these alignments represent : 86.26 % Following is the distribution of by feature coverage %coverage # alignments -------- -------- 9 0 19 0 29 1 39 16 49 68 59 182 69 187 79 283 89 734 90 145 91 180 92 220 93 275 94 320 95 465 96 570 97 883 98 1420 99 2251 100 7236 Features less than 90 % coverage are deleted. # remaining Alignments : 13828 # unique Features these remaining alignments represent: 11175 % of total features these alignments represent : 76.89 % % Identity distribution of the remaining features % Identity # alignments -------- -------- 90 37 91 28 92 57 93 89 94 152 95 258 96 575 97 1264 98 2439 99 3780 100 5149 Features less than 92 % identity are deleted. # remaining Alignments : 13763 # unique Features these remaining alignments represent: 11127 % of total features these alignments represent : 76.56 % Frequency distribution of the remaining features # hits # features -------- -------- 1 10537 2 413 3 62 4 29 5 16 6 4 8 19 9 3 10 6 20 18 30 8 40 5 50 1 100 3 Features that hit more than thrice are deleted. # remaining Alignments : 11549 # unique Features these remaining alignments represent: 11012 % of total features these alignments represent : 75.77 %
Last modified: Thu Sep 13 15:01:03 2007