This reports the protocol used to align the Maize_EST features to Oryza_sativa_indica-chromosome-20070724. Tue Aug 7 22:13:07 2007 Source of Maize_EST : Downloaded from genbank with query ' txid4577[orgn] AND gbdiv_est[PROP]' Alignment procedure details --------------------------- 1163689 Maize_EST are aligned to Oryza_sativa_indica-chromosome-20070724 using blat with blat parameters -minIdentity=50 followed by PslReps with -singleHit. This was followed by a filtering procedure described below and applied in general to 'CrossSpecies-Coding' data sets. Initial summary # alignments : 119628 # unique Features these alignments represent: 94651 % of total features these alignments represent : 8.13 % The length of the matches are distributed as follows Hit_Length # alignments -------- -------- 100 66934 150 19882 200 7750 250 4500 300 4980 350 3613 400 4150 450 2449 500 1459 550 1380 600 879 650 584 700 451 750 375 800 162 10000 80 Alignments with matches less than 150 bp are deleted # remaining Alignments : 33035 # unique Features these remaining alignments represent: 27691 % of total features these alignments represent : 2.38 % Frequency distribution of the remaining features # hits # features -------- -------- 1 23927 2 2645 3 779 4 266 5 53 6 13 8 7 9 0 10 0 20 0 30 1 40 0 50 0 100 0 Features that hit more than thrice are deleted. # remaining Alignments : 31554 # unique Features these remaining alignments represent: 27351 % of total features these alignments represent : 2.35 % % Identity distribution of the remaining features % Identity # features -------- -------- 10 0 20 0 30 0 40 0 50 0 60 0 70 0 80 0 90 0 95 23769 100 7785 Following is the distribution of gaps Gaps # features -------- -------- 1000 24658 2000 2930 3000 908 4000 174 5000 46 6000 325 7000 28 8000 186 9000 18 10000 2 Following is the final summary # alignments : 31554 # unique Features these alignments represent: 27351 % of total features these alignments represent : 2.35 %
Last modified: Thu Sep 13 15:01:03 2007