This reports the protocol used to align the Rice_CDS features to Oryza_sativa_indica-chromosome-20070724. Tue Aug 7 16:25:07 2007 Source of Rice_CDS : Downloaded from Genbank with query '(txid4530[ORGN] AND complete[TITL] AND cds[TITL]) NOT (Mitochondrion[ALL] OR Chloroplast[ALL] OR Mitochondrial[ALL]) )' Alignment procedure details --------------------------- 72229 Rice_CDS are aligned to Oryza_sativa_indica-chromosome-20070724 using blat with blat parameters -minScore=120 followed by PslReps with -singleHit. This was followed by a filtering procedure described below and applied in general to 'Coding-SameSpecies' data sets. Initial summary # alignments : 74502 # unique Features these alignments represent: 68217 % of total features these alignments represent : 94.45 % The following is the distribution of the feature coverage %coverage no of alignments -------- -------- 9 32 19 130 29 131 39 214 49 293 59 665 69 840 79 1207 89 2364 90 411 91 526 92 590 93 843 94 1103 95 1547 96 2331 97 3200 98 4735 99 9169 100 44171 Alignments less than 95 % coverage are deleted # remaining Alignments : 63618 # unique Features these remaining alignments represent: 58825 % of total features these alignments represent : 81.44 % GAP distribution of the remaining features Gaps # alignments -------- -------- 1000 32951 2000 11542 3000 7641 4000 3775 5000 2012 6000 1204 7000 761 8000 455 9000 385 10000 252 20000 992 Alignments with gaps > 4000 bp are deleted # remaining Alignments : 55909 # unique Features these remaining alignments represent: 52180 % of total features these alignments represent : 72.24 % % Identity distribution of the remaining features % Identity # alignments -------- -------- 90 0 91 0 92 0 93 0 94 44 95 615 96 1382 97 2748 98 6006 99 36295 100 8819 Frequency distribution of the remaining features # hits # features -------- -------- 1 50442 2 1064 3 272 4 171 5 86 6 45 8 40 9 17 10 8 20 30 30 3 40 0 50 1 100 1 Features that hit more than four times are deleted. # remaining Alignments : 54070 # unique Features these remaining alignments represent: 51949 % of total features these alignments represent : 71.92 %
Last modified: Thu Sep 13 15:01:03 2007