This reports the protocol used to align the RiceIndica_ESTcluster_BGI features to Oryza_sativa_indica-chromosome-20070724.
Tue Aug  7 18:46:40 2007


Source of RiceIndica_ESTcluster_BGI : Oryza indica clusters downloaded from http://btn.genomics.org.cn/ 

Alignment procedure details 
--------------------------- 

23559 RiceIndica_ESTcluster_BGI are aligned to Oryza_sativa_indica-chromosome-20070724 using blat with blat parameters -minScore=120 followed by PslReps with -singleHit. This was followed by a filtering procedure described below and applied in general to 'Coding-SameSpecies' data sets.

Initial summary
# alignments : 23012
# unique Features these alignments represent: 22060
% of total features these alignments represent : 93.64 %

The following is the distribution of the feature coverage 
%coverage	no of alignments
--------	--------
9	 0
19	 2
29	 5
39	 23
49	 51
59	 138
69	 204
79	 254
89	 469
90	 75
91	 77
92	 122
93	 134
94	 178
95	 201
96	 314
97	 505
98	 957
99	 2368
100	 16935

 Alignments less than 95 % coverage are deleted
# remaining Alignments : 21082
# unique Features these remaining alignments represent: 20223
% of total features these alignments represent : 85.84 %

GAP distribution of the remaining features
Gaps	# alignments
--------	--------
1000	 15910
2000	 3000
3000	 1210
4000	 442
5000	 179
6000	 101
7000	 46
8000	 28
9000	 13
10000	 12
20000	 63

Alignments with gaps > 4000 bp are deleted
# remaining Alignments : 20562
# unique Features these remaining alignments represent: 19717
% of total features these alignments represent : 83.69 %

% Identity distribution of the remaining features
% Identity	# alignments
--------	--------
90	 0
91	 0
92	 0
93	 0
94	 5
95	 41
96	 94
97	 300
98	 1280
99	 8815
100	 10027

Frequency distribution of the remaining features
# hits	# features
--------	--------
1	 19278
2	 405
3	 22
4	 5
5	 1
6	 1
8	 1
9	 0
10	 0
20	 2
30	 1
40	 0
50	 0
100	 0

 Features that hit more than four times are deleted.  
# remaining Alignments : 20174
# unique Features these remaining alignments represent: 19710
% of total features these alignments represent : 83.66 %


  

Last modified: Thu Sep 13 15:01:03 2007