This reports the protocol used to align the RiceJaponica_BACend_OMAP features to Oryza_sativa_indica-chromosome-20070724.
Wed Aug  8 10:04:35 2007


Source of RiceJaponica_BACend_OMAP : Downloaded from genbank using the query '(CUGI Rice BAC end) AND (oryza [ORGN])' and only high quality sequence is extracted based on the information from genbank records   

Alignment procedure details 
--------------------------- 

328959 RiceJaponica_BACend_OMAP are aligned to Oryza_sativa_indica-chromosome-20070724 using blat with blat parameters -minScore=160 followed by PslReps with -minAli=0.90 -nearTop=0.01 -singleHit. This was followed by a filtering procedure described below and applied in general to 'Same Species Genomic' data sets.

Initial summary
# alignments : 394178
# unique Features these alignments represent: 304416
% of total features these alignments represent : 92.54 %

Following is the GAP distribution 
Gaps	# alignments
--------	--------
0	 153408
1	 38854
2	 20617
3	 12771
4	 9301
5	 7595
6	 7069
7	 6581
8	 5888
9	 5101
10	 4719
20	 27781
30	 12868
40	 7725
50	 5222
60	 3671
70	 2779
80	 2178
90	 1990
100	 1668
200	 7123
300	 4015
400	 3233
500	 2411
600	 1470
700	 1306
800	 1253
900	 1082
10000	 17294

Features with gaps > 40 bp are deleted 
# remaining Alignments : 320278
# unique Features these remaining alignments represent: 249594
% of total features these alignments represent : 75.87 %

 Following is the distribution of by feature coverage 
%coverage	# alignments
--------	--------
9	 2
19	 187
29	 4442
39	 9326
49	 13854
59	 21989
69	 31426
79	 34238
89	 45885
90	 6087
91	 6787
92	 7122
93	 7848
94	 8328
95	 8438
96	 9090
97	 12067
98	 16790
99	 23638
100	 52734

 Features less than 90 % coverage are deleted. 
# remaining Alignments : 152985
# unique Features these remaining alignments represent: 122343
% of total features these alignments represent : 37.19 %

% Identity distribution of the remaining features
% Identity	# alignments
--------	--------
90	 185
91	 273
92	 547
93	 1045
94	 2122
95	 4021
96	 8392
97	 17221
98	 33965
99	 51506
100	 33708

 Features less than 92 % identity are deleted. 
# remaining Alignments : 152527
# unique Features these remaining alignments represent: 121994
% of total features these alignments represent : 37.08 %

Frequency distribution of the remaining features
# hits	# features
--------	--------
1	 111396
2	 5779
3	 1693
4	 967
5	 616
6	 382
8	 420
9	 128
10	 130
20	 367
30	 75
40	 23
50	 5
100	 9

 Features that hit more than thrice are deleted.  
# remaining Alignments : 128033
# unique Features these remaining alignments represent: 118868
% of total features these alignments represent : 36.13 %


  

Last modified: Thu Sep 13 15:01:03 2007