Gramene Release 22 August 2006
This release includes:.
Gramene website features:
- Newsletter : Starting in July, a Gramene Newsletter has been available online. The link to newsletters from the navigation menu (under "Resources") now provides links to many of the cereal industry, database, and genetic newsletters available online.
- Gramene Outreach :
- Workshop Resources : The presentation and exercise materials from the RiceCAP presentation in June have been added to our Workshop Materials resource page.
- Species pages have been completed for rice, maize, wheat, oats, barley, millet, rye, sorghum and wild rice (Zizania). Each organism has a Query summary page listing the information that is available from Gramene.
- Plant Biology Databases: A Needs Assessment - White Paper - Now available from Gramene, this is a summary of a collaborative review the anticipated needs of the plant genome research community for long-lived data collections. The authors offer guidelines for balancing the funding of data production projects with those aimed to manage and integrate the data. In particular, there is a pressing need to develop a trained cadre of skilled knowledge workers who are able to curate complex biological data, and to provide this cadre with a system of stable funding that enables data repositories to be established and maintained over extended periods of time. There is a positive current trend of species-specific databases to expand into comparative genomics-minded clade-oriented databases, but the authors caution that new technologies are needed to facilitate the transparent integration of data among these databases.
Gramene Database:
- Genomes: In an effort to improve navigation through the website, there are significant changes to the page layout at Gramene's Genomes module. (See: More info)
- Maps: Changes to the Maps search view simplify your map search, and additional maps are now available. For a list of all the changes see ... (More info)
- Markers: In addition to an increase in marker information, the new detail information layout uses tabbed navigation to minimize scrolling through long pages of information, putting information right at your fingertips. (More info)
- Proteins: In addition to additional data, the Proteins database now has an advanced search and links to the TIGR rice gene models for a best match. (More info)
- Ontologies: The Ontologies database has grown with additional information, and has a few minor changes in the display. (More info)
- Genes: Several hundred genes have map positions on commonly used genetic maps, as well as the Gramene annotated Nipponbare Sequence 2006. In addition, the detail view for a gene has been enhanced to provide more information.(More info)
- QTL: Additions include information from O. Sativa crosses with wild relatives, as well as Sorghum QTL(More info)
- Pathways: With addition pathways added since the first release, we now present RiceCyc version 1.1. (More info)
- Diversity: (More info)
Genomes Release Notes
New Genomes Data
- Arabidopsis TAIR6 assembly and annotation
- Maize FPC contigs from the AGI 19 Jul 2005 agarose map
- Updated Syntenic regions between the maize FPC map (AGI maize FPC map released on April 5, 2006) and the rice genome (TIGR v4)
- New contigview tracks added to Rice genome browser (TIGR v4)
- Sorghumn_WGS: Sorghumn WGS reads downloade from NCBI trace archive as of June 7, 2006 (mapping document)
- Rice_ArrayOligo_Yale58K: Rice Gene based array Oligs from Timothy Nelson's lab at Yale University (mapping document)
- Rice_MpssTag: The rice oligos from Blake Meyers' lab (mapping document)
- Rice_ArrayOligo_NSF20K: The rice array oligos from NSF Rice Oligonucleotide Array Project (mapping document)
New Genomes Features
The current Gramene Genome Bowsers are developed on top of ensembl-39. Compared with the last build based on ensembl-30, it has had a major redesign, both on the surface and 'under the bonnet'. Most of the dynamic pages (e.g. ContigView, GeneView) have only minor cosmetic changes - in addition to the new template and stylesheets, some links have been moved from the body of the view to the new side-bar.
- New navigation options in ContigView
ContigView navigation has been enhanced with new popup menus and navigation features:
- Click on any blank spot in Overview, Detailed View or Basepair View to get a zoom menu popup
- "Drag'n'zoom" navigation in Overview, Detailed View and Basepair View panels - click anywhere and drag your mouse diagonally to select an area, and then choose an option from the popup menu
- Click on the DNA sequence in Overview to show a popup menu allowing centering of the display and export of clones, contigs, etc.
- In Overview, click on any feature to get a popup of information with a link directly to GeneView
- All popup menus are persistent, so you can open several at once and compare the contents (click on the 'x' in the top right corner to close), and can be "minimised" to show only their title bars (by clicking on the arrow next to the 'x').
- New 'subviews'
- GeneSeqView - displays the sequence section from GeneView
- GeneSpliceView - shows alternative splicing for a gene
- GeneSeqalignView - This new view is based on GeneSeqView, with the added ability to align the given sequence with one or more species.
- Changes to DAS functionality
1. Entrez Gene has been added to the list of DAS mapping types, so it is now possible to display features from DAS sources based on Entrez Gene IDs.
2. Multiple type DAS sources are supported, i.e if a source has a mixture of features, some of which are based on UniProt IDs and some on EnsEMBL Gene IDs, it is now possible to select both mapping types for the same source. This will result in a multiple segment DAS request, e.g features?segment=UNIPROT_ID;segment=ENSEMBL_GENE
3. You can now "link" DAS features, by providing "Previous" and "Next" fields, so that corresponding items are added to the feature's popup menu.
4. Added functionality to add GeneDAS sources through URL. E.g. /geneview?gene=BRCA2;add_das_source=(url=http://myhost:9999/das+dsn=mouse_ko_allele+type=markersymbol+name=MySource+active=1) will attach the MySource DAS source to the Ensembl browser of whomever clicks on the link
5. Data can be displayed on ContigView as a signal map (i.e. as levels above and below a threshold) instead of as a histogram, by selecting this option when configuring the display.
- Karyoview updates
Karyoview has been reorganised to make it easier to create multiple images (e.g. different chromosomes or track colours) from the same set of data. All data is now cached temporarily on our server, allowing you to reconfigure all aspects of the image without re-inputting your data.
Maps Release Notes
The Map Module has been updated to the latest CMap 0.16 release. There have been numerous changes to both the code base and user interface, which should improve query response times. Among the user interface improvements are:
- Ability to filter feature types on thhe reference map before the map is drawn.
- Selection of comparative maps now takes place under the "Map Options" menu. Look for the "Add Map Left" and "Add Maps Right" links.
- The "Stack Vertically" option reduces clutter when comparing multiple maps.
- The list of appropriate comparative maps are loaded from the server only after the user has selected "Add Maps Left/Right". This allows for faster page rendering and allows for the user to select display and other options without waiting for a query to return from the server.
Updated Sequence Map
New and Updated Physical Maps
New QTL Maps
- Rice-INDIR IR58025/O. rufipogon BC SSR QTL 2005
- Rice-UGA BTx623/IS3620C RL SSR RFLP QTL 2006
- Rice-UGA BTx623/S. propinquum F2 RFLP SSR QTL 2006
Updated QTL Maps
- Rice-Cornell IR64/Azucena DH QTL
- Rice-KRGRP Milyang23/Gihobyeo AFLP-SSLP-RFLP QTL 1998
- Rice-TAMU Lemont/Teqing RI RFLP QTL 1995
- Rice-JRGP Nipponbare/Kasalath RFLP QTL 2000
Markers Release Notes
The marker database now contains a total of 11,932,099 markers from 217 species.
Marker breakdown by type
Marker Type Markers AFLP 1,230 Centromere 12 Clone 2,390,592 EST 3,675,648 EST Cluster 1,325,059 FPC 1,701 Gene 8,673 Gene Model 5,7501 Gene Primer 19 GSS 4,312,690 Maize Bin 100 mRNA 98,737 OVERGO 15,575 Primer 62 RAPD 133 RFLP 19,643 SSR 16,885 STS 2,213 Undefined 5,621 Marker breakdown by species
Species Markers Barley (Hordeum spp.) 638,042 Maize (Zea spp.) 4,044,148 Oat (Avena spp.) 8,232 Rice (Oryza spp.) 4,412,024 Rye (Secale spp.) 13,346 Sorghum (Sorghum spp.) 1,265,320 Sugarcane (Saccharum spp.) 339,661 Wheat (Triticum spp. + Aegilops spp.) 1,063,010 Other 148,316
Proteins Release Notes
The Gramene protein database provides curated information on SP-Trembl entries from family Poaceae (Grasses). The annotations include Pfam, Prosite, TMHMM (for transmembrane domains), TargetP and Predotar (plastid, mitochrondrial and secretory pathway targeting) and Interpro assignments. Various ontologies such as Gene Ontology (GO), Plant Ontologies (PO+GRO) and Environment Ontology (EO) are used to provide functional characteristics.
- Total number of proteins: 77804
- Proteins from SWISS-PROT: 2502
- Proteins from TrEMBL: 75302
- Proteins with Pfam annotations: 49620
- Proteins with Prosite annotations: 24544
- Proteins with Signal peptide predicted by TargetP: 40177
- Proteins with Transmembrane domains predicted by TMHMM: 14678
The improvements in release 22 include:
- Advanced search was added (http://www.gramene.org/db/protein/protein_search) to allow customized query using a name, accession, Protein id (PID) from NCBI/Uniprot/DDBJ, SPTrEMBL ID, GenBank ID, cultivar (germplasm) as well as a selection of the species. In the species list only the major cereal crops are listed. IF your species of interest is not in that list please select "All Poaceae family".
- The rice (Oryza) protein detail pages (e.g. http://www.gramene.org/db/protein/protein_search?acc=P93436) now provide specific link to the TIGR rice gene models for a best match. If you are interested in finding out more number/suitable hits please continue using the BLASTP query link.
Ontologies Release Notes
Various ontologies and their associations to genes, gene models, proteins, QTL, markers and maps were updated. Below is a summary of the ontologies data.
- Gene Ontology : 1674 terms with about 2,45,000 associations
- Plant Ontology : 778 terms with 1354 associations
- Gramene Plant Growth Stage Ontology [236 terms with 1893 associations
- Trait Ontology : 769 terms with about 11,700 associations
- Gramene Taxonomy Ontology : 2407 terms with 97,518 associations
- Environment Ontology : 490 terms with 348 associations
Improvements on a term's detail page include:
Anchor links for "Term" and "Annotations". These provide quick browsing to the appropriate section on the same page. Ordering. The list of species in annotations section is now pre-sorted alphabetically from left to right, then top to bottom.
To view these improvements see http://www.gramene.org/db/ontology/search_term?id=GO:0044425 as an example.
Genes and Alleles Release Notes
Continually growing, Gramene release 22 presents a further improvement based on the previously reorganized gene search interface.
Content: Number of genes
- Total -- 8542
- Rice -- 1863 (65 New genes)
- Maize -- 6676
- Sorghum -- 3 (all new)
In order to provide users with the ability to compare gene positions across various genetic and sequence maps, about 800 rice genes now have map positions on most commonly used genetic maps, the "Cornell BS125/2/BS125/WL02 RFLP 2001 Map", the "Cornell IR64/Azucena SSR 2001 Map", the "JRGP Nipponbare/Kasalath RFLP 2000 Map", and the "Hokkaido Morphological 2000 Map". Of these about 250 have mappings to the gene models found on sequence map "Gramene Annotated Nipponbare Sequence 2006". About 45% of the maize genes have been mapped to the "Maize Bins Map".
The Gene details now provide mappings to the "Plant Growth and Development Stages" ontology provided by the Plant Ontology project. This is in addition to the already displayed "Cereal Plant Growth Stages" developed by us. For example go to the gene 'd1'.
A more detailed genes database statistics report can be found here.
QTL Release Notes
The Gramene QTL database now contains a total of 10,791 QTL identified for numerous agronomic traits in rice, maize, barley, oat, wheat, sorghum, pearl millet, foxtail millet and wild rice.
Around 200 new rice QTLs related to yield and other important traits have been curated from recent publications and added to the database. These new QTLs have been mostly identified from genetic populations derived from inter-specific crosses between cultivated rice (Oryza sativa) and its wild rice relatives such as Oryza rufipogon and Oryza glaberrima. The identification of those QTLs will provide new avenues to increase the yield of rice.
Another improvement in this release is the inclusion of sorghum QTLs, almost 100 of which where detected from sorghum inter-specific and intra-specific populations. Those QTLs are associated with several traits with agronomic importance such as grain weight, leaf senescence and maturity.
A more detailed QTL database statistics report can be found here.
Pathways Release Notes
The Pathway tool is a web based tool for viewing gene annotations mapped to various biochemical pathways in plants, rice (Oryza sativa) and Arabidopsis thaliana and bacteria E. coli. This tool also allows you to draw comparisons among the data sets from the above three species.
This is version 1.1 of the rice pathways. These pathways (also called RiceCyc) are curated by Gramene. The rice genes and their annotation used in this analysis was based on release 4 of the TIGR's assembly of Oryza sativa japonica cv.Nipponbare genome sequenced by IRGSP. Arabidopsis thaliana and E. coli pathways were imported from the AraCyc and EcoCyc project sites. Neither AraCyc nor EcoCyc have been updated in the August release of Gramene.
RiceCyc version 1.1 has following contents.
- Pathways: 323
- Enzymatic Reactions: 1687
- Transport Reactions: 5
- Polypeptides: 43172
- Protein Complexes: 4
- Enzymes: 10387
- Transporters: 62
- Compounds: 1265
Since the 1.0 release, seven new pathways have been added to RiceCyc; these pathways were added based upon information available from AraCyc. These pathways mainly relate to Gibberellin and include GA12 biosynthesis; ent-kaurene biosynthesis; and gibberellin synthesis I, II and III. The NAD biosynthesis II from tryptophan has also been added; this is based on the fact that in eukaryotes this pathway starts with tryptophan. Additionally the super pathway of pantothenate and coenzyme A biosynthesis has also been added so as to take into account missing components of the pathway.
Based upon information made available from the Nelson lab at Yale University new information related to NADP-dependent malic genes has been added to the dataset. If any groups have knowledge of genes that they feel should be added to the RiceCyc dataset please feel free to contact the Gramene curators. A peptides link to the gene model pages in rice genome. Similarly if the gene model has been mapped to a biochemical pathway it links to particular reaction in pathway database from the genome browser.
This tool also allows you to upload your own gene expression data on the pathways to visualise an overview of the cellular level expression profile. For more info please visit http://www.gramene.org/pathway/.
Diversity Release Notes
The Gramene Genetic Diversity database contains SSR allelic data for rice, SNP data for Wheat, and phenotypic, SSR and SNP data for maize. Allelic variation on loci of multiple germplasms of a species and genome-wide allelic variation of germplasms can be viewed by searching for locus/marker name, germplasm name or accession number.
On this release,
- Rice: 419 germaplasms; 280 SSR markers.
- Wheat: 48 germplasms; 3802 SNP markers.
- Maize: 2793 germplasms; 520 SSR and 897 SNP markers.
Click here for database summary.