How to Effectively Search and Download Data in CottonGen

Slides:



Advertisements
Similar presentations
Sook Jung, Taein Lee, Stephen Ficklin, Kate Evans, Cameron Peace and Dorrie Main.
Advertisements

How to use GDR, the Genome Database for Rosaceae Sook Jung, Stephen Ficklin, Taein Lee, Chun-Huai Cheng, Anna Blenda, Jing Yu, Sushan Ru, Kate Evans, Cameron.
GDR, the Genome Database for Rosaceae, in Chado and Tripal Sook Jung, Stephen Ficklin, Taein Lee, Chun-Huai Cheng, Anna Blenda, Sushan Ru, Ping Zheng,
Ten years of GDR Current Resources and Functionality S Jung, T Lee, S Ficklin, CH Cheng, P Zheng, A Blenda, S Ru, K Evans, C Peace, N Oraguzie, AG Abbott,
New Data and Functionality of GDR, the Genome Database for Rosaceae Sook Jung, Taein Lee, Stephen Ficklin, Chun-Huai Cheng, Ping Zheng, Anna Blenda, Sushan.
Update in GDR, The Genome Database for Rosaceae S Jung, T Lee, S Ficklin, CH Cheng, I Cho, P Zheng, K Evans, C Peace, N Oraguzie, A Abbott, D Layne, M.
1 Identify the location of a particular gene, trait, QTL or marker - and the grass species they have been mapped to - on genetic, QTL, physical, sequence,
Dorrie Main, Jing Yu, Sook Jung, Chun-Huai Cheng, Stephen Ficklin, Ping Zheng, Taein Lee, Richard Percy and Don Jones.
Introduction to NRSP databases and other breeding databases.
Building Database Resources For Translational Research in Rosaceae Sook Jung, Taein Lee, Stephen Ficklin, Chun-Huai Cheng, Anna Blenda, Sushan Ru, Ping.
Jing Yu 1, Sook Jung 1, Chun-Huai Cheng 1, Stephen Ficklin 1, Taein Lee 1, Ping Zheng 1, Don Jones 2, Richard Percy 3, Dorrie Main 1 1. Washington State.
Jing Yu 1, Sook Jung 1, Chun-Huai Cheng 1, Stephen Ficklin 1, Taein Lee 1, Ping Zheng 1, Don Jones 2, Richard Percy 3, Dorrie Main 1 1. Washington State.
Gene Expression Omnibus (GEO)
1 Welcome to the GrameneMart Tutorial A tool for batch data sequence retrieval 1.Select a Gramene dataset to search against. 2.Add filters to the dataset.
Jing Yu, Sook Jung, Chun-Huai Cheng, Stephen Ficklin, Ping Zheng, Taein Lee, Richard Percy, Don Jones, Dorrie Main.
Jing Yu 1, Sook Jung 1, Chun-Huai Cheng 1, Stephen Ficklin 1, Taein Lee 1, Ping Zheng 1, Don Jones 2, Richard Percy 3, Dorrie Main 1 1. Washington State.
GDR in Drupal facilitating community building and efficient maintenance.
Jing Yu 1, Sook Jung 1, Chun-Huai Cheng 1, Stephen Ficklin 1, Taein Lee 1, Ping Zheng 1, Don Jones 2, Richard Percy 3, Dorrie Main 1 1. Washington State.
Introduction to the Gramene Genetic Diversity module 5/2010 Build #31.
Updates to the Cool Season Food Legume Genome Database Dorrie Main, Chun-Huai Cheng, Rebecca McGee, Clarice Coyne, Stephen Ficklin, Taein Lee, Sook Jung,
Gramene V. 211 Gramene Diversity Gramene Genetic Diversity database contains SSR and SNP allelic data and passport descriptions for rice, maize and wheat.
What do we already know ? The rice disease resistance gene Pi-ta Genetically mapped to chromosome 12 Rybka et al. (1997). It has also been sequenced Bryan.
This tutorial will describe how to navigate the section of Gramene that allows you to view various types of maps (e.g., genetic, physical, or sequence-based)
Tools in Bioinformatics Genome Browsers. Retrieving genomic information Previous lesson(s): annotation-based perspective of search/data Today: genomic-based.
Jing Yu, Sook Jung, Chun-Huai Cheng, Taein Lee, Katheryn Buble, Ping Zheng, Jodi L. Humann, Deah McGaughey, Heidi Hough, Stephen P. Ficklin, B. Todd Campbell,
Welcome to the GrameneMart Tutorial A tool for batch data sequence retrieval 1.Select a Gramene dataset to search against. 2.Add filters to the dataset.
Progress on TripalBIMS Breeding Information Management System in Tripal Sook Jung, Taein Lee, Chun-Huai Chen, Jing Yu, Ksenija Gasic, Todd Campbell, Kate.
Jing Yu 1, Sook Jung 1, Chun-Huai Cheng 1, Stephen Ficklin 1, Taein Lee 1, Ping Zheng 1, Don Jones 2, Richard Percy 3, Dorrie Main 1 1. Washington State.
GDR Workshop Tuesday 21st, 2016 RGC8 2016
5/12/2018 Genome Database for Rosaceae New Data and New Functionality
Resources Available for Fragaria Research through the Genome Database for Rosaceae Dorrie Main, Sook Jung, Chun-Huai Cheng, Stephen Ficklin, Taein Lee,
Breeding Information Management System
Genome Sequence Annotation Server
9/11/2018 Genome Database for Rosaceae Since RGC7
CottonGen: An Up-to-Date Resource Enabling Genetics, Genomics and Breeding Research for Crop Improvement Plant and Animal Genome Conference XXV Jing Yu1,
The Cool Season Food Legume Database: An Integrated Resource for Basic, Translational and Applied Research Dorrie Main, Chun-Huai Cheng, Stephen Ficklin,
Overview for Breeders Jing Yu, Sook Jung, Chun-Huai Chung, Taein Lee, Ping Zheng, Jodi Humann, Deah McGaughey, Morgan Frank, Kirsten Scott, Heidi Hough,
A Breeders Perspective on using the Breeding Information Management System for Cotton Breeding Todd Campbell, Taein Lee, Sook Jung, Jing Yu, Don Jones.
Genome Sequence Annotation Server
Genome Database for Rosaceae
the Genome Database for Rosaceae: New Data and Functionality
CottonGen An Online Resource for the Cotton Community
Plant and Animal Genome Conference XXIV
Updates to the CSFL Genome Database:
Welcome to the Quantitative Trait Loci (QTL) Tutorial
for the Cotton Community
Updates and Future Direction
Welcome to the Markers Database Tutorial
CottonGen BIMS for Effective and Efficient Management of Breeding Data
Membership Login/sign in
Using CottonGen for Crop Improvement
Genome Database for Rosaceae:
A web-based platform for structural and functional annotation of model and non-model organisms Jodi Humann, Taein Lee, Stephen Ficklin,
Membership Login/sign in
Membership Login/sign in
Jing Yu1, Taein Lee1, Sook Jung1, Don C. Jones2, B
Jing Yu, Taein Lee, Sook Jung, Don Jones, Todd Campbell,
Jing Yu, Taein Lee, Sook Jung, Don Jones, Todd Campbell,
BIMS (Breeding Information Management System)
CottonGen: Enabling Cotton Research through Big-Data Analysis and Integration Jing Yu, Sook Jung, Chun-Huai Cheng, Taein Lee, Katheryn Buble, Ping Zheng,
New Data and Functionality in NRSP10 Databases
Key CottonGen Features For more information contact:
Viewing Genetic Maps with MapViewer in CottonGen
2016 Beltwide Cotton Conference
Resources for HLB and Citrus Genomics, Genetics and Breeding Research
Germplasm Overview Page Trait Descriptor Standard Images
Membership Login/sign in
Visualization of Conserved Syntenic Blocks Among Six Cotton Genomes in CottonGen Ping Zheng, Sook Jung, Chun-Huai Cheng, Jing Yu, Heidi Hough, Josh Udall,
Tutorial for: Gramene Website Navigation
For more information contact:
Presentation transcript:

How to Effectively Search and Download Data in CottonGen Chun-Huai Cheng, Jing Yu, Sook Jung, Ping Zheng, Jodi Humann, Taein Lee, Morgan Frank, Deah McGaughey, Amita Mohan, Heidi Hough, Josh Udall, Don Jones and Dorrie Main Hi, my name is Chun-Huai. I’m one of the developers of the CottonGen website. Today, I’d like to introduce you with the search functionalities on CottonGen and hopefully, my presentation will help you find the data you are interested in quickly and effectively. 2018 International Cotton Genomics Initiative Conference May 28 – June 1, 2018 Edinburgh, Scotland

CottonGen Search Interface and Searchable Content CottonGen is a curated and integrated web-based relational database. It hosts a collection of genomic, genetic and breeding data such as whole genome sequences, reference transcriptomes, gene sequences from NCBI, pathways, genetic maps, trait, germplasm, marker, breeding, publication and community member data. Because of the vast amount of information, finding just the right data you need can be challenging. On the website, all search interfaces can be accessed through the menu toolbar. Some of the searches are more complicated which may offer sub-menus or grouped tabs that return different results. The figure on the right shows the searchable content types on CottonGen. Not sure I would say the following text Chun-Huai but if you can do the whole talk in 4 mins then keep it in. For example, you can search germplasm by its name, by collection, by pedigree, by country of origin, or even by the description of its image. The Genotype Search allows you to search through genotype data for either SNP or SSR markers. The Trait Evaluation Search allows you to search up to three traits and return the germplasm that match the trait description. Other searchable contents include Colleague, Publication, Map, QTL, etc. The results can be downloaded as a table in the CSV format. For the sequence type of search such as Gene and Transcript Search, Sequence Search, and Marker Search, a FASTA file can also be downloaded in addition to the result table.

Initiate Search and Retrieve/Download Results The search process usually involves four steps. 1. Select the desired search from the menu toolbar based on the content type. Here we show an example of the Gene and Transcript Search. 2. Apply the filters on the search form. For the Gene and Transcript Search, you can restrict the results by specifying an organism or choose to return genes only from a specific genome assembly. You can also provide a list of gene names and retrieve the sequences for them. Or use the Keyword field to return genes of a certain function. 3. Click on the search button. 4. The results will be returned in the tabular format. From there, you can either sort the results by clicking on the header, follow the link to get detail information of a result, or download the result table and/or a FASTA file.

Example: Search Germplasm/Search Traits In this example we show the Germplasm Searches. All search interfaces are grouped in the tabs at the top of the form so that you can search by Name, Collection, Pedigree, Country, or Image. Here if you type in a description of the image, for example, boll open, it will return a list of germplasm with that image description. From the result table, you can either click on the germplasm name to get its detail information or click on the image get a hi-resolution image. On the right is another example. Trait Evaluation Search allows you to find germplasm by specifying three traits at most. You can use the ‘AND’ or ‘OR’ operator to return the intersection or union results of the matching germplasm.

Example: Search Polymorphic SNP Markers Download only polymorphic results Lastly, I’d like to show you a new search that was recently implemented. It is the SNP Genotype Search that allows you to search polymorphic SNP markers. You can filter the genotype data by Dataset, Species, Germplasm name or Marker name, etc. The results will be presented as a table that lists the genotypes of a marker in different germplasm. Here, you can download the result as usually or you’ll have an option to download only the polymorphic results. Clicking on the marker name will take you to all that marker details page.

Acknowledged with thanks Finally, I’d like to thank the organizations listed here for providing funding to support CottonGen. If you have any questions or suggestions, please don’t hesitate to use the contact form on the website to contact us or catch us in person during the conference Any Questions? Contact us: https://www.cottongen.org/contact Cite us: Yu et al., 2014