Sook Jung, Taein Lee, Stephen Ficklin, Kate Evans, Cameron Peace and Dorrie Main.

Slides:



Advertisements
Similar presentations
Data overload – Breeding decision-support software to the rescue! S. Jung, Taein Lee, Kate Evans, Cameron Peace, Gennaro Fazio, Sushan Ru, Amy F. Iezzoni,
Advertisements

Cameron Peace, Washington State University
How to use GDR, the Genome Database for Rosaceae Sook Jung, Stephen Ficklin, Taein Lee, Chun-Huai Cheng, Anna Blenda, Jing Yu, Sushan Ru, Kate Evans, Cameron.
GDR, the Genome Database for Rosaceae, in Chado and Tripal Sook Jung, Stephen Ficklin, Taein Lee, Chun-Huai Cheng, Anna Blenda, Sushan Ru, Ping Zheng,
Lacey-Anne Sanderson A Toolkit for Construction of Genomic and Genetic Websites.
GDR/CottonGen: Converting legacy sites to Tripal Sook Jung, Jing Yu, Taein Lee, Chun-Huai Cheng, Stephen Ficklin, Dorrie Main.
Integrating Phenotypic Data With Genomic, Genetic and Genotypic Data Using Chado Sook Jung, Taein Lee, Stephen Ficklin, Jing Yu, Dorrie Main.
A Construction Toolkit For Online Biological Databases Lacey-Anne Sanderson.
Ten years of GDR Current Resources and Functionality S Jung, T Lee, S Ficklin, CH Cheng, P Zheng, A Blenda, S Ru, K Evans, C Peace, N Oraguzie, AG Abbott,
GDR What’s New and What’s Next Dorrie Main, Sook Jung, Stephen Ficklin, Taein Lee, Chun-Huai Cheng, Anna Blenda, Jing Yu, Ping Zheng, Sushan Ru, Julia.
New Data and Functionality of GDR, the Genome Database for Rosaceae Sook Jung, Taein Lee, Stephen Ficklin, Chun-Huai Cheng, Ping Zheng, Anna Blenda, Sushan.
GenSAS: Genome Sequence Annotation Server, a Tool for Online Annotation and Curation Dorrie Main, Taein Lee, Ping Zheng, Sook Jung, Stephen P. Ficklin,
Update in GDR, The Genome Database for Rosaceae S Jung, T Lee, S Ficklin, CH Cheng, I Cho, P Zheng, K Evans, C Peace, N Oraguzie, A Abbott, D Layne, M.
Dorrie Main, Jing Yu, Sook Jung, Chun-Huai Cheng, Stephen Ficklin, Ping Zheng, Taein Lee, Richard Percy and Don Jones.
Introduction to NRSP databases and other breeding databases.
Building Database Resources For Translational Research in Rosaceae Sook Jung, Taein Lee, Stephen Ficklin, Chun-Huai Cheng, Anna Blenda, Sushan Ru, Ping.
Jing Yu 1, Sook Jung 1, Chun-Huai Cheng 1, Stephen Ficklin 1, Taein Lee 1, Ping Zheng 1, Don Jones 2, Richard Percy 3, Dorrie Main 1 1. Washington State.
Jing Yu 1, Sook Jung 1, Chun-Huai Cheng 1, Stephen Ficklin 1, Taein Lee 1, Ping Zheng 1, Don Jones 2, Richard Percy 3, Dorrie Main 1 1. Washington State.
Introducing NRSP10 Database Infrastructure for Specialty Crops Computer Applications in Horticulture/Teaching Methods Workshop ASHS Annual Conference 2015.
Lacey-Anne Sanderson A Toolkit for Construction of Genomic and Genetic Websites.
Jing Yu, Sook Jung, Chun-Huai Cheng, Stephen Ficklin, Ping Zheng, Taein Lee, Richard Percy, Don Jones, Dorrie Main.
Jing Yu 1, Sook Jung 1, Chun-Huai Cheng 1, Stephen Ficklin 1, Taein Lee 1, Ping Zheng 1, Don Jones 2, Richard Percy 3, Dorrie Main 1 1. Washington State.
The GMOD Chado Natural Diversity Module Bob MacCallum Seth Redmond Imperial College London
GDR in Drupal facilitating community building and efficient maintenance.
Jing Yu 1, Sook Jung 1, Chun-Huai Cheng 1, Stephen Ficklin 1, Taein Lee 1, Ping Zheng 1, Don Jones 2, Richard Percy 3, Dorrie Main 1 1. Washington State.
Digesting the Genome Glut Promoting the Use and Extension of GMOD To Emerging Model Organisms David Clements 1 Brian Osborne 2 Hilmar Lapp 1 Xianhua Liu.
NRSP10 Database Resources for Crop Genomics, Genetics and Breeding Research NRSP Crops Breeders Database Needs Focus Group Meeting July 30, 2015 Pullman,
Updates to the Cool Season Food Legume Genome Database Dorrie Main, Chun-Huai Cheng, Rebecca McGee, Clarice Coyne, Stephen Ficklin, Taein Lee, Sook Jung,
Development of a Cotton Marker Database (CMD) for Gossypium genome and genetic research CMD Main Goals Collect and integrate.
Jing Yu, Sook Jung, Chun-Huai Cheng, Taein Lee, Katheryn Buble, Ping Zheng, Jodi L. Humann, Deah McGaughey, Heidi Hough, Stephen P. Ficklin, B. Todd Campbell,
Progress on TripalBIMS Breeding Information Management System in Tripal Sook Jung, Taein Lee, Chun-Huai Chen, Jing Yu, Ksenija Gasic, Todd Campbell, Kate.
Jing Yu 1, Sook Jung 1, Chun-Huai Cheng 1, Stephen Ficklin 1, Taein Lee 1, Ping Zheng 1, Don Jones 2, Richard Percy 3, Dorrie Main 1 1. Washington State.
GDR Workshop Tuesday 21st, 2016 RGC8 2016
Using the Field Book App and BIMS in GDR for Peach Breeding
5/12/2018 Genome Database for Rosaceae New Data and New Functionality
“Big Data”, tree fruit and the Genome Database for Rosaceae
Resources Available for Fragaria Research through the Genome Database for Rosaceae Dorrie Main, Sook Jung, Chun-Huai Cheng, Stephen Ficklin, Taein Lee,
Behavior and Phenotype in GMOD Natural Diversity in GMOD
The is a Critical Resource for Developing and Refining Trait-Predictive DNA Tests Cameron Peace, Daniel Edge-Garza, Terry Rowland, Paul Sandefur.
Breeding Information Management System
9/11/2018 Genome Database for Rosaceae Since RGC7
CottonGen: An Up-to-Date Resource Enabling Genetics, Genomics and Breeding Research for Crop Improvement Plant and Animal Genome Conference XXV Jing Yu1,
The Cool Season Food Legume Database: An Integrated Resource for Basic, Translational and Applied Research Dorrie Main, Chun-Huai Cheng, Stephen Ficklin,
A Breeders Perspective on using the Breeding Information Management System for Cotton Breeding Todd Campbell, Taein Lee, Sook Jung, Jing Yu, Don Jones.
Genome Database for Rosaceae
the Genome Database for Rosaceae: New Data and Functionality
CottonGen An Online Resource for the Cotton Community
Plant and Animal Genome Conference XXIV
Updates to the CSFL Genome Database:
for the Cotton Community
Updates and Future Direction
CottonGen BIMS for Effective and Efficient Management of Breeding Data
Membership Login/sign in
Using CottonGen for Crop Improvement
Genome Database for Rosaceae:
Membership Login/sign in
Membership Login/sign in
Jing Yu1, Taein Lee1, Sook Jung1, Don C. Jones2, B
Jing Yu, Taein Lee, Sook Jung, Don Jones, Todd Campbell,
BIMS (Breeding Information Management System)
How to Effectively Search and Download Data in CottonGen
CottonGen: Enabling Cotton Research through Big-Data Analysis and Integration Jing Yu, Sook Jung, Chun-Huai Cheng, Taein Lee, Katheryn Buble, Ping Zheng,
New Data and Functionality in NRSP10 Databases
Key CottonGen Features For more information contact:
2016 Beltwide Cotton Conference
Resources for HLB and Citrus Genomics, Genetics and Breeding Research
Germplasm Overview Page Trait Descriptor Standard Images
Membership Login/sign in
Visualization of Conserved Syntenic Blocks Among Six Cotton Genomes in CottonGen Ping Zheng, Sook Jung, Chun-Huai Cheng, Jing Yu, Heidi Hough, Josh Udall,
For more information contact:
Presentation transcript:

Sook Jung, Taein Lee, Stephen Ficklin, Kate Evans, Cameron Peace and Dorrie Main

GDR: Genome database for Rosaceae Genomic, Genetic and Breeding data Other databases: CottonGen, Citrus Genome Database, Cool Season Food Legume Database, Genome Database for Vaccinium  Using open source tools for an efficient and flexible database construction (Chado, Tripal, Drupal)  Chado, with the recent Natural Diversity Module, allows integration of complex biological data from widely different projects and species Introduction

Part I: How to store data using Chado Part II: Demo of GDR Breeding Database Outline

Chado: Modular, Generic and Ontology-driven schema Feature Feature_id Name Uniquename Type_id Organism_id residues Feature_relationship Feature_relationship_id Subject_id Object_id Type_id Featureprop Featureprop_id Feature_id Type_id Value rank cvterm cvterm_id Name definition cv_id Dbxref_id gene, mRNA, marker, QTL, etc Abc- mRNA part_of Abc-gene Repeat_motif Product_size Subject_id object_id cv cv_id Name definition Sequence Ontology, Gene Ontology, etc

Storing Stock (from samples to population; pedigree) stock stock_id Name Uniquename Type_id Organism_id residues stock_relationship Feature_relationship_id Subject_id Object_id Type_id stockprop stockprop_id stock_id Type_id value cvterm cvterm_id Name definition cv_id Dbxref_id Population, cultivar, breeding line, clone, sample, etc Gala-001 sample_of Gala Description, population_size Subject_id object_id Gala Maternal_parent_of Sonya pedigree

Storing phenotype data (from measurements to projects) stock Feature_id Name Uniquename Type_id Organism_id residues nd_experiment Nd_experiment_id Nd_geolocation_id Type_id phenotype phenotype_id Uniquename value attr_id cvterm cvterm_id Name definition cv_id Dbxref_id Phenotyping Genotyping Cross_experiment project Featureprop_id Feature_id Type_id value NE_stock NE_phenoty pe project_relationsh ip NE_project

Genotypic data integrated with genomic/genetic data nd_experiment Nd_experiment_id Nd_geolocation_id Type_id genotype genotype_id name Uniquename description NE_genotype feature_genotype Feature Feature_id Name Uniquename Type_id Organism_id residues project stock uniquename: CPSCT038_190|192 description: 190:192 Uniquename:CPSCT038 Type:microsatellite map Explore sequences around marker in GBrowse

Relationship between genotype and phenotype (haplotype and haplotype effect) nd_experiment Nd_experiment_id Nd_geolocation_id Type_id genotype genotype_id name Uniquename description NE_genotype feature_genotype Feature Feature_id Name Uniquename Type_id Organism_id residues project stock uniquename: MA_H3|H4b description: H3|H4b Uniquename:Ma Type:MTL map phenotype phenotype_id Uniquename value attr_id NE_phenotype phenstatement phenstatement_id Type_id Genotype_id phenotype_id Environment pub attr_id: crisp value: 2.2 Germplasm with H3|H4b alleles of MA locus has value of 2.2 for crisp

Data Management (Browse, Search and Download) Data Conversion (Generate Input files for Pedimap) Decision Support Cross Assist Trait Locus Warehouse Marker Converter GDR Breeding Database Demo

10 Phenotypic Data Search

11

12 Genotypic Data Search

o A web interface to generate a list of parents and the number of seedlings to get the progeny with desired traits o Methods  “Phenotype” (uses only phenotypic information of individuals in the dataset),  “+Pedigree” (uses both phenotypic and pedigree information)  “+Ped+DNA” (uses phenotypic, pedigree information and information provided by DNA- based functional genotypes). Cross Assist

Step 1: Select Method

Step 2: Select target number and trait thresholds

Step 3: Filter results by data completeness, required number of seedlings, and parentage

Future Development o Data  RosBreed QTLs and their genome positions  More breeding data and DNA based functional genotypes  More re-sequencing data o Functionality  Data management: online data submission and editing  Viewing data on screen and generating report pages  Decision support tools o Cross Assist: o to accommodate more complex situations (selfing, cross compatibility, etc) o To upload users’ own data o Further develop more tools

Natural diversity module working group Naama Menda, Seth Redmond, Robert M. Buels, Maren Friesen, Yuri Bendana, Lacey-Anne Sanderson, Hilmar Lapp, Taein Lee, Bob MacCallum, Kirstin E. Bett, Scott Cain, Dave Clements, Lukas A. Mueller and Dorrie Main Main Lab team All Project CoPIs (tfGDR, RosBreed and CottonGen) Funding Sources USDA NIFA SCRI, NSF Plant Genome Program, USDA-ARS, Washington Tree Fruit Research Commission, Cotton Incorporated, Washington State University, Clemson University, University of Florida, Boyce Thompson Institute, North Carolina State University Taein Lee Stephen Ficklin Chun-Huai Cheng Ping Zheng Anna Blenda Sushan Ru Dorrie Main Jing Yu Acknowledgement

Any Questions?