Presentation is loading. Please wait.

Presentation is loading. Please wait.

Creating An Allele Index For NPGS: Bioinformatic Issues Edward Buckler USDA-ARS at Cornell University, Ithaca, NY.

Similar presentations


Presentation on theme: "Creating An Allele Index For NPGS: Bioinformatic Issues Edward Buckler USDA-ARS at Cornell University, Ithaca, NY."— Presentation transcript:

1 Creating An Allele Index For NPGS: Bioinformatic Issues Edward Buckler USDA-ARS at Cornell University, Ithaca, NY

2 AIM: Make more useful plants by conserving, finding and combining better alleles. NEED: The National Germplasm conserves 464,000 accessions and may contain 100,000,000 distinct alleles, but there is no index.

3 Population structure Familial relatedness Genetic mapping is the basis of the index, and QTL mapping approaches now exist for virtually all types of populations. Near gene level resolution achieved in multiple species Identification of genes controlling flowering, starch, nutrients, wood quality Positive Results in: Maize Rice Arabidopsis Conifers

4 What needs to happen? Genotyping (0.5Mdp per accession) Phenotyping (500dp per accession) Bioinformatics (GRIN) Mapping Tools Breeder Decision Tools

5 What data is currently available outside NPGS? Several large NSF Plant Genome projects on diversity with NPGS germplasm at the heart of these projects Numerous smaller projects (however, most data gets lost over time from these) Millions of genotypic and phenotypic data points in just maize, wheat, and rice projects. Database aware analysis tools (eg. TASSEL)

6 GDPDM Gramene Panzea (Maize) Rice Evol. TASSEL Other Analysis Tools Alignment & SNP Display DBs Middleware Analysis Panzea Web Data Access Display Upload Tools GDPC Germinate GDPC Data Browser GRIN GRIN?

7 GDPDM Germplasm Genotype Phenotype Environment Used by maize, wheat, and rice diversity projects.

8 GDPDM Gramene Panzea (Maize) Rice Evol. TASSEL Other Analysis Tools Alignment & SNP Display DBs Middleware Analysis Panzea Web Data Access Display Upload Tools GDPC Germinate GDPC Data Browser GRIN

9 Purpose The purpose of GDPC is to simplify access to the large genomic and phenotypic datasets that are becoming available in plant biology. www.maizegenetics.net/gdpc

10 GDPC Data Flow Diagram www.maizegenetics.net/gdpc

11 GDPC Data Flow Diagram www.maizegenetics.net/gdpc

12 Databases Where has GDPC been mapped? Panzea (GDPDM schema) Gramene (GDPDM) Germinate (generic schema) GRIN (passport data) www.maizegenetics.net/gdpc

13 GDPC: Select Data Service

14 GDPC: Select Taxa

15 GDPC: Nucleotide Data

16 GDPC: Trait Data

17 GDPC Browser Demo GDPC: Marker Data

18 Current GDPC Limitations XML is not efficient for large datasets –Several avenues are possible for improving efficiency More visualization and analysis tools need to be developed –Linkage Mapping –Breeder Decision Tools –Geographic interfaces –Pedigree Interfaces (in progress)

19 What should GRIN consider? Becoming the lead repository for genotypic and phenotypic diversity data Lead efforts for the consolidation of community diversity data Implement several middleware or web services standards (eg. GDPC and perhaps others IRRI) Collaborate on the development of data visualization tools

20 All of the software can be accessed through www.maizegenetics.net www.sourceforge.net www.panzea.org www.gramene.org www.maizegenetics.net www.sourceforge.net www.panzea.org www.gramene.org


Download ppt "Creating An Allele Index For NPGS: Bioinformatic Issues Edward Buckler USDA-ARS at Cornell University, Ithaca, NY."

Similar presentations


Ads by Google