Plants.ensembl.org / www.transplantdb.eu The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic.

Slides:



Advertisements
Similar presentations
Genomics for Triticeae improvement FP7 European Project.
Advertisements

Advancing Science with DNA Sequence Maize Missouri 17 chromosome 10 project update Dan Rokhsar 3 October 2006.
Sequencing the Maize Genome Maize Genome Sequencing Consortium
Maize Genetics, Genomics, Bioinformatics workshop
Dan Bolser, EMBL-EBI transPLANT portal: Overview and search Versailles, 12th-13th November 2012 trans-National Infrastructure for Plant Genomic Science.
Analysis of the bread wheat genome using whole- genome shotgun sequencing Manuel Spannagl MIPS, Helmholtz Center Munich Analysis of the bread wheat genome.
Development of COS markers in grasses Isabelle Bertin, Pauline Stephenson and Michelle Leverington-Waite John Innes Centre.
Mission statement Barley (Hordeum vulgare L.) was one of the first domesticated cereal grains, originating in the Fertile Crescent.
Some Jolly Fun with Barley ESTs David Marshall & All the Folks in Computational Biology.
Whole Genome Sequencing &Crop Genetic Breeding Presentation: Wenhui Gao
The IWGSC: Building the sequence-based foundation for accelerated wheat breeding Kellye A. Eversole IWGSC Executive Director & The IWGSC Cereals for Food,
Expanding the Tool Kit for BAC Extension Summary of completion criteria developed for NSF Tomato Sequencing Workshop January 14, 2007.
16 and 20 February, 2004 Chapter 9 Genomics Mapping and characterizing whole genomes.
How to access genomic information using Ensembl August 2005.
Evaluation of PacBio sequencing to improve the sunflower genome assembly Stéphane Muños & Jérôme Gouzy Presented by Nicolas Langlade Sunflower Genome Consortium.
Genome sequencing. Vocabulary Bac: Bacterial Artificial Chromosome: cloning vector for yeast Pac, cosmid, fosmid, plasmid: cloning vectors for E. coli.
The IWGSC: Strategies & Activities to Sequence the Bread Wheat Genome Kellye A. Eversole IWGSC Executive Director & The IWGSC Wheat Breeding 2014: Tools,
Puccinia graminis genome project Les J Szabo USDA ARS Cereal Disease Lab Department of Plant Pathology University of Minnesota.
Plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic.
RICE GENOMICS: Progress and prospects. What is genomics?  The genome of a plant, animal or microbe is the totality of its genetic information including.
GeVab: Genome Variation Analysis Browsing Server Korean BioInformation Center, KRIBB InCoB2009 KRIBB
Mouse Genome Sequencing
Tomato genome annotation pipeline in Cyrille2
Maps and Markers Gramene SAB Report Jan CMap Improvements Expanded, reorganized and hidden menus New map glyphs –Number of features –Crop map –Magnify.
Kerstin Howe, Mario Caccamo, Ian Sealy The Zebrafish Genome Sequencing Project Bioinformatics resources.
Rice Sequence and Map Analysis Leonid Teytelman. Rice Genome Annotation Sequence Alignments Automation Comparative Maps Genetic Marker Correspondences.
Genome Annotation and Databases Genomic DNA sequence Genomic annotation BIO520 BioinformaticsJim Lund Reading Ch 9, Ch10.
CUGI Pilot Sequencing/Assembly Projects Christopher Saski.
The New Zealand Institute for Plant & Food Research Limited Potato Genome Sequencing Consortium, notes from the edge Dr Susan Thomson, Dr Mark Fiers, Dr.
What is comparative genomics? Analyzing & comparing genetic material from different species to study evolution, gene function, and inherited disease Understand.
Whole genome scans to localise QTL X. Likely positionQTL Chromosome with mapped markers BAC Contig Spanning QTL region New MarkersCandidate Genes Fine.
Tomato Chromosome 4: A Mapping & Sequencing Update 28 th September 2005 Christine Nicholson Mapping Core Group Welcome Trust Sanger Institute, UK.
Sequence and Analysis of the Maize B73 Genome Doreen Ware 1,2, Joshua Stein 1, Apurva Narechania 1, Shiran Pasternak 1, Linda McMahan 1, Chengzhi Liang.
Genome Sequencing in the Legumes Le et al Phylogeny Major sequencing efforts Minor sequencing efforts ~14 MY ~45 MY.
APPLICATION OF MOLECULAR MARKERS FOR CHARACTERIZATION OF LATVIAN CROP PLANTS Nils Rostoks University of Latvia Vienošanās Nr. 2009/0218/1DP/ /09/APIA/VIAA/099.
SIZE SELECT SHEAR Shotgun DNA Sequencing (Technology) DNA target sample LIGATE & CLONE Vector End Reads (Mates) SEQUENCE Primer.
DAY 1c: Accessing Completed Genomes 1. UCSC Genome Bioinformatics 2. Ensembl 3. NCBI Genomic Biology.
Solanum lycopersicum Chromosome 4 Sequencing Update UK-SOL– Dec 2008 Wellcome Trust Medical Photographic Library.
I. Introduction and Red Line Education for Data-unlimited Science.
Theobroma cacao Integrated Physical and Genetic Map 2 BAC Libraries 250 Genetic Markers.
Current Challenges in Metagenomics: an Overview Chandan Pal 17 th December, GoBiG Meeting.
Gramene Objectives Provide researchers working on grasses and plants in general with a bird’s eye view of the grass genomes and their organization. Work.
Comparative analyses of the potato and tomato transcriptomes
IPlant Genomics in Education Workshop Genome Exploration in Your Classroom.
The Genome Assemblies of Tasmanian Devil Zemin Ning The Wellcome Trust Sanger Institute.
Maize Genome Project Shiran Pasternak January 13, 2006 Gramene SAB Meeting San Diego, CA Shiran Pasternak January 13, 2006 Gramene SAB Meeting San Diego,
CASE7——RAD-seq for Grape genetic map construction
US Sequencing Project Funded by NSF Two-year project Start date: Sept 1, 2004 Follow-up project for full sequencing of chromosomes 1, 10 and 11.
The Wellcome Trust Sanger Institute
A guided tour of Ensembl This quick tour will give you an outline view of what Ensembl is all about. You will learn: –Why we need Ensembl –What is in the.
BLAST Sequences queried against the nr or grass databases. GO ANALYSIS Contigs classified based on homology to known plant or fungal genes Next.
Accessing and visualizing genomics data
BIOL 433 Plant Genetics Term 2, Instructors: Dr. George Haughn Dr. Ljerka Kunst BioSciences 2239BioSciences Tel
Genome Analysis Assaad text book slides only Lectures by F. Assaad can be downlaoded from muenchen.de/~farhah/index.htm.
26 th July 2006 Christine Nicholson, Mapping Core Group Karen McLaren, Finishing Group Leader Wellcome Trust Sanger Institute Sequencing the Gene Space.
454 Genome Sequence Assembly and Analysis HC70AL S Brandon Le & Min Chen.
Welcome to the combined BLAST and Genome Browser Tutorial.
1 Comparative analyses of the potato and tomato transcriptomes David Francis, AllenVan Deynze, John Hamilton, Walter De Jong, David Douches, Sanwen Huang,
IPlant Genomics in Education Workshop Genome Exploration in Your Classroom.
Sequencing and Assembly of the WheatD Genome using BAC Pools A Preliminary Study Daniela Puiu Sept 23rd 2013.
Risheng Chen et al BMC Genomics
Denise Carvalho-Silva Ensembl Outreach
Figure 1. Phylogenetic tree of PDI gene promoter sequence of Triticum urartu (TU AA), Aegilops speltoides (AS BB) and Aegilops taushcii (TT DD) with three.
Gapless genome assembly of Colletotrichum higginsianum reveals chromosome structure and association of transposable elements with secondary metabolite.
Summary of Current Assembly
Pre-genomic era: finding your own clones
Volume 8, Issue 6, Pages (June 2015)
Cereal Genome Evolution: Grasses, line up and form a circle
Sequence the 3 billion base pairs of human
The Potato Genome Sequencing Consortium: An Update
Presentation transcript:

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number Dan Bolser, EMBL-EBI Triticeae data in Ensembl Plants Versailles, 12th-13th November 2012 trans-National Infrastructure for Plant Genomic Science

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number INTRODUCTION

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number Triticeae crops Wheat Bread wheat (Triticum aestivum) accounts for 20% of human consumption of calories and protein. Hexaploid (AA/BB/DD) – 7 chromosomes – 17Gb genome – ~80% repeats Currently only a fragmented assembly is available. Barley Barley (Hordeum vulgare) an important cereal and model for ecological adaption. Diploid – 7 chromosomes – 5.3Gb Genome – ~80% repeats Integrated gene-space and physical map.

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number Triticeae crops WheatBarley

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number WHEAT

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number Wheat – Sequence data Gene-space ‘sub- assemblies’ – 1,394,281 sub- assemblies – contigs and singletons Data provided: “in the syntenic context of Brachypodium distachyon” 117,411 (89%) mapped 6

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number Wheat Wheat sub-assemblies, classified into A, B, D (and X) genomes, aligned to Brachypodium distachyon in Ensembl Genomes 7

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number Wheat sub-assemblies and homoeologous SNPs Wheat sub-assemblies, classified into A, B, D (and X) genomes, aligned to Brachypodium distachyon in Ensembl Genomes, showing homoeologous SNPs (variations between the A, B and D genomes). 8

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number BARLEY

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number Barley NOTES Gene-space assembly Integrated physical map View of chromosomes and genes in EG – All the ‘features’ of Ensembl, Trees, Functional annotation

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number Barley – Sequence data cv. Morex 5x Illumina GAII – 300b PE – 2.5kb PE 376k contigs > 1kb – 100k directly integrated into PM – + a hierarchical approach for other sequence data

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number Barley – Gene & physical map data Gene calls Genes – 167Gb of RNA-Seq – 29k fl-cDNAs – 79k 'transcript clusters' – 26k 'High Confidence' genes (by homology) – 95% anchored on WGS contigs Physical map data Fingerprinted BACs – 600k BACs (14x) in six different BAC libraries – 10k FPC contigs with estimated n50 of 900kb – 500k x2 BES, 6k WGS Markers – 3000 gene-based – 500k sequence tags

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number SUMMARY

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number Wheat Too fragmented for a genomic assembly Shown in the syntenic context of Brachypodium distachyon – Small, model grass Diploid 270 Mbp Relatively low repeat density 21 Sub-assemblies classified into homoeologous chromosomes Homoeologous SNPs (SNPs between A, B, and D genomes) mapped onto brachypodium.

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number Barley 26,000 high confidence genes called More than 90% anchored into a chromosome-scale physical map Standard Ensembl Genomes analysis pipelines can be run – Comparative genomics – Functional annotation InterProScan

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number Acknowledgements

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number Questions?

plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic area “Infrastructures”. Contract number Alignment stats for wheat sub- assemblies on brachypodium Sub-Assemblies (88% singletons) Aligned to brachy. Full length alignment? A 123,383 (13%) 115,804 (94%) 114,375 (99%) B 158,440 (17%) 141,278 (89%) 138,438 (98%) D 156,976 (17%) 144,810 (92%) 142,635 (98%) X 510,480 (54%) 412,385 (81%) 402,049 (97%) Total949, ,277 (86%) 797,497 (98%)