The International Tomato Sequencing Project: The first Cornerstone of the SOL Project Lukas Mueller on behalf of International SOL Tomato Sequencing Project.

Slides:



Advertisements
Similar presentations
Sequencing the Maize Genome Maize Genome Sequencing Consortium
Advertisements

Genome Structure/Mapping Lisa Malm 05/April/2006 VCR 221 Lisa Malm 05/April/2006 VCR 221.
Chr9 A ntonio Granell IBMCP-Valencia Spain Tomato Sequencing, Madison July 2006.
The Role of Fluorescence in situ hybridization (FISH) in Sequencing the Tomato Genome.
Progress on the sequencing of the euchromatic gene rich space of chromosome 6 of Solanum lycopersicum cv. Heinz 1706 Sander Peters Cologne Oct 2008.
US Tomato sequencing project update January 14, 2007.
Structural and Functional Genomics of Tomato Barone et al Tomato (Solanum Lycopersicon) – economically important crop worldwide, – intensively investigated.
PAA / Solanaceae July 23-27, Madison, Wisconsin, USA Sequencing the gene-rich space of tomato chromosome 7 Current status of the French effort.
Sequencing Status of the Chromosome 8 and New Marker Development toward a Genetic Map Construction between Micro-Tom and Ailsa Craig SOL Genomics Workshop.
Genes. Outline  Genes: definitions  Molecular genetics - methodology  Genome Content  Molecular structure of mRNA-coding genes  Genetics  Gene regulation.
Sequencing Tomato Chr9 Tomato Sequencing Meeting PAG XV 2007 San Diego, 14 January 2007 Antonio Granell IBMCP, Valencia.
Expanding the Tool Kit for BAC Extension Summary of completion criteria developed for NSF Tomato Sequencing Workshop January 14, 2007.
Integrated Physical and Genetic Mapping of Upland Cotton Lei E Presented by Mingxiong PANG.
Use of FISH in sequencing tomato chromosome 6 René Klein Lankhorst Hans de Jong Korea meeting 2007.
Sequencing activities in EU-SOL René Klein Lankhorst PAG 2007.
Tomato Chromosome 8 sequencing at Kazusa DNA Research Institute Erika Asamizu.
Genome sequencing. Vocabulary Bac: Bacterial Artificial Chromosome: cloning vector for yeast Pac, cosmid, fosmid, plasmid: cloning vectors for E. coli.
EU-SOL 2008 November 13-16, Toulouse, FRANCE CHROMOSOME 7 SEQUENCING Current status and perspective TG216 TG438 T1112 T1355 T1328 T1428 T1962 T1414 T1497.
Mouse Genome Sequencing
SOL Genomics Network Formed in 2003 to answer two questions: – How can a common set of genes give rise to such a wide range of morphologically and ecologically.
Plant and Animal Genome Conference January 11, 2009.
Chromosome 8 Sequencing: Current Status and Future Prospects toward Finishing Shusei Sato, Erika Asamizu, Takakazu Kaneko, Hiroyuki Fukuoka, Satoshi Tabata.
What is SGN? S GN is a rapidly evolving comparative resource for the plants of the Solanaceae family, which includes important crop and model plants such.
Solanum lycopersicum Chromosome 4 Sequencing Update SOL Germany– October 2008 Wellcome Trust Medical Photographic Library.
The New Zealand Institute for Plant & Food Research Limited Potato Genome Sequencing Consortium, notes from the edge Dr Susan Thomson, Dr Mark Fiers, Dr.
Tomato Chromosome 4: A Mapping & Sequencing Update 28 th September 2005 Christine Nicholson Mapping Core Group Welcome Trust Sanger Institute, UK.
Update tomato chr. 6 Roeland van Ham Centre for BioSystems Genomics The Netherlands.
SOL 2008 October 12-16, Cologne, Germany CHROMOSOME 7 THE FRENCH CONTRIBUTION TG216 TG438 T1112 T1355 T1328 T1428 T1962 T1414 T1497 T0676 TM18 CT54 T0966.
Tomato Overgo Project and Seed BAC Selection Cornell Team Ying Eileen Wang, 2005 PAG.
Genome Sequencing in the Legumes Le et al Phylogeny Major sequencing efforts Minor sequencing efforts ~14 MY ~45 MY.
Progress on sequencing tomato chromosome 12 Mara Ercolano.
WGP Tomato EU-SOL meeting July 15, 2009 Antoine Janssen.
Mapping and sequencing chromosome 6 of Solanum lycopersicum cv
The Changing Face of Sequencing
Solanum lycopersicum Chromosome 4 Sequencing Update UK-SOL– Dec 2008 Wellcome Trust Medical Photographic Library.
Current Sequencing Status of Tomato Chromosome 2 PLANT GENOME RESEARCH CENTER, KRIBB, KOREA Sanghyeob Lee Sung-Hwan Jo Dal-Hoe Koo Chul-Goo Hur Hong-Seok.
4th Solanaceae Genome Workshop 2007, September 09th- 13th, Jeju Island, Korea THE FRENCH CONTRIBUTION TO THE INTERNATIONAL TOMATO GENOME SEQUENCING PROGRAM.
FINISHING WORKSHOP APRIL 2008 CHROMOSOME 7 THE FRENCH CONTRIBUTION TG216 TG438 T1112 T1355 T1328 T1428 T1962 T1414 T1497 T0676 TM18 CT54 T0966 T0731 TM15.
Finishing tomato chromosomes #6 and #12 using a Next Generation whole genome shotgun approach Roeland van Ham, CBSG, NL René Klein Lankhorst, EUSOL Giovanni.
Chromosome 2 Doil Choi, Sunghwan Jo KOREA. Cytological architecture of chromosome kb/µm DAPI (4’-6-diamidino-2-phenylindole) stained pachytene chromosome.
Progress tomato chromosome 6 René Klein Lankhorst.
INDIAN INITIATIVE FOR TOMATO GENOME SEQUENCING Nagendra Singh National Research Centre on Plant Biotechnology Indian Agricultural Research Institute New.
Chromosome 12 M. Pietrella 1, G. Falcone 1, E. Fantini 1, A. Fiore 1, C. Perla 1, M.R. Ercolano 2, A. Barone 2, M.L. Chiusano 2, S. Grandillo 3, N. D’Agostino.
Chromosome 12 M. Pietrella 1, G. Falcone 1, E. Fantini 1, A. Fiore 1, M.R. Ercolano 2, A. Barone 2, M.L. Chiusano 2, S. Grandillo 3, N. D’Agostino 2, A.
Wageningen, April 24-25, 2008 II Tomato Finishing Workshop Chromosome 12 Update ENEA, Rome University of Naples ‘Federico II’ CRIBI and Univ. of Padua.
1.Data production 2.General outline of assembly strategy.
Human Genome.
Italy: tomato chr. 12 Country Representative: Dr. Giovanni Giuliano Maria Luisa Chiusano Maria Raffaella Ercolano University.
Solanum lycopersicum Chromosome 4 Mapping and Finishing Update SRC-UK and Wellcome Trust Sanger Institute SOL Korea – September 2007 Wellcome Trust Medical.
Lindsay A. Shearer1, Lorinda K
13 th January 2008 Plant & Animal Genome Conference Progress with Sequencing Tomato Chromosome 4 Clare Riddle Tomato Project Group Wellcome Trust Sanger.
16 th April 2007 Christine Nicholson, Mapping Core Group Wellcome Trust Sanger Institute Tomato Chromosome 4 Mapping & Use of FPC Copyright Wellcome Trust.
26 th July 2006 Christine Nicholson, Mapping Core Group Karen McLaren, Finishing Group Leader Wellcome Trust Sanger Institute Sequencing the Gene Space.
US Contribution to the International Tomato Genome Sequencing Effort Current structure of contributions Ongoing activity summary Funding issues.
CURRENT STATUS ON SEQUENCING OF CHROMOSOME 12 Mara Ercolano Ischia, 2005.
Tomato Sequencing Project Meeting at SOL 2008, Oct. 15, 2008
Plant & Animal Genome Conference
Development of genome sequencing infrastructure and progress toward sequencing of chromosomes 1, 10 and 11 Steve Tanksley, Cornell U Steve Stack, Colorado.
Sequencing Chromosome 2
Status of the US contribution to the international
Progress on sequencing tomato chromosome 12
Progress in sequencing chromosome 6
International Tomato Genome Sequencing Project
Progress in sequencing chromosome 6
TG216 TG438 T1112 T1355 T1328 T1428 T1962 T1414 T1497 T0676 TM18 CT54 T0966 T0731 TM15 T1347 T1257 T0848 THE FRENCH CONTRIBUTION TO THE INTERNATIONAL.
Sequencing update of tomato chromosome 3 Chinese Academy of Sciences
Tomato FISH Song-Bin Chang Suzanne Royer Lorrie Anderson Steve Stack.
Sequence the 3 billion base pairs of human
The Role of Fluorescence in situ hybridization (FISH)
The Potato Genome Sequencing Consortium: An Update
Presentation transcript:

The International Tomato Sequencing Project: The first Cornerstone of the SOL Project Lukas Mueller on behalf of International SOL Tomato Sequencing Project

Overview Aims Why sequence the tomato genome? How to sequence the tomato genome? Who is sequencing the tomato genome? Resources for Sequencing the Tomato Genome –Genetic Map –BAC libraries –Overgo mapping –BAC End Sequences –Minimal Tiling Path –Bioinformatics Summary

Steven D. Tanksley Jim J. Giovannoni Stephen Stack, Joyce van Eck Doil Choi Byung Dong Kim Mingsheng Chen Zhukuan Cheng Chuanyou Li Hongqing Ling Yongbiao Xue Graham Seymour Gerard Bishop Ramesh Sharma Jiten Khurana Akhilesh Tyagi Willem Stiekema P. Lindhout Taco Jesse Rene Klein Lankhorst Mondher Bouzayen Mathilde Causse Daisuke Shibata Satoshi Tabata Antonio Granell Miguel A. Botella Giovanni Giuliano Luigi Fruciante

Aims Provide a high quality reference sequence for the Solanaceae genomes Using mapping of other Solanaceae sequences onto the tomato sequence, and comparative genetic maps to derive “virtual” genomes for other Solanaceae Prerequisite for studying natural diversity and linking genotype to phenotype Build a Solanaceae bioinformatics platform to integrate, analyze and distribute the information

asterid I asterid II asterid III asterid IV asterid V rosid I rosid II rosid III caryophyllids hamamelid I hamamelid II ranunculids paleoherb II Magnoliales monocots Laurales Rubiaceae (coffee) Compositeae (sunflower, safflower, lettuce) Leguminosae (soybean, Medicago Rosaceae (apple, peach, cherry);Salicaceae (poplar) Malvaceae (cotton); Sterculiaceae (cocoa) ; Rutaceae (citrus) Brassicaceae Gramineae (maize, wheat) ; Musaceae (banana) Liliaceae (onion) Chenopodiaceae (sugarbeet, spinach) Solanaceae Arabidopsis Rice WHY SOLANACEAE? Solanaceae is part of unique clade of flowering plants. Genome research in Solanaceae will provide a reference anchor and enable comparative genomics and systematic throughout this clade

Why sequence tomato? Tomato is the most intensively researched Solanaceae genome encoding approx. 35,000 genes euchromatic regions corresponding to less than a 25% of the total DNA in the tomato nucleus (220~250 Mb). Tomato provides the smallest diploid genome for which homozygous inbreds are available. Its sequence will facilitate positional cloning in tomato and other Solanaceae genomes (via synteny maps).

How to sequence the tomato genome? Whole Genome Shotgun –Advantages: Fast, cheaper, ok with reference genome –Disadvantages: Unordered contigs Methylation Filtering (Tobacco) –Advantages: Selects for expressed genome, cheaper –Disadvantages: unordered contigs Tiling Path (Arabidopsis, Drosophila, Rice) –Advantages: Sequence and gene order; select gene rich regions; easy to divide work –Disadvantages: Relatively expensive, time consuming ORDER IMPORTANT FOR COMPARING GENOMES

Tomato Genome Structure 12 chromosomes 950MB of total DNA 220MB contiguous, gene rich euchromatin Sequence only gene-rich euchromatin (>90% all genes) Tiling path method preferred Drosophila used and Medicago is using similar strategy pericentric heterochromatin 162 bp sub- telomeric repeat centromere telomere euchromatin pericentric heterochromatin 7 bp telomeric repeat telomere structure

BAC libraries All libraries derived from Solanum lycopersicum Heinz HindIII library (Rod Wing, Clemson U) –~120,000 clones, 120kB average size –~15x coverage –FPC contigged –Overgo analysis –75,000 clones BAC end sequenced MboI library –50,000 clones, 140kb average size –Will be BAC end sequenced EcoRI library (being prepared) –Will be BAC end sequenced

F Genetic Map Parents: –Solanum lycopersicum x Solanum pennellii Mapping population of 80 F2 individuals # Markers: 1579 Total cM: 1453 Density: 1 marker/0.92cM SGN rflp345 ssr149 tm43 p-mrkr39 cos576 est-by-clone265 unknown8 caps21 cosii98 kfg35 Total 1579 Marker-Types:

Tying the Genetic Map to the Physical Map: Overgos Overgos are “overlapping oligos”, short, very hot probes, developed from genetic markers of the F map Overgos are organized in 96 well plates, analyses are carried out with row and column pools Pools are hybridized to BAC filters, raw pool results are deconvoluted A total of 1536 overgos developed (16 plates) Analyses of all plates is complete

Overgo Anchoring Results Anchors: 652 anchor markers are involved in plausible non-conflicted associations with BACs good marker--BAC associations FPC contigs: 1880 BACs in 705 plausible contigs 2166 BAC singletons 652 seed BACs ==> 1/3 of euchromatic genome sequence

# anchors cM chr length cM per anchor Distribution of Anchor Markers on Chromosomes markers from Keygene AFLP map

Verification of overgo mappings Fluorescence In-Situe Hybridization (FISH) –BAC probe on pachytene chromosomes IL lines (Zamir lab) –Map BACs to IL lines –CAPS assays

(Hans de Jong)

Summary of FISH verification Song-Bin Cheng, Hans de Jong (Holland, chromosome 6): –9 BACs analyzed –8 mapped to chromsome 8 in right order –1 BAC gave signals on centromere of chromosome 1 Sangheob Lee, Doil Choi (Korea, chromosome 2): –27 BACs analyzed with FISH –25 confirmed to specific location, same order as F map –2 match to other chromosomes Chuanyou Li (China, chromosome 3) –>30 BACs being analyzed Steven Stack (USA): –Telomere and heterochromatic boundary determination –FISH service for countries without FISH capability

BAC end sequences Total of 400,000 reads (200,000 BACs from both ends) selected from the 3 BAC libraries Batch of 75,000 BACs in process (HindIII library) ~45,000 BAC end sequences already obtained (ftp://ftp.sgn.cornell.edu/tomato_genome/)ftp://ftp.sgn.cornell.edu/tomato_genome/ Average read length 655bp Annotation in progress SeqWright Inc, Houston, TX SeqWright is sponsoring a happy hour after this session.

anchored bacs Obtaining the Tiling Path A B C genetic map overgos “seed BAC”

20 14 US Korea China UK India NL France Japan Spain US US Italy BACs finished: in process: Overview: sgn.cornell.edu -> About -> tomato sequencing

Building a Bioinformatics Platform for the Solanaceae Project-wide standards for quality, gene naming, annotation ( Create a unified web presence for the entire project Develop distributed model for annotation, web presentation, involving different centers in SOL countries All data and programs developed in the project are shared in an open source format Integrate all data into the SOL bioinformatics platform, facilitating a systems approach to explore diversity and adaptation and the complex interactions that occur on all levels of biological organization

SGN Agronanotech Kazusa CASGenome India VIB Ghent

Annotation Phases 1.First pass annotations of sequences and gene models on BAC basis, available immediately 2.BAC based, common, distributed platform, stable BAC-based identifiers 3.Chromosome based, stable identifiers

Summary Sequencing of tomato is under way by a consortium of 10 countries High quality, ordered sequence using BAC tiling path BAC ends available, overgo results verified by FISH analyses Sequence will be tied to other Solanaceae and closely related species (coffee and beyond) Provide a foundation for shared biology for this economically important clade of plants

Acknowledgments SOL community Tomato Sequencing Project Funding National Science Foundation Other National Funding Sources Keygene NV Seqwright Inc. (Happy Hour) Colleagues Steven Tanksley, Jim Giovannoni, Joyce van Eck, Steven Stack SGN: Teri Solow, Beth Skwarecky, Nick Taylor, Robert Buels, John Binns, Chenwei Lin