The New Zealand Institute for Plant & Food Research Limited Potato Genome Sequencing Consortium, notes from the edge Dr Susan Thomson, Dr Mark Fiers, Dr.

Slides:



Advertisements
Similar presentations
The IWGSC: Building the sequence-based foundation for accelerated wheat breeding Kellye A. Eversole IWGSC Executive Director & The IWGSC Cereals for Food,
Advertisements

Structural and Functional Genomics of Tomato Barone et al Tomato (Solanum Lycopersicon) – economically important crop worldwide, – intensively investigated.
Physical Mapping I CIS 667 February 26, Physical Mapping A physical map of a piece of DNA tells us the location of certain markers  A marker is.
Mining SNPs from EST Databases Picoult-Newberg et al. (1999)
The Human Genome Race. Collins vs. Venter Collins Venter.
Expanding the Tool Kit for BAC Extension Summary of completion criteria developed for NSF Tomato Sequencing Workshop January 14, 2007.
SolCAP Solanaceae Coordinated Agricultural Project What is SolCAP? The SolCAP project links together people from public institutions, private institutions.
Novel multi-platform next generation assembly methods for mammalian genomes The Baylor College of Medicine, Australian Government and University of Connecticut.
CS273a Lecture 4, Autumn 08, Batzoglou Hierarchical Sequencing.
Human Genome Project. Basic Strategy How to determine the sequence of the roughly 3 billion base pairs of the human genome. Started in Various side.
Evaluation of PacBio sequencing to improve the sunflower genome assembly Stéphane Muños & Jérôme Gouzy Presented by Nicolas Langlade Sunflower Genome Consortium.
Sequence Variation Informatics Gabor T. Marth Department of Biology, Boston College BI420 – Introduction to Bioinformatics.
Genome sequencing. Vocabulary Bac: Bacterial Artificial Chromosome: cloning vector for yeast Pac, cosmid, fosmid, plasmid: cloning vectors for E. coli.
Genome Assembly Bonnie Hurwitz Graduate student TMPL.
Plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic.
Dept. of Plant Breeding & Genetics, Cornell University
SOL Genomics Network Formed in 2003 to answer two questions: – How can a common set of genes give rise to such a wide range of morphologically and ecologically.
Tomato genome annotation pipeline in Cyrille2
What is SGN? S GN is a rapidly evolving comparative resource for the plants of the Solanaceae family, which includes important crop and model plants such.
Solanum lycopersicum Chromosome 4 Sequencing Update SOL Germany– October 2008 Wellcome Trust Medical Photographic Library.
CUGI Pilot Sequencing/Assembly Projects Christopher Saski.
Whole genome scans to localise QTL X. Likely positionQTL Chromosome with mapped markers BAC Contig Spanning QTL region New MarkersCandidate Genes Fine.
Tomato Chromosome 4: A Mapping & Sequencing Update 28 th September 2005 Christine Nicholson Mapping Core Group Welcome Trust Sanger Institute, UK.
Update tomato chr. 6 Roeland van Ham Centre for BioSystems Genomics The Netherlands.
Tomato Overgo Project and Seed BAC Selection Cornell Team Ying Eileen Wang, 2005 PAG.
Sequence assembly using paired- end short tags Pramila Ariyaratne Genome Institute of Singapore SOC-FOS-SICS Joint Workshop on Computational Analysis of.
Update on Cacao Genome Sequencing Project August 4, 2009 NCGR, Santa Fe, NM.
Bioinformatics and Sequencing Relevant to SolCAP
P. Tang ( 鄧致剛 ); RRC. Gan ( 甘瑞麒 ); PJ Huang ( 黄栢榕 ) Bioinformatics Center, Chang Gung University. Genome Sequencing Genome Resequencing De novo Genome.
WGP Tomato EU-SOL meeting July 15, 2009 Antoine Janssen.
Biological Motivation for Fragment Assembly Rhys Price Jones Anne R. Haake.
SIZE SELECT SHEAR Shotgun DNA Sequencing (Technology) DNA target sample LIGATE & CLONE Vector End Reads (Mates) SEQUENCE Primer.
The Changing Face of Sequencing
Solanum lycopersicum Chromosome 4 Sequencing Update UK-SOL– Dec 2008 Wellcome Trust Medical Photographic Library.
4th Solanaceae Genome Workshop 2007, September 09th- 13th, Jeju Island, Korea THE FRENCH CONTRIBUTION TO THE INTERNATIONAL TOMATO GENOME SEQUENCING PROGRAM.
FINISHING WORKSHOP APRIL 2008 CHROMOSOME 7 THE FRENCH CONTRIBUTION TG216 TG438 T1112 T1355 T1328 T1428 T1962 T1414 T1497 T0676 TM18 CT54 T0966 T0731 TM15.
Plants.ensembl.org / The transPLANT project is funded by the European Commission within its 7 th Framework Programme under the thematic.
© 2010 by The Samuel Roberts Noble Foundation, Inc. 1 The Samuel Roberts Noble Foundation, 2510 Sam Noble Parkway, Ardmore, OK, 73401, USA 2 National Center.
Finishing tomato chromosomes #6 and #12 using a Next Generation whole genome shotgun approach Roeland van Ham, CBSG, NL René Klein Lankhorst, EUSOL Giovanni.
Chromosome 2 Doil Choi, Sunghwan Jo KOREA. Cytological architecture of chromosome kb/µm DAPI (4’-6-diamidino-2-phenylindole) stained pachytene chromosome.
Comparative analyses of the potato and tomato transcriptomes
Chromosome 12 M. Pietrella 1, G. Falcone 1, E. Fantini 1, A. Fiore 1, C. Perla 1, M.R. Ercolano 2, A. Barone 2, M.L. Chiusano 2, S. Grandillo 3, N. D’Agostino.
Chromosome 12 M. Pietrella 1, G. Falcone 1, E. Fantini 1, A. Fiore 1, M.R. Ercolano 2, A. Barone 2, M.L. Chiusano 2, S. Grandillo 3, N. D’Agostino 2, A.
Applied Bioinformatics Week 5. Topics Cleaning of Nucleotide Sequences Assembly of Nucleotide Reads.
1.Data production 2.General outline of assembly strategy.
Human Genome.
2nd TOMATO FINISHING WORKSHOP chromosome 9 Wageningen, April 24-25, 2008.
Solanum lycopersicum Chromosome 4 Mapping and Finishing Update SRC-UK and Wellcome Trust Sanger Institute SOL Korea – September 2007 Wellcome Trust Medical.
CASE7——RAD-seq for Grape genetic map construction
Mojavensis: Issues of Polymorphisms Chris Shaffer GEP 2009 Washington University.
13 th January 2008 Plant & Animal Genome Conference Progress with Sequencing Tomato Chromosome 4 Clare Riddle Tomato Project Group Wellcome Trust Sanger.
16 th April 2007 Christine Nicholson, Mapping Core Group Wellcome Trust Sanger Institute Tomato Chromosome 4 Mapping & Use of FPC Copyright Wellcome Trust.
26 th July 2006 Christine Nicholson, Mapping Core Group Karen McLaren, Finishing Group Leader Wellcome Trust Sanger Institute Sequencing the Gene Space.
1 Comparative analyses of the potato and tomato transcriptomes David Francis, AllenVan Deynze, John Hamilton, Walter De Jong, David Douches, Sanwen Huang,
US Contribution to the International Tomato Genome Sequencing Effort Current structure of contributions Ongoing activity summary Funding issues.
Comparative mapping of Brassica oleracea using sequence-based markers derived from other Brassica relatives and transcriptome sequences generated from.
Human Genome Project.
Tomato Sequencing Project Meeting at SOL 2008, Oct. 15, 2008
Gapless genome assembly of Colletotrichum higginsianum reveals chromosome structure and association of transposable elements with secondary metabolite.
Summary of Current Assembly
Pre-genomic era: finding your own clones
Plant & Animal Genome Conference
Discovery tools for human genetic variations
Development of genome sequencing infrastructure and progress toward sequencing of chromosomes 1, 10 and 11 Steve Tanksley, Cornell U Steve Stack, Colorado.
TG216 TG438 T1112 T1355 T1328 T1428 T1962 T1414 T1497 T0676 TM18 CT54 T0966 T0731 TM15 T1347 T1257 T0848 THE FRENCH CONTRIBUTION TO THE INTERNATIONAL.
Barley (Hordeum vulgare subsp. vulgare)
CSCI 1810 Computational Molecular Biology 2018
Sequence the 3 billion base pairs of human
The Potato Genome Sequencing Consortium: An Update
Presentation transcript:

The New Zealand Institute for Plant & Food Research Limited Potato Genome Sequencing Consortium, notes from the edge Dr Susan Thomson, Dr Mark Fiers, Dr Jeanne Jacobs

The New Zealand Institute for Plant & Food Research Limited Potato Genome Sequencing – why? Solanaceae - important family (tomato, eggplant, petunia, tobacco, and capsicum) Potato is now the 3 rd largest global food crop

The New Zealand Institute for Plant & Food Research Limited Potato Genome Sequencing – the beginning The Potato Genome Sequencing Consortium is an initiative of Wageningen University & Research Center PGSC brings together a global community to complete the project. Individual partners were assigned different chromosomes.

The New Zealand Institute for Plant & Food Research Limited PGSC – member countries

The New Zealand Institute for Plant & Food Research Limited PGSC – the beginning 1995 – Genetic map of potato, diploid mapping population SH (SH ) x RH (RH )

The New Zealand Institute for Plant & Food Research Limited Genetic map – Ultra High Density genetic map generated, ~10,000 AFLP markers (genome ~840Mb, 12 markers/Mb) PGSC – the beginning

The New Zealand Institute for Plant & Food Research Limited Ultra high-density genetic map 2001 Genetic map – BAC library, using RH ,000 BACs average insert of 120Kb. 73,000 fingerprinted by AFLP PGSC – the beginning

The New Zealand Institute for Plant & Food Research Limited Ultra high-density genetic map 2001 BAC library 2002 Genetic map – AFLP analysis of BACs used to build up contigs of overlapping BACs. Selective AFLPs used to anchor certain BACs (and contigs) to physical map. PGSC – the beginning

The New Zealand Institute for Plant & Food Research Limited Ultra high-density genetic map 2001 BAC library 2002 Genetic map 1995 Physical map /6 – Initiate genome sequencing. BAC by BAC Sanger sequence. Start with anchored seed BACs. 6x coverage, BACs/chromosome. PGSC – the beginning

The New Zealand Institute for Plant & Food Research Limited Ultra high-density genetic map 2001 BAC library 2002 Genetic map 1995 Physical map 2006 Dec 2009 – end date for full annotated potato genome sequence Sequencing start 2005/6 PGSC – the beginning

The New Zealand Institute for Plant & Food Research Limited Sequencing start 2005/6 PGSC – the beginning Annotation and sequence Dec 2009 Early 2008 – BAC sequencing status: chromosome 7 not started, others very few BACs done.

The New Zealand Institute for Plant & Food Research Limited PGSC – the worries Sanger BAC by BAC slow Despite UHD map of 10,000 markers, still large gaps in physical map reducing number of seed BACs Made more problematic by ‘stops’ caused by repeat elements and lack of overlapping BACs

The New Zealand Institute for Plant & Food Research Limited PGSC – the solutions Bigger and better machines! Next Generation Sequencing (NGS) technologies making Whole Genome Shotgun (WGS) sequencing more financially feasible (data/$). RH is highly heterozygous, leading to assembly issues. Continue RH sequencing using mainly NGS methods

The New Zealand Institute for Plant & Food Research Limited PGSC – the solutions Introducing a new line, DM: DM R44 Doubled Monohaploid, homozygous, line. (Ref: Lightbourn GJ, Jelesko JG, Veillieux RE Genome 50 (5):492–501.) DM flowers well. Can be used as female parent in crosses with most diploid potato germplasm.

The New Zealand Institute for Plant & Food Research Limited PGSC – mapping No genetic knowledge for DM R44 Diploid mapping population: DM x DI (China Runtush) F1 x DI Mapping population 2 x 96 well plates with DNA of mapping population, along with parents. Generated by International Potato Center (CIP), Peru.

The New Zealand Institute for Plant & Food Research Limited PGSC – mapping Preliminary Scaffold assembly of DM derived from Illumina data: (generated by Beijing Genome Institute, BGI) No. of sequences57681 max scaffold length min scaffold length100 total assembly length average scaffold length12180 median scaffold length179 n50* * n50 = largest first, align along length of genome. n50 is size of scaffold at 50% genome. As at 20 August 2009

The New Zealand Institute for Plant & Food Research Limited PGSC – mapping 550 newly generated SSR markers*; SSRsInstituteCountry 100 Plant and Food Research New Zealand 100 Universidad Nacional Agraria La Molina Peru 100 International Potato Centre Peru 100 Scottish Crop Research Institute Scotland 50 Instituto Nacional de Tecnologia Agropecuaria Argentina 50 The Irish Agriculture and Food Development Authority, Teagasc Eire 50 Institute of Bioengineering Russian Federation *SSRs generated by BGI, China Preliminary results 14/44 were monomorphic, 15/44 tested show polymorphism in DI, 15/44 show polymorphism between DM/DI

The New Zealand Institute for Plant & Food Research Limited PGSC – mapping Sequence Tagged Markers (STM). Known to map to regions spanning all 12 chromosomes. - ~60 Ste markers, currently being mapped in an SHxRH population. Generated by large scale in-silico design of SSRs from ESTs in public database. (Ref: Tang J, Baldwin SJ, Jacobs JM, Linden CG, Voorrips RE, Leunissen JA, van Eck H, Vosman B. BMC Bioinformatics Sep 15;9:374)

The New Zealand Institute for Plant & Food Research Limited PGSC – mapping SNP data – EST data aligned to DM scaffold. (Robin Buell, courtesy of SolCAP USDA project Design ~ 2000 markers for use with BeadXpress (Illumina) (Glenn Bryan, Scottish Crop Research Institute) Aiming for > 1000 mapped. DArT data* – Two discovery arrays with over 30,000 probes to begin. Discovered 3000 candidate markers. It is hoped that 1000 to 1500 unique DM markers will segregate in the mapping population. Sequencing of 7000 DArT markers will also be carried out. * Diversity Arrays Mapping data will be combined with results from:

The New Zealand Institute for Plant & Food Research Limited PGSC – assembly Plans for an in silico* pipeline to improve scaffold bringing together data from: - SOL Genomics Network - Tomato genome - Markers; SSR, SNP and DArT - RH UHD/physical map information * Dan Bolser, University of Dundee, Scotland

The New Zealand Institute for Plant & Food Research Limited PGSC – the present & future LineIn progressSanger sequencing Illumina runs Roche/454 runs RH WGS + Long Jump libraries 10 X coverage WGS 60 X coverage BAC library 150,000 BAC end sequences + 2,000 BAC clones Random sheared BAC library (~100kb)120,000 BAC end sequences DM WGS + Long jump libraries 10 X coverage WGS + 500bp to 10kb libraries 65 X coverage Fosmid library (~ 35kb)100,000 end sequences BAC libray200,000 BAC end sequences

The New Zealand Institute for Plant & Food Research Limited Add into assembly pipeline, data from Transcriptome sequencing: 16 runs, a combination of different tissues and conditions for DM and also RH

The New Zealand Institute for Plant & Food Research Limited Acknowledgements Plant & Food Research is part of the international Potato Genome Sequencing Consortium (PGSC). For more information, visit Website going live as of 1 st September. PFR – Lincoln Jeanne Jacobs Mark Fiers Samantha Baldwin