Biodiversity initiative: Integrating Taxonomy, Genomics and Biodiversity ++ = ????? Speaker: Benjamin Linard Alfried Vogler Team.

Slides:



Advertisements
Similar presentations
The Past, Present, and Future of DNA Sequencing
Advertisements

 Sequencing technology › Roche/454 GS-FLX (‘454’) › Illumina  Prokaryotic profiling › De novo genome sequencing › Metagenomics › SNP profiling › Species.
Current methods for high-throughput resequencing of custom targets Adam Gordon Nickerson Lab, UW Genome Sciences WHI Genetics SIG call 3/26/14.
CSU IDRC Next Generation Sequencing Core Genomic Sequencing Services.
Click to edit Master title style Irys data analysis January 10 th, 2014.
Pamela Ferretti Laboratory of Computational Metagenomics Centre for Integrative Biology University of Trento Italy Microbial Genome Assembly 1.
Next Generation Sequencing, Assembly, and Alignment Methods
Bioinformatics at WSU Matt Settles Bioinformatics Core Washington State University Wednesday, April 23, 2008 WSU Linux User Group (LUG)‏
Dawei Lin, Ph.D. Director, Bioinformatics Core UC Davis Genome Center July 20, 2008, SLIMS (Solexa sequencing.
Next-generation sequencing and PBRC. Next Generation Sequencer Applications DeNovo Sequencing Resequencing, Comparative Genomics Global SNP Analysis Gene.
Bioinformatics for Whole-Genome Shotgun Sequencing of Microbial Communities By Kevin Chen, Lior Pachter PLoS Computational Biology, 2005 David Kelley.
Workshop in Bioinformatics 2010 Class # Class 8 March 2010.
Titus Brown Qingpeng Zhang John Blischak Welcome!.
Metagenomics Binning and Machine Learning
Consortium for the Barcode of Life A rapid, cost-effective system for species identification David E. Schindel, Executive Secretary National Museum of.
Report : Zhen Ming Wu 2008 IEEE 9th Grid Computing Conference.
EDIT BoD meeting 22 & 23 June 2010, Paris ATBI+M sustainability WP7 ”Applying Taxonomy to Conservation”
Bioinformatics Core Facility Ernesto Lowy February 2012.
Todd J. Treangen, Steven L. Salzberg
A Pervasive Technology Institute Center What is The National Center for Genome Analysis Support? NCGAS is a national center dedicated to providing scientists.
H = -Σp i log 2 p i. SCOPI Each one of the many microbial communities has its own structure and ecosystem, depending on the body environment it exists.
IPlant cyberifrastructure to support ecological modeling Presented at the Species Distribution Modeling Group at the American Museum of Natural History.
Roadmap for Soil Community Metagenomics of DOE’s FACE & OTC Sites
ANALYSIS AND VISUALIZATION OF SINGLE COPY ORTHOLOGS IN ARABIDOPSIS, LETTUCE, SUNFLOWER AND OTHER PLANT SPECIES. Alexander Kozik and Richard W. Michelmore.
Advancing Science with DNA Sequence Metagenome definitions: a refresher course Natalia Ivanova MGM Workshop September 12, 2012.
The iPlant Collaborative
Current Challenges in Metagenomics: an Overview Chandan Pal 17 th December, GoBiG Meeting.
Madagascar: need and opportunity Brian Fisher, California Academy of Sciences Madagascar Biodiversity Center.
The metagenomics sequencing service CD Genomics. Metagenomics: Metagenomics is the study of metagenomes, genetic material recovered directly from environmental.
UK NGS Sequencing Update July 2009 Dr Gerard Bishop - Division of Biology Dr Sarah Butcher – Centre for Bioinformatics.
2009 IADR, MIAMI, FL, USA Hands-on Experience for using the Human Oral Microbiome Database (HOMD) 2009 IADR Workshop, Miami, FL, USA Tsute (George) Chen.
EB3233 Bioinformatics Introduction to Bioinformatics.
Anna Shcherbina Bioinformatics Challenge Day 01/10/2013 De novo assembly from clinical sample This work is sponsored by the Defense Threat Reduction Agency.
Computational Research in the Battelle Center for Mathmatical medicine.
__________________________________________________________________________________________________ Fall 2015GCBA 815 __________________________________________________________________________________________________.
SRB Genome Assembly and Analysis From 454 Sequences HC70AL S Brandon Le & Min Chen.
From: Duggan et.al. Nature Genetics 21:10-14, 1999 Microarray-Based Assays (The Basics) Each feature or “spot” represents a specific expressed gene (mRNA).
NCGAS provides A specific goal is to provide dedicated access to memory rich supercomputers customized for genomics studies, including Mason and other.
Chapter 5 Sequence Assembly: Assembling the Human Genome.
454 Genome Sequence Assembly and Analysis HC70AL S Brandon Le & Min Chen.
Metagenomic dataset preprocessing – data reduction
CyVerse Workshop Transcriptome Assembly. Overview of work RNA-Seq without a reference genome Generate Sequence QC and Processing Transcriptome Assembly.
Canadian Bioinformatics Workshops
Genome sequencing and annotation Week 2 reading assignment - pages 63-78, 93-98, Boxes 2.1 and don’t worry about details of similarity scoring.
Culturable Bacterial Communities Analyzer DIANA VANESSA SARRIA-ZUNIGA ELIANA TORRES-ZELADA April 29, 2016.
Date of download: 6/23/2016 Copyright © 2016 McGraw-Hill Education. All rights reserved. Pipeline for culture-independent studies of a microbiota. (A)
De Novo Assembly of Mitochondrial Genomes from Low Coverage Whole-Genome Sequencing Reads Fahad Alqahtani and Ion Mandoiu University of Connecticut Computer.
Will 10x technology make us rethink genome assemblies?
Canadian Bioinformatics Workshops
Presented By: Emily Lamoureux
Metagenomic Species Diversity.
Considerations for metagenomics data analysis and summary of workflows
A Fast Hybrid Short Read Fragment Assembly Algorithm
Metafast High-throughput tool for metagenome comparison
Transcriptomics II De novo assembly
Research in Computational Molecular Biology , Vol (2008)
Toward Next Generation Biodiversity Research
The African Soil Microbiology project
Teagasc/APC Sequencing Facility
Transcriptome Assembly
Metagenomics Image: Iverson et al. 2012, Science.
H = -Σpi log2 pi.
Metagenomics Microbial community DNA extraction
The ability of the SOP to sequence and identify unknown samples.
Screenshot of JCVI's Advanced Reference Viewer ( jcvi
Joshua Antony1, Elizabeth Argiro2, and Benjamin tenOever3
Schematic representation of a transcriptomic evaluation approach.
Genome resolved metagenomics
Comparison of species and function profiles with ultradeep sequencing data. Comparison of species and function profiles with ultradeep sequencing data.
General overview of the bioinformatic pipelines for the 16S rRNA gene microbial profiling and shotgun metagenomics. General overview of the bioinformatic.
Presentation transcript:

Biodiversity initiative: Integrating Taxonomy, Genomics and Biodiversity ++ = ????? Speaker: Benjamin Linard Alfried Vogler Team

Arthropods metagenomics 1 / 8 DNA extraction PCR barcodes Mixed sample All 480 beetles Est. 288 species 1x Illumina MiSeq run (8.5Gb) Mitochondrial contigs De novo assembly into contigs Pool DNA by volume 1 SAMPLE: 480 beetle specimens captured in Borneo

Mitochondrial DNA 2 / 8 Harpalinae Chrysomelidae Coccinelidae Curculionidae Tenebrionidae Buprestidae Log(Biomass) = (log(No. Reads)) P<0.001; F 1,84 =73.32; R 2 =0.47 ~5% beetle mitochondrial DNA Shotgun output No. reads33,796,432 Est. proportion mitochondrial reads 4.94% Complete mitogenomes35 Partial mitogenomes >10kb85 Partial mitogenomes 2-10kb 420 Results of Alex Crampton Plat

Genomic information ? 3 / 8 ~95% genomic information, ~45 % is Coleoptera DNA ~5% beetle mitochondria Taxomomy Abundance Genomic analyses Functional information ? Tribolium castaneum, chromosome 3 # homologous contigs position Homologous contig% GC Chromosome region with known sequence NNNN region (unresolved sequence) Homologous DNA between 4 beetle metagenomic samples

Arthropods metagenomics Computational requirements to analyse 1 arthropod soup : Server:~128Gb RAM, 24 cores Xeon 2.4 GHz Assemblies TypeRAM (Gb)Time (6 cores)Disk (Go) Mitochondrial< 10< 12 hours< 30 Total DNA~ 100~ 5 days~ ( in the best case... when data complexity is manageable by current algorithms ) Our last DNA assembly One assembly at a time, unpredictible risk of memory overload Several assemblies (~1.5 per week) Successful Aborted 4 / 8

Arthropods metagenomics Computational requirements to analyse 1 arthropod soup : Server:~128Gb RAM, 24 cores Xeon 2.4 GHz Assemblies TypeRAM (Gb)Time (6 cores)Disk (Go) Mitochondrial< 10< 12 hours< 30 Total DNA~ 100~ 5 days~ ( in the best case... when data complexity is manageable by current algorithms ) 5 / 8 Genomic analyses : TypeRAM (Gb)Time (6 cores)Disk (Go) Homology/ alignments< 2~ 5 days< 10 Statistics / graphs< 2~ 1 day< 2 Need support of SQL database. Currenlty ~ 300 Gb

5 / 8 Future ? for the analysis of 1 arthropod soup ~1 000 arthropods trancriptomes/genomes ~50 beetle species transcriptomes ~50 beetle draft/complete genomes Long term perspective: Disk space consuming...# more reference genomes # larger/more complex databases Growth of analysis pipeline ! More CPU to perform a complete metagenomic analyses standard MIGS (D Field & al, 2008) standards MINIMESS (J Raes & al, 2007) Biodiversity & functional analysis

Arthropods biodiversity n traps per site: (soil, canopy, Ground…) N plots Mitochondrial analysis, not a problem Full DNA analysis...  Which computational power could we access ?  More computations to answer more interesting questions Pooling DNA?We will loose metagenomic resolution... More complex assemblies Many soup analyses... 6 / 7

A source of DNA collection 7 / 8 Metadata (1 arthropod soup) General:Sampling localisation, date, methods... Mitochondrial information:Identified species/taxons Abundance Other identified species (plants, fungi...) Genomic information:Identified genes Identified functions (sugar degradation) Soup metadata 125 species, French Guyana Soup metadata 24 unidentified tenebrionidae from Madagascar Soup metadata 53 species + abundance data Soup metadata 24 species, Congo, Scientific community in NHM access NHM data portal Kemu Integration, links, queries... “I want all NHM collection data concerning the species X” Data storage ? NHM databases integration ? 8 specimens, 4 images, 2 metagenomic samples...

Thank you for your attention. BEETLE SOUP, Your daily source of DNA! BEETLE SOUP