16S sequencing for microbiome studies Nicola Segata and Nick Loman

Slides:



Advertisements
Similar presentations
Clostridium difficile Colitis or Dysbiosis. Symbiostasis/Dysbiosis.
Advertisements

The Past, Present, and Future of DNA Sequencing
OLEH SUDRAJAT FMIPA UNMUL Klasifikasi Bakteri Somewhat different: a clinical rapid ID is often important when trying to find causative agent of.
Use of the genomic data o Reconstruction of metabolic properties o Nature’s Microbiome o NGS in Population Genetics.
Metabarcoding 16S RNA targeted sequencing
Metagenomics. What is metagenomics? Term first used in 1998 by Jo Handelsman "the application of modern genomics techniques to the study of communities.
Determination of host-associated bacterial communities In the rhizospheres of maize, acorn squash, and pinto beans.
Transcriptomics Breakout. Topics Discussed Transcriptomics Applications and Challenges For Each Systems Biology Project –Host and Pathogen Bacteria Viruses.
Genome organization Lesk, Ch 2 (Lesk, 2008). Genomes and proteomes Genome of a typical bacterium comes as a single DNA molecule of about 5 million characters.
Practical Bioinformatics Community structure measures for meta-genomics István Albert Bioinformatics Consulting Center Penn State.
Bioinformatics for Whole-Genome Shotgun Sequencing of Microbial Communities By Kevin Chen, Lior Pachter PLoS Computational Biology, 2005 David Kelley.
Central Dogma Information storage in biological molecules DNA RNA Protein transcription translation replication.
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
The Microbiome and Metagenomics
PCR Primer Design
Discussion on Metagenomic Data for ANGUS Course Adina Howe.
Molecular Microbial Ecology
Discovery of new biomarkers as indicators of watershed health and water quality Anamaria Crisan & Mike Peabody.
H = -Σp i log 2 p i. SCOPI Each one of the many microbial communities has its own structure and ecosystem, depending on the body environment it exists.
Species  OTUs  OPUs  Species  OTUs  OPUs. Rosselló-Mora & Amann 2001, FEMS Rev. 25:39-67 Taxa circumscription depends on the observable characters.
Genomics – Next-Gen sequencing and Microarrays
Gene expression and DNA microarrays Old methods. New methods based on genome sequence. –DNA Microarrays Reading assignment - handout –Chapter ,
Accurate estimation of microbial communities using 16S tags Julien Tremblay, PhD
 16S rRNA gene marker  intra-gene variability  primer selection  size & information content Primer selection, information content, alignment and length.
The Human Microbiome.
Christian Rinke Microbial Genomics DOE, Joint Genome Institute Introduction to ARB (From A User's Perspective)
Roadmap for Soil Community Metagenomics of DOE’s FACE & OTC Sites
Supporting Scientific Collaboration Online SCOPE Workshop at San Diego Supercomputer Center March 19-22, 2008.
Abstract Our current understanding of the taxonomic and phylogenetic diversity of cellular organisms, especially the bacteria and archaea, is mostly based.
Elucidating factors behind pair wise distances discrepancies between short and near full-length sequences. We hypothesized that since the 16S rRNA molecule.
Introduction to RNAseq
The Microbiome and Metagenomics
Compositionality and Sparseness in 16S rRNA data Anthony Fodor Associate Professor Bioinformatics and Genomics UNC Charlotte.
Accurate estimation of microbial communities using 16S tags
Motif Search and RNA Structure Prediction Lesson 9.
Higher Human Biology Unit 1 Human Cells KEY AREA 5: Human Genomics.
Canadian Bioinformatics Workshops
Canadian Bioinformatics Workshops
tracking microbes at the strain level
MEGAN analysis of metagenomic data Daniel H. Huson, Alexander F. Auch, Ji Qi, et al. Genome Res
Canadian Bioinformatics Workshops
Date of download: 6/23/2016 Copyright © 2016 McGraw-Hill Education. All rights reserved. Pipeline for culture-independent studies of a microbiota. (A)
General Microbiology (Micr300)
Discussion on Genomic/Metagenomic Data for ANGUS Course Adina Howe.
Tools for microbial community analysis. What I am not going to talk  Culture dependent analysis  Isolate all possible colonies  Infer community  Test.
Robert Edgar Independent scientist
16S rRNA Experimental Design
16S RNA sequencing analysis
Canadian Bioinformatics Workshops
Presented By: Emily Lamoureux
Metagenomic Species Diversity.
Metagenomics: From Bench to Data Analysis 19-23rd September S rRNA-based surveys for Community Analysis: How Quantitative are they? Dr.
Molecular analyses of the interaction of microbes and marsh grasses  Spartina alterniflora and Phragmites australis Lathadevi K. Chintapenta1; Venkateswara.
Metagenomic assembly Cedric Notredame
Unraveling the microbial profile of the rhizosphere of SDS-suppressive soils in Soybean fields Ali Y. Srour1, Jason Bond1, Leonor Leandro2, Dean Malvick3.
From: Computational analysis of bacterial RNA-Seq data
Workshop on the analysis of microbial sequence data using ARB
Human Cells Human genomics
Taxonomic profiling with MetaPhlAn2
Microbiome: 16S rRNA Sequencing
H = -Σpi log2 pi.
Microbiome studies for microbial disease pathogenesis research
(a) PCoA of the abundance of unique OTUs per sample from the 16S marker gene sequencing data from the AGP data repository (small spheres) and the San Diego.
Skin Microbiome Surveys Are Strongly Influenced by Experimental Design
Volume 10, Issue 4, Pages (October 2011)
A typical current computational meta'omic pipeline to analyze and contrast microbial communities. A typical current computational meta'omic pipeline to.
Research Techniques Made Simple: Profiling the Skin Microbiota
Bacterial composition of olive fermentations is affected by microbial inoculation. Bacterial composition of olive fermentations is affected by microbial.
Toward Accurate and Quantitative Comparative Metagenomics
General overview of the bioinformatic pipelines for the 16S rRNA gene microbial profiling and shotgun metagenomics. General overview of the bioinformatic.
Presentation transcript:

16S sequencing for microbiome studies Nicola Segata and Nick Loman Web Valley 2014 16S sequencing for microbiome studies Nicola Segata and Nick Loman Principal Investigator Laboratory of Computational Metagenomics Centre for Integrative Biology University of Trento Italy

The human microbiome Who’s there? What are they doing? Metagenomics: 10x more microbial than human cells 1M times as many microbes inside each of us than humans on earth 100x more microbial than human genes Nature 486(7402) Who’s there? What are they doing? Scientific American, May 2012 Metagenomics: Study of uncultured microorganisms from the environment, which can include humans or other living hosts Focus on taxonomic and functional characteristics of the total collection of microorganisms within a community Main experimental tool is high-throughput sequencing: ~10M short (~100nt) reads per dataset

16S sequencing Liu, Bo, et al. "Accurate and fast estimation of taxonomic profiles from metagenomic shotgun sequences." BMC genomics 12.Suppl 2 (2011): S4. PROS: Cost-effective Avoids non-bacterial contamination The resulting dataset is reasonable in size and complexity Mature analysis software available Can potentially catch low abundance bacteria CONS: Not genome-wide (so no metabolic potential) Limited taxonomic resolution Not effective for pathogen profiling Cannot catch viruses and eukaryotes Several (usually underestimated) biases Almost impossible cross-study comparisons

16S-based “metagenomics” V6 Samples Microbes Counts George Rice, Montana State University PCR to amplify the single 16S rRNA marker gene Classify sequence  microbe V2

The ribosome Ribosomes are the universal machinery that translate the genetic code into proteins. The ribosomal machinery is composed by: Two subunits several proteins mRNAs tRNAs rRNA (5S, 16S, 23S)

The ribosome

The ribosome

The 16S rRNA  Center for Molecular Biology of RNA, University of California

The 16S rRNA gene 1/3 This annotation has been performed on a representative E. coli 16S sequence Baker, G. C., J. J. Smith, and Donald A. Cowan. JMMs 55.3 (2003): 541-555.

The 16S rRNA gene 2/3

The 16S rRNA gene 3/3

The 16S rRNA V6 V7 V7 V6 V4 V8 V4 V5 V8 V5 V3 V3 V1 V1 V9 V9 V2 V2  Center for Molecular Biology of RNA, University of California

16S: The 530 loop structure of six species

The 16S gene: statistical view of the variable regions Andersson, Anders F., et al. " PloS one 3.7 (2008) Variability within the 16S rRNA gene V6 V3 Which HTM would you choose? 454 historically well suited (~400nt reads  3 regions), good cost/throughput trade-off Illumina (HiSeq) is not optimal (shorter reads, unnecessary high throughput) Illumina MiSeq and IonTorrent can be a nice compromise. V2 V5 V4 V8 V9 V1 V7 Claesson, Marcus J., et al. Nucleic acids research 38.22 (2010) Multiple variable regions can be targeted simultaneously (if you have long enough reads!)

Which HTM would you choose? Throughput Very low (~1 seqs / sample) Medium (~3k seqs / sample) High (~50k seqs / sample)

The data revolution is now

One of the challenges: which technology? http://flxlexblog.files.wordpress.com/

One of the challenges: which technology? Mol Ecol Resour. 2011 Sep;11(5):759-69

One of the challenges: which technology? Mol Ecol Resour. 2011 Sep;11(5):759-69

In silico primer validation/testing The idea: use the available (taxonomically labeled) 16S sequences to check which organisms are targeted by the primers http://www.arb-silva.de/search/testprobe (to test single probes) http://www.arb-silva.de/search/testprime (to test pairs of probes, below)

An example on “universal” primers Fw: CCTACGGGRSGCAGCAG Rev: ATTACCGCGGCTGCT (our primers)

An example on “universal” primers Archaea, 49.2% matches Bacteria, 94.7% matches Proteobacteria, 97.1 % matches WS6 candidate division, 2.9 % matches BE AWARE: universal primers do not exists, and the choice of the primers is going to bias your study no matter what!

Validation of hypervariable regions using a mock community Ward, Doyle V., et al. PloS one 7.6 (2011): e39315-e39315.

Variability within hyper variable regions

A high level 16S analysis workflow Hamady, Micah, and Rob Knight. Genome research 19.7 (2009): 1141-1152.

Schematic 16S analysis workflow Input dataset (one sample) Multiple-sequence alignment Operational taxonomic unit (OTUs) definition CAAGCCGAAUGCAGCUAUUC CAAGCCUGAUGCAGCCAUGC CAUGCCUGAGACAGCCUUGC CAAGCCGAAUGCAGCUAUCC CAAGGCUGAGACAGCCUUGC CAAGCCUGAUGCUGCCAUGC CAAGCCGAAUGCAGCUAUGC CAAGCCGGAGACAGCCUUGC CAAGCCGAAUGCAGCUAUUC CAAGCCUGAUGCAGCCAUGC CAUGCCUGAGACAGCCUUGC CAAGCCGAAUGCAGCUAUCC CAAGGCUGAGACAGCCUUGC CAAGCCUGAUGCUGCCAUGC CAAGCCGAAUGCAGCUAUGC CAAGCCGGAGACAGCCUUGC AAAGCCUGAUGCAGCCAUGC CAAGCCGAAUGCAGCUAUUC CAAGCCGAAUGCAGCUAUCC CAAGCCGAAUGCAGCUAUGC CAUGCCUGAGACAGCCUUGC CAAGGCUGAGACAGCCUUGC CAAGCCGGAGACAGCCUUGC CAAGCCUGAUGCAGCCAUGC CAAGCCUGAUGCUGCCAUGC AAAGCCUGAUGCAGCCAUGC OTU_1 OTU_2 OTU_3 OTU_1 OTU_2 OTU_3 OTU_1  30% OTU_2  30% OTU_3  40% 16S DB with taxonomic information OTU_1  E. coli OTU_2  S. aureus OTU_3  S. pneumoniae

Intro into diversity analysis Alpha-diversity A measure of how diverse (complex) a microbial community is “within sample” diversity Species richness (i.e. number) is a widely use alpha diversity index Beta-diversity A measure of how different two microbial communities are “between sample” diversity Inverse of number of shared species is one possibility to estimate beta-diversity Jurasinski, G., Retzer, V., & Beierkuhnlein, C. (2009). Oecologia, 159(1), 15-26.

Practical tutorial time http://nickloman.github.io