Genomic island analysis: Improved web-based software and insights into an apparent gene pool associated with genomic islands William Hsiao Brinkman Laboratory.

Slides:



Advertisements
Similar presentations
Codon Bias and Regulation of Translation among Bacteria and Phages
Advertisements

Genomics – The Language of DNA Honors Genetics 2006.
Journal Club Jenny Gu October 24, Introduction Defining the subset of Superfamilies in LUCA Examine adaptability and expansion of particular superfamilies.
Recombinant DNA technology
1 3. genome analysis. 2 The first DNA-based genome to be sequenced in its entirety was that of bacteriophage Φ-X174; (5,368 bp), sequenced by Frederick.
Viral & Prokaryotic Genetics “Simple” Model Systems.
Microbial genetics.
Pathogenomics: Focusing studies of bacterial pathogenicity through evolutionary analysis of genomes.
RNA and Protein Synthesis
Conjugative DNA transfer, antibiotic resistance and MDR bacteria.
Computational Molecular Biology (Spring’03) Chitta Baral Professor of Computer Science & Engg.
Conjugative DNA transfer, antibiotic resistance and MDR bacteria With thanks to Steve Matson Who first created this lecture.
Changes in bacterial traits Caused by: Changes in environmental conditions (only phenotypic changes) Changes in the genetic codes 1- Intermicrobial exchange.
DNA to Proteins to Evolution and Natural Selection Biology 155 Krilowicz Spring 2010.
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
Aynaz Taheri 1 C. Gyles and P. Boerlin. * Transfer of foreign DNA * Mechanisms of transfer of DNA * Mobile genetic elements (MGE) * MGEs in the virulence.
Copyright © 2009 Pearson Education Inc., publishing as Pearson Benjamin Cummings Lecture prepared by Mindy Miller-Kittrell, University of Tennessee, Knoxville.
The Microbiome and Metagenomics
Chapter 9 Genetics of Bacteria and Their Viruses Jones and Bartlett Publishers © 2005.
Bacterial Genetics Xiao-Kui GUO PhD.
Genome of the week - Deinococcus radiodurans Highly resistant to DNA damage –Most radiation resistant organism known Multiple genetic elements –2 chromosomes,
Genetic transfer and recombination
Genetic exchange Mutations Genetic exchange: three mechanisms
Epigenome 1. 2 Background: GWAS Genome-Wide Association Studies 3.
Mastering Microbes with Microchips Fiona Brinkman Fiona Brinkman Department of Molecular Biology and Biochemistry Department of Molecular Biology and Biochemistry.
DNA microarrays Each spot contains a picomole of a DNA ( moles) sequence.
Changes in bacterial traits Caused by: Changes in environmental conditions (only phenotypic changes) Changes in the genetic codes 1- Intermicrobial exchange.
By Prof. Dr. Asem Shehabi and Dr. Suzan Matar
Finish up array applications Move on to proteomics Protein microarrays.
20.1 Structural Genomics Determines the DNA Sequences of Entire Genomes The ultimate goal of genomic research: determining the ordered nucleotide sequences.
The Organization of Cellular Genome Asmarinah Department of Medical Biology.
Shatha Khalil Ismael. Transformation Certain species of Gram- negative, gram- positive bacteria and some species of Archaea are transformable. The uptake.
Identification of Compositionally Similar Cis-element Clusters in Coordinately Regulated Genes Anil G Jegga, Ashima Gupta, Andrew T Pinski, James W Carman,
Staphylococcus bacteria in nose
Analyzing the Increased and Decreased Expression of Microarray Data for Vibrio cholerae Merrell DS, Butler SM, Qadri F, Dolganov NA, Alam A, Cohen MB,
Genome annotation and search for homologs. Genome of the week Discuss the diversity and features of selected microbial genomes. Link to the paper describing.
In brief Vertical vs. Horizontal Homologous vs. Unequal
1 From Mendel to Genomics Historically –Identify or create mutations, follow inheritance –Determine linkage, create maps Now: Genomics –Not just a gene,
TOX680 Unveiling the Transcriptome using RNA-seq Jinze Liu.
Exchange of Genetic Information
Microbial Genetics. DNA is the Genetic Material Griffiths Avery et al.
Computational Biology Group. Class prediction of tumor samples Supervised Clustering Detection of Subgroups in a Class.
Finding genes in the genome
Starter What do you know about DNA and gene expression?
Supplementary Fig. 1 Supplementary Figure 1. Distributions of (A) exon and (B) intron lengths in O. sativa and A. thaliana genes. Green bars are used for.
Bacterial Genetics Prof. Dr. Asem Shehabi Faculty of Medicine University of Jordan.
Evolution: rate of evolution CfE Advanced Higher Biology Unit 2: Organisms and Evolution.
Bacteria CHAPTER 27. You must know  The key ways in which prokaryotes differ from eukaryotes with respect to genome, membrane- bound organelles, size,
De novo creation of new genes 1.Retrotransposition (+/- cooption of other sequences) AAAAA Pre-mRNA AAAAA Splicing to remove intron Reverse transcription.
MICROBIOLOGIA GENERALE
Metagenomic Species Diversity.
Supplementary Fig. 1 Supplementary Figure 1. Distributions of (A) exon and (B) intron lengths in O. sativa and A. thaliana genes. Green bars are used.
A Quest for Genes What’s a gene? gene (jēn) n.
3. genome analysis.
TRANSFERIMIENTO LATERAL DE GENES
Chapter 7 Microbial Genetics
Hunting down the Alien (gene)!
Volume 5, Issue 3, Pages (March 2016)
Reciprocal Crossovers and a Positional Preference for Strand Exchange in Recombination Events Resulting in Deletion or Duplication of Chromosome 17p11.2 
Discovery and Characterization of piRNAs in the Human Fetal Ovary
Pathogenicity Islands: Bacterial Evolution in Quantum Leaps
Pangenomes and core genomes of 13 M. florum strains.
Transposable Elements
Reciprocal Crossovers and a Positional Preference for Strand Exchange in Recombination Events Resulting in Deletion or Duplication of Chromosome 17p11.2 
Genome Annotation and the Human Genome
Volume 10, Issue 2, Pages (August 2011)
Deep Learning in Bioinformatics
AS Level Paper 1 and 2. A2 Level Paper 1 and 3 - Topics 1-4
Fig. 4 Identification of C
Distribution of the diversity numbers of antibiotic resistance and virulence factor protein families by environmental metagenomes’ protein family richness.
Presentation transcript:

Genomic island analysis: Improved web-based software and insights into an apparent gene pool associated with genomic islands William Hsiao Brinkman Laboratory Simon Fraser University Burnaby, BC, Canada William Hsiao Brinkman Laboratory Simon Fraser University Burnaby, BC, Canada

Prokaryotic Genomic Islands (GIs) Definition: Genomic DNA segments with particular characteristics that indicate horizontal origins Definition: Genomic DNA segments with particular characteristics that indicate horizontal origins GI A bacterium

Genomic Island Characteristics  Often contain genes encoding adaptive functions of medical and environmental importance  Pathogenicity Islands: virulence factors (genes contribute to diseases)  Resistance Islands: antibiotic resistance  Metabolic Islands: secondary metabolism (e.g. sucrose)  Often contain genes encoding adaptive functions of medical and environmental importance  Pathogenicity Islands: virulence factors (genes contribute to diseases)  Resistance Islands: antibiotic resistance  Metabolic Islands: secondary metabolism (e.g. sucrose) tRNA gene mob Direct Repeats Genomic Island (e.g. PAI) (%G+C, sequence composition bias) mob: mobility genes chromosome  Exhibit sequence and annotation features VF

A yellow circle: %G+C above high cutoff A green circle: % G+C between cutoffs A pink circle: %G+C below low cutoff A black bar: transfer RNA A purple bar: ribosomal RNA A deep blue bar: both tRNA and rRNA A black square: transposase A black triangle: integrase A strike-line: regions with dinucleotide bias (Hsiao et al 2003 Bioinformatics p418-20) IslandPath: Aiding identification of GIs Vibrio cholerae N16961 Chr1 TCP island TCP = toxin co-regulated pili

IslandPath V.2

Which Features Best Identify GIs Examined prevalence of features in 95 published islands 85% of islands with >25% dinucleotide bias coverage (62% have > 50% dinucleotide bias coverage) 85% of islands with >25% dinucleotide bias coverage (62% have > 50% dinucleotide bias coverage) Mobility genes identified in >75% of the islands Mobility genes identified in >75% of the islands tRNA genes observed in <50% of known islands tRNA genes observed in <50% of known islands Only 20% of the islands show atypical %G+C Only 20% of the islands show atypical %G+C Examined prevalence of features in 95 published islands 85% of islands with >25% dinucleotide bias coverage (62% have > 50% dinucleotide bias coverage) 85% of islands with >25% dinucleotide bias coverage (62% have > 50% dinucleotide bias coverage) Mobility genes identified in >75% of the islands Mobility genes identified in >75% of the islands tRNA genes observed in <50% of known islands tRNA genes observed in <50% of known islands Only 20% of the islands show atypical %G+C Only 20% of the islands show atypical %G+C

Properties of genes in GIs? Defined a “putative island” as 8 or more genes in a row with dinucleotide bias 8 or more genes in a row with dinucleotide bias 8 or more genes in a row with dinucleotide bias + an associated mobility gene 8 or more genes in a row with dinucleotide bias + an associated mobility gene Any difference for genes in islands versus outside of islands in terms of their protein Functional categories? 63 genomes (67 chromosomes) analyzed 63 genomes (67 chromosomes) analyzed COG: cluster of orthologous groups of proteins COG: cluster of orthologous groups of proteins Defined a “putative island” as 8 or more genes in a row with dinucleotide bias 8 or more genes in a row with dinucleotide bias 8 or more genes in a row with dinucleotide bias + an associated mobility gene 8 or more genes in a row with dinucleotide bias + an associated mobility gene Any difference for genes in islands versus outside of islands in terms of their protein Functional categories? 63 genomes (67 chromosomes) analyzed 63 genomes (67 chromosomes) analyzed COG: cluster of orthologous groups of proteins COG: cluster of orthologous groups of proteins

Paired-t-test P value: 1.27E-18 More novel genes inside of islands Hsiao et al. PLOS Genetics e62, Nov. 2005

Control for Analysis Biases Control for mis-prediction of genes in sequence composition biased regions Control for mis-prediction of genes in sequence composition biased regions Excluded genes < 300bps Excluded genes < 300bps Control for bias of COG Protein Classification Control for bias of COG Protein Classification Used SUPERFAMILY classification which is better at detecting distant homologs Used SUPERFAMILY classification which is better at detecting distant homologs Control for compositional bias due to other factors Control for compositional bias due to other factors Used the dinucleotide bias plus mobility gene dataset Used the dinucleotide bias plus mobility gene dataset Control for mis-prediction of genes in sequence composition biased regions Control for mis-prediction of genes in sequence composition biased regions Excluded genes < 300bps Excluded genes < 300bps Control for bias of COG Protein Classification Control for bias of COG Protein Classification Used SUPERFAMILY classification which is better at detecting distant homologs Used SUPERFAMILY classification which is better at detecting distant homologs Control for compositional bias due to other factors Control for compositional bias due to other factors Used the dinucleotide bias plus mobility gene dataset Used the dinucleotide bias plus mobility gene dataset

Island DatasetClassification Method Paired t-test p- value DINUC (all genes)COG1.27E-18 DINUC+MOB (all Genes)COG1.20E-18 DINUC (all genes)SUPERFAMILY1.13E-18 DINUC+Mob (all genes)SUPERFAMILY4.43E-14 DINUC (>300bps)COG1.05E-17 DINUC+MOB (>300bps)COG7.65E-16 DINUC (>300bps)SUPERFAMILY3.01E-16 DINUC+MOB (>300bps)SUPERFAMILY2.04E-10 Hsiao et al. PLOS Genetics e62, Nov More novel genes in islands in all experiments

Phage may be the predominant donors of GIs Some GIs are clearly of bacteriophage origin, but more may be from phage as well Some GIs are clearly of bacteriophage origin, but more may be from phage as well Predicted subcellular localizations of proteins encoded in our GIs similar to phage genomes (lower proportion of cytoplasmic membrane proteins) Predicted subcellular localizations of proteins encoded in our GIs similar to phage genomes (lower proportion of cytoplasmic membrane proteins) Hsiao et al. PLOS Genetics e62, Nov Hsiao et al. PLOS Genetics e62, Nov Many GI encoded genes have sequence characteristics similar to phage genes (A+T rich and short) Many GI encoded genes have sequence characteristics similar to phage genes (A+T rich and short) Daubin et al. Genome Biol. 4(9): R57 Daubin et al. Genome Biol. 4(9): R57 Some GIs are clearly of bacteriophage origin, but more may be from phage as well Some GIs are clearly of bacteriophage origin, but more may be from phage as well Predicted subcellular localizations of proteins encoded in our GIs similar to phage genomes (lower proportion of cytoplasmic membrane proteins) Predicted subcellular localizations of proteins encoded in our GIs similar to phage genomes (lower proportion of cytoplasmic membrane proteins) Hsiao et al. PLOS Genetics e62, Nov Hsiao et al. PLOS Genetics e62, Nov Many GI encoded genes have sequence characteristics similar to phage genes (A+T rich and short) Many GI encoded genes have sequence characteristics similar to phage genes (A+T rich and short) Daubin et al. Genome Biol. 4(9): R57 Daubin et al. Genome Biol. 4(9): R57

Higher proportions of genes in Islands are VFs P value: < 2.2E-16 Fedynak, Hsiao, and Brinkman (unpublished)

Certain classes of VFs over- represented in GIs Most of these are “offensive” virulence factors Fedynak, Hsiao, and Brinkman (unpublished)

Conclusions Genomic islands contain disproportionately higher number of novel genes, suggesting a large and understudied gene pool contributing to horizontal gene transfer Genomic islands contain disproportionately higher number of novel genes, suggesting a large and understudied gene pool contributing to horizontal gene transfer These novel genes appear to be drawn from a large pool of phage - metagenomics studies useful These novel genes appear to be drawn from a large pool of phage - metagenomics studies useful These novel genes may contribute to microbial adaptation and may play a role in pathogenesis and in antibiotic resistance These novel genes may contribute to microbial adaptation and may play a role in pathogenesis and in antibiotic resistance Genomic islands contain disproportionately higher number of novel genes, suggesting a large and understudied gene pool contributing to horizontal gene transfer Genomic islands contain disproportionately higher number of novel genes, suggesting a large and understudied gene pool contributing to horizontal gene transfer These novel genes appear to be drawn from a large pool of phage - metagenomics studies useful These novel genes appear to be drawn from a large pool of phage - metagenomics studies useful These novel genes may contribute to microbial adaptation and may play a role in pathogenesis and in antibiotic resistance These novel genes may contribute to microbial adaptation and may play a role in pathogenesis and in antibiotic resistance

Acknowledgements Fiona Brinkman Fiona Brinkman Amber Fedynak -VF studies Amber Fedynak -VF studies Brian Coombes, Michael Lowden, and Brett Finlay (UBC) - Microarray data Brian Coombes, Michael Lowden, and Brett Finlay (UBC) - Microarray data Jenny Bryan (UBC) -Stats analysis Jenny Bryan (UBC) -Stats analysis Brinkman Laboratory Brinkman Laboratory Fiona Brinkman Fiona Brinkman Amber Fedynak -VF studies Amber Fedynak -VF studies Brian Coombes, Michael Lowden, and Brett Finlay (UBC) - Microarray data Brian Coombes, Michael Lowden, and Brett Finlay (UBC) - Microarray data Jenny Bryan (UBC) -Stats analysis Jenny Bryan (UBC) -Stats analysis Brinkman Laboratory Brinkman Laboratory

Other categories more common in islands Category In putative islands: Paired t-test p-value In putative islands + mobility genes: Paired t-test p-value Cell motility 7.73E (may be a sampling size issue) Intracellular trafficking, secretion, and vesicular transport 8.124E (may be a sampling size issue) * Novel genes not included in analysis due to potential skew of other category results Several metabolism-associated categories are under-represented in islands

P value 3.0E-16

IslandPath V.2

Experiment: S. typhimurium LT2 ssrB gene KO Track 1: IslandPath Track 2: Microarray expression (overexp & underexp )