Tool 2: “TransBAE” - Identifying Cross- Domain Lateral Transfer Rationale: Pathogen proteins have been identified that manipulate host cells by interacting.

Slides:



Advertisements
Similar presentations
After 13 years of scientist work predominatly in USA & UK the DNA sequence of the human genome was completed in 2003 Any ideas how they did it? What would.
Advertisements

Integration of Prokaryotic Genomics into the Unknown Microbe ID Lab Bert Eardley – Penn State, Berks & Dan Golemboski – Bellarmine University.
Chapter 26: Bacteria and Archaea: the Prokaryotic Domains CHAPTER 26 Bacteria and Archaea: The Prokaryotic Domains.
Genomic island analysis: Improved web-based software and insights into an apparent gene pool associated with genomic islands William Hsiao Brinkman Laboratory.
First Gene Identified - Evidence of gene transfer from bacterial to protozoan pathogen This represents the strongest evidence known to date of horizontal.
Tree of Life Chapter 26.
Phylogenetic Trees Understand the history and diversity of life. Systematics. –Study of biological diversity in evolutionary context. –Phylogeny is evolutionary.
Basics of Comparative Genomics Dr G. P. S. Raghava.
Summer Bioinformatics Workshop 2008 Comparative Genomics and Phylogenetics Chi-Cheng Lin, Ph.D., Professor Department of Computer Science Winona State.
BIOINFORMATICS Ency Lee.
Pathogenomics: Focusing studies of bacterial pathogenicity through evolutionary analysis of genomes.
A Novel Multigene Family May Encode Odorant Receptors: A Molecular Basis for Odor Recognition Linda Buck and Richard Axel Published in Cell, Volume 65,
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
Identifying Unusual Bacterial- Eukaryotic Homologs Rationale: Pathogen proteins have been identified that manipulate host cells by interacting with, or.
Bioinformatics and Phylogenetic Analysis
Pathogenomics Goal: Identify previously unrecognized mechanisms of microbial pathogenicity using a unique combination of informatics, evolutionary biology,
CHAPTER 15 Microbial Genomics Genomic Cloning Techniques Vectors for Genomic Cloning and Sequencing MS2, RNA virus nt sequenced in 1976 X17, ssDNA.
Influenza A Virus Pandemic Prediction and Simulation Through the Modeling of Reassortment Matthew Ingham Integrated Sciences Program University of British.
Taxonomy Definition Binomial Nomenclature Strains Genus
AN INTRODUCTION TO TAXONOMY: THE BACTERIA
Subsystem Approach to Genome Annotation National Microbial Pathogen Data Resource Claudia Reich NCSA, University of Illinois, Urbana.
The diversity of genomes and the tree of life
Genome Evolution: Duplication (Paralogs) & Degradation (Pseudogenes)
Comparative Genomics of the Eukaryotes
Genome projects and model organisms Level 3 Molecular Evolution and Bioinformatics Jim Provan.
Overview of Bioinformatics A/P Shoba Ranganathan Justin Choo National University of Singapore A Tutorial on Bioinformatics.
Screening and genome mining of polyether-producing strains in actinomycetes Minghao Liu, Hao Wang, Ning Liu, Jisheng Ruan and Ying Huang* State Key Laboratory.
Unit 1: The Language of Science  communicate and apply scientific information extracted from various sources (3.B)  evaluate models according to their.
Microbial taxonomy and phylogeny
Pathogenomics Using bioinformatics to focus studies of bacterial pathogenicity.
Compare and contrast prokaryotic and eukaryotic cells.[BIO.4A] October 2014Secondary Science - Biology.
T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo.
Molecular evidence for endosymbiosis Perform blastp to investigate sequence similarity among domains of life Found yeast nuclear genes exhibit more sequence.
Ch 10 Classification of Microorganisms.
Cross-Domain and Within-Domain Horizontal Gene Transfer: Implications for Bacterial Pathogenicity 1.Pathogenomics Project 2.Cross-Domain Horizontal Gene.
Fluidity of the 16S rRNA Gene Sequence within Aeromonas Strains Alessia Morandi Institute for Infectious Diseases University of Berne.
Genomes and Their Evolution. GenomicsThe study of whole sets of genes and their interactions. Bioinformatics The use of computer modeling and computational.
BIOINFORMATICS IN BIOCHEMISTRY Bioinformatics– a field at the interface of molecular biology, computer science, and mathematics Bioinformatics focuses.
Pathogenomics Goal: Identify previously unrecognized mechanisms by which bacteria cause disease, using a unique combination of informatics, evolutionary.
Chapter 13 Table of Contents Section 1 DNA Technology
Sequence-based Similarity Module (BLAST & CDD only ) & Horizontal Gene Transfer Module (Ortholog Neighborhood & GC content only)
Web Databases for Drosophila Introduction to FlyBase and Ensembl Database Wilson Leung6/06.
Whole Genome Sequence of Staphylococcus saprophyticus reveals the pathogenesis of uncomplicated urinary tract infection Comparative Microbial Genomics:
Genome Analysis II Comparative Genomics Jiangbo Miao Apr. 25, 2002 CISC889-02S: Bioinformatics.
Using blast to study gene evolution – an example.
Pathogenomics How this project began: Ann Rose - take advantage of DNA sequence information - genomics Julian Davies - use the information to understand.
Functional and Evolutionary Attributes through Analysis of Metabolism Sophia Tsoka European Bioinformatics Institute Cambridge UK.
Copyright OpenHelix. No use or reproduction without express written consent1.
Bioinformatics and Computational Biology
From your notebook and/or glossary select 5 key terms from KA 1.7 and write each one on a study card. On the reverse write a meaning or explanation of.
Finding genes in the genome
Major characteristics used in taxonomy
NCBI: something old, something new. What is NCBI? Create automated systems for knowledge about molecular biology, biochemistry, and genetics. Perform.
Molecular Clocks and Continued Research
Taxonomy & Phylogeny. B-5.6 Summarize ways that scientists use data from a variety of sources to investigate and critically analyze aspects of evolutionary.
General Microbiology (Micr300)
Bioinformatics What is a genome? How are databases used? What is a phylogentic tree?
Metagenomic Species Diversity.
Basics of Comparative Genomics
Pipelines for Computational Analysis (Bioinformatics)
Genomes and Their Evolution
Scientists use several techniques to manipulate DNA.
Genomes and Their Evolution
Fig Figure 21.1 What genomic information makes a human or chimpanzee?
Chapter 26 Phylogeny and the Tree of Life
5 kingdoms.
EST Analysis of the Cnidarian Acropora millepora Reveals Extensive Gene Loss and Rapid Sequence Divergence in the Model Invertebrates  R.Daniel Kortschak,
Unit Genomic sequencing
Basics of Comparative Genomics
Phylogeny and the Tree of Life
Presentation transcript:

Tool 2: “TransBAE” - Identifying Cross- Domain Lateral Transfer Rationale: Pathogen proteins have been identified that manipulate host cells by interacting with, or mimicking, host proteins. We wondered whether we could identify selected novel virulence factors by identifying bacterial pathogen genes more similar to host genes than you would expect based on phylogeny. The tool we developed investigates this, and is also useful for identifying cross-domain lateral gene transfer events (i.e. Trans - Bacteria, Archaea and Eukarya). Description: Proteins in a given pathogen genome that are more similar to eukaryote proteins than other proteins (and vice versa) are identified through BLAST analysis, followed by a scoring system we developed. Various taxonomic levels of organisms are filtered from the BLAST results to identify putative lateral transfers that occurred before or after species, genus, family etc… divergence. This analysis has also been expanded to analyze all bacterial genomes, and to make all cross-domain comparisons between Bacteria, Archaea and Eukarya. Tool 1: “IslandPath” – aiding identification of pathogenicity islands Rationale: Pathogenicity islands in genomes tend to have atypical %G+C, contain mobility genes (i.e. transposases, integrases), and are associated with tRNA sequences. Combined identification of such features could facilitate the identification of genes in new genomes sequences that are involved in virulence, or have horizontal origins. Description: Each dot in a graphic corresponds to a predicted protein- coding ORF in the genome. Dot colours indicate if an ORF has a higher or lower %G+C than cutoffs you set (default settings are plus or minus 1 Std. Dev. from the mean %G+C for all genes in the genome). You may click on a dot to view a portion of an annotation table presented below the graphic. Example: (Below) Portion of the graphic and (edited) table for the Neisseria meningitidis MC58 genome is shown, illustrating the location of a cluster of genes that may be involved in pathogenicity (1). Using the Tools: Examples of interesting genes identified A Streptomyces gene may be the “missing link” to explain the occurrence of some sensor histidine kinases (NIK and FIK) in Candida sp. and Fusarium sp. pathogenic fungi (Brinkman et al., submitted). Histidine kinases are common in bacteria but relatively uncommon in eukaryotes, and phylogenetic analysis suggests that these virulence-associated histidine kinases in the fungi were obtained by lateral gene transfer from bacteria. All orthologs of this gene (LemA, GacS, etc…) examined to date have a role in virulence. In most cases, the “plant-like” genes reported previously in the Chlamydia sp. genomes (6) may have plastid origins, as Synechocystis sp., a relative of the ancestor of the plastid, also shares notable similarity to these genes. Other Genes: New, potential, islands of horizontally transferred genes, containing “hypothetical genes”, were identified in almost all microbial genomes examined to date. “Odd” bacterial genes with notable similarity to animal (metazoan) genes were identified, however, in most cases more sampling of sequences from other organisms is needed to identify whether the genes are a case of horizontal gene transfer, selective gene maintenance and gene loss, or of organellar origin. Promising genes are in the process of being investigated further. Computational tools to aid identification of potential horizontally transferred genes involved in pathogenicity Fiona S. L. Brinkman 1,2, Hans Greberg 1,3, Ivan Wan 1,3, Yossef Av-Gay 4, David L. Baillie 5, Robert Brunham 6, Rachel C. Fernandez 2, B. Brett Finlay 2,8, Robert E.W. Hancock 2, Audrey de Koning 9, Patrick Keeling 10, Emma Macfarlane 2, Don G. Moerman 9,11, Sarah P. Otto 9, B. Francis Ouellette 7, Hong Yan 2, Ann M. Rose 1, and Steven J. Jones 3. 1 Dept of Medical Genetics, 2 Dept of Microbiology and Immunology, 4 Dept of Medicine, 8 Biotechnology Laboratory, 9 Dept of Zoology, 11 C. elegans Reverse Genetics Facility, 10 Dept of Botany, University of British Columbia, 5 Dept of Biological Sciences, Simon Fraser University, 7 Centre for Molecular Medicine and Therapeutics, 6 UBC Centre for Disease Control and 3 Genome Sequence Centre, BC Cancer Agency, Vancouver, British Columbia, Canada. Abstract Evidence is increasing that pathogens often develop virulence though the acquisition of sequences encoding virulence factors that are horizontally transferred. The Pathogenomics Project funded by the Peter Wall Institute for Advanced Studies is developing software to aid identification of horizontally transferred sequences of relevance to pathogenicity. Candidate virulence genes identified are being targeted for further functional study as part of this interdisciplinary project. Our approach has enabled us to not only identify new potential virulence factors, but also gain insight into the frequency of horizontal gene transfer within the Bacteria, and between the three domains of life of Bacteria, Eukarya, and Archaea. Tool 3: “PhyloBLAST” – aiding phylogenetic analysis PhyloBLAST compares your protein sequence to a SWISSPROT/ TREMBL database using BLAST2 and then allows you to perform user-defined phylogenetic analyses based on user-selected proteins listed in the BLAST output (4). PhyloBLAST was initially developed to aid analysis of lateral gene transfer events detected by “TransBAE”, but is now available on the internet as its own web-based application at: Some Features - Organism information and phylogenetic distance measures are added to the BLAST output and subsequent phylogenetic trees - You may select BLAST hits for further phylogenetic analysis, or you may input your own sequences or alignments. Analyses vary from obtaining a FASTA file of the sequences, a ClustalW alignment, or user-defined phylogenetic trees (based on PHYLIP). Acknowledgements This project is funded by the Peter Wall Institute for Advanced Studies. References 1.Tettelin H, et al., Science 287: Read TD, et al Nucleic Acids Res. 28: Doolittle WF Trends Genet. 14: Brinkman FSL, et al Bioinformatics. In Press. 5.de Koning A, et al., Mol. Biol. Evol. 17: Stephens RS, et al Science. 282: The protozoan pathogen Trichomonas vaginalis appears to have obtained the gene for N-acetylneuraminate lyase (NanA) from an ancestor of pathogenic Pasterellaceae bacteria (based on phylogenetic analysis and 92-95% sequence similarity; 5). NanA is involved in sialic acid metabolism and is used by some bacteria to parasitize the mucous membranes of animals for nutritional purposes. It is possible that T. vaginalis acquired this gene to aid its parasitization of animal/human tissues. Using the Tools: Some Trends 1. Correlation between variance of ORF G+C in a genome and clonality of the pathogen. %G+C analysis of genome ORFs, used to identify pathogenicity islands, revealed the following trend: Low variance of the mean G+C of ORFs for a given genome correlates with an intracellular lifestyle for the bacterium and a clonal nature (Two-tailed P value of 0.004, for a nonparametric correlation). Variance is similar within a given species. Variance of %G+C for ORFs may therefore be a useful marker for investigating the clonality of bacteria. Its relationship with intracellular lifestyle may reflect the ecological isolation of intracellular bacteria, as was previously proposed to explain the lack of chromosome rearrangement for Chlamydia species (2). 2. Detecting lateral gene transfer between Bacteria and Eukarya. While our primary focus is to identify new genes or pathways involved in virulence, our approach has also identified the strongest cases of lateral gene transfer between bacteria and eukaryotes identified to date, and facilitates the identification of organellar genes that have moved to the nucleus (due to the bacterial origin of organellar genes). We have found that most cases of probable recent cross-domain gene transfer involve movement of a bacterial gene to a unicellular eukaryote. It has been proposed that such eukaryotes may obtain bacterial genes through ingestion of bacteria (the “you are what you eat” hypothesis; 3). We have found no cases to date of recent (since the divergence of humans from mammals) lateral gene exchange between multicellular eukaryotes and bacteria, suggesting that such occurrences are rare. This has significance for both the evolution of mechanisms of pathogen host mimicry, and also for movement of genes of relevant to the use of genetically modified foods. Neisseria meningitidis serogroup B strain MC58 Mean %G+C: STD DEV: 7.57 %G+C SD Location Strand Product hypothetical tspB protein, putative conserved hypothetical conserved hypothetical pilin gene inverting PivNM virulence assoc. pro. homolog cryptic plasmid A-related hypothetical hypothetical hypothetical hypothetical conserved hypothetical conserved hypothetical conserved hypothetical put. hemolysin activ. HecB put. toxin-activating hypothetical hypothetical hypothetical hypothetical hemagglutinin/hemolysin-rel transposase, IS30 family A C B Example: (Above) Screenshot A: Overview of no. of proteins identified from each pathogen genome that are most similar to eukaryotic proteins. Screen B: List of further information about a subset of proteins (H. influenzae proteins in this case) and the eukaryotic proteins they are similar to. Screen C: Colourized summary of BLAST analysis for a M. tuberculosis protein of interest.