Create 3 different phylogenetic trees from the alignments

Slides:



Advertisements
Similar presentations
Admixture in Horse Breeds Illustrated from Single Nucleotide Polymorphism Data César Torres, Yaniv Brandvain University of Minnesota, Department of Plant.
Advertisements

Evolutionary divergence of Plasmids and Phages detected within Clostridium taeniosporum. Thompson, Jose §, Blinkova, A. ¥, Hunicke-Smith, S. ¥, Satterwhite,
Identification of the Origin of Replication of Gram-Positive Anaerobic Bacterium Clostridium taeniosporum Thompson, Jose §, Bode, Addys §, Shrenker, Natalie.
College Student Identity and Emotional Intelligence Abstract This research examines the longitudinal relationship between identity and emotional intelligence.
An Introduction to “Bioinformatics to Predict Bacterial Phenotypes” Jerry H. Kavouras, Ph.D. Lewis University Romeoville, IL.
DNA BLAST Lab.
NCBI data, sliding window programs and dot plots Sept. 25, 2012 Learning objectives-Become familiar with OMIM and PubMed. Understand the difference between.
Creating NCBI The late Senator Claude Pepper recognized the importance of computerized information processing methods for the conduct of biomedical research.
Phylogenetic Trees Understand the history and diversity of life. Systematics. –Study of biological diversity in evolutionary context. –Phylogeny is evolutionary.
Summer Bioinformatics Workshop 2008 Comparative Genomics and Phylogenetics Chi-Cheng Lin, Ph.D., Professor Department of Computer Science Winona State.
Unit 1: DNA and the Genome Key area 8: Genomic sequencing.
Introduction to Bioinformatics
Genetica per Scienze Naturali a.a prof S. Presciuttini Human and chimpanzee genomes The human and chimpanzee genomes—with their 5-million-year history.
Whole Genome Alignment using Multithreaded Parallel Implementation Hyma S Murthy CMSC 838 Presentation.
Bioinformatics and Phylogenetic Analysis
We are developing a web database for plant comparative genomics, named Phytome, that, when complete, will integrate organismal phylogenies, genetic maps.
BLOSUM Information Resources Algorithms in Computational Biology Spring 2006 Created by Itai Sharon.
Bioinformatics Unit 1: Data Bases and Alignments Lecture 3: “Homology” Searches and Sequence Alignments (cont.) The Mechanics of Alignments.
Topic : Phylogenetic Reconstruction I. Systematics = Science of biological diversity. Systematics uses taxonomy to reflect phylogeny (evolutionary history).
Materials and Methods Abstract Conclusions Introduction 1. Korber B, et al. Br Med Bull 2001; 58: Rambaut A, et al. Nat. Rev. Genet. 2004; 5:
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
Multiple Sequence Alignments and Phylogeny.  Within a protein sequence, some regions will be more conserved than others. As more conserved,
Is it E. coli O157:H7? Using Bioinformatics to Develop and Test Hypotheses I’d like to thank Shellie Kieke, Concordia University – St. Paul, and Ruth Gyure,
A MOLECULAR APPROACH TO INVESTIGATE TUBERCULOSIS CASES IN A GOTHIC POPULATION FROM GHERĂSENI NECROPOLIS, BUZĂU COUNTY 1 Molecular Biology Center, Interdisciplinary.
Multiple Sequence Alignment May 12, 2009 Announcements Quiz #2 return (average 30) Hand in homework #7 Learning objectives-Understand ClustalW Homework#8-Due.
Molecular evidence for endosymbiosis Perform blastp to investigate sequence similarity among domains of life Found yeast nuclear genes exhibit more sequence.
Small protein modules with similar 3D structure but different amino acid sequence Institute of Evolution, University of Haifa, ISRAEL Genome Diversity.
Applied Bioinformatics Week 8 Jens Allmer. Practice I.
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
Phylogenetic Prediction Lecture II by Clarke S. Arnold March 19, 2002.
Discovering the Correlation Between Evolutionary Genomics and Protein-Protein Interaction Rezaul Kabir and Brett Thompson
Introduction to Phylogenetics
Figure 2: over-representation of neighbors in the fushi-tarazu region of Drosophila melanogaster. Annotated enhancers are marked grey. The CDS is marked.
Introduction Results and Conclusions ATR Results: On demographic and social characteristics, ATR completers were more likely to be Hispanic, employed,
Patterns of selection for or against amino acid change among different CD4 T-cell count progressor groups Michael Pina, Salomon Garcia Journal Club Presentation.
Background Gregory Fischer Julie Anderson Daniel Herman  Department of Biology  University of Wisconsin-Eau Claire Heterologous expression of MBP1 from.
PREETI MISRA Advisor: Dr. HAIXU TANG SCHOOL OF INFORMATICS - INDIANA UNIVERSITY Computational method to analyze tandem repeats in eukaryote genomes.
Genome annotation and search for homologs. Genome of the week Discuss the diversity and features of selected microbial genomes. Link to the paper describing.
Bioinformatics and Computational Biology
Introduction to Phylogenetic trees Colin Dewey BMI/CS 576 Fall 2015.
By Chris Paine Genes Essential idea: Every living organism inherits a blueprint for life from its parents. Genes and.
Northern Star Coral (Astrangia poculata) Populations from the New Jersey Coast. Abstract- This project investigated the distribution and molecular evolution.
Ayesha M.Khan Spring Phylogenetic Basics 2 One central field in biology is to infer the relation between species. Do they possess a common ancestor?
Examining the Genetic Similarity and Difference of the Three Progressor Groups at the First and Middle Visits Nicole Anguiano BIOL398: Bioinformatics Laboratory.
An Introduction to NCBI & BLAST National Center for Biotechnology Information Richard Johnston Pasadena City College.
Welcome to the combined BLAST and Genome Browser Tutorial.
Examining Genetic Similarity and Difference of the Three Progressor Groups at the First and Middle Visits Nicole Anguiano BIOL398: Bioinformatics Laboratory.
Visualizing Biosciences Genomics & Proteomics. “Scientists Complete Rough Draft of Human Genome” - New York Times, June 26, 2000 The problem: –3 billion.
2. Centers for Disease Control and Prevention (CDC), Atlanta, GA, USA
Introduction to Bioinformatics Resources for DNA Barcoding
Unrooted phylogenetic tree showing the relationship between the human SLC2A gene family for all 14 members created using PHYLIP 3.6 softwareDistance between.
Pipelines for Computational Analysis (Bioinformatics)
Figure A. Molecular phylogenetic tree of β-catenin and related proteins. The human E-cadherin and α-catenin were used for root tree. Phylogenetic analyses.
A Hybrid Algorithm for Multiple DNA Sequence Alignment
Novel PMS2 Pseudogenes Can Conceal Recessive Mutations Causing a Distinctive Childhood Cancer Syndrome  Michel De Vos, Bruce E. Hayward, Susan Picton,
Overview Bioinformatics: Analyzing biological data using statistics, math modeling, and computer science BLAST = Basic Local Alignment Search Tool Input.
L. Dubourg  Clinical Microbiology and Infection 
Chapter 19 Molecular Phylogenetics
Comparative phylogenetic analysis of sapoviruses based on complete RdRp and VP1 nucleotide sequences. Comparative phylogenetic analysis of sapoviruses.
Maximum likelihood (ML) unrooted tree based on the full-length 16S rRNA genes (A) and 31 conserved single-copy genes (B) showing the phylogenetic position.
Phylogenetic tree of 38 Pseudomonas type strains, based on the V3-V5 region sequence of the 16S rRNA gene (V3 primer, positions 442 to 492; and V5 primer,
Unit Genomic sequencing
Novel PMS2 Pseudogenes Can Conceal Recessive Mutations Causing a Distinctive Childhood Cancer Syndrome  Michel De Vos, Bruce E. Hayward, Susan Picton,
Phylogenetic analyses of alphacoronaviruses based on complete genome and ORF1ab protein sequence. Phylogenetic analyses of alphacoronaviruses based on.
Phylogenetic analysis of AquK2P.
Phylogenetic tree representation of a neighbor-joining analysis of several species of piroplasms. Phylogenetic tree representation of a neighbor-joining.
(A) Bayesian phylogenetic tree of the H gene nucleotide alignment from tigers Pt2004 and Pt and representative CDV sequences obtained from GenBank.
Phylogenetic tree based on predominant 16S rRNA gene sequences obtained by C4–V8 Sutterella PCR from AUT-GI patients, Sutterella species isolates, and.
(A and B) Maximum-likelihood trees of 28 strains of Pantoea agglomerans and closely related species, constructed using concatenated sequences of six protein-coding.
ADAGE extracted features that represented sequence differences between strains. ADAGE extracted features that represented sequence differences between.
Presentation transcript:

Create 3 different phylogenetic trees from the alignments Evolutionary Divergence of the Origin of Replication of Various Group II Clostridium Vidal, Kevin§, Hunicke-Smith, S.¥, Blinkova A.¥, Walker J.R.¥, León A.J.§, and Ginés-Candelaria E.§ §Miami Dade College-Wolfson Campus, Department of Natural Sciences, Health & Wellness, Miami, FL 33132 ¥University of Texas at Austin, School of Biological Sciences, Section of Molecular Genetics and Microbiology, Austin TX 78712 Discussion Based on results gathered from the phylogenetic trees constructed from multi-wise alignments from full replication origin regions, we observed a relative high homology between Clostridium taeniosporum and the Clostridium botulinum strains E3 Alaska and B strain Eklund, namely 85.5% and 89.5%, respectively. This result contrasts with the relative relatedness at the 70% level with other Group II strains (see Table 1). Also interesting is the homology score for strain BKT015925, when the orientation of the origin was reversed to correspond to the orientation of the remaining origins. Figure 1 illustrates the actual phylogenetic tree produced by Geneious Pro v6.1.5 for the full replication origin regions the displays the substitutions per site of each genome. We could preliminary infer that the lesser the score value the lesser the substitutions detected. As revealed from the phylogenetic tree, Clostridium taeniosporum had only 0.0405 substitutions per site with regards to the node connecting it to the strains Clostridium botulinum E3 Alaska and Clostridium botulinum B strain Eklund. This means that the origin of replication of Clostridium taeniosporum has only diverged from those strains by .0405 substitutions per site. This result, along with percent identity values shown in table 1, seem to indicate that Clostridium taeniosporum is most closely related to Group II Clostridium botulinum strains E3 Alaska and B strain Eklund. The latter result seems to be supported when we observed the percent identities between the short-sequence dnaA box region 3 detected by OriFinder of the various Group II C. botulinum strains with that same region 1 of Clostridium taeniosporum. A similar outcome results when we analyzed the long-sequence dnaA box regions 2. Region 3 was also run in each alignment to verify our hypothesis that it may not contribute to the activation of the origin of replication in Clostridium taeniosporum. Thus suspicion arose when looking at the annotated genome of Clostridium taeniosporum and seeing that while region 1 and 2 are located in the vicinity of the dnaA gene operon that includes dnaN, recF and gyrB, region 3 is located about 20kbp upstream near the annotation for the gene encoding the ammonium transporter. Whether this represents a factual possibility will require further testing. As seen from table 2 the strains with the highest percent homology to region 1 is Clostridium botulinum E3 Alaska and Clostridium botulinum B strain Eklund, with a percent homology of 71.4 and 70.6, respectively, while the homology between these same two strains with region 3 is only 14.9 and 16.1, respectively. A similar result was obtained when exploring the percent homology for the long-sequence dnaA box region 2, which exhibits higher homology with the long regions than region 1. The latter result, along with the others observed by this study supports our hypothesis that region 1 perhaps may not contribute to the function of the origin in addition of confirming results obtained by rDNA reported by Iver, et al., 2008, which is also confirmed with the reduction in substitutions per site (0.0443) for the branch connecting C. taeniosporum origin to that of the node that links C. botulinum E3 str. Alaska and C. botulinum B str. Eklund 17B. References /Acknowledgements Feng Gao and Chun-Ting Zhang (2008) Ori-Finder: a web-based system for finding oriCs in unannotated bacterial genomes. BMC Bioinformatics.9:79. Geneious v6.5.1 created by Biomatters. Available from http://www.geneious.com/ Iyer A.V. et al. 2008. Clostridium taeniosporum is a close relative of the Clostridium botulinum Group II. Anaerobe 14:318-324. We would like to thank the National Science Foundation Advanced Technological Education Program NSF ATE DUE 0802508 “The Biotechnbology Research Learning Collaborative” (BRLC) and the US Department of Education, HIS-STEM Program P031C110190 STEM-TRACK for their support of this research project. We would like to thank Dr. Edwin Ginés-Candelaria and Professor Alfredo León for their guidance during this project. Introduction Clostridium taeniosporum is an anaerobic, Gram positive bacterium that is nonpathogenic, which has been found to be very closely related to Clostrodium botulinum Group II strains, some of which produce very potent neurotoxins. Despite this close relationship, C. taeniosporum produces endospores with distinctive such as the spores that are formed by Clostridium taeniosporum large, flat, ribbon-like appendages (Iyer, A.V. et al., 2009). Another key distinction is the absence of the toxin produced by some members of their group. The purpose of this study is to determine the evolutionary divergence of clostridial replication origins from the various strains in Group II to reveal if there were any changes in replication strategy from those observed in the Firmicutes. The strategy employed required the localization of the origins of replication for various clostridial strains in Group II directly from their annotated genomes. These sequences were extracted from the National Center for Biotechnology Information (NCBI) depository GenBank. These origins were then aligned compared to the replication origin of Clostridium taeniosporum (Thompson, J., Bode, A., Shrenker, N., et al., personal communication) using CLUSTALW Multi-wise nucleotide alignment software. The resulting alignment was to generate a phylogenetic tree for all the origins. We employed Escherichia coli as the Enterobacteriaceae out-group control. From this preliminary phylogenetic tree, we could observe that Clostridium taeniosporum is most closely related to the Clostridium botulinum group II strains E3 Alaska and B Eklund, thus confirming results obtained by Iyer, et al., using rDNA analyses. Using this information may provide us greater insight into the evolutionary divergence of replication strategies in Clostridium taeniosporum and ultimately, the Firmicutes. Results Figure 1. Phylogenetic tree obtained from CLUSTALW alignments of the complete origins of replication for various Group II clostridia. Materials & Methods The project began by extracting the sequences of the dnaA box regions for the various Clostridium strains along with the Escherichia coli out-group control using the Bioinformatics software OriFinder. The extracted long (ranging from 378-1074 nucleotides in length) and short (260-432 nucleotide range) dnaA box regions from selected Group II clostridia and E. coli were then entered into Bioinformatics software Geneious Pro v6.1.5 to perform a CLUSTAL W multi-wise sequence alignment. The alignment was used to generate a phylogenetic tree for the dnaA box region of origin activation. A similar approach was employed for the complete replication origin where each region was extracted by using the region of approximately 10-15 Kbps encompassing two indicator genes: From gidA (encoding the glucose-inhibited division protein A) to the gyrA (encoding the gyrase subunit A), with the exception of C. taeniosporum where this region consisted of approximately more than 35 Kbps. Extract long and short dnaA box regions on OriFinder for Group II Clostridium botulinum, Clostridium taeniosporum, and Escheria coli Find complete origin of replication for each group of strains using origin indicator genes Run a multiple sequence alignment for the complete origin, and dnaA box long and short regions Create 3 different phylogenetic trees from the alignments Examine the distances and the percent homology of each strain to determine preliminary evolutionary relatedness in Group II clostridia