Identification of an informative and accurate region of the HCV genome for phylogenetic analyses Patrik Medstrand Clinical Virology.

Slides:



Advertisements
Similar presentations
Gene tree analyses of Aboriginal Australians Rosalind Harding University of Oxford.
Advertisements

Bioinformatics Phylogenetic analysis and sequence alignment The concept of evolutionary tree Types of phylogenetic trees Measurements of genetic distances.
HCV Evolution: Diversification and Convergence Yury Khudyakov Division of Viral Hepatitis Centers for Disease Control and Prevention, Atlanta, GA.
Analysis of Genetic Linkage of HIV from Couples Enrolled in the HPTN 052 Trial Susan H. Eshleman, MD/PhD HPTN Network Laboratory Johns Hopkins Univ. School.
Phylogenetic Trees Understand the history and diversity of life. Systematics. –Study of biological diversity in evolutionary context. –Phylogeny is evolutionary.
Summer Bioinformatics Workshop 2008 Comparative Genomics and Phylogenetics Chi-Cheng Lin, Ph.D., Professor Department of Computer Science Winona State.
Phylogenetic reconstruction
D. Paraskevis 1*, S, Paraschiv 2, V. Sypsa 1, G. Nikolopoulos 3, C. Tsiara 3, G. Magiorkinis 4, M. Psichogiou 5, I. Niculescu 2, I. Batan 2, M. Malliori.
Genetica per Scienze Naturali a.a prof S. Presciuttini Human and chimpanzee genomes The human and chimpanzee genomes—with their 5-million-year history.
Mining SNPs from EST Databases Picoult-Newberg et al. (1999)
Dispersal models Continuous populations Isolation-by-distance Discrete populations Stepping-stone Island model.
Bell Work Dogs of a certain breed can have black fur or white fur. Black fur is dominant, but the breeder only wants puppies with white fur. Cross two.
Lecture 13 – Performance of Methods Folks often use the term “reliability” without a very clear definition of what it is. Methods of assessing performance.
Materials and Methods Abstract Conclusions Introduction 1. Korber B, et al. Br Med Bull 2001; 58: Rambaut A, et al. Nat. Rev. Genet. 2004; 5:
OECD Short-Term Economic Statistics Working PartyJune Analysis of revisions for short-term economic statistics Richard McKenzie OECD OECD Short.
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
Using Phylogenetic Analysis to Identify HIV Transmission Channels among Persons Newly Diagnosed with HIV-1 Infection in Los Angeles County, Kwa.
HIV-1 subtype C now accounts for approximately 50% of the estimated 33 million people living with HIV/AIDS and half of the 1-2 million new infections annually.
A MOLECULAR APPROACH TO INVESTIGATE TUBERCULOSIS CASES IN A GOTHIC POPULATION FROM GHERĂSENI NECROPOLIS, BUZĂU COUNTY 1 Molecular Biology Center, Interdisciplinary.
Molecular phylogenetics
Epidemiology The Basics Only… Adapted with permission from a class presentation developed by Dr. Charles Lynch – University of Iowa, Iowa City.
Phylogenetic Insight into HIV Transmission Networks in a Southeastern US Cohort Ann Dennis 1, Stéphane Hué 2, Christopher Hurt 1, Sonia Napravnik 1, Deenan.
From Genomic Sequence Data to Genotype: A Proposed Machine Learning Approach for Genotyping Hepatitis C Virus Genaro Hernandez Jr CMSC 601 Spring 2011.
Molecular phylogenetics 1 Level 3 Molecular Evolution and Bioinformatics Jim Provan Page and Holmes: Sections
Tree Confidence Have we got the true tree? Use known phylogenies Unfortunately, very rare Hillis et al. (1992) created experimental phylogenies using phage.
Data Collection and Sampling
The use of short-read next generation sequences to recover the evolutionary histories in multi-individual samples Systematic biology presentation Yuantong.
 Read Chapter 4.  All living organisms are related to each other having descended from common ancestors.  Understanding the evolutionary relationships.
Phylodynamically Estimated HIV Diversification Rates Reveal Prevention of HIV-1 by Antiretroviral Therapy Jeffrey B. Joy, Richard H. Liang, Rosemary M.
We obtained breast cancer tissues from the Breast Cancer Biospecimen Repository of Fred Hutchinson Cancer Research Center. We performed two rounds of next-gen.
THE DEVELOPMENT OF INFLUENZA SURVEILLANCE NETWORK IN THE PHILIPPINES (June 2005 – March 2008)‏ Remigio M. Olveda 1, Enrique A. Tayag 2, Analisa N. Bautista.
Introduction to Phylogenetic trees Colin Dewey BMI/CS 576 Fall 2015.
PHYLOGENY AND THE TREE OF LIFE CH 26. I. Phylogenies show evolutionary relationships A. Binomial nomenclature: – Genus + species name Homo sapiens.
Northern Star Coral (Astrangia poculata) Populations from the New Jersey Coast. Abstract- This project investigated the distribution and molecular evolution.
Selecting Genomes for Reconstruction of Ancestral Genomes Louxin Zhang Department of Mathematics National University of Singapore.
Ayesha M.Khan Spring Phylogenetic Basics 2 One central field in biology is to infer the relation between species. Do they possess a common ancestor?
Lessons Learned and Novel Investigation Techniques in Response to a Large Community Outbreak of HIV-1 infection Philip J. Peters MD HIV Testing and Biomedical.
By Dr Hidayathulla Shaikh. Objectives At the end of the lecture student should be able to –  Explain types of examination  Discuss different types of.
Jelena Prpić, B.Sc., PhD Croatian Veterinary Institute.
Taxonomy & Phylogeny. B-5.6 Summarize ways that scientists use data from a variety of sources to investigate and critically analyze aspects of evolutionary.
Academic Communication in English Introduction. Purposes Be familiar with the 3 moves in an introduction and learn how to write an introduction for the.
The impact of treatment duration on defective and expanded identical HIV genomes in T cell subsets from peripheral blood and tissues Eunok Lee.
2. Centers for Disease Control and Prevention (CDC), Atlanta, GA, USA
Introduction to Bioinformatics Resources for DNA Barcoding
Vienna, 2016 Towards developing evidence-based GIS-driven infrastructure for cancer surveillance in Greece. Dimitra Sifaki-Pistolla Epidemiologist, MPH,
Dr.MUSTAQUE AHMED MBBS,MD(COMMUNITY MEDICINE), FELLOWSHIP IN HIV/AIDS
Figure 1. Recent estimates of Neoaves phylogeny. a) The Jarvis et al
Pipelines for Computational Analysis (Bioinformatics)
A Hybrid Algorithm for Multiple DNA Sequence Alignment
Accurate genotyping of hepatitis C virus through nucleotide sequencing and identification of new HCV subtypes in China population  Y.-Q. Tong, B. Liu,
Evolution and Spread of Ebola Virus in Liberia, 2014–2015
Increased Risk of Transmission of Hepatitis C in Open Heart Surgery Compared With Vascular and Pulmonary Surgery  Karina Olsen, MD, Per Erling Dahl, MD,
Summary and Recommendations
Volume 130, Issue 3, Pages (March 2006)
The Role of NICs in Influenza Surveillance
Volume 26, Issue 2, Pages (January 2016)
Accurate genotyping of hepatitis C virus through nucleotide sequencing and identification of new HCV subtypes in China population  Y.-Q. Tong, B. Liu,
A Web-based Interactive Genome Library for Surveillance, Detection, Characterization and Drug-Resistance Monitoring of Influenza Virus Infection in the.
Genotyping and origin of the emergent U. S
Tracing the Evolution of Hepatitis C Virus in the United States, Japan, and Egypt By Using the Molecular Clock  Masashi Mizokami, Yasuhito Tanaka  Clinical.
Maximum likelihood (ML) unrooted tree based on the full-length 16S rRNA genes (A) and 31 conserved single-copy genes (B) showing the phylogenetic position.
Justin R. Bailey, Eleanor Barnes, Andrea L. Cox  Gastroenterology 
Phylogenetic Trees Jasmin sutkovic.
PHYLOGENETIC TREES.
TRACE INITIATIVE: Data Use
Summary and Recommendations
Perspectives from Los Angeles County Tuberculosis Control Program
Phylogenetic analyses of alphacoronaviruses based on complete genome and ORF1ab protein sequence. Phylogenetic analyses of alphacoronaviruses based on.
Neighbor-joining tree of the 262 S
Rapid Detection of HIV-1 subtype C Integrase resistance mutations by the Use of High-Resolution Melting Analysis Tendai Washaya BSc, Msc. Pre-PhD Student.
Presentation transcript:

Identification of an informative and accurate region of the HCV genome for phylogenetic analyses Patrik Medstrand Clinical Virology Department of Laboratory Medicine Malmö Lund University, Sweden 11 th annual conference of new Visby network Vilnius, April 25-27, 2014

To investigate the dynamics of the HCV epidemic in the Baltic region using genetic and epidemiological data (date and site of sample collection, information of potential ”risk group”) To initiate a joint project among participants in the network (part of the proposal to SI) AIMS

Phylogenetics define genetic relationships We will study the genetic relationship among HCV strains that are circulating in the Baltic region. We could address: Relationship among HCV strains in the region (cities, countries, risk groups, general population) this can be done by “classical” phylogenetic studies, where groups (“clusters”) are defined

Cluster definitions = need statistics To define relationships (clusters; epidemiological links defined by a common ancestral HCV strain) we need to use statistics. Limited information will obscure the identification of true relationships. Information or power in phylogenetic inference is represented by genetic information in terms of nucleotide sequences. This is similar to any statistical comparison - to small groups or limited information does not allow us to draw any robust conclusions.

More might be better 1 kb 9 kb Subset Complete

From phylogenetics to phylodynamics Unrooted Genetic relationshipRooted and with time estimates Colors: Geographic locations Risk groups etc NEW INSIGHTS

Using phylodynamic analysis, we investigate the extent the HCV epidemic in three metropolitan areas of Sweden were linked or separate. We found evidence for one early introduction (Western Europe to Gothenburg in 1958; panel A) and rapid dissemination (from Gothenburg to Stockholm and Malmö ; panels B-C), whereas the later epidemic (after 1975) were characterized by HCV strains that were introduced from regions outside Sweden (Western Europe and USA; panel D), indicating limited epidemic links within Sweden during this later time period (Jerkeman et al, manuscript in preparation). Panel E: exponential growth from ~ A. B. C. D. Phylodynamics can inform about migrations and growth of the epidemic historical and more recent Similar studies can be performed on other “traits”, such as risk groups E

Goal of present study 1. Identify phylogenetically informative genome regions that Allow identification of a reasonable number and correct clusters Allow reconstruction of the “true” phylogeny in comparison to the phylogeny reconstructed from near full-length HCV genomes 2. Establish a convenient PCR and sequencing protocol

Genome regions Sequence similarity Number of sequences (country and year info) E1-E2 P7 P7-NS2 NS5A NS5B NS5A-NS5B NS5Bsh

Data set and Methods 143 near full length HCV 1a genomes (polyprotein region) were obtained from the Los Alamos HCV database The data set was used to create 7 subsets representing 7 subgenomic regions ML trees were constructed using Garli v2.0 using GTR+I+G subst model Branch support was estimated using the Shimodaira-Hasegawa (SH) test as implemented in PhyML False positive branches were defined as branches with statistical support (SH > 0.9) in ML-trees of subgenomic regions, that were absent in the ML-tree obtained from the polyprotein region (“true” tree) Accuracy (topology-testing) of phylogenies obtained from subgenomic regions were inferred by statistical comparison to the “true” tree using the SH-test implemented in TreePuzzle and Consel

Branch support Polyprotein E1-E2 P7 P7-NS2 NS5A-NS5B NS5A NS5B NS5B-sh (9036 bp) (1236 bp) (933 bp) (1455 bp) (2934 bp) (1272 bp) (1688 bp) (640 bp) Supported Branches (N) FP (%) True supported Branches (N)

Topology support E1-E2 P7 P7-NS2 NS5A-NS5B NS5A NS5B NS5B-sh (1236 bp) (933 bp) (1455 bp) (2934 bp) (1272 bp) (1688 bp) (640 bp) Branches in subgenomic tree supported in true tree (N) Topology difference of sub- genomic and true tree (p-value) - <0.01 <0.01 < <0.01 <0.01

Conclusions The 1272-bp region of NS5A displayed the lowest FP-rate compared to other subgenomic regions analyzed The NS5A and NS5A-NS5B trees conformed topologies of the true tree. In total, 39 NS5A branches of a total 75 branches were shared with the true tree. Among those, 22 branches had statistical support. The NS5A region represents a trade-off between phylogenetic accuracy/information in comparison to full-length genome sequencing, and may be suitable for phylogenetic and phylodynamics studies of HCV The preliminary findings shown here will be confirmed using other HCV subgenotypes and methods PCR protocols will be established and shared to network members

Lund University, Sweden Anders Widell Per Björkman Anna Jerkeman Marianne Alanko Vilma Molnegren Joakim Esbjörnsson HCV study ACKNOWLEDGEMENT Thanks to Anders and Joakim for presenting this! To bad that I couldn’t come to Vilnius but are looking forwards to see you all soon!

Alternative regions Sequence similarity Number of sequences (country and year info) E1-E2 P7 P7-NS2 NS5A NS5B NS5A-NS5B NS5Bsh