Compositional Assemblies Behave Similarly to Quasispecies Model

Slides:



Advertisements
Similar presentations
Modeling Butterfly Populations Richard Gejji Justin Skoff.
Advertisements

Introduction to Haplotype Estimation Stat/Biostat 550.
Chapter 14 Chemical Kinetics.
On the Genetic Evolution of a Perfect Tic-Tac-Toe Strategy
What is a Population A population is a group of organisms of the same species that live in a specific geographical area and interbreed. A population is.
Quasispecies Theory and the Behavior of RNA Viruses Sumeeta Singh, Steve Bowers, Greg Rice, Tom McCarty BINF /19/13.
Alexa Curcio. Original Problem : Would a restriction on height, such as prohibiting males from marrying taller females, affect the height of the entire.
Sampling distributions of alleles under models of neutral evolution.
Amplicon-Based Quasipecies Assembly Using Next Generation Sequencing Nick Mancuso Bassam Tork Computer Science Department Georgia State University.
Unit B 4-4 Animal Science and The Industry. Problem Area 4 Understanding Animal Reproduction and Biotechnology.
An Approach to Evaluate Data Trustworthiness Based on Data Provenance Department of Computer Science Purdue University.
Lipidia : An Artificial Chemistry of Self-Replicating Assemblies of Lipid-like Molecules ”Lipid World” Modeling a ”Lipid World” scenario Origin of Life.
Forward Genealogical Simulations Assumptions:1) Fixed population size 2) Fixed mating time Step #1:The mating process: For a fixed population size N, there.
TEMPLATE DESIGN © Distribution of Passenger Mutations in Exponentially Growing Wave 0 Cancer Population Yifei Chen 1 ;
Population Genetics: An introduction Change in Populations & Communities: Population Genetics.
Dynamics of RNA-based replicator networks Camille STEPHAN-OTTO ATTOLINI, Christof FLAMM, Peter STADLER Institut for Theoretical Chemistry and Structural.
Introduction to the Cell Cycle and Inheritance
Unit Animal Science. Problem Area Animal Genetics and Biotechnology.
Hilton’s Game of Life (HGL) A theoretical explanation of the phenomenon “life” in real nature. Hilton Tamanaha Goi Ph.D. 1st Year, KAIST, Dept. of EECS.
Case Study. DNA Deoxyribonucleic acid (DNA) is a nucleic acid that contains the genetic instructions used in the development and functioning of all known.
Projection exercises Stable age distribution and population growth rate Reproductive value of different ages Not all matrices yield a stable age distribution.
Graph-based consensus clustering for class discovery from gene expression data Zhiwen Yum, Hau-San Wong and Hongqiang Wang Bioinformatics, 2007.
Introduction to Statistical Inferences
Genetic Algorithm.
A REVIEW OF OCCUPANCY PROBLEMS AND THEIR APPLICATIONS WITH A MATLAB DEMO Samuel Khuvis, Undergraduate Nagaraj Neerchal, Professor of Statistics Department.
Rates and reaction mechanism ► The reaction mechanism is the sequence of individual reaction steps that together complete the transformation of reactants.
Lotte Ramekers. Research questions Introduction Models Methods Experiments and results Conclusions Questions.
Evolvability of Autocatalytic Reaction Networks
Modeling and Simulation CS 313
1 Prebiotic Evolution of Molecular Assemblies: From Molecules to Ecology Omer Markovitch and Doron Lancet Department of Molecular Genetics, Weizmann Institute.
Researchers: Preet Bola Mike Earnest Kevin Varela-O’Hara Han Zou Advisor: Walter Rusin Data Storage Networks.
Lecture 3: population genetics I: mutation and recombination
Dynamical Systems Model of the Simple Genetic Algorithm Introduction to Michael Vose’s Theory Rafal Kicinger Summer Lecture Series 2002.
Virus Evolution. Lecture 6. Chapter 20, pp. 759 – 777.
Functional Integrals for the Parallel and Eigen Models of Virus Evolution Jeong-Man Park The Catholic University of Korea.
Finding Mathematics in Genes and Diseases Ming-Ying Leung Department of Mathematical Sciences University of Texas at El Paso (UTEP)
The Importance of DNA to Biology Nathan Money 2 nd period August 3, 2011 Watson & Crick with their DNA model in 1953.
The error threshold or ribo-organisms Eörs Szathmáry Collegium Budapest AND Eötvös University.
What is a QUASI-SPECIES By Ye Dan U062281A USC3002 Picturing the World through Mathematics.
Numerical simulations of thermal counterflow in the presence of solid boundaries Andrew Baggaley Jason Laurie Weizmann Institute Sylvain Laizet Imperial.
By Rosalind Allen Regulatory networks Biochemical noise.
BLAST: Basic Local Alignment Search Tool Altschul et al. J. Mol Bio CS 466 Saurabh Sinha.
Estimating Recombination Rates. LRH selection test, and recombination Recall that LRH/EHH tests for selection by looking at frequencies of specific haplotypes.
1 Network Motifs in Prebiotic Metabolic Networks Omer Markovitch and Doron Lancet, Department of Molecular Genetics, Weizmann Institute of Science.
Analysis of the Life-Cycle Graph: The Transition Matrix Modeling Approach.
1 1 Slide Simulation Professor Ahmadi. 2 2 Slide Simulation Chapter Outline n Computer Simulation n Simulation Modeling n Random Variables and Pseudo-Random.
DNA replication ?  DNA replication- the basis for biological inheritance, is a fundamental process occurring in all living organisms to copy their DNA.
Sensitivity of HOM Frequency in the ESS Medium Beta Cavity Aaron Farricker.
NEW TOPIC: MOLECULAR EVOLUTION.
1.Stream A and Stream B are located on two isolated islands with similar characteristics. How do these two stream beds differ? 2.Suppose a fish that varies.
GENETIC ALGORITHM Basic Algorithm begin set time t = 0;
To be presented by Maral Hudaybergenova IENG 513 FALL 2015.
BSAA CD UNIT C Animal Science. Problem Area 1 Animal Genetics and Biotechnology.
Genetic algorithms: A Stochastic Approach for Improving the Current Cadastre Accuracies Anna Shnaidman Uri Shoshani Yerach Doytsher Mapping and Geo-Information.
Application of the GA-PSO with the Fuzzy controller to the robot soccer Department of Electrical Engineering, Southern Taiwan University, Tainan, R.O.C.
Mechanisms of Evolution  Lesson goals:  1. Define evolution in terms of genetics.  2. Using mathematics show how evolution cannot occur unless there.
Chapter 16 Darwin’s Theory of Evolution Evolution What is evolution? A change in a population over time These changes is caused by many factors and are.
An unsupervised conditional random fields approach for clustering gene expression time series Chang-Tsun Li, Yinyin Yuan and Roland Wilson Bioinformatics,
Introduction Sample surveys involve chance error. Here we will study how to find the likely size of the chance error in a percentage, for simple random.
Abstract Our research mainly applies Maximum Likelihood Method (MLE), Dynamic Programming, and Neighbor Joining Method in an attempt of shortening the.
Unit 11 Review Created by Mrs. Martin. 1. List the 6 factors that affect the rate of a reaction.
Lecture 6 Genetic drift & Mutation Sonja Kujala
Bulgarian Academy of Sciences
The coalescent with recombination (Chapter 5, Part 1)
Dr. Kenneth Stanley February 6, 2006
CHEM 3310 Chemical Kinetics Collision Theory & Transition State Theory.
Unit Genomic sequencing
First, let’s talk about the word THEORY…
Evolution of Replicator Systems Bratus A. S. 1,2 , Yakushkina T
The two fundamental equations of quasispecies theory (243).
Presentation transcript:

Compositional Assemblies Behave Similarly to Quasispecies Model Renan Gross, Omer Markovitch and Doron Lancet Department of Molecular Genetics, Weizmann Institute of Science, Israel ILASOL, 01/12/2013

An overview of Quasispecies An overview of GARD Results Outline Main result: populations of the compositional assemblies of the GARD model behave in a way that is similar to that predicted by quasispecies theory. Our plan today: An overview of Quasispecies An overview of GARD Results

Introduction Quasispecies are a cloud of genotypes that appear in a population at mutation-selection balance. (J.J Bull et al, PLoS 2005) - Theoretical model (equations and assumptions), with experimental support by RNA viruses. Such dynamics are needed when mutation rates are high. Early life is reasoned to have lived near error-catastrophe

Introduction Why is this important? Further evidence that compositional inheritance can describe known population dynamics. As early life probably had little error correction, quasispecies applies to models for the origin of life. If complete correspondence between GARD and quasispecies model is achieved, we can move to a higher level of abstraction.

Basically a population model n different genotypes / identities Quasispecies model Reactor Basically a population model n different genotypes / identities Their relative concentration in the environment is denoted by the fraction xi. Goal: To find out how xi behave as a function of time. 𝑖=1 𝑛 𝑥 𝑖 =1

Quasispecies model Each one replicates at a certain rate – how many offspring it has per unit time. Some replicate faster than others. This is called the replication rate, denoted Ai.

Quasispecies model However, replication is not exact. Sometimes, the offspring is of another genotype. The chance that a genotype j replicates into genotype i is denoted Qij. Q11 = 0.5 Q21 = 0.2 Q31 = 0.2 Q41 = 0.1

We can put everything in a matrix, called the transition matrix, Q. Quasispecies model We can put everything in a matrix, called the transition matrix, Q. 𝟎.𝟓 0.1 0 0.7 0.2 𝟎.𝟓 0 0.2 0.2 0.2 𝟏 0 0.1 0.2 0 𝟎.𝟏 The main diagonal is faithful self replication. From To

The Eigen equation (after Manfred Eigen): Quasispecies model Under constant population assumptions, the transition matrix Q and the replication rates A are all that are needed in order to find out the concentration dynamics. The Eigen equation (after Manfred Eigen): Where E is the “average excess rate” 𝑑 𝑥 𝑖 𝑑𝑡 = 𝐴 𝑖 𝑄 𝑖𝑖 − 𝐸 𝑡 𝑥 𝑖 + 𝑗≠𝑖 𝐴 𝑖 𝑄 𝑖𝑗 𝑥 𝑗 𝐸 𝑡 = 𝑖=1 𝑛 𝐴 𝑖 𝑥 𝑖

Initial conditions: x1 = 1, all others are 0. Examples: 𝑾= 1 0.001 0.001 0.01 0.1 2 0.01 0.01 0.001 0.1 3 0.01 0.001 0.001 0.001 4 Initial conditions: x1 = 1, all others are 0.

Initial conditions: x1 = 1, all others are 0. Examples: 𝑾= 1 0.001 0.001 1 0.1 2 0.01 1 0.001 0.1 3 1 0.001 0.001 0.001 4 Initial conditions: x1 = 1, all others are 0.

The steady state population distribution is called the quasispecies.

This is all theory. What happens for RNA based life forms? Quasispecies model This is all theory. What happens for RNA based life forms? The basic model deals with RNA strands of constant length v : There is a single genotype with highest replication rate: the master sequence Single digit replication: 0 ≤ q ≤ 1 All mutations of the master sequence replicate slower according to the Hamming distance. Effectively, many genotypes are grouped together. Q and A are built only from q and v. q 1 1-q

GARD model (Graded Autocatalysis Replication Domain) Lipid world RNA world Assemblies / Clusters / Vesicles / Membranes  Composition non-covalent bonds DNA / RNA / Polymers  Sequence covalent bonds Segre and Lancet, EMBO Reports 1 (2000)

GARD model (Graded Autocatalysis Replication Domain) Fission / Split Homeostatic growth b

GARD model (Graded Autocatalysis Replication Domain) Following a single lineage. Compositional Similarity Similarity ‘carpet’ Composome (compositional genome): a faithfully replicating composition/assembly. Compotype (composome type): a collection of similar composomes.

Do GARD population dynamics behave like the quasispecies model? GARD and Quasispecies Do GARD population dynamics behave like the quasispecies model? What we would like to do: For each assembly, experimentally find out the transition frequencies and replication rates. In other words: find Q and A.

But, wait, isn’t it trivial? GARD and Quasispecies But, wait, isn’t it trivial? Shouldn’t every replicating-with-errors entity follow the quasispecies dynamics? As we will be seen, the compotype is a “master sequence” candidate for GARD This is emergent from GARD dynamics, and is hence non-trivial An effect similar to error catastrophe also occurs

Back to finding Q and A: Problem: GARD and Quasispecies With 100 different molecule types and maximum assembly size of 100, there are ~4∙1058 different assemblies.

We decided to group together by distances from the compotype. GARD and Quasispecies Solution: group some assemblies together and treat them as one genotype. We decided to group together by distances from the compotype. NG space (simplified to 2d) 3 Molecule 2 2 1 Compotype Molecule 1

Each assembly is split and its offspring grown. GARD and Quasispecies Each assembly is split and its offspring grown. Q = to where did the assembly split? A = how long did it take the offspring to grow? Offspring 3 Molecule 2 2 Random assembly 1 Molecule 1

Example of a transition matrix: GARD and Quasispecies Example of a transition matrix: <<Data removed from published version>>

There is a constant number of assemblies in a population. GARD and Quasispecies There is a constant number of assemblies in a population. Each turn, a single molecule is added. Upon split, the parent as well as a random assembly are replaced with the two children.

GARD and Quasispecies The population simulation goes into steady state, concerning the frequencies of compotypes (as shown by O. Markovitch) The distribution of distances from the eigenvector was calculated for the population steady state, and compared with prediction.

GARD and Quasispecies <<Data removed from published version>>

GARD and Quasispecies <<Data removed from published version>>

Correlations About 0.7 are above 0.8 GARD and Quasispecies <<Data removed from published version>>

There is a dependence on the number of compotypes GARD and Quasispecies There is a dependence on the number of compotypes <<Data removed from published version>>

Dynamics GARD and Quasispecies <<Data removed from published version>>

Error Catastrophe in GARD The GARD equation: Includes two parameters How do kf and kb affect the population dynamics?

Error Catastrophe in GARD The GARD equation contains two parameters which dictate how fast molecules flow in and out How do kf and kb affect the population dynamics?

Error Catastrophe in GARD What happens if you lower kf ? Run single lineage simulation with different kf values. <<Data removed from published version>>

Error Catastrophe in GARD Since the most common compotype frequency decreases drastically, there is a high increase in drift  no composomes. Conclusion: kf and kb affect replication fidelity.

Error Catastrophe in GARD What happens in quasispecies model? We obtained Q and A for lower kf values. Two types of results: Seed = 12 Seed = 15 <<Data removed from published version>>

Error Catastrophe in GARD The difference seems to relate to the size of the compotype. <<Data removed from published version>>

∴ Compotypes/composomes behave similarly to quasispecies. Conclusions GARD constant population models give distance distributions that are similar to those generated by the quasispecies model. GARD replication fidelity shows sensitivity to kf and kb. Low kf results in loss of compotype dominance, just like high single digit error results in loss of master sequence. ∴ Compotypes/composomes behave similarly to quasispecies.