Introduction to Non-Archimedean Physics of Proteins.

Slides:



Advertisements
Similar presentations
Finding what happens in a space of ever so many dimensions R. Stephen Berry The University of Chicago Brijuni, 28 August – 1 September 2006.
Advertisements

报告人: 林 苑 指导老师:章忠志 副教授 复旦大学  Introduction about random walks  Concepts  Applications  Our works  Fixed-trap problem  Multi-trap problem.
Potential Energy Surface. The Potential Energy Surface Captures the idea that each structure— that is, geometry—has associated with it a unique energy.
Introduction to Non-Archimedean Physics of Proteins. Vladik Avetisov Semenov Institute of Chemical Physics, Russian Academy of Sciences, Moscow.
Small Coupled Oscillations. Types of motion Each multi-particle body has different types of degrees of freedom: translational, rotational and oscillatory.
The Wales Group in Context: Exploring Energy Landscapes Research Review by Ryan Babbush Applied Computation 298r February 8, 2013.
VL Netzwerke, WS 2007/08 Edda Klipp 1 Max Planck Institute Molecular Genetics Humboldt University Berlin Theoretical Biophysics Networks in Metabolism.
Introduction to Molecular Orbitals
Molecular Modeling: Molecular Mechanics C372 Introduction to Cheminformatics II Kelsey Forsythe.
Dynamics of Learning VQ and Neural Gas Aree Witoelar, Michael Biehl Mathematics and Computing Science University of Groningen, Netherlands in collaboration.
Computational Solid State Physics 計算物性学特論 第2回 2.Interaction between atoms and the lattice properties of crystals.
Conical Intersections Spiridoula Matsika. The study of chemical systems is based on the separation of nuclear and electronic motion The potential energy.
Optical Tweezers F scatt F grad 1. Velocity autocorrelation function from the Langevin model kinetic property property of equilibrium fluctuations For.
10/11/2001Random walks and spectral segmentation1 CSE 291 Fall 2001 Marina Meila and Jianbo Shi: Learning Segmentation by Random Walks/A Random Walks View.
Energy landscapes and folding dynamics Lorenzo Bongini Dipartimento di Fisica Universita’di Firenze
Protein folding kinetics and more Chi-Lun Lee ( 李紀倫 ) Department of Physics National Central University.
Lattice regularized diffusion Monte Carlo
Finding the Tools--and the Questions--to Understand Dynamics in Many Dimensions R. Stephen Berry The University of Chicago TELLURIDE, APRIL 2007.
AppxA_01fig_PChem.jpg Complex Numbers i. AppxA_02fig_PChem.jpg Complex Conjugate.
Potential Energy Surfaces
Basic Concepts and Definitions Vector and Function Space. A finite or an infinite dimensional linear vector/function space described with set of non-unique.
Anders Eriksson Complex Systems Group Dept. Energy and Environmental Research Chalmers EMBIO Cambridge July 2005 Complex Systems at Chalmers Information.
Stochastic Roadmap Simulation: An Efficient Representation and Algorithm for Analyzing Molecular Motion Mehmet Serkan Apaydin, Douglas L. Brutlag, Carlos.
A semiclassical, quantitative approach to the Anderson transition Antonio M. García-García Princeton University We study analytically.
MULTISCALE COMPUTATIONAL METHODS Achi Brandt The Weizmann Institute of Science UCLA
MANE 4240 & CIVL 4240 Introduction to Finite Elements
Presented by Johanna Lind and Anna Schurba Facility Location Planning using the Analytic Hierarchy Process Specialisation Seminar „Facility Location Planning“
Vibrational Spectroscopy
Do we need Number Theo ry? Václav Snášel. What is a number? VŠB-TUO, Ostrava
Geometry Optimisation Modelling OH + C 2 H 4 *CH 2 -CH 2 -OH CH 3 -CH 2 -O* 3D PES.
Molecular Information Content
Shai Carmi Bar-Ilan, BU Together with: Shlomo Havlin, Chaoming Song, Kun Wang, and Hernan Makse.
ChE 452 Lecture 24 Reactions As Collisions 1. According To Collision Theory 2 (Equation 7.10)
WHAT CAN KINETICS LEARN FROM NONSTATIONARY THERMODYNAMICS Miloslav Pekař Faculty of Chemistry Institute of Physical and Applied Chemistry Brno University.
Kinetics II Lecture 15. Rates and Concentrations Knowing the rate constant (usually empirically determined), we can compute rate. Integrating rate of.
CZ5225 Methods in Computational Biology Lecture 4-5: Protein Structure and Structural Modeling Prof. Chen Yu Zong Tel:
Dynamical Systems Model of the Simple Genetic Algorithm Introduction to Michael Vose’s Theory Rafal Kicinger Summer Lecture Series 2002.
Semenov Institute of Chemical Physics, RAS New results in applications of p-adic pseudo- differential equations to the protein dynamics Vladik Avetisov.
Conformational Space.  Conformation of a molecule: specification of the relative positions of all atoms in 3D-space,  Typical parameterizations:  List.
Ch 24 pages Lecture 9 – Flexible macromolecules.
10/3/2003 Molecular and Cellular Modeling 10/3/2003 Introduction Objective: to construct a comprehensive simulation software system for the computational.
Illustration of FE algorithm on the example of 1D problem Problem: Stress and displacement analysis of a one-dimensional bar, loaded only by its own weight,
University of Texas at Austin CS384G - Computer Graphics Fall 2008 Don Fussell Affine Transformations.
Protein folding dynamics and more Chi-Lun Lee ( 李紀倫 ) Department of Physics National Central University.
Austin Howard & Chris Wohlgamuth April 28, 2009 This presentation is available at
ChE 452 Lecture 25 Non-linear Collisions 1. Background: Collision Theory Key equation Method Use molecular dynamics to simulate the collisions Integrate.
Simplistic Molecular Mechanics Force Field Van der WaalsCharge - Charge Bond Angle Improper Dihedral  
Collective diffusion of the interacting surface gas Magdalena Załuska-Kotur Institute of Physics, Polish Academy of Sciences.
LSM3241: Bioinformatics and Biocomputing Lecture 6: Fundamentals of Molecular Modeling Prof. Chen Yu Zong Tel:
KPS 2007 (April 19, 2007) On spectral density of scale-free networks Doochul Kim (Department of Physics and Astronomy, Seoul National University) Collaborators:
Bioinformatics: Practical Application of Simulation and Data Mining Markov Modeling I Prof. Corey O’Hern Department of Mechanical Engineering Department.
Assignment for the course: “Introduction to Statistical Thermodynamics of Soft and Biological Matter” Dima Lukatsky In the first.
Graphs, Vectors, and Matrices Daniel A. Spielman Yale University AMS Josiah Willard Gibbs Lecture January 6, 2016.
Simple Rate Equations Consider a chemical reaction: There are two common trends for the time rate of change of the concentration, C A. first order reaction.
Singular Value Decomposition and Numerical Rank. The SVD was established for real square matrices in the 1870’s by Beltrami & Jordan for complex square.
ECE-7000: Nonlinear Dynamical Systems 3. Phase Space Methods 3.1 Determinism: Uniqueness in phase space We Assume that the system is linear stochastic.
Geometric diffusions as a tool for harmonic analysis and structure definition of data By R. R. Coifman et al. The second-round discussion* on * The first-round.
Computational Biology BS123A/MB223 UC-Irvine Ray Luo, MBB, BS.
1 Xin Zhou Asia Pacific Center for Theoretical Physics, Dep. of Phys., POSTECH, Pohang, Korea Structuring and Sampling in Complex Conformational.
1924: de Broglie suggests particles are waves Mid-1925: Werner Heisenberg introduces Matrix Mechanics In 1927 he derives uncertainty principles Late 1925:
Tutorial 6. Eigenvalues & Eigenvectors Reminder: Eigenvectors A vector x invariant up to a scaling by λ to a multiplication by matrix A is called.
Intrinsic Data Geometry from a Training Set
Ultrametricity in the context of biology
Master Equation Formalism
Driven Adiabatic Dynamics Approach to the Generation of Multidimensional Free-Energy Surfaces. Mark E. Tuckerman, Dept. of Chemistry, New York University,
Large Time Scale Molecular Paths Using Least Action.
CZ5225 Methods in Computational Biology Lecture 7: Protein Structure and Structural Modeling Prof. Chen Yu Zong Tel:
Linear Algebra Lecture 32.
Physics 319 Classical Mechanics
Hydrogen Oxidation by MD Simulations
Presentation transcript:

Introduction to Non-Archimedean Physics of Proteins. Lecture II: p-Adic description of multi-scale protein dynamics. Tree-like presentation of high-dimensional rugged energy landscapes Basin-to-basin kinetics Ultrametric random walk Eigenvalues and eigenvectors of block- hierarchical transition matrices p-Adic equation of ultrametric diffusion p-Adic wavelets

How to define protein dynamics Protein is a macromolecule protein states protein dynamics Protein states are defined by means of conformations of a protein macromolecule. A conformation is understood as the spatial arrangement of all “elementary parts” of a macromolecule. Atoms, units of a polymer chain, or even larger molecular fragments of a chain can be considered as its “elementary parts”. Particular representation depends on the question under the study. Protein dynamics is defined by means of conformational rearrangements of a protein macromolecule. Conformational rearrangements involve fluctuation induced movements of atoms, atomic groups, and even large macromolecular fragments.

Protein molecule consists of ~10 3 atoms. To study protein motions on the subtle scales, say, from ~10-9 sec, it is necessary to use the atomic representation of a protein molecule. Protein molecule consists of ~10 3 atoms. Protein conformational states: number of degrees of freedom : ~ 103 dimensionality of (Euclidian) space of states : ~ 103 In fine-scale presentation, dimensionality of a space of protein states is very high.

Protein energy landscape: dimensionality: ~ 103; Protein dynamics over high dimensional conformational space is governed by complex energy landscape. protein energy landscape Given the interatomic interactions, one can specify the potential energy of each protein conformation, and thereby define an energy surface over the space of protein conformational states. Such a surface is called the protein energy landscape. As far as the protein polymeric chain is folded into a condensed globular state, high dimensionality and ruggedness are assumed to be characteristic to the protein energy landscapes Protein energy landscape: dimensionality: ~ 103; number of local minima ~10100

How such model can be constructed? While modeling the protein motions on many time scales (from ~10-9 sec up to ~100 sec), we need the simplified description of protein energy landscape that keeps its multi-scale complexity. How such model can be constructed? Computer reconstructions of energy landscapes of complex molecular structures suggest some ideas.

Computer reconstruction of complex energy landscapes O.M.Becker, M.Karplus J.Chem.Phys. 106, 1495 (1997) potential energy U(x) conformational space Method Computation of local energy minima and saddle points on the energy landscape using molecular dynamic simulation; Specification a topography of the landscape by the energy sections; Clustering the local minima into hierarchically nested basins of minima. Specification of activation barriers between the basins. B1 B2 B3

Presentation of energy landscapes by tree-like graphs O.M.Becker, M.Karplus J.Chem.Phys. 106, 1495 (1997) The relations between the basins embedded one into another are presented by a tree-like graph. Such a tee is interpreted as a “skeleton” of complex energy landscape. The nodes on the border of the tree ( the “leaves”) are associated with local energy minima (quasi-steady conformational states). The branching vertexes are associated with the energy barriers between the basins of local minima. potential energy U(x) local energy minima

Complex energy landscapes: a fullerene molecule Many deep local minima form the basins of comparable scales. Ground state: attracting basin with a few deep local minima. C60 D.J.Wales et al. Nature 394, 758 (1998)

Complex energy landscapes : Lenard-Jones cluster LJ38 Many local minima form basins of different scales. Ground state: large attracting basin with many local minima of different depths. D.J.Wales et al. Nature 394, 758 (1998)

Complex energy landscapes : tetra-peptide Many local minima form basins of relatively small scales. Ground state is not well defined: there are many small attracting basins. O.M.Becker, M.Karplus J.Chem.Phys. 106, 1495 (1997)

Complex energy landscapes : 58-peptide-chain in a globular state This is a small part of the energy landscape of a crambin Garcia A.E. et al. Physica D, 107, 225 (1997) (reproduced from Frauenfelder H., Leeson D. T. Nature Struct. Biol. 5, 757 (1998)) Tremendous number of local minima grouped into many basins of different scales. Ground state is strongly degenerated.

Complex energy landscapes : a protein The total number of minima on the protein energy landscape is expected to be of the order of ~10100. This value exceeds any real scale in the Universe. Complete reconstruction of protein energy landscape is impossible for any computational resources.

25 years ago, Hans Frauenfelder suggested a tree-like structure of the energy landscape of myoglobin (and this is all what he sad) Hans Frauenfelder, in Protein Structure (N-Y.:Springer Verlag, 1987) p.258.

This is exactly the physical meaning of protein ultrameticity ! 10 years later, Martin Karplus suggested the same idea “In <…> proteins, for example, where individual states are usually clustered in “basins”, the interesting kinetics involves basin-to-basin transitions. The internal distribution within a basin is expected to approach equilibrium on a relatively short time scale, while the slower basin-to-basin kinetics, which involves the crossing of higher barriers, governs the intermediate and long time behavior of the system.” Becker O. M., Karplus M. J. Chem. Phys., 1997, 106, 1495 This is exactly the physical meaning of protein ultrameticity !

That is, the conformational dynamics of a protein molecule is approximated by a random process on the boundary of tree-like graph that represents the protein energy landscape.

Random walk on the boundary of a Cayley tree The branching vertexes are associated with the activation barriers for passes between the basins of local minima. Cayley tree is understood as a hierarchical skeleton of protein energy landscape. The leaves are the local energy minima, and each subtree of the Cayley tree is a basin of local minima. w3 w2 w1 Master equation 𝒇𝒊 is the transition probability, i.e. the probability to find a walker in a state 𝒊 at instant 𝒕, and 𝒘𝒋𝒊 is the rate of transition from 𝒋 to 𝒊. The energy landscape is represented by the transition rates 𝒘𝒋𝒊

Matrix description Master equation 1 2 3 4 5 6 7 8 Due to the basin-to-basin transitions, transition matrix W has a block-hierarchical structure. For regularly branching tree, any matrix element 𝒘𝒋𝒊, is indexed by the hierarchy level  of that vertex over which the transition occurs 𝒘𝒋𝒊 =𝒘𝒊𝒋=𝒘

Translation-non-invariant transition matrix Indexation of the transition matrix elements: non-regular hierarchies with branching index p=2 1 2 3 4 5 6 7 8 A (1,1) C (2,2) 𝜸=𝟐, 𝒏=𝟐 2-adic (2-branching) Cayley tree: each branching vertex is indexed by a pair of integers (𝜸, 𝒏 𝜸 ), where 𝜸 specifies the level at which the vertex lies, and 𝒏 specifies the particular vertex over which the transition occurs. For example: A=(1,1), B=(1,2), C=(2,2). B (1,2) Translation-non-invariant transition matrix The elements of the transition matrix W can be indexed by the pairs of integers (𝜸, 𝒏 𝜸 ). 𝒘 𝒊𝒋 = 𝒘 𝒋𝒊 = 𝒘 𝜸 𝒏 𝜸

Indexation of the transition matrix elements: random walk on 𝒑-branching Cayley tree, 𝒑>𝟐: 𝒑=𝟑 𝑖 𝑗 𝑛 𝛾 𝛾 Given the transition 𝒊→𝒋 we, first, find a minimal subgraph to which both sites 𝒊 and 𝒋 belong. In other words, we find a minimal basin in which the transition 𝒊→𝒋 takes place. This basin is presented by the particular vertex (𝜸, 𝒏𝜸) lying on level 𝜸 of the tree. Then, we go down to the lower lying 𝒑 subbasins 𝜸−𝟏, 𝒏 𝜸−𝟏 𝟏 ,…, 𝜸−𝟏, 𝒏 𝜸−𝟏 𝒑 and find a particular pair of maximal subbasins between which the transition 𝒊→𝒋 occurs. Thus, the elements 𝒘 𝒊𝒋 of the transition matrix 𝐖 can be indexed by three integers, e. g., by a pair (𝜸, 𝒏 𝜸 ) that indicates the smallest basin in which the transition occurs, and an additional index 𝒌, 1≤𝒌≤𝒑−1, that fixes a pair of the largest subbasins between which the transition takes place. 𝒌=𝟏 𝒌=𝟐 𝒘 𝜸𝒏𝒌 𝜸−𝟏 𝒌=2 𝒘 𝜸𝒏𝒌 𝑖 𝑗 𝒌=𝟏 The pair of subbasins that specifies the transition from site 𝒊 to site 𝒋 over the vertex 𝜸, 𝒏 𝜸 minimal basin 𝜸, 𝒏 𝜸 in which the transition takes place

Eigen vectors and eigenvalues of symmetric block-hierarchical transition matrices

Eigenvectors (ultrametric wavelets) An eigenvector of a symmetric block-hierarchical transition matrix specifying a random walk on 𝒑-adic Cayley tree with 𝜞 levels, is a column vector that consist of blocks of components according to the hierarchy of basins. For each level 𝜸, there are (𝒑−𝟏) 𝒑 𝜸 eigenvectors 𝒆 𝒑 (𝜸, 𝒏 𝜸 ,𝒌). Each eigenvector consists of 𝒑 𝚪−𝜸 blocks with 𝒑 𝜸 elements, and only one block has nonzero components. The non-zero block consists of 𝒑 sub-blocks with 𝒑 𝜸−𝟏 identical components in each. These components are the complex numbers 𝒆𝒙𝒑 𝟐𝝅𝒊𝝓(𝜸, 𝒏 𝜸 ,𝒌) such that the sum of all components in non-zero block is equal to 0. Thus, each eigenvector is indexed by a triple (𝜸, 𝒏𝜸, 𝒌). The triple specifies the scale of nonzero block in the column vector ( 𝒑 𝜸 ) , the position of non-zero block in the column vector (𝒏 𝜸 ), and the values of non-zero components, 𝒆𝒙𝒑 𝟐𝝅𝒊𝝓(𝜸, 𝒏 𝜸 ,𝒌) . р=3: one of the 1st-level eigenvectors

Examples: Eigenvectors and eigenvalues of symmetric block-hierarchical 2-adic transition matrix 3,1 1 2 3 4 5 6 7 8 energy barriers 2,2 2,1 1,2 1,3 1,1 1,4

p=2 four 1st-level eigenvectors four 1st-level eigenvalues (3,1) (2,2) 1 2 3 4 5 6 7 8 (1,1) (1,2) (1,3) (1,4) (2,1) (2,2) (3,1)

p=2 two 2nd -level eigenvectors two 2nd-level eigenvalues (3,1) (2,2) 1 2 3 4 5 6 7 8 (1,1) (2,2) (2,1) (1,2) (1,3) (1,4)

p=2 one 3rd -level eigenvector one 3rd -level eigenvalue (3,1) (2,2) 1 2 3 4 5 6 7 8 (1,1) one 3rd -level eigenvalue (2,2) (2,1) (1,2) (1,3) (1,4)

p=2 eigenvector of the equilibrium state eigenvalue of the equilibrium state

Simple rule: eigenvalue is the total rate to exit particular basin formula for non-zero eigenvalues: (p=2)

p-Adic description of ultrametric random walk The basic idea: In the basin-to-basin approximation, the distances between the protein states are ultrametric, so they can be specified by the p- adic numerical norm, and transition rates can be indexed by the p-adic numbers.

Parameterization of ultrametric lattice by p-adic numbers V.A.Avetisov, A.Kh.Bikulov, S.V.Kozyrev J.Phys.A:Math.Gen. 32, 8785 (1999) ultrametric lattice Cayley tree is a graph of ultrametric distances between the sites. At the same time, this tree represents a hierarchy of basins of local minima on the energy landscape. 1 2 3 4 5 6 7 8 0 1 1/2 3/2 1/4 5/4 3/4 7/4 0 1 1\2 3\2 1/4 5/4 3/4 7/4 20 21 22 ultrametric distances between the sites

р-adic equation of ultrametric diffusion Avetisov V A, Bikulov A Kh , Kozyrev S V . Phys.A:Math.Gen. 32, 8785 (1999); Arrhenius law connects mathematics and physics: 𝒘( 𝒙−𝒚 𝒑 )~𝐞𝐱𝐩⁡ − 𝑬( |𝒙−𝒚| 𝒑 ) 𝒌𝑻 master equation of random walk on ultrametric lattice p0 p1 p3 ultrametric distance energy landscape parameterization of the lattice states {𝒊} by rational numbers 𝒙∈𝑿; specification of the transition rates 𝒘 𝒊𝒋 as a function on ultrametric distance, 𝒘( |𝒙−𝒚| 𝒑 ) continuous limit 𝑿⇒ ℚ 𝒑

Thus, we can consider the p-adic equation of ultrametric random walk as a model of macromolecular dynamics on particular energy landscape In fact, this p-adic equation describes very well the complicated protein dynamics on many time scales

Eigenvectors of block-hierarchical transition matrixes is described by p-adic wavelets

p=2 1st-level eigenvector 1st-level wavelet (3,0) (2,1/8) (2,0) (1,0) (1,1/4) (1,1/8) (1,3/8) (2,0) (2,1/8) (3,0) 0 1/2 1\4 3\4 1/8 5/8 3/8 7/8

p=2 2nd -level eigenvector 2nd -level wavelet (3,0) (2,1/8) (2,0) 0 1/2 1/4 3/4 1/8 5/8 3/8 7/8 (1,0) (2,1/8) (2,0) (1,1/4) (1,1/8) (1,3/8)

p=2 3rd -level eigenvector 3rd -level wavelet (3,0) (2,1/8) (2,0) (1,1/4) (1,1/8) (1,0) (1,3/8) 0 1/2 1/4 3/4 1/8 5/8 3/8 7/8

Given the transition rates 𝒘( |𝒙−𝒚| 𝒑 ), i. e Given the transition rates 𝒘( |𝒙−𝒚| 𝒑 ), i.e. a hierarchical skeleton of the energy landscape, one can solve a Cauchy problem for the p-adic equation of ultrametric diffusion: and then calculate some observables using the solution 𝒇 𝒙,𝒕 . In many experiments, the dynamics is observed as a relaxation process (survival probability)

Characteristic relaxations in complex molecular systems V.A.Avetisov, A.Kh.Bikulov, V.Al.Osipov. J.Phys.A:Math.Gen. 36 (2003) 4239 “soft” (logarithmic) landscape stretched exponent decay self-similar (linear) landscape: power decay “robust” (exponential) landscape: logarithmic decay

A type of relaxation suggests particular tree for tree-like presentation of energy landscape Power kinetics of CO rebinding to myoglobin and power broadening of the spectral diffusion suggest that the activation barriers between the basins of local minima linearly grow with hierarchical level 𝜸. Thus, the power-law relaxation typical for proteins suggests the particular form of p-adic equation of protein dynamics:

Summary: p-Adic description of multi-scale protein dynamics is based on: Tree-like presentation of high-dimensional rugged energy landscapes and basin-to-basin-kinetics. p-Adic description of ultrametric random walk on the boundary of a p-branching Cayley tree. Particular form of the p-adic equation of ultrametric diffusion given by the Vladimirov operator.

protein conformational space With the p-adic equation in hands, we can describe all features of CO rebinding and spectral diffusion in proteins protein conformational space X Mb1 binding CO Mb* ?