Two cases of chemometrics application in protein crystallography European Molecular Biology Laboratory (EMBL), Hamburg, Germany Andrey Bogomolov.

Slides:



Advertisements
Similar presentations
Introduction to protein x-ray crystallography. Electromagnetic waves E- electromagnetic field strength A- amplitude  - angular velocity - frequency.
Advertisements

Determination of Protein Structure. Methods for Determining Structures X-ray crystallography – uses an X-ray diffraction pattern and electron density.
Lecture 3 – 4. October 2010 Molecular force field 1.
(X-Ray Crystallography) X-RAY DIFFRACTION. I. X-Ray Diffraction  Uses X-Rays to identify the arrangement of atoms, molecules, or ions within a crystalline.
Hanging Drop Sitting Drop Microdialysis Crystallization Screening.
Macromolecular structure refinement Garib N Murshudov York Structural Biology Laboratory Chemistry Department University of York.
Protein Primer. Outline n Protein representations n Structure of Proteins Structure of Proteins –Primary: amino acid sequence –Secondary:  -helices &
Mark J. van der Woerd1, Donald Estep2, Simon Tavener2, F
Bridging the solution divide: comprehensive structural analyses of dynamic RNA, DNA, and protein assemblies by small-angle X-ray scattering By Rambo and.
Proteins are made by linking amino acids Protein Structure Review and Refinement Introduction Brian Bahnson Dept of Chemistry & Biochemistry, University.
The Crystallographic Refinement of TM1389- A methyl-transferase from Thermotoga maritima Rosanne Joseph SLAC Summer Intern Joint Center for Structural.
Using Structure to Elucidate Function/Mechanism An experimental strategy for – determining the arrangement of atoms/molecules in space to obtain structural.
Spectroscopy of Proteins. Proteins The final product of the genes, translated form genes (mutation in gene leads to a mutated protein) Made of a verity.
Computing and Chemistry 3-41 Athabasca Hall Sept. 16, 2013.
From T. MADHAVAN, & K.Chandrasekaran Lecturers in Zoology.. EXIT.
Common types of spectroscopy
Number of released entries Year. Growth of Molecular Complexity Number of Chains Year Number of Structures Containing that Number of Chains.
NUS CS5247 A dimensionality reduction approach to modeling protein flexibility By, By Miguel L. Teodoro, George N. Phillips J* and Lydia E. Kavraki Rice.
23 May June May 2002 From genes to drugs via crystallography 19 May 1996 Experimental and computational approaches to structure based.
Introduction to Macromolecular X-ray Crystallography Biochem 300 Borden Lacy Print and online resources: Introduction to Macromolecular X-ray Crystallography,
Interactive Series Baseline Correction Algorithm
Threeway analysis Batch organic synthesis. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.
BALBES (Current working name) A. Vagin, F. Long, J. Foadi, A. Lebedev G. Murshudov Chemistry Department, University of York.
 Four levels of protein structure  Linear  Sub-Structure  3D Structure  Complex Structure.
De novo Protein Design Presented by Alison Fraser, Christine Lee, Pradhuman Jhala, Corban Rivera.
EBI is an Outstation of the European Molecular Biology Laboratory. A web service for the analysis of macromolecular interactions and complexes PDBe Protein.
The Strategy of Atomic Resolution Structural Biology Break down complexity so that the system can be understood at a fundamental level Build up a picture.
EBI is an Outstation of the European Molecular Biology Laboratory. Annotation Procedures for Structural Data Deposited in the PDBe at EBI.
Single-crystal X-ray Crystallography ● The most common experimental means of obtaining a detailed picture of a large molecule like a protein. ● Allows.
Structure to Elucidate Mechanism
EBI is an Outstation of the European Molecular Biology Laboratory. A web service for the analysis of macromolecular interactions and complexes PDBe Protein.
Overview of MR in CCP4 II. Roadmap
1. Diffraction intensity 2. Patterson map Lecture
Structural Biomedicine
STRUCTURAL BIOLOGY Martina Mijušković ETH Zürich, Switzerland.
~ Crystallography ~ Prof. Yu Wang Office: A507 Telephone: ~
X RAY CRYSTALLOGRAPHY. WHY X-RAY? IN ORDER TO BE OBSERVED THE DIMENTIONS OF AN OBJECT MUST BE HALF OF THE LIGHT WAVELENGHT USED TO OBSERVE IT.
Structure of a Membrane Proteins in Situ F. Jamilidinan, P. Schwander,D. K. Saldin.
COMPUTERS IN BIOLOGY Elizabeth Muros INTRO TO PERSONAL COMPUTING.
Data Harvesting: automatic extraction of information necessary for the deposition of structures from protein crystallography Martyn Winn CCP4, Daresbury.
EBI is an Outstation of the European Molecular Biology Laboratory. Sanchayita Sen, Ph.D. PDB Depositions Validation & Structure Quality.
ELECTRON CRYSTALLOGRAPHY:
3. Spot Finding 7(i). 2D Integration 2. Image Handling 7(ii). 3D Integration 4. Indexing 8. Results 1. Introduction5. Refinement Background mask and plane.
Ligand Building with ARP/wARP. Automated Model Building Given the native X-ray diffraction data and a phase-set To rapidly deliver a complete, accurate.
Atomic structure model
X-ray crystallography – an overview (based on Bernie Brown’s talk, Dept. of Chemistry, WFU) Protein is crystallized (sometimes low-gravity atmosphere is.
Ch 10 Pages ; Lecture 24 – Introduction to Spectroscopy.
Membrane protein session Erice Introduction Werner Kühlbrandt5 min Aquaporin 0Tom Walz30+5 min Na + /H + antiporterCarola Hunte30+5 min EM and.
X-Ray Diffraction Spring 2011.
Using Technology to Study Cellular and Molecular Biology.
Lecture 10 CS566 Fall Structural Bioinformatics Motivation Concepts Structure Solving Structure Comparison Structure Prediction Modeling Structural.
Małopolska Centre of Biotechnology (MCB) X-Ray Crystallography Laboratory Looking into the deep – structural investigations of biological macromolecules.
Bringing structural biology services through collaboration.
Cont. Proteomics Structural Genomics Describes the experimental and analytical techniques that are used to determine the structure of proteins and RNA.
Page 1 Phase Determination by Creative BiostructureCreative Biostructure.
Page 1 Protein Crystallization —by Creative Biostructure.
Stony Brook Integrative Structural Biology Organization
5th European Immunology & Innate Immunity Conference
What is cryo EM? EM = (Transmission) Electron Microscopy
Organic Chemistry Lesson 21 X-ray crystallography.
How do we determine these structures?
Data interpretation Creative Biostructure now can provide a collection of tools and analysis package for biomolecular structure determination, refinement.
Protein structure Our understanding of life at the molecular level is highly dependent on the ability to map the molecular details of individual proteins.
X-ray crystallography Creative Biostructure offers high-throughput UniCrys™ crystallization services with our state-of-the-art facilities and has developed.
Protein structure analysis Our understanding of life at the molecular level is highly dependent on the ability to map the molecular details of individual.
Data processing Creative Biostructure now can provide a collection of tools and analysis package for biomolecular structure determination, refinement and.
Introduction to Bioinformatics II
Reduce the need for human intervention in protein model building
Nobel Laureates of X Ray Crystallography
Experimental phasing in Crank2 Pavol Skubak and Navraj Pannu Biophysical Structural Chemistry, Leiden University, The Netherlands
Presentation transcript:

Two cases of chemometrics application in protein crystallography European Molecular Biology Laboratory (EMBL), Hamburg, Germany Andrey Bogomolov

Outline Protein crystallography: a brief introduction Case I: determination of protein secondary structure from the raw diffraction data using PLS-R Case II: modeling of crystal radiation damage Potential applications of chemometric techniques to crystallography (of biological macromolecules)

Protein crystallography: introduction Protein (macromolecular) crystallography is a scientific discipline that studies… biological objects: proteins, DNA, RNA etc. … by physical means: X-ray diffraction, synchrotron radiation … on the chemical level: 3D-structure, complexes, interactions … with the extensive use of mathematics: data analysis, modeling The main objectives: solve 3D-structure of a molecule explain its biological function at the atomic level Today’s hot topic: drug design part of the global “-omics” project (genomics/proteomics)

Protein crystallography workflow protein (DNA, RNA) solution structure solution data collection crystallization phasing expression& purification

Protein crystallography workflow protein crystal structure solution data collection expression& purification phasing crystallization

Protein crystallography workflow diffraction pattern structure solution crystallization expression& purification phasing data collection

Protein crystallography workflow electron density map structure solution crystallization expression& purification data collection phasing

Protein crystallography workflow 3D structure structure solution crystallization expression& purification phasing data collection

Protein Data Bank (PDB) Global data collection (>30000 records) 3D structures experimental data biological and chemical information

Crystallographic data collection: Wilson plot X-ray beam experimental theoretical control optimization

Case I: Determination of protein secondary structure Problem: determine the contents (fractions of the polypeptide chain) of secondary structure elements in a protein molecule from the raw diffraction data (Wilson plot) well established method for CD and IR spectra of protein solutions PLS regression – one of the best methods Wilson plot: only qualitative data on existing correlation for “theoretical” data α-helix β-sheet

Secondary structure determination: data Data Preprocessing: averaging with an optimal bin size* special scaling (correction for anisotropic B-factor)* taking the natural logarithm conversion into the matrix (Wilson plots in rows)* auto-scaling outliers detection and removal* theoretical experimental *) experimental data only

Secondary structure determination: data (2) theoretical experimental 1d5t (α+β) 1at0 (β) 1hq3 (α)

Secondary structure determination: calibration results 1.S. Navea, R. Tauler, A. de Juan, Elucidation of protein secondary structure, Anal. Biochem. 336 (2005) 231–242 2.K.A. Oberg, J.-M. Ruysschaert, and E. Goormaghtigh, The optimization of protein secondary structure determination with infrared and circular dichroism spectra, Eur. J. Biochem. 271 (2004) α-helix (theoretical) Element -helix-sheet Theoretical0.062 (0.96)0.060 (0.92) Experimental * (0.84)0.081 (0.84) IR/PLS [1]0.078 (0.93)0.075 (0.93) CD/PLS [2]0.077 (0.94)0.092 (0.89) μ: α=0.31, β= (0.00)0.22 (0.00) RMSEP & correlation coefficients for different methods *) Resolution (1/d) = 0.52 Å -1 (~1.9 Å)

Case II: Modeling radiation damage Biological crystal exposed to X-rays undergoes radiation damage: Modeling of radiation damage is important understanding of the effect on the protein optimization of data collection Problem present state no comprehensive theory of RD specific effects are well-known, but it the main changes are non- specific Suggestion by Gleb Bourenkov: radiation dose has linear effect on atom’s B-factors Task check for linearity, find reason(s) of deviation

Radiation damage modeling: data (trypsin)

Radiation damage modeling: results r=0.999 RMSEP=9.4×10 -3

Conclusions Multivariate data analysis has a great potential for protein crystallography currently it is application is episodic rarely goes beyond PCA Method-centric approach would be beneficial: “I have a method, I am looking for problems”

X-files PCA, Factor Analysis Multivariate Regression MSPC, Design Of Experiment Curve Resolution Multivariate Image Analysis Target Factor Analysis PARAFAC, 3(multi)-way Wavelet Transform SIMCA, PLSD crystallization, HTPC crystal screening crystal auto-mounting data collection data reduction radiation damage phasing structure solution structure refinement

Challenge Critical re-assessment of the entire protein crystallographic workflow with multivariate approach in mind – an ambitious project for chemometricians?

Acknowledgements Alexander Popov Gleb Bourenkov Victor Lamzin