CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Protein Structure Thomas Blicher, Center for Biological Sequence Analysis.

Slides:



Advertisements
Similar presentations
The Chemical Nature of Enzyme Catalysis
Advertisements

Protein Structure C483 Spring 2013.
Review of Basic Principles of Chemistry, Amino Acids and Proteins Brian Kuhlman: The material presented here is available on the.
©CMBI 2001 The amino acids in their natural habitat.
Proteins Function and Structure.
Review: Amino Acid Side Chains Aliphatic- Ala, Val, Leu, Ile, Gly Polar- Ser, Thr, Cys, Met, [Tyr, Trp] Acidic (and conjugate amide)- Asp, Asn, Glu, Gln.
1 Protein Structure, Structure Classification and Prediction Bioinformatics X3 January 2005 P. Johansson, D. Madsen Dept.of Cell & Molecular Biology, Uppsala.
Structure Prediction. Tertiary protein structure: protein folding Three main approaches: [1] experimental determination (X-ray crystallography, NMR) [2]
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modeling Anne Mølgaard, CBS, BioCentrum, DTU.
Protein-a chemical view A chain of amino acids folded in 3D Picture from on-line biology bookon-line biology book Peptide Protein backbone N / C terminal.
1 Levels of Protein Structure Primary to Quaternary Structure.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Programme Introduction to protein structure Break Homology.
Amino Acids and Proteins 1.What is an amino acid / protein 2.Where are they found 3.Properties of the amino acids 4.How are proteins synthesized 1.Transcription.
Protein structure Anne Mølgaard, Center for Biological Sequence Analysis.
Protein Structure, Databases and Structural Alignment
Structure Prediction. Tertiary protein structure: protein folding Three main approaches: [1] experimental determination (X-ray crystallography, NMR) [2]
Sequence analysis June 18, 2008 Learning objectives-Understand the concept of sliding window programs. Understand difference between identity, similarity.
Introduction to Structural Bioinformatics Dong Xu Computer Science Department 271C Life Sciences Center 1201 East Rollins Road University of Missouri-Columbia.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Thomas Blicher, Center for Biological Sequence Analysis Details of Protein Structure.
Protein Secondary Structure : Kendrew Solves the Structure of Myoglobin “Perhaps the most remarkable features of the molecule are its complexity.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Protein Structure Thomas Blicher, Center for Biological Sequence Analysis.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Protein structure Anne Mølgaard, Center for Biological Sequence Analysis.
Thomas Blicher Center for Biological Sequence Analysis
ProteinStructuralDatabases. Proteins are built from amino-acids. Introduction H | NH2-c-CO2H | R.
CISC667, F05, Lec20, Liao1 CISC 467/667 Intro to Bioinformatics (Fall 2005) Protein Structure Prediction Protein Secondary Structure.
Proteins, Pair HMMs, and Alignment. CS262 Lecture 8, Win06, Batzoglou A state model for alignment -AGGCTATCACCTGACCTCCAGGCCGA--TGCCC--- TAG-CTATCAC--GACCGC-GGTCGATTTGCCCGACC.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modelling Thomas Blicher Center for Biological Sequence Analysis.
Protein structure Classification Ole Lund, Associate professor, CBS, DTU.
Protein Structure Analysis - I
The relative orientation observed for  helices packed on ß sheets.
Protein Structure Elements Primary to Quaternary Structure.
Protein Structure FDSC400. Protein Functions Biological?Food?
Proteins and Enzymes Nestor T. Hilvano, M.D., M.P.H. (Images Copyright Discover Biology, 5 th ed., Singh-Cundy and Cain, Textbook, 2012.)
Proteins are polymers of amino acids.
Principles of Protein Structure
Protein Structural Prediction. Protein Structure is Hierarchical.
Chapter 12 Protein Structure Basics. 20 naturally occurring amino acids Free amino group (-NH2) Free carboxyl group (-COOH) Both groups linked to a central.
The structural organization within proteins Kevin Slep June 13 th, 2012.
Genomics and Personalized Care in Health Systems Lecture 9 RNA and Protein Structure Leming Zhou, PhD School of Health and Rehabilitation Sciences Department.
Lecture 1: Fundamentals of Protein Structure
Amino Acids & Side Groups Polar Charged ◦ ACIDIC negatively charged amino acids  ASP & GLU R group with a 2nd COOH that ionizes* above pH 7.02nd COOH.
Secondary structure prediction
1 10/26/2015 MOLECULES. 2 10/26/2015 H 2 N-CH-C-OH O R Monomer E.g. protein Monomer vs polymer amino acid monomer R is a side group.
Part I : Introduction to Protein Structure A/P Shoba Ranganathan Kong Lesheng National University of Singapore.
©CMBI 2001 Step 5: The amino acids in their natural habitat.
Protein Structure & Modeling Biology 224 Instructor: Tom Peavy Nov 18 & 23, 2009
Protein Modeling Protein Structure Prediction. 3D Protein Structure ALA CαCα LEU CαCαCαCαCαCαCαCα PRO VALVAL ARG …… ??? backbone sidechain.
Amino Acids ©CMBI 2001 “ When you understand the amino acids, you understand everything ”
Proteins.
Proteins Structure of proteins Proteins are made of C, H, O and nitrogen and may have sulfur. The monomers of proteins are amino acids An amino acid.
Chapter 3 Proteins.
X-ray detection xray/facilities.html.
Hyperthermophile subtilases
Protein Structure and Bioinformatics. Chapter 2 What is protein structure? What are proteins made of? What forces determines protein structure? What is.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
Protein Classification
Lecture 10 CS566 Fall Structural Bioinformatics Motivation Concepts Structure Solving Structure Comparison Structure Prediction Modeling Structural.
Marlou Snelleman 2012 Protein structure. Overview Sequence to structure Hydrogen bonds Helices Sheets Turns Hydrophobicity Helices Sheets Structure and.
Doug Raiford Lesson 14.  Reminder  Involved in virtually every chemical reaction ▪ Enzymes catalyze reactions  Structure ▪ muscle, keratins (skin,
Fibrous Proteins Examples 1. a-keratins 2. Silk Fibroin 3. Collagen
Introduction to Protein Structure
Protein Structure and Properties
Protein Structure September 7,
Protein Structure FDSC400. Protein Functions Biological?Food?
Haixu Tang School of Inforamtics
Amino Acids Amine group -NH2 Carboxylic group -COOH
Proteins Genetic information in DNA codes specifically for the production of proteins Cells have thousands of different proteins, each with a specific.
Levels of Protein Structure
“When you understand the amino acids,
Structures of osmosensory transporters.
Presentation transcript:

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Protein Structure Thomas Blicher, Center for Biological Sequence Analysis

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU “Could the search for ultimate truth really have revealed so hideous and visceral-looking an object?” Max Perutz, 1964, on protein structure John Kendrew, 1959, with myoglobin model Once Upon a Time…

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Holdings of the Protein Data Bank (PDB): The PDB also contains nucleotide and nucleotide analogue structures. PDB Sep May 2006 Oct X-ray NMR Other Total

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU  X-ray crystallography  Nuclear Magnetic Resonance (NMR)  Modelling techniques  More exotic techniques  Cryo electron microscopy (Cryo EM)  Small angle X-ray scattering (SAXS)  Neutron scattering Methods for Structure Determination

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU X-ray Crystallography  No size limitation.  Protein molecules are ”stuck” in a crystal lattice.  Some proteins seem to be uncrystallizable.  Slow.  Especially suited for studying structural details.

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU X-rays Fourier transform

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU NMR Spectroscopy  Upper limit for structure determination currently ~50 kDa.  Protein molecules are in solution.  Dynamics, protein folding.  Slow.  Especially suited for studies of protein dynamics of small to medium size proteins.

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU NMR Spectroscopy

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Modelling  Need structure of a >30% id homolog.  Only applicable to ~50% of sequences.  Fast.   Accuracy poor for low sequence id.  There is still need for experimental structure determination!

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Amino Acids  Amino group and acid group  Side chain at C   Chiral, only one enantiomer found in proteins (L-amino acids) N O C Ca

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Amino Acids Livingstone & Barton, CABIOS, 9, , 1993 A – Ala C – Cys D – Asp E – Glu F – Phe G – Gly H – His I – Ile K – Lys L – Leu M – Met N – Asn P – Pro Q – Gln R – Arg S – Ser T – Thr V – Val W – Trp Y - Tyr

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Levels of Protein Structure  Primary structure = Sequence  Secondary Structure = Helix, sheets/strands, loops & turns  Tertiary structure = Arrangement of Secondary structure elements  Structural Motif = Small, recurrent arrangement of secondary structure, e.g.  Helix-loop-helix  Beta hairpins  EF hand (calcium binding motif)  Etc.

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU  Protein sequence: MKTAALAPLFFLPSALATTVYLAGDSTMAK NGGGSGTNGWGEYLASYLSATVVNDAVA GRSAR…(etc) Primary Structure 2 × + H 2 O

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Ramachandran Plot  Allowed backbone torsion angles in proteins N H

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Hydrophobic Core  Hydrophobic side chains go into the core of the molecule – but the main chain is highly polar.  The polar groups (C=O and NH) are neutralized through formation of H-bonds.

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU  Two main types:  Helices (mainly  -helix)  Sheets (consisting of individual strands) Secondary Structure  -helix  -sheet

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Secondary Structure  Other types of secondary structure:  3 10 helices (C=O (n) … HN (n+3) )   -helices (C=O (n) … HN (n+5) )   -turns and loops (in old textbooks sometimes referred to as random coil)

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Characteristics of Helices  Aligned peptide units  Dipolar moment  Ion/ligand binding  Secondary and quaternary structure packing/arrangement  Capping residues  The  helix (i→i+4)  Other helix types! (3 10,  ) N C

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Two Types of  -Sheet

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Two Types of  -Sheet  Anti-parallel  Parallel

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Rhamnogalacturonan lyase (1nkg) Rhamnogalacturonan acetylesterase (1k7c) Tertiary Structure  Domains and modules

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU B. caldolyticus UPRTase (1i5e) B. subtilis PRPP synthase (1dkr) Quaternary Structure

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Classification Schemes  SCOP  Manual classification (A. Murzin)  CATH  Semi manual classification (C. Orengo)  FSSP  Automatic classification (L. Holm)

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Levels in SCOP Class# Folds# Superfamilies # Families All alpha proteins All beta proteins Alpha and beta proteins (a/b) Alpha and beta proteins (a+b) Multi-domain proteins Membrane and cell surface proteins Small proteins Total

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Major Classes in SCOP  Classes  All alpha proteins  Alpha and beta proteins (  /  )  Alpha and beta proteins (  +  )  Multi-domain proteins  Membrane and cell surface proteins  Small proteins

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU All   Hemoglobin (1BAB)

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU All   Immunoglobulin (8FAB)

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU   Triose phosphate isomerase (1HTI)

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU   Lysozyme (1JSF)

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Folds*  Proteins which have >~50% of their secondary structure elements arranged the in the same order in the protein chain and in three dimensions are classified as having the same fold.  No evolutionary relation between proteins. *confusingly also called fold classes.

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Superfamilies  Proteins with (remote) evolutionary relations  Sequence similarity low  Share function  Share special structural features  Relationships between members of a superfamily may not be readily recognizable from the sequence alone.

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Families  Proteins whose evolutionarily relationship is readily recognizable from the sequence (>~25% sequence identity).  Families are further subdivided into Proteins.  Proteins are divided into Species  The same protein may be found in several species.

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Links  PDB (protein structure database)   SCOP (protein classification database)  scop.berkeley.edu scop.berkeley.edu  CATH (protein classification database)   FSSP (protein classification database) 

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU  They provide a detailed picture of interesting biological features, such as active site, substrate specificity, allosteric regulation etc.  They aid in rational drug design and protein engineering.  They can elucidate evolutionary relationships undetectable by sequence comparisons. Why are Protein Structures so Interesting?

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU COOH NH 2 Asp His Ser Topological switchpoint Inferring biological features from the structure 1DEO Structure to Function

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Active site Triose phosephate isomerase (1AG1) (Verlinde et al. (1991) Eur.J.Biochem. 198, 53) Structure to Function  Inferring biological features from the structure

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Im, Ryu & Yu (2004) Engineering thermostability in serine protease inhibitors PEDS, 17, Engineering Thermostability Example: Serpin (serine protease inhibitor)  Overpacking  Buried polar groups  Cavities

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU  In evolution structure is conserved longer than both function and sequence. Structure > Function > Sequence Structure & Evolution

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Rhamnogalacturonan acetylesterase (A. aculeatus) (1k7c) Platelet activating factor acetylhydrolase (B. Taurus) (1WAB) Serine esterase (S. scabies) (1ESC) Structure & Evolution

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Platelet activating factor acetylhydrolase Serine esterase Rhamnogalacturonan acetylesterase Mølgaard, Kauppinen & Larsen (2000) Structure, 8, Structure & Evolution

CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Quote of the Day  Don C. Wiley ( )  Has solved the structures of many immunologically important proteins:  Class I and II Major Histocompatibility Complex proteins.  Has studied viruses including HIV, Herpes and Influenza virus and many more.  Special focus on molecules involved in viral entry. “I’m sorry, but I just don’t understand anything in biology unless I know what it looks like.”