CS 177 Proteins I (Structure-function relationships) Review of protein structures Computational Modeling Three-dimensional structural analysis in laboratory.

Slides:



Advertisements
Similar presentations
Amino Acids and Proteins B.2. there are about 20 amino acids that occur naturally they are the basic “building blocks” of life/proteins.
Advertisements

Proteins Review. Learning outcomes (e) Describe the structure of an amino acid and the formation and breakage of a peptide bond. (f) Explain the meaning.
Protein Structure and Function Review: Fibrous vs. Globular Proteins.
Pp 50 – 51 & Pp 15 & Proteins Proteins are polymers of amino acids Each has a unique 3D shape Amino acid sequences vary Proteins are.
Protein Structure C483 Spring 2013.
Protein Structure Prediction
Protein Structure – Part-2 Pauling Rules The bond lengths and bond angles should be distorted as little as possible. No two atoms should approach one another.
The amino acids in their natural habitat. Topics: Hydrogen bonds Secondary Structure Alpha helix Beta strands & beta sheets Turns Loop Tertiary & Quarternary.
Proteins - Many Structures, Many Functions 1.A polypeptide is a polymer of amino acids connected to a specific sequence 2.A protein’s function depends.
Protein Structure & Function
Structural bioinformatics
Mossbauer Spectroscopy in Biological Systems: Proceedings of a meeting held at Allerton House, Monticello, Illinois. Editors: J. T. P. DeBrunner and E.
Computing for Bioinformatics Lecture 8: protein folding.
(Foundation Block) Dr. Ahmed Mujamammi Dr. Sumbul Fatma
Proteins: Levels of Protein Structure Conformation of Peptide Group
Proteins Dr. Sumbul Fatma Clinical Chemistry Unit
Bioinformatics for biomedicine Protein domains and 3D structure Lecture 4, Per Kraulis
Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.
Protein Tertiary Structure Prediction
Chapter 12 Protein Structure Basics. 20 naturally occurring amino acids Free amino group (-NH2) Free carboxyl group (-COOH) Both groups linked to a central.
CS 177 Proteins part 1: Structure-function relationships Review of protein structures Need for analyses of protein structures Sources of protein structure.
© 2012 Pearson Education, Inc. Lectures by Kathleen Fitzpatrick Simon Fraser University Chapter 3 The Macromolecules of the Cell- Modified by Dr. Jordan-
Housekeeping Your performance on the exam has caused me to re-evaluate how homework will be handled I will now be picking up every problem assigned on.
Macromolecular structure
Supersecondary structures. Supersecondary structures motifs motifs or folds, are particularly stable arrangements of several elements of the secondary.
Genomics and Personalized Care in Health Systems Lecture 9 RNA and Protein Structure Leming Zhou, PhD School of Health and Rehabilitation Sciences Department.
Introduction to Protein Structure
Proteins. Proteins? What is its How does it How is its How does it How is it Where is it What are its.
Chapter 4 The Three-Dimensional Structure of Proteins.
 Four levels of protein structure  Linear  Sub-Structure  3D Structure  Complex Structure.
A protein’s function depends on its specific conformation (shape) A functional proteins consists of one or more polypeptides that have been precisely twisted,
CS 177 Proteins, part 2 (Computational modeling) Review of protein structures Computational Modeling Three-dimensional structural analysis in laboratory.
Protein Folding & Biospectroscopy F14PFB David Robinson Mark Searle Jon McMaster
Operone lac Principles of protein structure and function Function is derived from structure Structure is derived from amino acid sequence Different.
Protein Structure & Modeling Biology 224 Instructor: Tom Peavy Nov 18 & 23, 2009
Protein Structure (Foundation Block) What are proteins? Four levels of structure (primary, secondary, tertiary, quaternary) Protein folding and stability.
Protein structure and function Part - I
Proteins are instrumental in about everything that an organism does. These functions include structural support, storage, transport of other substances,
THE STRUCTURE AND FUNCTION OF MACROMOLECULES Proteins - Many Structures, Many Functions 1.A polypeptide is a polymer of amino acids connected to a specific.
Structural proteomics
Biochemistry - as science; biomolecules; metabolic ways. Structure of proteins, methods of its determination.
Protein Modeling Protein Structure Prediction. 3D Protein Structure ALA CαCα LEU CαCαCαCαCαCαCαCα PRO VALVAL ARG …… ??? backbone sidechain.
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
Proteins Dr. Sumbul Fatma Clinical Chemistry Unit Department of Pathology Tel
Structural proteomics Handouts. Proteomics section from book already assigned.
PROTEIN FUNCTIONS. PROTEIN FUNCTIONS (continued)
Objective 7: TSWBAT recognize and give examples of four levels of protein conformation and relate them to denaturation.
Protein Structure and Bioinformatics. Chapter 2 What is protein structure? What are proteins made of? What forces determines protein structure? What is.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
Protein backbone Biochemical view:
PROTEIN STRUCTURE (Donaldson, March 10,2003) What are we trying to learn about genes and their proteins: Predict function for unknown protein by comparison.
Levels of Protein Structure. Why is the structure of proteins (and the other organic nutrients) important to learn?
PROTEINS Characteristics of Proteins Contain carbon, hydrogen, oxygen, nitrogen, and sulfur Serve as structural components of animals Serve as control.
Levels of Protein Structure. Why is the structure of proteins (and the other organic nutrients) important to learn?
CHAPTER 5 THE STRUCTURE AND FUNCTION OF MACROMOLECULES Copyright © 2002 Pearson Education, Inc., publishing as Benjamin Cummings Section D: Proteins -
Protein Structure Prediction. Protein Sequence Analysis Molecular properties (pH, mol. wt. isoelectric point, hydrophobicity) Secondary Structure Super-secondary.
Polypeptide Chains Can Change Direction by Making Reverse Turns and Loops.
Structure and Function
Proteins Structures and Functions. What? A series of amino acids in a polypeptide chain Produced from the coding in the DNA of the nucleus Makes up.
CHM 708: MEDICINAL CHEMISTRY
Protein Structure BL
Protein Proteins are biochemical compounds consisting of one or more polypeptides typically folded into a globular or fibrous form in a biologically functional.
CHAPTER 5 THE STRUCTURE AND FUNCTION OF MACROMOLECULES
The Peptide Bond Amino acids are joined together in a condensation reaction that forms an amide known as a peptide bond.
The Peptide Bond Amino acids are joined together in a condensation reaction that forms an amide known as a peptide bond.
Protein Structure Prediction
CS 177 Proteins, part 2 (Computational modeling)
CHAPTER 5 THE STRUCTURE AND FUNCTION OF MACROMOLECULES
The Three-Dimensional Structure of Proteins
Presentation transcript:

CS 177 Proteins I (Structure-function relationships) Review of protein structures Computational Modeling Three-dimensional structural analysis in laboratory Review of protein structures Computational modeling Three-dimensional structural analysis in laboratory

Structure-function relationships A science primer: Molecular modeling Brown, S.M. (2000) Bioinformatics, Eaton Publishing, pp Veeramalai, M. & Gilbert, D.: Bioinformatics Tools for protein structure visualisation and analysis Mount, D.W. (2001) Bioinformatics, Cold Spring Harbor Lab Press, pp Recommended readings (very) basic advanced Review of protein structures Computational Modeling Three-dimensional structural analysis in laboratory

Proteins are chains of amino acids joined by peptide bonds Primary structure Review of protein structure Polypeptide chain The structure of two amid acids The N-C-C sequence is repeated throughout the protein, forming the backbone The bonds on each side of the C atom are free to rotate within spatial constrains, the angles of these bonds determine the conformation of the protein backbone The R side chains also play an important structural role Review of protein structures Computational Modeling Three-dimensional structural analysis in laboratory

Interactions that occur between the C=O and N-H groups on amino acids Much of the protein core comprises  helices and  sheets, folded into a three- dimensional configuration: - regular patterns of H bonds are formed between neighboring amino acids - the amino acids have similar angles - the formation of these structures neutralizes the polar groups on each amino acid - the secondary structures are tightly packed in a hydrophobic environment - Each R side group has a limited volume to occupy and a limited number of interactions with other R side groups Secondary structure: Review of protein structure  helix  sheet Review of protein structures Computational Modeling Three-dimensional structural analysis in laboratory

Other Secondary structure elements (no standardized classification) Secondary structure - loop - random coil - others (e.g helix, -hairpin, paperclip) Super-secondary structure - In addition to secondary structure elements that apply to all proteins (e.g. helix, sheet) there are some simple structural motifs in some proteins - These super-secondary structures (e.g. transmembrane domains, coiled coils, helix-turn-helix, signal peptides) can give important hints about protein function Review of protein structures Computational Modeling Three-dimensional structural analysis in laboratory

Review of protein structure Q: If we have all the Psi and Phi angles in a protein, do we then have enough information to describe the 3-D structure? Tertiary structure A: No, because the detailed packing of the amino acid side chains is not revealed from this information. However, the Psi and Phi angles do determine the entire secondary structure of a protein Review of protein structures Computational Modeling Three-dimensional structural analysis in laboratory

Tertiary structure The tertiary structure describes the organization in three dimensions of all the atoms in the polypeptide The tertiary structure is determined by a combination of different types of bonding: - Ionic interactions between oppositely charged residues can pull them together, - Hydrogen Bonds - Hydrogens are partially positively charged, are attracted to partially negative oxygens. (weaker) - van Der Waals - hydrophobic residues become attractive to each other when forced together by exclusion from the aqueous surroundings. (weakest) Many of these bonds are very week and easy to break, but hundreds or thousands working together give the protein structure great stability If a protein consists of only one polypeptide chain, this level then describes the complete structure Review of protein structures Computational Modeling Three-dimensional structural analysis in laboratory

Tertiary structure Review of protein structures Computational Modeling Three-dimensional structural analysis in laboratory

Tertiary structure Proteins can be divided into two general classes based on their tertiary structure: - Fibrous proteins have elongated structure with the polypeptide chains arranged in long strands. This class of proteins serves as major structural component of cells Examples: silk, keratin, collagen - Globular proteins have more compact, often irregular structures. This class of proteins includes most enzymes and most proteins involved in gene expression and regulation Review of protein structures Computational Modeling Three-dimensional structural analysis in laboratory

Quaternary structure The quaternary structure defines the conformation assumed by a multimeric protein. The individual polypeptide chains that make up a multimeric protein are often referred to as protein subunits. Subunits are joined by ionic, H and hydrophobic interactions Example: Haemoglobin (4 subunits) Review of protein structures Computational Modeling Three-dimensional structural analysis in laboratory

Summary protein structure Secondary structure: Primary structure: Tertiary structure: Quaternary structure: Sequence of amino acids Interactions that occur between the C=O and N-H groups on amino acids Organization in three dimensions of all the atoms in the polypeptide Conformation assumed by a multimeric protein The four levels of protein structure are hierarchical: each level of the build process is dependent upon the one below it Review of protein structures Computational Modeling Three-dimensional structural analysis in laboratory

Summary protein structure Review of protein structures Computational Modeling Three-dimensional structural analysis in laboratory

Structure displays Common displays are (among others) cartoon, spacefill, and backbone cartoon spacefill backbone Review of protein structures Computational Modeling Three-dimensional structural analysis in laboratory

Need for analyses of protein structures A protein performs metabolic, structural, or regulatory functions in a cell. Cellular biochemistry works based on interactions between 3-D molecular structures The 3-D structure of a protein determines its function Therefore, the relationship of sequence to function is primarily concerned with understanding the 3-D folding of proteins and inferring protein functions from these 3-D structures (e.g. binding sites, catalytic activities, interactions with other molecules) The study of protein structure is not only of fundamental scientific interest in terms of understanding biochemical processes, but also produces very valuable practical benefits Medicine The understanding of enzyme function allows the design of new and improved drugs Agriculture Therapeutic proteins and drugs for veterinary purposes and for treatment of plant diseases Industry Protein engineering has potential for the synthesis of enzymes to carry out various industrial processes on a mass scale Review of protein structures Computational Modeling Three-dimensional structural analysis in laboratory

Sources of protein structure information 3-D macromolecular structures stored in databases The most important database: the Protein Data Bank (PDB) The PDB is maintained by the Research Collaboratory for Structural Bioinformatics (RCSB) and can be accessed at three different sites (plus a number of mirror sites outside the USA): - (Rutgers University) - (San Diego Supercomputer Center) - (National Institute for Standards and Technology) It is the very first “bioinformatics” database ever build Review of protein structures Computational Modeling Three-dimensional structural analysis in laboratory

Sources of protein structure information In practice, most biomolecular structures (>99% of structures in PDB) are determined using three techniques: - X-ray crystallography (low to very high resolution) Problem: requires crystals; difficult to crystallize proteins by maintaining their native conformation; not all protein can be crystallized; Experimental structure determination Review of protein structures Computational Modeling Three-dimensional structural analysis in laboratory

X-ray crystallography Review of protein structures Computational Modeling Three-dimensional structural analysis in laboratory

Sources of protein structure information In practice, most biomolecular structures (>99% of structures in PDB) are determined using three techniques: - X-ray crystallography (low to very high resolution) Problem: requires crystals; difficult to crystallize proteins by maintaining their native conformation; not all protein can be crystallized; - Nuclear magnetic resonance (NMR) spectroscopy of proteins in solution (medium to high resolution) Problem: Works only with small and medium size proteins (~50% of proteins cannot be studied with this method); requires high solubility - Electron microscopy and crystallography (low to medium resolution) Problem: (still) relatively low resolution Experimental structure determination Experimental methods are still very time consuming and expensive; in most cases the experimental data will contain errors and/or are incomplete. Thus the initial model needs to be refined and rebuild Review of protein structures Computational Modeling Three-dimensional structural analysis in laboratory

Sources of protein structure information Researches have been working for decades to develop procedures for predicting protein structure that are not so time consuming and not hindered by size and solubility constrains. As protein sequences are encoded in DNA, in principle, it should therefore be possible to translate a gene sequence into an amino acid sequence, and to predict the three-dimensional structure of the resulting chain from this amino acid sequence Computational Modeling Review of protein structures Computational Modeling Three-dimensional structural analysis in laboratory

Computational modeling How to predict the protein structure? Ab initio prediction of protein structure from sequence: not yet. Problem: the information contained in protein structures lies essentially in the conformational torsion angles. Even if we only assume that every amino-acid residue has three such torsion angles, and that each of these three can only assume one of three "ideal" values (e.g., 60, 180 and -60 degrees), this still leaves us with 27 possible conformations per residue. For a typical 200-amino acid protein, this would give (roughly 1.87 x ) possible conformations! If we were able to evaluate 10 9 conformations per second, this would still keep us busy 4 x times the current age of the universe There are optimized ab initio prediction algorithms available as well as fold recognition algorithms that use threading (compares protein folds with know fold structures from databases), but the results are still very poor Q: Can’t we just generate all these conformations, calculate their energy and see which conformation has the lowest energy? Review of protein structures Computational Modeling Three-dimensional structural analysis in laboratory

Computational modeling Solution: homology modeling Homology (comparative) modeling attempts to predict structure on the strength of a protein’s sequence similarity to another protein of known structure Basic idea: a significant alignment of the query sequence with a target sequence from PDB is evidence that the query sequence has a similar 3-D structure (current threshold ~ 40% sequence identity). Then multiple sequence alignment and pattern analysis can be used to predict the structure of the protein Review of protein structures Computational Modeling Three-dimensional structural analysis in laboratory

Computational modeling Flow chart for protein structure prediction (from Mount, 2001) Review of protein structures Computational Modeling Three-dimensional structural analysis in laboratory

Computational modeling Protein sequence - partial or full sequences; predicted through gene finding Review of protein structures Computational Modeling Three-dimensional structural analysis in laboratory

Computational modeling Database similarity search - sequence is used as a query in a database similarity search against proteins in PDB Review of protein structures Computational Modeling Three-dimensional structural analysis in laboratory

Computational modeling Does the sequence align with a protein of known structure? - Yes: if the database similarity search reveals a significant alignment between the query sequence and a PDB target sequence, the alignment can be used to position the amino acids of the query sequence in the same approximate 3-D structure - No: proceed to protein family analysis Review of protein structures Computational Modeling Three-dimensional structural analysis in laboratory

Computational modeling Protein family analysis/relationship to known structure - Family (structural context): structures that have a significant level of structural similarity but not necessarily significant sequence similarity - the goal is to exploit these structure sequence relationships; two questions: 1) is the new protein a member of a family, 2) does the family have a predicted structural fold? - analyze sequence for family specific profiles and patterns. Available databases: 3D-Ali, 3D-PSSM, BLOCKS, eMOTIF, INTERPRO, Pfam …) - if the family analysis reveals that the query protein is a member of a family with a predicted structural fold, multiple alignment can be used for structural modeling Review of protein structures Computational Modeling Three-dimensional structural analysis in laboratory

Computational modeling Protein family analysis/relationship to known structure - if the family analysis is unsuccessful, proceed to structural analyses Review of protein structures Computational Modeling Three-dimensional structural analysis in laboratory

Computational modeling Structural analysis - several different types of analyses to infer structural information - presence of small amino acid motifs in a protein can be indicator of a biochemical function associated with a particular structure. Motifs are available from the Prosite catalog - spacing and arrangement of amino acids (e.g. hydrophobic amino acids) provide important structural clues that can be used for modeling - certain amino acid combinations can occur in certain types of secondary structure - These structural analyses can provide clues as to the presence of active sites and regions of secondary structure. These information can help to identify a new protein as a member of a known structural class Review of protein structures Computational Modeling Three-dimensional structural analysis in laboratory

Computational modeling 3-D structural analysis in lab - proteins that fail to show any relationship to proteins of known structure are candidates for structural analyses (X-ray crystallography, NMR). There are about 600 known fold families and new structures are frequently found to have already known structural fold. Accordingly, protein families with no relatives of known structure may represent a novel fold Review of protein structures Computational Modeling Three-dimensional structural analysis in laboratory

Computational modeling: summary Partial or full sequences predicted through gene finding Similarity search against proteins in PDB Alignment can be used to position the amino acids of the query sequence in the same approximate 3-D structure Find structures that have a significant level of structural similarity (but not necessarily significant sequence similarity) If member of a family with a predicted structural fold, multiple alignment can be used for structural modeling Infer structural information (e.g. presence of small amino acid motifs; spacing and arrangement of amino acids; certain typical amino acid combinations associated with certain types of secondary structure) can provide clues as to the presence of active sites and regions of secondary structure Structural analyses in the lab (X-ray crystallography, NMR) Review of protein structures Computational Modeling Three-dimensional structural analysis in laboratory

Computational modeling: summary How to predict the protein structure? Ab initio prediction of protein structure from sequence Homology (comparative) modeling attempts to predict structure on the strength of a protein’s sequence similarity to another protein of known structure Experimental structure determination Review of protein structures Computational Modeling Three-dimensional structural analysis in laboratory

Computational modeling Viewing protein structures A number of molecular viewers are freely available and run on most computer platforms and operating systems Examples: Cn3D 4.1 (stand-alone) Rasmol (stand-alone) Chime (Web browser based on Rasmol) Swiss 3D viewer Spdbv (stand-alone) All these viewers can use the PDB identification code or the structural file from PDB Review of protein structures Computational Modeling Three-dimensional structural analysis in laboratory