Presentation is loading. Please wait.

Presentation is loading. Please wait.

11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 1 11/7/05 Protein Structure: Classification, Databases,

Similar presentations


Presentation on theme: "11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 1 11/7/05 Protein Structure: Classification, Databases,"— Presentation transcript:

1 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 1 11/7/05 Protein Structure: Classification, Databases, Visualization

2 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 2 Announcements BCB 544 Projects - Important Dates: Nov 2 Wed noon - Project proposals due to David/Drena Nov 4 Fri PM - Approvals/responses & tentative presentation schedule to students Dec 2 Fri noon - Written project reports due Dec 5,7,8,9 class/lab- Oral Presentations (20') (Dec 15 Thurs = Final Exam)

3 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 3 Bioinformatics Seminars Nov 7 Mon 12:10 IG Faculty Seminar in 101 Ind Ed II Inborn Errors of Metabolism in Humans & Animal Models Matt Ellinwood, Animal Science, ISU Nov 10 Thurs 3:40 Com S Seminar in 223 Atanasoff Computational Epidemiology Armin R. Mikler, Univ. North Texas http://www.cs.iastate.edu/~colloq/#t3

4 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 4 Bioinformatics Seminars CORRECTION: Next week - Baker Center/BCB Seminars:Baker Center/BCB Seminars: (seminar abstracts available at above link) Nov 14 Mon 1:10 PM Doug Brutlag, Stanford Discovering transcription factor binding sites Nov 15 Tues 1:10 PM Ilya Vakser, Univ Kansas Modeling protein-protein interactions both seminars will be in Howe Hall Auditorium

5 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 5 Protein Structure & Function: Analysis & Prediction Mon Protein structure: classification, databases, visualization Wed Protein structure: prediction & modeling Thurs Lab Protein structure prediction Fri Protein-nucleic acid interactions Protein-ligand docking

6 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 6 Reading Assignment (for Mon-Fri) Mount Bioinformatics Chp 10 Protein classification & structure prediction http://www.bioinformaticsonline.org/ch/ch10/index.html pp. 409-491 Ck Errata: http://www.bioinformaticsonline.org/help/errata2.htmlhttp://www.bioinformaticsonline.org/help/errata2.html Other? Additional reading assignments for BCB 544

7 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 7 Review last lecture: RNA Structure Prediction Algorithms

8 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 8 RNA structure prediction strategies 1)Energy minimization (thermodynamics) 2) Comparative sequence analysis (co-variation) 3) Combined experimental & computational Secondary structure prediction

9 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 9 1) Energy minimization method What are the assumptions? Native tertiary structure or "fold" of an RNA molecule is (one of) its lowest free energy configuration(s) Gibbs free energy =  G in kcal/mol at 37  C = equilibrium stability of structure lower values (negative) are more favorable Is this assumption valid? in vivo? - this may not hold, but we don't really know

10 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 10 Gibbs free energy:  G Gibbs Free energy (G) is formally defined in terms of state functions enthalpy & entropy, & state variable, temperature G = H - TS  G =  H - T  S ( for constant temp) Enthalpy (H) = amount of heat absorbed by a system at constant pressure Entropy (S) = measure of the amount of disorder or randomness in a system Note = this is not the same as "entropy" in information theory, but is related, see: http://en.wikipedia.org/wiki/Information_theoryhttp://en.wikipedia.org/wiki/Information_theory

11 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 11 Gibbs free energy:  G Gibbs free energy for formation of an RNA or protein structure =  G  = equilibrium stability of that structure at a specific temperature ( kcal/mol at 37°C)  G  = -RT lnK eq R = gas constant

12 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 12 Nearest-neighbor parameters Most methods for free energy minimization use nearest-neighbor parameters (derived from experiment) for predicting stability of an RNA secondary structure (in terms of  G at 37  C) & most available software packages use the same set of parameters : Mathews, Sabina, Zuker & Turner, 1999

13 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 13 Energy minimization - calculations: Total free energy of a specific conformation for a specific RNA molecule = sum of incremental energy terms for: helical stacking (sequence dependent) loop initiation unpaired stacking (favorable "increments" are < 0) Fig 6.3 Baxevanis & Ouellette 2005

14 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 14 But how many possible conformations for a single RNA molecule? Huge number: Zuker estimates (1.8) N possible secondary structures for a sequence of N nucleotides for 100 nts (small RNA…) = 3 X 10 25 structures! Solution? Not exhaustive enumeration…  Dynamic programming O(N 3 ) in time O(N 2 ) in space/storage iff pseudoknots excluded, otherwise: O(N 6 ), time O(N 4 ), space

15 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 15 Algorithms based on energy minimization For outline of algorithm used in Mfold, including description of dynamic programming recursion, please visit Michael Zuker's lecture: http://www.bioinfo.rpi.edu/~zukerm/lectures/RNAfold-html http://www.bioinfo.rpi.edu/~zukerm/lectures/RNAfold-html From this site, you may also download Zuker's lecture as either PDF or PS file.

16 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 16 2) Comparative sequence analysis (co-variation) Two basic approaches: Algorithms constrained by initial alignment Much faster, but not as robust as unconstrained Base-pairing probabilities determined by a partition function Algorithms not constrained by initial alignment Genetic algorithms often used for finding an alignment & set of structures

17 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 17 RNA structure prediction strategies Requires "craft" & significant user input & insight 1)Extensive comparative sequence analysis to predict tertiary contacts (co-variation) e.g., MANIP - Westhof 2)Use experimental data to constrain model building e.g., MC-CYM - Major 3)Homology modeling using sequence alignment & reference tertiary structure (not many of these!) 4)Low resolution molecular mechanics e.g., yammp - Harvey Tertiary structure prediction

18 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 18 New Last Time: Protein Structure & Function

19 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 19 Protein Structure & Function Protein structure - primarily determined by sequence Protein function - primarily determined by structure Globular proteins: compact hydrophobic core & hydrophilic surface Membrane proteins: special hydrophobic surfaces Folded proteins are only marginally stable Some proteins do not assume a stable "fold" until they bind to something = Intrinsically disordered  Predicting protein structure and function can be very hard -- & fun!

20 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 20 4 Basic Levels of Protein Structure

21 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 21 Primary & Secondary Structure Primary Linear sequence of amino acids Description of covalent bonds linking aa’s Secondary Local spatial arrangement of amino acids Description of short-range non-covalent interactions Periodic structural patterns:  -helix,  -sheet

22 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 22 Tertiary & Quaternary Structure Tertiary Overall 3-D "fold" of a single polypeptide chain Spatial arrangement of 2’ structural elements; packing of these into compact "domains" Description of long-range non-covalent interactions (plus disulfide bonds) Quaternary In proteins with > 1 polypeptide chain, spatial arrangement of subunits

23 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 23 "Additional" Structural Levels Super-secondary elements Motifs Domains Foldons

24 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 24 New Today: Protein Structure & Function Amino acids characteristics Structural classes & motifs Protein functions & functional families not much - more on this later Classification Databases Visualization

25 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 25 Amino Acids Each of 20 different amino acids has different "R-Group," side chain attached to C 

26 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 26 Peptide bond is rigid and planar

27 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 27 Hydrophobic Amino Acids

28 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 28 Charged Amino Acids

29 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 29 Polar Amino Acids

30 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 30 Certain side-chain configurations are energetically favored (rotamers) Ramachandran plot: "Allowable" psi & phi angles

31 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 31 Glycine is smallest amino acid R group = H atom Glycine residues increase backbone flexibility because they have no R group

32 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 32 Proline is cyclic Proline residues reduce flexibility of polypeptide chain Proline cis-trans isomerization is often a rate-limiting step in protein folding Recent work suggests it also may also regulate ligand binding in native proteins - Andreotti

33 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 33 Cysteines can form disulfide bonds Disulfide bonds (covalent) stabilize 3-D structures In eukaryotes, disulfide bonds are found only in secreted proteins or extracellular domains

34 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 34 Globular proteins have a compact hydrophobic core Packing of hydrophobic side chains into interior is main driving force for folding Problem? Polypeptide backbone is highly polar (hydrophilic) due to polar -NH and C=O in each peptide unit; these polar groups must be neutralized Solution? Form regular secondary structures, e.g.,  -helix,  -sheet, stabilized by H-bonds

35 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 35 Exterior surface of globular proteins is generally hydrophilic Hydrophobic core formed by packed secondary structural elements provides compact, stable core "Functional groups" of protein are attached to this framework; exterior has more flexible regions (loops) and polar/charged residues Hydrophobic "patches" on protein surface are often involved in protein-protein interactions

36 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 36 Protein Secondary Structures  Helix  Sheets Loops Coils

37 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 37  - Helix Most abundant 2' structure in proteins Average length = 10 aa's (~10 Angstroms) Length varies from 5-40 aa's Alignment of H-bonds creates dipole moment (positive charge at NH end) Often at surface of core, with hydrophobic residues on inner-facing side, hydrophilic on other side

38 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 38  helix is stabilized by H-bonds between ~ every 4th residue C = black O = red N = blue

39 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 39 R-groups are on outside of  helix

40 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 40 Types of  helices "Standard"  helix: 3.6 residues per turn H-bonds between C=0 of residue n and NH of residue n + 4 Helix ends are polar; almost always on surface of protein Other types of helices? n + 5 =  helix n + 3 = 3 10 helix

41 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 41 Certain amino acids are "preferred" & others are rare in  helices Ala, Glu, Leu, Met = good helix formers Pro, Gly Tyr, Ser = very poor Amino acid composition & distribution varies, depending on on location of helix in 3-D structure

42 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 42  -Strands & Sheets H-bonds formed between 5-10 consecutive residues in one portion of chain with another set of 5-10 residues farther down chain Interacting regions may be adjacent (with short loop between) or far apart  -sheets usually have all strands either parallel or antiparallel

43 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 43 Antiparallel  -sheet

44 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 44 Antiparallel  -sheet

45 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 45 Parallel  -sheet

46 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 46 Mixed  -Sheets also occur

47 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 47  Loops Connect helices and sheets Vary in length and 3-D configurations Are located on surface of structure Are more "tolerant" of mutations Are more flexible and can adopt multiple conformations Tend to have charged and polar amino acids Are frequently components of active sites Some fall into distinct structural families (e.g., hairpin loops, reverse turns)

48 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 48 Coils Regions of 2' structure that are not helices, sheets, or recognizable turns Intrinsically disordered regions appear to play important functional roles

49 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 49 Globular proteins are built from recurring structural patterns Motifs or supersecondary structures = combinations of 2' structural elements Domains = combinations of motifs Independently folding unit (foldon) Functional unit

50 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 50 A few common structural motifs Helix-turn-helix e.g., DNA binding Helix-loop-helix e.g., Calcium binding   -hairpin 2 adjacent antiparallel strands connected by short loop Greek key 4 adjacent antiparallel strands  2 parallel strands connected by helix

51 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 51 H-T-H H-L-H

52 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 52   -hairpin

53 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 53 Greek key

54 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 54 Beta-alpha-beta

55 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 55 Simple motifs combine to form domains

56 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 56 Large polypeptide chains fold into several domains

57 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 57 6 main classes of protein structure 1)  Domains Bundles of helices connected by loops 2)  Domains Mainly antiparallel sheets, usually with 2 sheets forming sandwich 3)  Domains Mainly parallel sheets with intervening helices, also mixed sheets 4)  Domains Mainly segregated helices and sheets 5) Multidomain (  Containing domains from more than one class 6 ) Membrane & cell-surface proteins

58 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 58  -domain structures: coiled-coils

59 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 59  -domain structures: 4-helix bundles

60 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 60 All-  proteins: Globins

61 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 61  -domain structures Anti-parallel  structures Functionally most diverse Includes: Up-and-down sheets or barrels Propeller-like structures Jelly roll barrels (from Greek key motifs)

62 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 62 Up-and-down sheets and barrel

63 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 63 Up-and-down sheets can form propeller-like structures

64 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 64 Greek key motifs can form jelly roll barrels

65 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 65  -domain structures 3 main classes TIM barrel = Core of twisted parallel strands close together Rossman fold = open twisted sheet surrounded by helices on both sides Leucine-rich motif = specific pattern of Leu residues, strands form a curved sheet with helices on outside

66 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 66 TIM barrel Rossman fold

67 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 67 Leucine rich motifs can form  horseshoes

68 11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 68 Protein structure databases, structural classification & visualization PDB = Protein Data Bank http://www.rcsb.org/pdb/ http://www.rcsb.org/pdb/ (RISC) - several different structure viewers MMDB = Molecular Modeling Database http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Structure (NCBI Entrez) - Cn3D viewer SCOP = Structural Classification of Proteins Levels reflect both evolutionary and structural relationships CATH = Classification by Class, Architecture, Topology and Homology


Download ppt "11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 1 11/7/05 Protein Structure: Classification, Databases,"

Similar presentations


Ads by Google