Presentation is loading. Please wait.

Presentation is loading. Please wait.

Biochemistry 301 Principles of Protein Structure Walter Chazin 5140 BIOSCI/MRBIII Walter.Chazin Jan. 8-10,

Similar presentations


Presentation on theme: "Biochemistry 301 Principles of Protein Structure Walter Chazin 5140 BIOSCI/MRBIII Walter.Chazin Jan. 8-10,"— Presentation transcript:

1 Biochemistry 301 Principles of Protein Structure Walter Chazin 5140 BIOSCI/MRBIII E-mail: Walter.Chazin http://structbio.vanderbilt.edu/chazin Jan. 8-10, 2003

2 Text Books Branden and Tooze Introduction to Protein Structure Voet, Voet and Pratt Fundamentals of Biochemistry Stryer Biochemistry

3 Proteins: Polymers of Amino Acids 20 different amino acids: many combinations Proteins are made in the RIBOSOME

4 Amino Acid Chemistry NH 2 CC R1R1 CO H NHCC R2R2 COOH H NH 2 CC R COOH H amino acid 20 different types Amino acid PolypeptideProtein NH 2 CC R1R1 COOH H NH 2 CC R2R2 COOH H

5 Amino Acid Chemistry NH 2 CC R COOH H amino acid The free amino and carboxylic acid groups have pKa’s COOH COO - pKa ~ 2.2 NH 2 NH 3 + pKa ~ 9.4 At physiological pH, amino acids are zwitterions + NH 3 CC R COO - H

6 Amino Acid Chemistry Note the axes Also titratable groups in side chain

7 GlycineGly - G 2.49.8 AlanineAla - A 2.49.9 ValineVal - V 2.29.7 LeucineLeu - L 2.39.7 IsoleucineIle - I 2.39.8 Amino Acids with Aliphatic R-Groups pKa’s

8 Amino Acids with Polar R-Groups Non-Aromatic Amino Acids with Hydroxyl R-Groups SerineSer - S 2.29.2 ~13 ThreonineThr - T 2.19.1 ~13 Amino Acids with Sulfur-Containing R-Groups CysteineCys - C 1.910.8 8.3 MethionineMet-M 2.19.3

9 Aspartic AcidAsp - D 2.09.9 3.9 AsparagineAsn - N 2.18.8 Glutamic AcidGlu - E 2.19.5 4.1 GlutamineGln - Q 2.29.1 Acidic Amino Acids and Amide Conjugates

10 Basic Amino Acids ArginineArg - R 1.89.0 12.5 LysineLys - K 2.29.2 10.8 HistidineHis - H 1.89.2 6.0

11 Aromatic Amino Acids and Proline PhenylalaninePhe - F 2.29.2 TyrosineTyr - Y 2.29.1 10.1 TryptophanTrp-W 2.49.4 ProlinePro - P 2.010.6

12 Hierarchy of Protein Structure 20 different amino acids: many combinations The order of amino acids: Protein sequence Primary Structure Local conformation, depends on sequence Secondary Structure Overall structure of the chain(s) in full 3D Tertiary/Quaternary Structure

13 Beyond Primary Structure: The Peptide Bond -C - N- O = - H -C = N- O-O- - - H Resonance structures  Peptide plane is flat  angle ~180º  Partial double-bond: Peptide bond

14 Implications of Peptide Planes   angle varies little,  and  angles vary alot  Many  /  combinations cause atoms to collide  Each residue is sandwiched between two planes CC HR   Peptide planes CC H R CC

15 Polypeptide Backbone  Backbone restricted  limited conformations  Collisions with side chain groups further limit  /  combinations CC HR   CC HR CC HR

16 Secondary Structure Local Conformation of Consecutive Residues Three low energy backbone  combinations 1. Right-hand helix:  -helix (-40°, -60°) 2. Extended: antiparallel  -sheet (140°, -140°) rare 3. Left-hand helix (rare):  -helix (45°, 45°)  Glycine: special it has no side chain! Hydrogen bonds between backbone atoms provides stability to secondary structures Amino acids have specific preferences

17 Secondary Structure-  Helix H-bond

18 Secondary Structure-  Sheet OxygenNitrogen R Group Hydrogen Carbon  Carbonyl C H Bond

19 Secondary Structure-  Turn 1 43 2  Reverses direction of the chain

20 Ribbon and Topology Diagrams Representations of Secondary Structures  Sheets (arrows), Helices (cylinders) B/T- Figure 2.17

21 Ribbon and Topology Diagrams Organization of Secondary Structures helix B/T- Figure 2.11

22 Beyond Secondary Structure Supersecondary structure (motifs): small, discrete, commonly observed aggregates of secondary structures   sheet   helix-loop-helix   Domains: independent units of structure   barrel  four-helix bundle *Domains and motifs sometimes interchanged*

23 Protein Motifs V/V/P- Figure 6.28

24 Hairpin Motif B/T- Figure 2.14

25 Helix-Loop-Helix (H-L-H) Motif B/T- Figure 2.12

26 EF-Hand H-L-H Motif B/T- Figure 2.13

27 Greek Key Motif B/T- Figure 2.15

28 Multi-Domain (Modular) Proteins EGF Protease Kringle Ca-binding Protein Domain

29 Tertiary Structure Definition: Overall 3D form of a molecule  Organization of the secondary structures/ motifs/domains  Optimization of interactions between residues  A specific 3D structure is formed All proteins have multiple secondary structures, almost always multiple motifs, and in some cases multiple domains

30 Tertiary Structure Specific structures result from long-range interactions  Electrostatic (charged) interactions  Hydrogen bonds (O  H, N  H, S  H)   Hydrophobic interactions Soluble proteins have an inside (core) and outside  Folding driven by water- hydrophilic/phobic  Side chain properties specify core/exterior   Some interactions inside, others outside

31 Tertiary Structure I. Ionic Interactions (exterior) Forms between 2 charged side chains: 1 Negative – Glu,Asp 1 Positive – Lys,Arg,His  Also called “salt bridges”.  Ionic interactions are pH-dependent (pKa).  Occurs at the exterior  NOTE: pKs for in the interior of a protein may be very different from free amino acid.

32 Tertiary Structure II. Hydrogen bonds (interior and exterior) Forms between side chains/backbone/water: Charged side chains: Glu,Asp,His,Lys,Arg Polar chains: Ser,Thr,Cys,Asn,Gln,[Tyr,Trp]  Not a specific covalent bond – lower energy.  Occurs inside, at the exterior, and with water.

33 Tertiary Structure III. Hydrophobic Interactions (interior) Forms between side chains of non-polar residues: Aliphatic (Ala,Val,Leu,Ile,Pro,Met) Aromatic (Phe,Trp,[Tyr])  Clusters of side chains- but no requirement for a specific orientation like an H-bond  In the protein interior, away from water  Not pH dependent

34 Tertiary Structure IV. Disulfide Bonds (interior and exterior) Forms between Cys residues: Cys-SH + HS-Cys  Cys-S-S-Cys  Catalyzed by specific enzymes, oxidizing agents  Restricts flexibility of the protein  Usually within a protein, less for linking proteins

35 Disulfide Bonding V/V/P- Figure 16.6

36 Quaternary Structure Definition: Organization of multiple chain associations  Oligomerization- Homo (self), Hetero (different)   Used in organizing single proteins and protein machines Specific structures result from long-range interactions  Electrostatic (charged) interactions  Hydrogen bonds (O  H, N  H, S  H)   Hydrophobic interactions  Disulfides only VERY infrequently

37 Quaternary Structure The classic example- hemoglobin  2 -  2 B/T- Figure 3.7 END OF PART 1

38 Protein Structure from Sequence The pattern of amino acid side chains determines the local conformation and the global structure *Pattern is more important than exact sequence* A T V R L L E W E D L Reporting/Comparing Protein Sequences A T V R L L E Y K D L 5 10 h-CaM b-CaM conservativenon-conservative

39 Proteins Fold To Their Native Structure Folded proteins are only marginally stable!!  ~0.4 kJmol -1 required to unfold (cf. ~20/H-bond)  Balance loss of entropy vs. stabilizing forces Protein fold is specified by sequence  Reversible reaction- denature (fold)/renature   Even single mutations can cause changes  Recent discovery that amyloid diseases (eg. CJD, Alzheimer) are due to unstable protein folding

40 How Does a Protein Find It’s Fold? A protein of n residues: 20 n possible sequences! 100 residue protein has 100 20 possibilities 1.3 X 10 130 !  The latest estimates indicate < 40,000 sequences in the human genome  THERE MUST BE RULES! 20 different amino acids: many combinations NC 1234 Amino terminusCarboxyl terminus Residue number

41 Limitations on Protein Sequence  Minimum length based on ability to perform a biochemical function: ~40 residues (e.g. inhibitors)  Maximum length based on complexity of assembly: Conversion of DNA code and production of proteins is carried out by molecular machines that are not perfect. If the sequence gets too long, too many errors will build up. *Length is generally 100-1000 residues*

42 Protein Folding The hydrophobic effect is the major driving force  Hydrophobic side chains cluster/exclude water  Release of water cages in unfolded state Other forces providing stability to the folded state  Hydrogen bonds   Electrostatic interactions  Chemical cross links- Disulfides, metal ions

43 Protein Folding Random folding has too many possibilities Backbone restricted but side chains not A 100 residue protein would require 10 87 s to search all conformations (age of universe < 10 18 s) Most proteins fold in less than 10 s!!  Proteins must fold along specific pathways!!

44 Protein Folding Pathways Usual order of folding events  Secondary structures formed quickly (local)  Secondary structures aggregate to form motifs  Hydrophobic collapse to form domains  Coalescence of domains Molecular chaperones assist folding in-vivo  Complexity of large chains/multi-domains  Cellular environment is rich in interacting molecules  Chaperones sequester proteins and allow time to fold

45 Progressive Folding of Proteins From Disordered to Native State Protein Folding Funnel V/V/P- Figures 6.37/38

46 Functional Classes of Proteins Receptors- sense stimuli, e.g. in neurons Channels- control cell contents Transport- e.g. hemoglobin in blood Storage- e.g. ferritin in liver Enzyme- catalyze biochemical reactions Cell function- multi-protein machines Structural- collagen in skin Immune response- antibodies

47 Structural Classes of Proteins 1. Globular proteins (enzymes, molecular machines)  Variety of secondary structures  Approximately spherical shape  Water soluble  Function in dynamic roles (e.g. catalysis, regulation, transport, gene processing)

48 Globular Proteins V/V/P- Figure 6.27 Hemoglobin  Conconavalin A Triose Phosphate isomerase

49 Structural Classes of Proteins 2. Fibrous Proteins (fibrils, structural proteins)  One dominating secondary structure  Typically narrow, rod-like shape  Poor water solubility  Function in structural roles (e.g. cytoskeleton, bone, skin)

50 Collagen: A Fibrous Protein V/V/P- Figures 6.17/18 Triple Helix Gly-Pro-Pro Repeat Stabilizing Inter-strand H-bonds

51 Structural Classes of Proteins 3. Membrane Proteins (receptors, channels)  Inserted into (through) membranes  Multi-domain- membrane spanning, cytoplasmic, and extra-cellular domains  Poor water solubility  Function in cell communication (e.g. cell signaling, transport)

52 Photosynthetic Reaction Center B/T Figure 13.6 Extracellular Intracellular (cytoplasmic) Membrane- spanning

53 In the physical sense, the progression of living organisms results from the communication between molecules. Interaction between molecules is determined by binding affinities.

54 Binding Classification of Proteins Structural- other structural proteins Receptors- regulatory proteins, transmitters Toxins- receptors Transport- O 2 /CO 2, cholesterol, metals, sugars Storage- metals, amino acids, Enzymes- substrates, inhibitors, co-factors Cell function- proteins, RNA, DNA, metals, ions Immune response- foreign matter (antigens)

55 Surface Determines What Binds  Steric access  Shape  Hydrophobic accessible surface  Electrostatic surface Sequence and structure optimized to generate surface properties for requisite binding event(s)

56 Determinants of Protein Surface Function requires specific amino acid properties  Not all amino acids are equally useful  Abundant: Leu, Ala, Gly, Ser, Val, Glu  Rare: Trp, Cys, Met, His Post-translational modifications  Addition of co-factors- metals, hemes, etc.  Chemical modification- phosphorylation, glycosylation, acetylation, ubiquination, sumoylation

57 Binding Alters Protein Structure Mechanisms of Achieving Functional Properties  Allosteric Control- binding at one site effects changes in conformation or chemistry at a point distant in space  Stimulation/inhibition by control factors- proteins, ions, metals control progression of a biochemical process (e.g. controlling access to active site) 3. Reversible covalent modification- chemical bonding, e.g. phosphorylation (kinase/phosphatase) 4. Proteolytic activation/inactivation- irreversible, involves cleavage of one or more peptide bonds

58 Calcium Signal Transduction Allostery & Stimulation by Control Factor Target Ca 2+ Calmodulin

59 Sequence  Structure  Function Many sequences can give same structure  Side chain pattern more important than sequence When homology is high (>50%), likely to have same structure and function (Structural Genomics)  Cores conserved  Surfaces and loops more variable *3-D shape more conserved than sequence* *There are a limited number of structural frameworks*

60 I. Homologous: similar sequence (cytochrome c)  Same structure  Same function  Modeling structure from homology Varied Relationships Between Sequence, Structure and Function

61 V/V/P Figure 6.31 C-Type Cytochromes Same structure/function- Different Sequence Heme  Constant structural elements and basic architecture

62 Varied Relationships Between Sequence, Structure and Function I. Homologous: very similar sequence (cytochrome c)  Same structure  Same function  Modeling structure from homology II. Similar function- different sequence (dehydrogenases)  One domain same structure  One domain different

63 B/T Figure 10.8 NAD-Binding Domains Conserved Domains/Functional Elements Lactate DehydrogenaseAlcohol Dehydrogenase

64 Varied Relationships Between Sequence, Structure and Function I. Homologous: very similar sequence (cytochrome c)  Same structure  Same function  Modeling structure from homology II. Similar function- different sequence (dehydrogenases)  One domain same structure  One domain different III. Similar structure- different function (cf. thioredoxin)  Same 3-D structure  Not same function

65 B/T Figures 10.8/2.7 NADH-Binding and Redox Same structure- Different Function Alcohol DehydrogenaseLactate Dehydrogenase Thioredoxin


Download ppt "Biochemistry 301 Principles of Protein Structure Walter Chazin 5140 BIOSCI/MRBIII Walter.Chazin Jan. 8-10,"

Similar presentations


Ads by Google