Presentation is loading. Please wait.

Presentation is loading. Please wait.

Protein Structure Nimrod Rubinstein Bioinformatics Seminar.

Similar presentations


Presentation on theme: "Protein Structure Nimrod Rubinstein Bioinformatics Seminar."— Presentation transcript:

1 Protein Structure Nimrod Rubinstein Bioinformatics Seminar

2 Protein Synthesis 1. Attachment of correct amino acids (AAs) to their corresponding tRNAs. 2. Initiation: forming the initiation complex. 3. Elongation: sequentially forming peptide bonds. 4. Termination: synthesis is terminated and the polypeptide is released.

3 From Sequence to Structure Structure Hierarchies: Primary structure: the sequence of AAs covalently bound along the backbone of the polypeptide chain. Cα C C N N O O O ф ф ψ ψ C N ф ψ Ala Gly Cys -180 0 ≤ ф ≤ 180 0 -180 0 ≤ ψ ≤ 180 0

4 From Sequence to Structure Structure Hierarchies: Secondary structure: local conformation of some part of the polypeptide. β Sheet α Helix ParallelAnti Parallel

5 From Sequence to Structure Structure Hierarchies: Tertiary structure: the overall 3-dimensional arrangement of all the atoms in the protein.

6 From Sequence to Structure Structure Hierarchies: Quaternary structure: some proteins contain two or more separate polypeptide chains, which may be identical or different. GlobularFibrous

7 From Sequence to Structure Additional Parameters: Surface accessibility: The surface area of the molecule that is exposed to the solvent, derived from the complete structure. VDW surface: the surface area of an atom. Connolly surface: the interface between the molecule and the solvent sphere (conventionally with r = 1.4 Å ). Solvent accessible surface: the path of the center of the solvent sphere rolled over the VDW surface. Relative accessibility = (SAS)/(maxSAS) maxSAS = SAS(Gly-X-Gly)

8 From Sequence to Structure Additional Parameters: Coordination number: The number of structure stabilizing contacts each residue in the structure makes.The number of structure stabilizing contacts each residue in the structure makes. Computation: encapsulating an AA with a sphere, centered at the residue ’ s center of mass, and counting the number of residues falling inside this sphere.Computation: encapsulating an AA with a sphere, centered at the residue ’ s center of mass, and counting the number of residues falling inside this sphere. Usually done with different cutoff radii.Usually done with different cutoff radii.

9 From Sequence to Structure Protein Folding: Luckily, nature works out with these sorts of numbers and the correct conformation of a protein is reached within seconds. If each conformation were sampled in the shortest possible time (time of a molecular vibration ~ 10 -13 s) it would take an astronomical amount of time (~10 77 years) to sample all possible conformations, in order to find the Native State. The Levinthal paradox: The Levinthal paradox: [Levinthal C.; J. Chym. Phys. (1968)] Assume a protein is comprised of 100 AAs. Assume each AA ’ s backbone can take up 10 different conformations, defined by ф and ψ values. Altogether we get: 10 100 conformations. NPC even in the 2D case

10 From Sequence to Structure Folding Models: The Backbone-Centric view: фψpropensitiesSequence order dependent interactions (фψ - propensities and H- bonds), produce local secondary structure elements (SSEs). Local SSEs later overgo longer- range interactions to form supersecondary structures. Supersecondary structures of ever-increasing complexity thus grow, ultimately into the native conformation.

11 From Sequence to Structure Folding Models: The Sidechain-Centric view: Hydrophobic sidechain interactions are the strongest for AAs in a water solution.Hydrophobic sidechain interactions are the strongest for AAs in a water solution. A few key hydrophobic residues are responsible for a “ hydrophobic collapse ” to the “ molten globule ” state.A few key hydrophobic residues are responsible for a “ hydrophobic collapse ” to the “ molten globule ” state. The “ molten globule ” might not include SSEs, yet about this structure the remainder of the polypeptide chain condenses.The “ molten globule ” might not include SSEs, yet about this structure the remainder of the polypeptide chain condenses. The conformation space is viewed as “ funnel shaped ”.The conformation space is viewed as “ funnel shaped ”. Molten globule states

12 From Sequence to Structure Folding Models: The Sidechain-Centric view - Larger proteins: Intermediate states exist, which are highly populated. These states may assist in finding the Native Structure or may serve as traps that inhibit the folding process. Structurally aligning intermediate states against the SCOP found the corresponding Native Structures to have the highest scores. But, many features were missing: Well defined SSEs. A well formed hydrophobic core. High RMSDs (7-10 Å). [Dobson C. M.; TRENDS in Biochemical Sciences; Jan 2005]

13 From Sequence to Structure Folding Models: Post-translationalVs. Co-translational Denaturation-Renaturation experiments are biased. An AA is added to the polypeptide chain in: 10 -2 s. The rate at which an SSE is formed is: 10 -7 – 10 -4 s. Anfinsen ’ s experiments: Exposure of a purified RNase-A enzyme to a concentrated urea solution in the presence of a reducing agent denaturizes the folded conformation resulting in a complete loss of catalytic activity. Removal of the urea and reducing agent causes the enzyme to accurately refold to its native structure and restore its catalytic activity. [Anfinsen C. et al.; PNAS (1961)]

14 Determining the Structure Crystallization: Assembling a solution of protein molecules into a periodic lattice. Assembling a solution of protein molecules into a periodic lattice. F F X-Ray Diffraction: The crystal is bombarded with X-ray beams. The crystal is bombarded with X-ray beams. The collision of the beams with the electrons creates a diffraction pattern. The collision of the beams with the electrons creates a diffraction pattern. The diffraction pattern is transformed into an electron density map of the protein from which the 3D locations of the atoms can be deduced. The diffraction pattern is transformed into an electron density map of the protein from which the 3D locations of the atoms can be deduced.

15 Determining the Structure Nucleotide Magnetic Resonance: A solution of the protein is placed in a magnetic field. A solution of the protein is placed in a magnetic field. spins align parallel or anti-parallel to the field. spins align parallel or anti-parallel to the field. RF pulses of electromagnetic energy shifts spins from their alignment. RF pulses of electromagnetic energy shifts spins from their alignment. Upon radiation termination spins re-align while emitting the energy they absorbed. Upon radiation termination spins re-align while emitting the energy they absorbed. The emission spectrum contains information about the identity of the nuclei and their immediate environment. The emission spectrum contains information about the identity of the nuclei and their immediate environment. The result is an ensemble of models rather than a single structure. The result is an ensemble of models rather than a single structure.

16 Structure Similarity Protein Families: Structures seem to be preserved much more than sequences, which is easily explainable due to neutral mutations. Structures seem to be preserved much more than sequences, which is easily explainable due to neutral mutations. 1BRU: Pancreatic Elastase (Sus scorfa). 1CHG: Chymotrysinogen (Bos taurus). Global Alignment: 39% identity 1BRU1CHG Rigid C α Alignment: RMSD 1.26Å 1CHG 1BRU

17 Structure Similarity http://scop.mrc-lmb.cam.ac.uk/scop/ Protein Families: Structures seems to be preserved much more than sequences, which is easily explainable due to neutral mutations. Structures seems to be preserved much more than sequences, which is easily explainable due to neutral mutations. Structural Biologists claim that there are a limited number of ways in which protein domains fold. There may be as few as ~2000 different folds (differing by their backbone topology). Structural Biologists claim that there are a limited number of ways in which protein domains fold. There may be as few as ~2000 different folds (differing by their backbone topology). Nearly a 1000 different folds have already been resolved. Nearly a 1000 different folds have already been resolved.

18 Structure Prediction Homology (Comparative) Modeling: Guideline: At least 30% sequence identity is needed between probe and template. 1.Template Assignment: creating a robust probe- template alignment (PWA/MSA). 2.Model Construction: a.Generation of coordinates for conserved segments: superimposing/averaging/restrain based. b.Generation of coordinates for variable segments: DB scanning/Ab Initio/restrain based. c.Generation of coordinates for sidechain atoms: superimposing/rotamer libraries/restrain based. 3.Model Evaluation: a.Assessment of to the ability to functionally identify the active site of the model. b.Assessment of physico-chemical or structural environment based on statistical analyses of DBs for characteristics such as:  Intramolecular packing.  Bond geometry.  Solvent accessibility. bFGF [Peitsch et al. (1999)]

19 Structure Prediction Threading (Sequence-Structure Alignment): Identifying evolutionary unrelated proteins that have converged to similar folds. Scoring Scheme: describes the propensity of each AA for its structural/physico- chemical environment: SS type, solvent accessibility, coordination number, etc …Scoring Scheme: describes the propensity of each AA for its structural/physico- chemical environment: SS type, solvent accessibility, coordination number, etc … Profile construction: encoding the template ’ s AAs structural features to a 1D profile and predicting such a profile for the probe.Profile construction: encoding the template ’ s AAs structural features to a 1D profile and predicting such a profile for the probe. Threading Algorithm: Aligning the 1D profiles of the template and the probe using DP and the defined scoring scheme.Threading Algorithm: Aligning the 1D profiles of the template and the probe using DP and the defined scoring scheme. No adjustments to the template profile can be made thus substantial rearrangements are ignored But: No adjustments to the template profile can be made thus substantial rearrangements are ignored templateprobe [Bryant, Lawrence; Proteins (1993)]

20 Structure Prediction Ab Initio Techniques: Simulating the folding process Simulating the folding process Simplifying the energy landscape: Reducing the number of degrees of freedom: Reducing the number of degrees of freedom: Representing a group of atoms by a single atom.Representing a group of atoms by a single atom. Reducing the number of atom interactions.Reducing the number of atom interactions. Sampling the conformation space: Sampling the conformation space: Monte Carlo sampling.Monte Carlo sampling. Genetic Algorithm.Genetic Algorithm. Simulated Annealing.Simulated Annealing. Hierarchical folding simulation. Hierarchical folding simulation.

21 Blind Prediction Critical Assessment of Protein Structure Prediction – CASP Goal: “ to obtain an in-depth and objective assessment of our current abilities and inabilities in the area of protein structure prediction ”. Goal: “ to obtain an in-depth and objective assessment of our current abilities and inabilities in the area of protein structure prediction ”. Groups use their tools to model proteins with pre-published structures. Groups use their tools to model proteins with pre-published structures. The predictions are thus evaluated against the subsequently determined structures. The predictions are thus evaluated against the subsequently determined structures. CASP6 (2004) shows limited improvements compared to CASP5 (2003). CASP6 (2004) shows limited improvements compared to CASP5 (2003).


Download ppt "Protein Structure Nimrod Rubinstein Bioinformatics Seminar."

Similar presentations


Ads by Google