ProteinStructuralDatabases. Proteins are built from amino-acids. Introduction H | NH2-c-CO2H | R.

Slides:



Advertisements
Similar presentations
Proteins: Structure reflects function….. Fig. 5-UN1 Amino group Carboxyl group carbon.
Advertisements

Review.
Review of Basic Principles of Chemistry, Amino Acids and Proteins Brian Kuhlman: The material presented here is available on the.
Lactate dehydrogenase + 38 ATP + 2 ATP. How does lactate dehydrogenase perform its catalytic function ?
Applications of knowledge discovery to molecular biology: Identifying structural regularities in proteins Shaobing Su Supervisor: Dr. Lawrence B. Holder.
5’ C 3’ OH (free) 1’ C 5’ PO4 (free) DNA is a linear polymer of nucleotide subunits joined together by phosphodiester bonds - covalent bonds between.
An overview of amino acid structure Topic 2. Biomacromolecule A naturally occurring substance of large molecular weight e.g. Protein, DNA, lipids etc.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modeling Anne Mølgaard, CBS, BioCentrum, DTU.
Protein-a chemical view A chain of amino acids folded in 3D Picture from on-line biology bookon-line biology book Peptide Protein backbone N / C terminal.
1 Levels of Protein Structure Primary to Quaternary Structure.
Amino Acids and Proteins 1.What is an amino acid / protein 2.Where are they found 3.Properties of the amino acids 4.How are proteins synthesized 1.Transcription.
Sequence analysis June 18, 2008 Learning objectives-Understand the concept of sliding window programs. Understand difference between identity, similarity.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Protein structure Anne Mølgaard, Center for Biological Sequence Analysis.
Scoring Matrices June 19, 2008 Learning objectives- Understand how scoring matrices are constructed. Workshop-Use different BLOSUM matrices in the Dotter.
Thomas Blicher Center for Biological Sequence Analysis
Lipids A. Classified based on solubility (like dissolves like) 1. insoluble in polar solvents 2. soluble in nonpolar solvents 3. lipids are hydrophobic.
It & Health 2009 Summary Thomas Nordahl Petersen.
Scoring Matrices June 22, 2006 Learning objectives- Understand how scoring matrices are constructed. Workshop-Use different BLOSUM matrices in the Dotter.
The Protein Data Bank (PDB)
Protein: Linear chain of amino acids called residues (4 in this toy protein) Ser Trp Leu O N N N N O O C C C C O O CαCα CαCα CαCα CαCα Lys H H H H H The.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modelling Thomas Blicher Center for Biological Sequence Analysis.
Other structure programs Insight II Swiss PDB-Viewer QUANTA/CHARMm and Quanta 2005 for proteins SYBYL with FUGUE and FlexX for drugs O or FRODO for x-ray.
Protein structure Classification Ole Lund, Associate professor, CBS, DTU.
Protein Structure Prediction II
1 Computational Biology, Part 11 Retrieving and Displaying Macromolecular Structures Robert F. Murphy Copyright  1996, 1999, All rights reserved.
The relative orientation observed for  helices packed on ß sheets.
Protein Structure FDSC400. Protein Functions Biological?Food?
You Must Know How the sequence and subcomponents of proteins determine their properties. The cellular functions of proteins. (Brief – we will come back.
Amino Acids, Peptides, and Proteins.. Classification of Amino Acids.
Marlou Snelleman 2012 Proteins and amino acids. Overview Proteins Primary structure Secondary structure Tertiary structure Quaternary structure Amino.
Protein Structural Prediction. Protein Structure is Hierarchical.
1.What makes an enzyme specific to one type of reaction (in other words, what determines the function of a protein)? –SHAPE determines the function of.
Structure and Function of Proteins Lecturer: Dr. Ora Furman Oct 2009 Winter 2009/10 Teaching Assistants: Miraim Oxsman Sivan Pearl.
Part II : Introduction To Protein Structure Kong Lesheng Victor Tong Joo Chuan National University of Singapore.
Structural alignment Protein structure Every protein is defined by a unique sequence (primary structure) that folds into a unique.
Proteins account for more than 50% of the dry mass of most cells
The.pdb file format, and other resources for structural information Topic 5 Chapter 10 & 11, Du and Bourne “Structural Bioinformatics”
Protein Sequences. The Genetic Code The natural extension of the genetic code…
BIOCHEMISTRY REVIEW Overview of Biomolecules Chapter 4 Protein Sequence.
1.Overall amino acid structure 2.Amino acid stereochemistry 3.Amino acid sidechain structure & classification 4.‘Non-standard’ amino acids 5.Amino acid.
AMINO ACIDS.
Amino Acids & Side Groups Polar Charged ◦ ACIDIC negatively charged amino acids  ASP & GLU R group with a 2nd COOH that ionizes* above pH 7.02nd COOH.
Secondary structure prediction
Learning Targets “I Can...” -State how many nucleotides make up a codon. -Use a codon chart to find the corresponding amino acid.
1 10/26/2015 MOLECULES. 2 10/26/2015 H 2 N-CH-C-OH O R Monomer E.g. protein Monomer vs polymer amino acid monomer R is a side group.
Welcome Back! February 27, 2012 Sit in any seat for today. You will have assigned seats tomorrow Were you absent before the break? Plan on coming to tutorial.
Part I : Introduction to Protein Structure A/P Shoba Ranganathan Kong Lesheng National University of Singapore.
Protein Tertiary Structure. Protein Data Bank (PDB) Contains all known 3D structural data of large biological molecules, mostly proteins and nucleic acids:
A program of ITEST (Information Technology Experiences for Students and Teachers) funded by the National Science Foundation Background Session #3 DNA &
Amino Acids ©CMBI 2001 “ When you understand the amino acids, you understand everything ”
Marlou Snelleman 2011 Proteins and amino acids. Overview Proteins Primary structure Secondary structure Tertiary structure Quaternary structure Amino.
Proteins.
Proteins Structure of proteins Proteins are made of C, H, O and nitrogen and may have sulfur. The monomers of proteins are amino acids An amino acid.
Chapter 3 Proteins.
X-ray detection xray/facilities.html.
©CMBI 2008 Databases Data must be in a certain format for software to recognize Every database can have its own format but some data elements are essential.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
Principles of Protein Structure. AMINOACIDS Estereoisomer L Side-chain (-CH 3 ) }carboxyl-COOH amino amino -NH 2.
Arginine, who are you? Why so important?. Release 2015_01 of 07-Jan-15 of UniProtKB/Swiss-Prot contains sequence entries, comprising
Useful shell commands head/tail, cut, sort, uniq Virginie Orgogozo March 2011.
Useful shell commands head/tail, cut, sort, uniq Virginie Orgogozo March 2011.
Introduction to Protein Structure
Cathode (attracts (+) amino acids)
Figure 3.14A–D Protein structure (layer 1)
Haixu Tang School of Inforamtics
Packet #9 Supplement.
Amino Acids Amine group -NH2 Carboxylic group -COOH
Packet #9 Supplement.
Levels of Protein Structure
How to Test an Assertion
Presentation transcript:

ProteinStructuralDatabases

Proteins are built from amino-acids. Introduction H | NH2-c-CO2H | R

Schema of a peptide bond Peptide bonds are rigid and planar NH2 CH R1 C O N H CH C O N R2 R3 HOOC N-terminus C-terminus Peptide bonds

THE MAIN STRUCTURE DATABASES PDB: The repository for all known structural proteins. SCOP: Structural Classification of Proteins CATH: Class(C), Architecture(A), Topology(T) and Homologous superfamily (H).

PDB

HEADER PLANT SEED PROTEIN 30-APR-81 1CRN 1CRND 1 COMPND CRAMBIN 1CRN 4 Seqres: Sequence of protein. Be aware: Not always all 3D coordinates are present for all the amino acids in SEQRES SEQRES 1 46 THR THR CYS CYS PRO SER ILE VAL ALA ARG SER ASN PHE 1CRN 51 SEQRES 2 46 ASN VAL CYS ARG LEU PRO GLY THR PRO GLU ALA ILE CYS 1CRN 52 SEQRES 3 46 ALA THR TYR THR GLY CYS ILE ILE ILE PRO GLY ALA THR 1CRN 53 SEQRES 4 46 CYS PRO GLY ASP TYR ALA ASN 1CRN 54 Filename=accession number=PDB code Filename is 4 positions (often 1 digit & 3 letters, i.e.: 1CRN) (if you see 1abcD, it means chain D of structure 1abc) PDB RECORD (1) Header: Describes molecule & gives deposition date CMPND: Name of the molecule

ATOM 1 N THR CRN 70 ATOM 2 CA THR CRN 71 ATOM 3 C THR CRN 72 ATOM 4 O THR CRN 73 ATOM 5 CB THR CRN 74 ATOM 6 OG1 THR CRN 75 ATOM 7 CG2 THR CRN 76 ATOM 8 N THR CRN 77 ATOM 9 CA THR CRN 78 ATOM 10 C THR CRN 79 ATOM 11 O THR CRN 80 ATOM 12 CB THR CRN 81 ATOM 13 OG1 THR CRN 82 ATOM 14 CG2 THR CRN 83 ATOM 15 N CYS CRN 84 ATOM 16 CA CYS CRN ATOM 324 CG ASN CRN 393 ATOM 325 OD1 ASN CRN 394 ATOM 326 ND2 ASN CRN 395 ATOM 327 OXT ASN CRN 396 TER 328 ASN 46 1CRN 397 ATOM: one line for each atom with its unique name and its, x, y, z, coordinates PDB RECORD (2)

FOLD: Major structural similarity. Proteins are defined as having a common fold if they have the same major secondary structures in the same arrangement and with the same topological connections SUPERFAMILY: Probable common evolutionary origin. Proteins that have low sequences identities, but whose structural and functional features suggest that a common evolutionary origin is probable. FAMILY: Clear evolutionary relationship. Proteins clustered together into families are clearly evolutionarily related. Generally, this means that pairwise residue identities between the proteins are 30% greater. SCOP (Murzin, Brenner, Hubbard, Clothia, 1995) Classification according to visual analysis and comparison. STRUCTURAL CLASSIFICATION OF PROTEINS

Only crystal structures solved to resolution better than 3.0 angstroms are considered

CATH hierarchy Class: Seconday structure and packing Architecture: overall shape domain structure and orientation (no conenctivities between the secondary structures) Topology (FOLD family): overall shape and connectivities. Homologous superfamily : prots are thought to share common ancestor Similarities by seq. alignment and then by structure comparison using SSAP