Topic 1 Roland Dunbrack. Modeling of Biological Units Model data files of single proteins may require –sequence alignment(s) to templates (entry and chain)

Slides:



Advertisements
Similar presentations
Refinement of a pdb-structure and Convert A. Search for a pdb with the closest sequence to your protein of interest. B. Choose the most suitable entry.
Advertisements

Hydrophobic: tending to repel and not absorb water; tending not to dissolve in or mix or be wetted by water.
Dictionaries and Ontologies in Structural Biology.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modeling Anne Mølgaard, CBS, BioCentrum, DTU.
Protein Structure, Databases and Structural Alignment
Slide 1 Powerpoint Assignment for Micbio 565 Eric Martz Yeast Gal4 transcriptional regulator, 1d66. X-ray crystallography, resolution 2.7 Angstroms. 2.
Slide 1 Powerpoint Assignment for Micbio 565 Eric Martz Yeast Gal4 transcriptional regulator, 1d66. X-ray crystallography, resolution 2.7 Angstroms. 2.
High Throughput Processing of the Structural Information of the Protein Data Bank Zoltán Szabadka, Vince Grolmusz Department of Computer Science Eötvös.
1 Computational Biology, Part 13 Retrieving and Displaying Macromolecular Structures Robert F. Murphy Copyright  1996, 1999, All rights reserved.
1 Computational Biology, Part 11 Retrieving and Displaying Macromolecular Structures Robert F. Murphy Copyright  1996, 1999, All rights reserved.
2.7 DNA Replication, transcription and translation
Management and Distribution of Chemical Data in the Protein Data Bank John Westbrook, Dimitris Dimitropoulos, Jasmine Young, Peter Rose, Philip E. Bourne.
Structure Representation and Coordinates Format Lecture 3 Structural Bioinformatics Dr. Avraham Samson
Computational Structure Prediction Kevin Drew BCH364C/391L Systems Biology/Bioinformatics 2/12/15.
Structure and Function of Proteins Lecturer: Dr. Ora Furman Oct 2009 Winter 2009/10 Teaching Assistants: Miraim Oxsman Sivan Pearl.
Homology Modeling David Shiuan Department of Life Science and Institute of Biotechnology National Dong Hwa University.
Visualization of Biological Macromolecules Shuchismita Dutta, Ph.D.
Number of released entries Year. Growth of Molecular Complexity Number of Chains Year Number of Structures Containing that Number of Chains.
Part II : Introduction To Protein Structure Kong Lesheng Victor Tong Joo Chuan National University of Singapore.
Erice 2008 Introduction to PDB Workshop From Molecules to Medicine: Integrating Crystallography in Drug Discovery Erice, 29 May - 8 June Peter Rose
Being a binding site: Characterizing Residue-Composition of Binding Sites on Proteins joint work with Zoltán Szabadka and Gábor Iván, Protein Information.
Evaluation of Structure Quality Using RCSB PDB Tools Kyle Burkhardt, Lead Data Annotator The RCSB PDB at Rutgers University.
The.pdb file format, and other resources for structural information Topic 5 Chapter 10 & 11, Du and Bourne “Structural Bioinformatics”
Pages 34 to 36.  Can form 4 covalent bonds  Can form rings or long chains – allowing for complex structures.
Data quality and model parameterisation Martyn Winn CCP4, Daresbury Laboratory, U.K. Prague, April 2009.
1 P9 Extra Discussion Slides. Sequence-Structure-Function Relationships Proteins of similar sequences fold into similar structures and perform similar.
Macromolecular Visualization or… Where to go when ChemDraw just isn’t enough Martin Case Chem
Transcription & Translation Do Now: 1.Get out yesterday’s homework (10-1 review) 2.If a DNA strand has the nucleotide sequence TCC-GAT-AAT, what will the.
Question 1: Name, PDB codes Eric Martz (no research lab) Macromolecular visualization I chose*: 3onz HUMAN TETRAMERIC HEMOGLOBIN:
EBI is an Outstation of the European Molecular Biology Laboratory. Annotation Procedures for Structural Data Deposited in the PDBe at EBI.
1.Overall amino acid structure 2.Amino acid stereochemistry 3.Amino acid sidechain structure & classification 4.‘Non-standard’ amino acids 5.Amino acid.
PROTEIN SYNTHESIS THE FORMATION OF PROTEINS USING THE INFORMATION CODED IN DNA WITHIN THE NUCLEUS AND CARRIED OUT BY RNA IN THE CYTOPLASM.
Common File Formats in Rosetta Steven Combs. The Files Flags/Option files Resfiles Params PDB Silent Atom tree diffs.
Copyright OpenHelix. No use or reproduction without express written consent1.
MolIDE2: Homology Modeling Of Protein Oligomers And Complexes Qiang Wang, Qifang Xu, Guoli Wang, and Roland L. Dunbrack, Jr. Fox Chase Cancer Center Philadelphia,
Module 3 Protein Structure Database/Structure Analysis Learning objectives Understand how information is stored in PDB Learn how to read a PDB flat file.
Structure database: PDB Tuomas Hätinen. Protein Data Bank A repository for 3-D biological macromolecular structure. It includes proteins, nucleic acids.
Protein Data Bank: An Introduction Learning to Use the RCSB PDB Portal.
EBI is an Outstation of the European Molecular Biology Laboratory. Quaternary Structure.
Cell Protein Production. Transcription : process of mRNA formation. 1. Triggered by chem. messengers from cytoplasm which bind to DNA 2. This causes release.
Question 1: Name, PDB codes Eric Martz Keiichi Namba Macromolecular visualization I chose: 3onz HUMAN TETRAMERIC HEMOGLOBIN: PROXIMAL.
EBI is an Outstation of the European Molecular Biology Laboratory. Protein Database in Europe Deposition, Validation, Search and Analysis Services.
Modeling tRNA’s Translator Function
X-ray detection xray/facilities.html.
Motivational Lecture: UNIX and computer-aided design of new medicines. Alexey Onufriev.
Chemistry 1011 Slot 51 Chemistry 1011 TOPIC Acids and Bases TEXT REFERENCE Masterton and Hurley Chapter 4.2 (Review), 13, 14.1, 15.1 (page 427), 21.2 (page589)
2.3 notes Carbon Compounds. Organic chemistry- study of compounds that contain bonds between C atoms Carbon: -can bond with 4 e- to another atom -can.
Marlou Snelleman 2012 Protein structure. Overview Sequence to structure Hydrogen bonds Helices Sheets Turns Hydrophobicity Helices Sheets Structure and.
What is a macromolecule? There are four main types of biological molecules called macromolecules. The four types of macromolecules are carbohydrates, lipids,
PUMPKIN Ideas. Objectives Development and application of methods to:  Model the dynamics of domain motion in large proteins  Incorporate experimental.
Molecular mechanics Classical physics, treats atoms as spheres Calculations are rapid, even for large molecules Useful for studying conformations Cannot.
Metals. bulk eliments trace eliments for some species Periodic Table.
Sequence: PFAM Used example: Database of protein domain families. It is based on manually curated alignments.
Introduction to RCSB PDB Data, Tools and Resources
Hydrophobic: tending to repel and not absorb water; tending not to dissolve in or mix or be wetted by water.
Number of released entries
Section 1 Powerpoint Assignment for Micbio 565, 2012
Biological Molecules.
Voltaic Cells How They work.
Section 1: Identity Powerpoint Assignment for Micbio 565, 2014
DNA Structure and Replication REVIEW
COMPOUNDS OF LIVING THINGS
Ligand Binding to the Voltage-Gated Kv1
COMPOUNDS OF LIVING THINGS
COMPOUNDS OF LIVING THINGS
COMPOUNDS OF LIVING THINGS
COMPOUNDS OF LIVING THINGS
Nucleic Acid Molecules -DNA, RNA and ATP -Structure and Functions
Chapter 2 The Chemistry of Life
Crystal structure description
Presentation transcript:

Topic 1 Roland Dunbrack

Modeling of Biological Units Model data files of single proteins may require –sequence alignment(s) to templates (entry and chain) –sequence-coordinate correspondence –methods description –information on ligands Models of biological units -- homo-N-mers, protein complexes, with ligands, ions, etc. would require more: –sequence alignments for each protein to templates –source of biological unit (RCSB or PQS) –which protein of model based on which chain of asymmetric unit and which symmetry operator used to build it –which ligands associated with which chains PDB format is inadequate We should use XML (preferably) or CIF, using same entity_ID/asym_ID/seq_id schema used by RCSB with additional tags specific for models of various types (e.g., for methods and target-template alignments)

The XML Files and Data Uniformity RCSB project to remake all the PDB files and set standards for all new files fixing format errors and completing information missing in many old files sequence-coordinate correspondence unique identifiers for all molecules and atoms no fixed width fields as in “PDB format” contents of XML files for experimental structures: –asymmetric unit contents –biological unit contents –covalent attachments and modified residues –non-covalent ligands, DNA, RNA –structural quality -- missing residues, resolution, R-factors, B-factors add modeling-specific tags –sequence alignments to template(s) –methods: alignment, coordinate-building, docking, refinement, assessment, etc. –asym_ID of model corresponding to asym_ID of templates+symmetry operator

ATOM C CB HIS A HIS A CB 1 ATOM 3016 N MET B N ATOM 3017 CA MET B C ATOM 3018 C MET B C ATOM 3019 O MET B O ATOM 3020 CB MET B C ATOM 3021 CG MET B C ATOM 3022 SD MET B S ATOM 3023 CE MET B C ATOM 3024 OXT MET B O

Entity 1 PBGS Molecule types Molecules in the asymmetric unit AsymID AAsymID B Entity 2 Levulinic acid AsymID CAsymID D Entity 3 Zn 2+ AsymID KAsymID L Entity 4 water AsymID W AsymID MAsymID N (res 1, res 2, res 3…)

AsymID W (res 1, res 2, res 3…) AsymID AAsymID B AsymID CAsymID D AsymID KAsymID L AsymID MAsymID N AsymID AAsymID B AsymID CAsymID D AsymID KAsymID L AsymID MAsymID N AsymID AAsymID B AsymID CAsymID D AsymID KAsymID L AsymID MAsymID N AsymID AAsymID B AsymID CAsymID D AsymID KAsymID L AsymID MAsymID N Biological Unit: protein homooctamer=4 copies of asymmetric unit

Unit PDB Un# # asym auth ent polymer polymertype source name asym 1jk9 0 1 A A 1 polymer polypeptide(L) pdb superoxide dismutase asym 1jk9 0 1 C C 1 polymer polypeptide(L) pdb superoxide dismutase asym 1jk9 0 1 B B 2 polymer polypeptide(L) pdb copper chaperone for superoxide dismutase asym 1jk9 0 1 D D 2 polymer polypeptide(L) pdb copper chaperone for superoxide dismutase asym 1jk9 0 1 E A 3 non-polymer - pdb ZINC ION asym 1jk9 0 1 F C 3 non-polymer - pdb ZINC ION asym 1jk9 0 1 G _ 3 non-polymer - pdb ZINC ION asym 1jk9 0 1 H _ 3 non-polymer - pdb ZINC ION asym 1jk9 0 1 I _ 4 non-polymer - pdb SULFATE ION asym 1jk9 0 1 J _ 4 non-polymer - pdb SULFATE ION asym 1jk9 0 1 K _ 5 water - pdb water rcsb 1jk9 1 1 A A 1 polymer polypeptide(L) pdb superoxide dismutase rcsb 1jk9 1 1 B B 2 polymer polypeptide(L) pdb copper chaperone for superoxide dismutase rcsb 1jk9 1 1 E A 3 non-polymer - pdb ZINC ION rcsb 1jk9 1 1 G _ 3 non-polymer - pdb ZINC ION rcsb 1jk9 1 1 H _ 3 non-polymer - pdb ZINC ION rcsb 1jk9 1 1 I _ 4 non-polymer - pdb SULFATE ION rcsb 1jk9 1 1 J _ 4 non-polymer - pdb SULFATE ION rcsb 1jk9 1 1 K _ 5 water - pdb water rcsb 1jk9 2 1 C C 1 polymer polypeptide(L) pdb superoxide dismutase rcsb 1jk9 2 1 D D 2 polymer polypeptide(L) pdb copper chaperone for superoxide dismutase rcsb 1jk9 2 1 F C 3 non-polymer - pdb ZINC ION pqs 1jk9 1 1 A A 1 polymer polypeptide(L) pdb superoxide dismutase pqs 1jk9 1 1 C C 1 polymer polypeptide(L) pdb superoxide dismutase pqs 1jk9 1 1 B B 2 polymer polypeptide(L) pdb copper chaperone for superoxide dismutase pqs 1jk9 1 1 D D 2 polymer polypeptide(L) pdb copper chaperone for superoxide dismutase pqs 1jk9 1 1 E A 3 non-polymer - pdb ZINC ION pqs 1jk9 1 1 F C 3 non-polymer - pdb ZINC ION pqs 1jk9 1 1 G _ 3 non-polymer - pdb ZINC ION pqs 1jk9 1 1 H _ 3 non-polymer - pdb ZINC ION pqs 1jk9 1 1 I _ 4 non-polymer - pdb SULFATE ION pqs 1jk9 1 1 J _ 4 non-polymer - pdb SULFATE ION pqs 1jk9 1 1 K _ 5 water - pdb water Content of Asymmetric and Biological Units