Web Resources for Bioinformatics Vadim Alexandrov and Mark Gerstein.

Slides:



Advertisements
Similar presentations
Protein Structure.
Advertisements

Protein Structure and Physics. What I will talk about today… -Outline protein synthesis and explain the basic steps involved. -Go over the Chemistry of.
1 Welcome to the Protein Database Tutorial This tutorial will describe how to navigate the section of Gramene that provides collective information on proteins.
Nucleic Acid Database By Pooja Awatramani. Database Utilities Provides structural references in the form of base pair annotation for DNA, RNA, and some.
Pfam(Protein families )
PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D.
©CMBI 2005 Exploring Protein Sequences - Part 2 Part 1: Patterns and Motifs Profiles Hydropathy Plots Transmembrane helices Antigenic Prediction Signal.
1 Protein Structure, Structure Classification and Prediction Bioinformatics X3 January 2005 P. Johansson, D. Madsen Dept.of Cell & Molecular Biology, Uppsala.
1 (c) Mark Gerstein, 1999, Yale, bioinfo.mbb.yale.edu BIOINFORMATICS Introduction Mark Gerstein, Yale University bioinfo.mbb.yale.edu/mbb452a.
Tema 14. Bases of protein structure and structural prediction. Structural data bank. Protein Data Bank. Molecular Visualization Tools for 3D. Prediction.
Intro to Bioinformatics Summary. What did we learn Pairwise alignment – Local and Global Alignments When? How ? Tools : for local blast2seq, for global.
Strict Regularities in Structure-Sequence Relationship
Protein-DNA interactions: amino acid conservation and the effects of mutations on binding specificity Nicholas M. Luscombe and Janet M. Thornton JMB (2002)
Protein structure (Part 2 of 2).
The Protein Data Bank (PDB)
ProteinStructuralDatabases. Proteins are built from amino-acids. Introduction H | NH2-c-CO2H | R.
Protein structure prediction May 30, 2002 Quiz#4 on June 4 Learning objectives-Understand difference between primary secondary and tertiary structure.
Protein Structure Analysis - I
BMI 731 Protein Structures and Related Database Searches.
PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D.
Protein Structure Prediction II
© Wiley Publishing All Rights Reserved. Biological Sequences.
Protein Structure Prediction and Analysis
Using 3D-SURFER. Before you start 3D-Surfer can be accessed at For visualization.
Pattern databasesPattern databasesPattern databasesPattern databases Gopalan Vivek.
Bioinformatics for biomedicine Protein domains and 3D structure Lecture 4, Per Kraulis
Protein Tertiary Structure Prediction
Macromolecular structure
Genomics and Personalized Care in Health Systems Lecture 9 RNA and Protein Structure Leming Zhou, PhD School of Health and Rehabilitation Sciences Department.
Protein 3D-structure analysis Exercises. Practicals Find update frequency for RCSB PDB: weekly. When was the last update? How many protein structures.
EMBL-EBI Adel Golovin MSDsite The project is funded by the European Commission as the TEMBLOR, contract-no. QLRI-CT under the RTD programme.
Gene Annotation and Analysis Lab Work Reference: European Multimedia Bioinformatics Educational Resource.
 Four levels of protein structure  Linear  Sub-Structure  3D Structure  Complex Structure.
Multiple Alignment and Phylogenetic Trees Csc 487/687 Computing for Bioinformatics.
CATH – a hierarchic classification of protein domain structures Rui Kuang.
PROTEIN STRUCTURE CLASSIFICATION SUMI SINGH (sxs5729)
EBI is an Outstation of the European Molecular Biology Laboratory. Annotation Procedures for Structural Data Deposited in the PDBe at EBI.
BLOCKS Multiply aligned ungapped segments corresponding to most highly conserved regions of proteins- represented in profile.
Copyright OpenHelix. No use or reproduction without express written consent1.
Protein Classification II CISC889: Bioinformatics Gang Situ 04/11/2002 Parts of this lecture borrowed from lecture given by Dr. Altman.
Part I : Introduction to Protein Structure A/P Shoba Ranganathan Kong Lesheng National University of Singapore.
Protein Structure & Modeling Biology 224 Instructor: Tom Peavy Nov 18 & 23, 2009
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Module 3 Protein Structure Database/Structure Analysis Learning objectives Understand how information is stored in PDB Learn how to read a PDB flat file.
Protein Tertiary Structure. Protein Data Bank (PDB) Contains all known 3D structural data of large biological molecules, mostly proteins and nucleic acids:
Protein Data Bank: An Introduction Learning to Use the RCSB PDB Portal.
Protein Modeling Protein Structure Prediction. 3D Protein Structure ALA CαCα LEU CαCαCαCαCαCαCαCα PRO VALVAL ARG …… ??? backbone sidechain.
Proteomics Session 1 Introduction. Some basic concepts in biology and biochemistry.
EBI is an Outstation of the European Molecular Biology Laboratory. Quaternary Structure.
Central dogma: the story of life RNA DNA Protein.
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
Homology modeling with SWISS-MODEL
3DM: Protein engineering Super-family platforms Bio-Prodict DM super-family systems Henk-Jan Joosten Remko Kuipers Tom v/d Bergh Bas Vroling.
March 28, 2002 NIH Proteomics Workshop Bethesda, MD Lai-Su Yeh, Ph.D. Protein Scientist, National Biomedical Research Foundation Demo: Protein Information.
Bioinformatics Project BB201 Metabolism A.Nasser
Polish Infrastructure for Supporting Computational Science in the European Research Space EUROPEAN UNION Examining Protein Folding Process Simulation and.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
InterPro Sandra Orchard.
EBI is an Outstation of the European Molecular Biology Laboratory. PDBe Search Services (PDBelite, PDBePro and BIObar) Sanchayita Sen, Ph.D. PDB Depositions.
EBI is an Outstation of the European Molecular Biology Laboratory. A web based integrated search service to understand ligand binding and secondary structure.
Sequence: PFAM Used example: Database of protein domain families. It is based on manually curated alignments.
Bio/Chem-informatics
PDBemotif A web based integrated search service to understand ligand binding and secondary structure properties in macromolecular structures.
Demo: Protein Information Resource
Getting the Most out of the PDBe
From: Structural database resources for biological macromolecules
Protein Structure Prediction and Protein Homology modeling
BIOINFORMATICS Introduction
BIOINFORMATICS Summary
SUBMITTED BY: DEEPTI SHARMA BIOLOGICAL DATABASE AND SEQUENCE ANALYSIS.
Presentation transcript:

Web Resources for Bioinformatics Vadim Alexandrov and Mark Gerstein

What is Bioinformatics? (Molecular) Bio - informatics One idea for a definition? Bioinformatics is conceptualizing biology in terms of molecules (in the sense of physical- chemistry) and then applying “informatics” techniques (derived from disciplines such as applied math, CS, and statistics) to understand and organize the information associated with these molecules, on a large-scale. Bioinformatics is “MIS” for Molecular Biology Information. It is a practical discipline with many applications.

Web Resources : Molecules –Sequence, Structure, Function Algorithms –HMMs –alignments –simulations Databases

0. Good Starting Point

Web tour of UCL tools and resources

Web tour of UCL tools and resources

1. PDBsum capabilities PDBsum: Starting point for looking at PDB structure Each entry contains: a. View - Schematic pictures of the entry - Interactive views (RasMol/VRML) b. Details - Name, date and description of macromolecules in PDB entry - Authors, resolution and R-factor c. Links - PDB header information - PDB, NDB, SWISSPROT - PQS (protein quaternary structure), MMDB - CATH, SCOP, FSSP - Structure check reports - PROCHECK, WHATIF - Many others – enzyme, PRINTS etc

PDBsum capabilites, continued d. Each chain - CATH classification - Plot of sequence, secondary structure and domain assignments - PROMOTIF analysis - TOPS topology diagram - SAS – annotated FASTA alignment of related sequences in PDB - PROSITE pattern e. Nucleic acid ligands - Base sequence - NUCPLOT diagram of interactions f. Small molecule ligands - Schematic diagram of ligand - LIGPLOT diagram of interactions

2. SAS (Sequence Annotated by Structure): Annotation of protein sequences by structural information. a.Input for FASTA search of rest of PDB -PDB code -SWISS-PROT code -Paste sequence -Upload own alignment b. Annotation -Residue type -Ligand contacts -Active site residues -CATH domains -Residue similarity c. Options -Select inclusion in alignment -Colour/b&w, secondary structure d.View 3D structural superposition -coloured by SAS annotation

3. CATH: Hierarchical domain classification of protein structures in the PDB. Four basic levels: a. Class (automated): secondary structure composition and packing within structure -mainly- , mainly- , mixed , low secondary structure b.Architecture (manual): overall shape of the domain structure as determined by the orientations of the secondary structures. Connectivity is ignored - e.g. barrel, sandwich etc. c.Topology (semi-automated): fold families determined by shape and connectivity of secondary structures -e.g. Mainly-b two-layer sandwich d.Homologous superfamily (semi-automated): domains of common ancestors determined by sequence and structural similarity e.Sequence family (automated): highly similar structures and function as determined by sequence identity

4. Other classification databases a.Enzyme structures database: -PDB enzymes structures classified by E.C. number b.Protein-DNA database: -PDB complex structures classified by binding motif

5. Protein sequence analysis: Protein sequence search using protein fingerprints - group of conserved sequence motifs used to characterize a protein family.

6.Gross-level protein properties Protein-protein interaction server: Protein-DNA interaction server:

7. Atomic-level protein properties a.PROCAT: -Database of 3D enzyme active sites b.Hydrogen bond atlas: -Graphical summary of hydrogen-bonding properties of amino acids c.Atlas of side chain-side chain/side chain-base interactions: -interaction geometries of side chain and side chain-base pairs

8. Publicly available software (protein structure/interaction) a.HBPLUS - calculation of interactions in PDB structures b.LIGPLOT - schematic diagrams of protein-ligand interactions c.NUCPLOT - schematic diagrams of protein-DNA interactions d.PROMOTIF - analyze protein secondary structural motifs e.NACCESS - calculate atomic accessibilities of protein surfaces f.SURFNET - visualization of molecular surfaces, cavities etc g.PROCHECK - check stereochemical quality of protein structures h.THREADER - prediction of protein tertiary structure i.MEMSAT - prediction of transmembrane protein structure j-z BROWSE THE WEB AT YOUR SPARE TIME AND BOOKMARK ‘EM!

‘Domestic’ resources: