Organization of RNA structural motifs: Lessons from SCOR

Slides:



Advertisements
Similar presentations
Satellite Workshop on RNA Ontology RNA 2005 Tenth Annual Meeting of the RNA Society Banff, May 23st and May 24rth.
Advertisements

Towards RNA structure prediction: 3D motif prediction and knowledge-based potential functions Christian Laing Tamar Schlick’s lab Courant Institute of.
Protein Structure C483 Spring 2013.
Protein Structure – Part-2 Pauling Rules The bond lengths and bond angles should be distorted as little as possible. No two atoms should approach one another.
The amino acids in their natural habitat. Topics: Hydrogen bonds Secondary Structure Alpha helix Beta strands & beta sheets Turns Loop Tertiary & Quarternary.
Nucleic Acid Database By Pooja Awatramani. Database Utilities Provides structural references in the form of base pair annotation for DNA, RNA, and some.
Nucleic Acids - Informational Polymers
Bioinformatics Master Course II: DNA/Protein structure-function analysis and prediction Lecture 12: DNA/RNA structure Centre for Integrative Bioinformatics.
Predicting the 3D Structure of RNA motifs Ali Mokdad – UCSF May 28, 2007.
Predicting RNA Structure and Function. Non coding DNA (98.5% human genome) Intergenic Repetitive elements Promoters Introns mRNA untranslated region (UTR)
LECTURE 5: DNA, RNA & PROTEINS
Functional RNA - Introduction Biochemistry 4000 Dr. Ute Kothe.
Introduction to Bioinformatics Lecture 20: Sequencing genomes.
Mesoscopic Modeling of RNA Structure and Dynamics Hin Hark Gan A. Fundamentals of RNA structure 1. Hierarchical folding 2. Folding timescales B. Issues.
RNA structure can be specific, stable and complex. (As a result, RNA mediates specific recognition and catalytic reactions.) Principles/ideas--RNAs contain.
A. niger 2aaa:1-353 Acid  -amylase B. cereus J. Biochem 113: Oligo-1,6 glucosidase B. circulans 1cdg:1-3821cgt:1-382 Cyclodextrin glycosyltransferase.
Deoxyribonucleic Acid (DNA). The double helix Nitrogenous Bases and Pentose Sugars.
Types of Proteins Proteomics - study of large sets of proteins, such as the entire complement of proteins produced by a cell E. coli has about 4000 different.
Genomics and Personalized Care in Health Systems Lecture 9 RNA and Protein Structure Leming Zhou, PhD School of Health and Rehabilitation Sciences Department.
Nucleic Acids Structures 1-Discovery of DNA structure 2- A, B and Z conformations of dsDNA/dsRNA Not treated: -DNA topology -DNA Sequencing 5- Principles.
RNA Secondary Structure Prediction. 16s rRNA RNA Secondary Structure Hairpin loop Junction (Multiloop)Bulge Single- Stranded Interior Loop Stem Image–
From DNA to Protein. Knowledge of Nucleic Acid Chemistry Is Essential to the Understanding of DNA Structure.
Part I : Introduction to Protein Structure A/P Shoba Ranganathan Kong Lesheng National University of Singapore.
Protein structure and function Part - I
Chapter 3. Protein structure and function. Proteins are the most versatile macromolecules in living systems. serve crucial functions in essentially all.
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
RNA Structure 1.RNA = DNA 2.Elements & Motifs : RNA :: Helices : DNA 3. Sequence Dependence ElementNo MotifYes /
DNA Structure DNA is usually a double-helix and has two strands running in opposite directions. (There are some examples of viral DNA which are single-stranded).
(CHAPTER 9- Brooker Text) Molecular Structure of DNA & RNA Sept 9 & 11, 2008 BIO 184 Dr. Tom Peavy.
Structure of proteins by X-ray crystallography
Chapter 10 Outline 10.1 Genetic Material Possesses Several Key Characteristics, All Genetic Information Is Encoded in the Structure of DNA or.
Protein backbone Biochemical view:
The Discovery of DNA as the genetic material. Frederick Griffith.
Levels of Protein Structure. Why is the structure of proteins (and the other organic nutrients) important to learn?
Levels of Protein Structure. Why is the structure of proteins (and the other organic nutrients) important to learn?
Enzymes SADIA SAYED. Enzymes are proteins  All enzymes are proteins  Strings of amino acids folding up into distinct structures  The properties of.
DNA HISTORY, STRUCTURE, & REPLICATION. WHAT IS DNA? Deoxyribose Nucleic Acid Polymer made out of sugars (deoxyribose), phosphates, and nitrogen bases.
Lecture 47: Structure II -- Proteins. Today’s Outline The monomers: amino acids – Side chain characteristics – Acid-base equilibria and pK a Peptide backbone.
1 Advanced Biochemistry 高等生化學 Nucleotides and Nucleic Acids 陳威戎.
Structural organization of proteins
Mir Ishruna Muniyat. Primary structure (Amino acid sequence) ↓ Secondary structure ( α -helix, β -sheet ) ↓ Tertiary structure ( Three-dimensional.
Riboswitch Structures: Purine Ligands Replace Tertiary Contacts
Structure of Nucleic Acids
Biochemical Organization &Functions of DNA
Le repliement de l’ARN I
Protein Structure BL
From Gene to Protein pp Discover Biology: C15 From Gene to Protein pp
PROTEINS Polymers (long chains) of AMINO ACIDS
There are four levels of structure in proteins
A Detailed View of a Ribosomal Active Site
Mechanical Force and Biomolecules Lecture 2: Overview of biomolecular structure.
DNA and Its Role in Heredity
Topic 2 Molecular biology
RNA 2D and 3D Structure Craig L. Zirbel October 7, 2010.
Molecular Basis of Box C/D RNA-Protein Interactions
Protein structure prediction.
Solution structure of the donor site of a trans-splicing RNA
College of Chemistry and Molecular Sciences,
Tertiary and Quaternary Protein Structure
Volume 13, Issue 9, Pages (December 2015)
RNA bulges as architectural and recognition motifs
LECTURE 5: DNA, RNA & PROTEINS
The Dynamic Landscapes of RNA Architecture
Crystal Structure of SRP19 in Complex with the S Domain of SRP RNA and Its Implication for the Assembly of the Signal Recognition Particle  Chris Oubridge,
Volume 84, Issue 6, Pages (June 2003)
Hinge-Like Motions in RNA Kink-Turns: The Role of the Second A-Minor Motif and Nominally Unpaired Bases  Filip Rázga, Jaroslav Koča, Jiří Šponer, Neocles.
Fundamentals of Organic Chemistry
Dr. Israa ayoub alwan Lec – 3-))
Sequence-Specific RNA Binding by a Nova KH Domain
RNA enzymes: Putting together a large ribozyme
Presentation transcript:

Organization of RNA structural motifs: Lessons from SCOR Donna K. Hendrix Department of Plant and Microbial Biology University of California, Berkeley and Physical Biosciences Division Lawrence Berkeley National Laboratory dkhendrix@lbl.gov

Structural classification of RNA http://scor.lbl.gov Search by PDB or NDB id primary sequence key word Directed Acyclic Graph Architecture SCOR 1.2 location, and what it looks like. SCOR lives today.

Classification principles Base pairing Watson Crick non-canonical Base stacking Backbone conformation Sequence Backbone interactions backbone-backbone backbone-base

SCOR 2.0 classification Tertiary interactions Ribose zippers Structural classification Hairpin loops Internal loops Tertiary interactions Ribose zippers Coaxial helices, Tetraloop-receptor, A-minor motif, Kissing hairpin, Pseudoknots Functional classification Molecular function Motif function Structural models Add location http://scor.lbl.gov Functional classification

RNA structural classification Conserved patterns and relationships sequence structure Organize data for non-specialist Classification for RNA model-building, engineering

How to give yourself eye strain SCOR 2.0.3 update: 102 new structures 20 structures removed from SCOR 2.0.2 85 structures previously in SCOR but not functionally annotated Moved server from LBL; cleaned up the code a little bit; upgraded OS/tomcat; Eric added apache services.

What defines an RNA structural motif? Conserved, repeated structural features sequence fold (backbone, stacking) interactions (hydrogen bonds, stacking)

Primary structure Identify by Specify by sequence: conservation of sequence binding or stability Specify by sequence: GUAUGA (Box C of C/D Box snoRNA) CUCAGUACGAGAGGAAC (sarcin-ricin loop) M. Tamura and S.R Holbrook JMB 320:455 (2002)

Secondary structure motifs Specify by Watson Crick base pairing internal loops hairpin loops junction loops some tertiary interactions (pseudoknots) 1euy, Sherlin, et. Al. JMB 299:431 (2000)

Structural, or 3-d motifs Distinguished from secondary structural motifs by three-dimensional features and interactions bases: pairing, stacking, base-backbone backbone: backbone-backbone, torsion angles (including chi), pseudotorsion Described by sequence, secondary structure features as well 1euy, Sherlin, et. Al. JMB 299:431 (2000)

Organization of structural motifs: hierarchical classification from SCOR 1.1 and 1.2 Internal Loops Non-Watson Crick paired stacked duplexes Several looped-out bases One Looped out base Base triple, no dinucleotide platform Unpaired, unstacked looped in bases Trans- glycosidic bond(s) Loops with unpaired stacked bases, no triples or dinucleotide platforms One looped-out base with stacked non-Watson Crick base pairs Loops with Dinucleotide platform

Limitations of the hierarchical classification (SCOR 1.1, 1.2) Internal Loops Non-Watson Crick paired stacked duplexes Several looped-out bases One Looped out base Base triple, no dinucleotide platform Unpaired, unstacked looped in bases Trans- glycosidic bond(s) Loops with unpaired stacked bases, no triples or dinucleotide platforms One looped-out base with stacked non-Watson Crick base pairs Loops with Dinucleotide platform 1i6u: c:10-11, c:28 (A-U)A Tishchenko, et al., JMB 311:311 (2001) 1exy:a:9,20,22 (G,C,A) Jiang, et al. Structure 7:1461 (1999)

Organization of structural motifs: SCOR 2.0 and the DAG classification Use a directed acyclic graph (DAG) to represent the relationships among motifs Increase searching options: by sequence, strand, PDB or NDB identifier, residue number and key words

Limitations of the hierarchical classification(SCOR 1.1, 1.2) Internal Loops Non-Watson Crick paired stacked duplexes Several looped-out bases One Looped out base Base triple, no dinucleotide platform Unpaired, unstacked looped in bases Trans- glycosidic bond(s) Loops with unpaired stacked bases, no triples or dinucleotide platforms One looped-out base with stacked non-Watson Crick base pairs Loops with Dinucleotide platform 1i6u: c:10-11, c:28 (A-U)A Tishchenko, et al., JMB 311:311 (2001) 1exy:a:9,20,22 (G,C,A) Jiang, et al. Structure 7:1461 (1999)

SCOR 2.0 DAG: internal loop base triples Internal Loops Loops with dinucleotide platforms Loops with base triples Loops with base triples, no dinucleotide platform Loops with simple dinucleotide platform Loops with a dinucleotide platform in a triple

Limitations of the DAG No clean way to present orthogonal attributes “hairball” Multiple DAGs Not easily searchable Inherent awkwardness to browsing Inherent akwardness to browsing

Organization of structural motifs: hierarchically organized queryable attributes PDB ID: 1dul Location: chain b, res 146-150; chain b, res 161-165 Sequence 146-UCAGG-150 165-GACGA-161 Base pairings 146-165; U∙G; cis WC-WC 147-164; C∙A; trans WC/Hoogsteen 148-163; A∙C; trans WC/sugar edge 149-162; G∙G; trans bifurcated/Hoogsteen 150-161; G∙A; cis WC-WC Base stacking Adjacent: 145-146, 146-147, 148-149, 149-150… Non-adjacent: 147-162, 148-164 (stack swap) Pseudotorsions Residue η θ χ 146.B 169.3 195.0 203.9 147.B 160.9 144.3 217.6 148.B 110.7 155.2 228.2 RNA “Rotamers” … Identify motifs that consist of these more atomic attributes. Organization of queryable attributes will, itself, be structured 1dul:146-150.b, 161-165.b E. coli SRP/RNA Batey, et al., Science 287:1232 (2000)

Feature-based structural classification *Sequence *Loop length Base pairings Pseudotorsion angles Hydrogen bonds Stacking adjacent and non-adjacent Classification of structural elements by features Feature-based searching and characterization of motifs

RNA Structural Elements Characteristic Element Loop Motifs Tertiary Interaction Motifs Size Small, local May span entire loop Multiple loops, stems involved Sequence Conservation Little or none Often have sequence preferences/isosteric Interaction sites Structural Conservation By definition Often conserved Evolutionarily conserved Features (pairing, stacking, etc.) Usually single feature Multiple features/elements Multiple in each interacting motif Occurrence Found within various motifs Not nested; may occur in tertiary interaction motifs May include multiple elements and motifs

Element Name(s) Description Found In Reference U turn/Uridine turn/Pi turn A sharp bend in the phosphate-sugar backbone between the first and second nucleotides, followed by characteristic stacking of the second and third nucleotides. Original descriptions include a stabilizing hydrogen bond between the first and third residues. Hairpin loops (e.g., GNRA, T--C loop) and internal loops (Holbrook et al., 1978; Kim and Sussman, 1976; Klosterman et al., 2004b; Quigley and Rich, 1976) A-minor interaction The insertion of minor groove edges of an adenine into the minor groove of neighboring helices. Four types have been identified. Ribose zipper, kink-turn (Nissen et al., 2001) S-turn Two consecutive bends in the phosphate-sugar backbone characterized by backbone distortions and inverted sugar puckers, resulting in an "S" shape. Loop E motif Sarcin-ricin loop (Correll et al., 1999; Szewczak et al., 1993; Wimberly et al., 1993) Dinucleotide platform Two adjacent, covalently linked, co-planar residues that form a non-Watson Crick pairing. Internal loops, often involved in a base triple (Klosterman et al., 2004b) Base triples Three hydrogen-bonded, coplanar bases with two of the bases sometimes forming a Watson-Crick pair or dinucleotide platform. Loop E motif, Sarcin-ricin loop Cross-strand stack A base on one strand stacks with a base on the opposing strand, rather than stacking with the adjacent bases on its own strand. Internal loops, e.g., Bacterial Loop E motif (Correll et al., 1997) Non-canonical base pairs Two bases of any type interacting in a generally planar arrangement can form hydrogen bonds in characteristic patterns. Double helices (Leontis and Westhof, 2001) (Nagaswamy et al., 2002) Extruded helical single strand Two or three unpaired bases extruded from the main double helical stack forming an independent stack. Internal and hairpin loops Backbone rotamers Commonly occurring RNA backbone conformations. Double helices, hairpin, internal and junction loops (Duarte et al., 2003; Hershkovitz et al., 2003; Murray et al., 2003; Schneider et al., 2004)

Annotation issues: What is a motif? Recurrent structure Conserved structure Conserved function? I know it when I see it. Definition (glossary)

Annotation issues: Assessment Canonical Variations (-like, pseudo-, reverse-, inverse-) eVal

Annotation issues: Who is it for? Student. Naïve in knowledge of structural motifs, but expert in biology. Expert. Computer-readable, human-interpretable? But what about my favorite structure (sequence, motif)? BLAST?