RiboSearch Ben Daniel Ariel Kirshner Naomi

Slides:



Advertisements
Similar presentations
Prokaryotic Gene Regulation:
Advertisements

Prokaryotic Gene Regulation: Lecture 5. Introduction The two types of transcription regulation control in prokaryotic cells The lac operon an inducible.
Riboswitches Sharon Epstein 30/03/2006 Frontiers in Metabolome sciences Feinberg Graduate School.
Nucleic Acids Nucleic Acid Basics Contain instructions to build proteins 2 types: – DNA – RNA Composed of smaller units called nucleotides – Monomer:
DEOXYRIBONUCLEIC ACID DNA. O.L Lesson Objectives At the end of this lesson you should be able to 1. Outline the simple structure of DNA – 2 strands and.
NUCLEIC ACIDS Journey to the tiny world of DNA. Nucleic Acids  Organic molecules, include C, H, O, N and P elements.  Have various roles in metabolic.
Nucleic Acids. Elements Nucleic Acids Contain C, H, O,N, P.
Section 8.6: Gene Expression and Regulation
Predicting RNA Structure and Function. Non coding DNA (98.5% human genome) Intergenic Repetitive elements Promoters Introns mRNA untranslated region (UTR)
. Class 1: Introduction. The Tree of Life Source: Alberts et al.
Introduction to Bioinformatics Spring 2008 Yana Kortsarts, Computer Science Department Bob Morris, Biology Department.
Pattern Discovery in RNA Secondary Structure Using Affix Trees (when computer scientists meet real molecules) Giulio Pavesi& Giancarlo Mauri Dept. of Computer.
A Basic Introduction to SFold Kevin MacDonald December 7, 2004 BI420 Final Presentation.
Molecular Testing and Clinical Diagnosis
Bioinformatics Lecture 2. Bioinformatics: is the computational branch of molecular biology Using the computer software to analyze biological data The.
Tutorial 5 Motif discovery.
Discovery of RNA Structural Elements Using Evolutionary Computation Authors: G. Fogel, V. Porto, D. Weekes, D. Fogel, R. Griffey, J. McNeil, E. Lesnik,
Introduction to Bioinformatics - Tutorial no. 5 MEME – Discovering motifs in sequences MAST – Searching for motifs in databanks TRANSFAC – The Transcription.
. Class 5: RNA Structure Prediction. RNA types u Messenger RNA (mRNA) l Encodes protein sequences u Transfer RNA (tRNA) l Adaptor between mRNA molecules.
Nucleic Acids Nucleic Acid Basics Contain instructions to build proteins 2 types: – DNA – RNA Composed of smaller units called nucleotides – Monomer:
Experiment 4- Gene Expression Study in Arabidopsis Thaliana.
1 Vocabulary Review Nucleic Acids. 2 Enzyme that unwinds & separates the DNA strands Helicase.
Chapter 10 – DNA, RNA, and Protein Synthesis
MicroRNA Targets Prediction and Analysis. Small RNAs play important roles The Nobel Prize in Physiology or Medicine for 2006 Andrew Z. Fire and Craig.
1 Bio + Informatics AAACTGCTGACCGGTAACTGAGGCCTGCCTGCAATTGCTTAACTTGGC An Overview پرتال پرتال بيوانفورماتيك ايرانيان.
CSE 6406: Bioinformatics Algorithms. Course Outline
DEOXYRIBONUCLEIC ACID DNA. O.L Lesson Objectives At the end of this lesson you should be able to 1. Outline the simple structure of DNA – 2 strands and.
Genome organization. Nucleic acids DNA (deoxyribonucleic acid) and RNA (ribonucleic acid) store and transfer genetic information in living organisms.
The MOLECULAR BASIS OF INHERITANCE
From Structure to Function. Given a protein structure can we predict the function of a protein when we do not have a known homolog in the database ?
End Show Slide 1 of 39 Copyright Pearson Prentice Hall 12-3 RNA and Protein Synthesis RNA and Protein Synthesis.
Molecular Biology 2.6 Structure of DNA and RNA. Nucleic Acids The nucleic acids DNA and RNA are polymers of nucleotides.
Ben-Gurion University Research focus on riboswitches Bioinformatics Research Focus: Searching for Novel Riboswitches in Newly Sequenced Genomes Danny Barash†
KEY CONCEPT DNA structure is the same in all organisms.
© Wiley Publishing All Rights Reserved. RNA Analysis.
Nucleic Acids.
RNA Structure Prediction
DNA Structure The Chemical Composition of DNA DNA is made of 3 different components: a deoxyribose sugar, a phosphate group, and a nitrogenous.
Questions?. Novel ncRNAs are abundant: Ex: miRNAs miRNAs were the second major story in 2001 (after the genome). Subsequently, many other non-coding genes.
Motif discovery and Protein Databases Tutorial 5.
Transcription and mRNA Modification
Characteristic of Life!!
Motif Search and RNA Structure Prediction Lesson 9.
Classification of real and pseudo microRNA precursors using local structure-sequence features and support vector machine 朱林娇 14S
Introduction to Molecular Biology
Gene Expression Role of DNA. Where is DNA? In the chromosomes in the nucleus.
Introduction to Bioinformatics - Tutorial no. 5 MEME – Discovering motifs in sequences MAST – Searching for motifs in databanks TRANSFAC – the Transcription.
DNA Structure. Essential Questions for Today What is DNA? What is a gene? What is the basic structure of DNA? What is the function of DNA?
DNA and Genes. Prokaryotes VS Eukaryotes Prokaryotes: no defined nucleus and a simplified internal structure Eukaryotes: membrane limited nucleus and.
General, Organic, and Biological Chemistry Copyright © 2010 Pearson Education, Inc.1 Chapter 21 Nucleic Acids and Protein Synthesis 21.3DNA Double Helix.
RNA Structure Prediction
Rapid ab initio RNA Folding Including Pseudoknots via Graph Tree Decomposition Jizhen Zhao, Liming Cai Russell Malmberg Computer Science Plant Biology.
Analyzing Promoter Sequences with Multilayer Perceptrons Glenn Walker ECE 539.
DNA The nucleic acids DNA and RNA are polymers of nucleotides Nucleic acids  first discovered in material extracted from the nucleus  2 types.
L. Bahiya Osrah LAB 1 INTRODUCTION TO NUCLEIC ACIDS STRUCTURAL PROPERTIES.
Ch. 11: DNA Replication, Transcription, & Translation Mrs. Geist Biology, Fall Swansboro High School.
bacteria and eukaryotes
Lab 8.3: RNA Secondary Structure
M.B.Ch.B, MSC, DCH (UK), MRCPCH
Intro to DNA DNA = Deoxyribonucleic acid
DNA Structure.
Gene Expression I pp
Bioinformatics Vicki & Joe.
DNA Structure.
a. Distinguish between DNA and RNA
Noncoding RNA roles in Gene Expression
Deoxyribonucleic Acid
Department of Computer Science Ben-Gurion University
Department of Computer Science Ben-Gurion University
Presentation transcript:

RiboSearch Ben Daniel Ariel Kirshner Naomi Instructor : Dr. Danny Barash Adaya Cohen

Introduction Biological Introduction Method Layout “The merge strategy” Results and Conclusions

RNA A single-stranded nucleic acid made up of 4 nucleotides : Purines : adenine (A), guanine (G) Pyramidines: cytosine (C), and uracil (U). WC pairs: A-U G-C

Introduction Biological Old scheme Protein carry out all biological functions RNA : only a stage between DNA to protein with no catalytic function DNA RNA Protein

Biological introduction New scheme Since the discovery of self-splicing RNAs in the early 1980’s, a number of new structural and catalytic RNAs have been discovered. Recent studies focusing on non-coding and small RNAs have led to discovery of RNA molecules that posses essential regulatory functions DNA RNA Protein

RNA Secondary Structure Hairpin Internal loop Bulge loop Junction Stem (double strand) pseudoknot The secondary structure of many RNAs is usually more conserved than their sequence

Riboswitch Aptamer Coding section 3’ 5’ Expression platform 5’ UTR 3’ UTR RNA control elements that regulates gene expression, without the participation of proteins Utilize a unique mechanism where by small molecules bind to aptamer/box region causing a conformational switch Were found initially in 5’ UTR of bacteria with successive discoveries in prokaryotes There are evidence suggesting riboswitches could be found in eukaryotes.

Riboswitch mechanism Guanine bind to aptamer region with cause conformational change in the expression platform, which regulates the guanine metabolism.

G-box Regulates genes related to purine metabolism and transport Binds purines Consists of 2 hairpins and 1 internal junction

RiboSearch Goal Finding G-box in eukaryotic genomes Method Combining existing search methods into one overall package

Search Methods Whiffer – CS department, BGU RNAMotif – Macke et al. , 2001 RNAProfile – Pavesi et al. , 2004 STR2 – CS department, BGU

Whiffer Input Pattern that consists of : Output Sequence information Variable gaps Base pairing brackets representing WC pairs Output Candidates locations that meet constraints imposed by the method <<<< [2] TA [5] GTNTCTAC [3] <<<<< [3] CCNNNAA [3] >>>>> [5] >>>>

Whiffer Method Uses simple matching ,based on the constraints ,as opposed to dynamic programming.

RNAMotif Input Database of nucleotide sequences Description file that consists of: Descriptor section Score section (optional) Output Candidates that meet the conditions of the descriptor and the scoring scheme

RNAMotif Sample descriptor file : descr h5 (minlen=6, maxlen=8) ss (minlen=4, maxlen=6) h3 score { gcnt = 0; glen = 0; for( i = 1; i <= NSE; i++ ){ llen=length( se[i] ); glen=glen+llen; for( j = 1; j <= glen; j++ ){ b = se[i,j,1]; if( b == "g" || b == "c" ) gcnt++; { SCORE = 1.0 * gcnt / glen; if( SCORE < .4 ) REJECT; } ss h5 h3

RNAMotif Method Two-stage algorithm Stage I : Compilation stage Analyzing the specific motif, called a descriptor and converting it into a search tree based on the helical nesting of the motif

RNAMotif Method Two-stage algorithm Stage II : DFS Depth first search of the tree that was created by the compilation stage Each time a complete solution to the descriptor is found, the candidate is passed to an optional score section for scoring and ranking In absence of score section the candidate is accepted

RNAProfile Input Number of distinct hairpins a motif has to contain Set of unaligned RNA sequences expected to share a common motif

RNAProfile Output Regions that are most conserved throughout the sequences, according to sequence of the regions Secondary structure that can be formed according to base-pairing and thermodynamic rules

RNAProfile Method Two phases Phase I : Extracting a set of candidate regions from each input sequence, whose predicted optimal secondary structure contains the number of hairpins given as input Phase II : The regions selected are compared with each other to find the group of most similar ones, formed by a region taken from each sequence

Method Summery Whiffer RNAMotif RNAProfile Combines sequence and structure similarity Very high specifity – potential candidates may be ruled out RNAMotif Similarity based mostly on structural elements, according to the descriptor RNAProfile Similarity based on both sequence and structure Recommended as a post-processing step

Structure (bracket notation) The merge strategy Query: Sequence Structure (bracket notation) Input (((..((((…)))).)) Parsing Whiffer RNAMotif Parsing Candidates

Candidates The location contained within a gene The gene is relevant to the requested function (purine metabolism) Filtering RNAProfile Post processing Final candidates

Biological experiments Final candidates Sequence alignment Biological experiments

Results – prokaryote Bacillus Halodurans Merge RNAMotif Whiffer 7 4 Candidates 2 True positives 3 5 False positives False negatives

Results – eukaryote Arabidopsis Thaliana Merge RNAMotif Run #2 Run #1 Whiffer - 70000 30 Candidates 11 17 Final candidates

Results – eukaryote Arabidopsis Thaliana Most promising candidates Arabidopsis Thaliana

c2__11199940_11199996 queryGBox CGTGGATATGGCACGCAAGTTTCTACCGGGCACCGTAAATGTCCGACTAT 50 c2__11199940_11199996_ --TTCAGGTC-CATCTTTGGCTAGACCGAAGTCAGATAATTTGGCGTTAT 47 * * * ** * * **** * * *** * *** queryGBox G-------- 51 c2__11199940_11199996_ AGTCCTGAA 56

c3_20894864_20894920 c3_sequences GGATGAGGAACCAATTGACCCTGGATTTCAAGATT-TACAAAAGAACGTA 49 queryGBox -------------CGTGGATATGGCACGCAAGTTTCTACCGGGCACCGTA 37 ** *** **** ** *** * **** c3_sequences AGCATCC------- 56 queryGBox AATGTCCGACTATG 51 * ***

RiboSearch - Conclusions Filters false positives Sequences are by far less conserved within eukaryotes than prokaryotes The merge strategy is essential in eukaryotic genomes search

Our thanks Dr. Danny Barash Adaya Cohen