For Prediction of microRNA Genes Vertebrate MicroRNA Genes Lee P. Lim, et. al. SCIENCE 2003 The microRNAs of Caenorhabditis elegans Lee P. Lim, et al GENES.

Slides:



Advertisements
Similar presentations
Improving miRNA Target Genes Prediction Rikky Wenang Purbojati.
Advertisements

MiRNA in computational biology 1 The Nobel Prize in Physiology or Medicine for 2006 Andrew Z. Fire and Craig C. Mello for their discovery of "RNA interference.
RNA Structure Prediction
A turbo intro to (the bioinformatics of) microRNAs 11/ Peter Hagedorn.
Predicting RNA Structure and Function. Non coding DNA (98.5% human genome) Intergenic Repetitive elements Promoters Introns mRNA untranslated region (UTR)
The Molecular Genetics of Gene Expression
Computational biology seminar
Predicting RNA Structure and Function. Nobel prize 1989Nobel prize 2009 Ribozyme Ribosome RNA has many biological functions The function of the RNA molecule.
Noble Prize.
Presenting: Asher Malka Supervisor: Prof. Hermona Soreq.
MicroRNA genes Ka-Lok Ng Department of Bioinformatics Asia University.
Eukaryotic Gene Finding
Small RNA World microRNAs(miRNAs) ECES458 FALL 2001 Student: Kai He Instructor: Dr. Sahinalp.
Predicting RNA Structure and Function
UTR motifs and microRNA analysis 曾 大 千 助 理 教 授 10/28/2008.
Predicting RNA Structure and Function. Nobel prize 1989 Nobel prize 2009 Ribozyme Ribosome.
microRNA computational prediction and analysis
Gene expression.
 ribose  Adenine  Uracil  Adenine  Single.
GENETICS ESSENTIALS Concepts and Connections SECOND EDITION GENETICS ESSENTIALS Concepts and Connections SECOND EDITION Benjamin A. Pierce © 2013 W. H.
NcRNAs What Genomes are Telling Us ncrna.ppt. ncRNA genes are difficult to discover! small an annotational and statistical concern no ORFs and no polyadenylation.
Control of Gene Expression Eukaryotes. Eukaryotic Gene Expression Some genes are expressed in all cells all the time. These so-called housekeeping genes.
MicroRNA Targets Prediction and Analysis. Small RNAs play important roles The Nobel Prize in Physiology or Medicine for 2006 Andrew Z. Fire and Craig.
Identifying and classifying functional small RNAs from pine Ryan Morin BC Genome Sciences Centre (presenting research conducted in the lab of Dr. Peter.
13.1 RNA.
Transfer RNA SS, folded upon themselves into DS section with cloverleaf structure 3’ end of tRNA has CCA terminus added after transcription, for AA binding.
From Structure to Function. Given a protein structure can we predict the function of a protein when we do not have a known homolog in the database ?
MicroRNA identification based on sequence and structure alignment Presented by - Neeta Jain Xiaowo Wang†, Jing Zhang†, Fei Li, Jin Gu, Tao He, Xuegong.
RNA Folding. RNA Folding Algorithms Intuitively: given a sequence, find the structure with the maximal number of base pairs For nested structures, four.
Computational Identification of Drosophila microRNA Genes Journal Club 09/05/03 Jared Bischof.
Stefan Aigner Christian Carson Rusty Gage Gene Yeo Crick-Jacobs Center Salk Institute Analysis of Small RNAs in Stem Cell Differentiation.
Changes to Syllabus: Quizzes put back: Change Oct. 3 to Oct. 17
1 RNA Bioinformatics Genes and Secondary Structure Anne Haake Rhys Price Jones & Tex Thompson.
What is RNA interference?
Welcome Everyone. Self introduction Sun, Luguo ( 孙陆果) Contact me by Professor in School of Life Sciences & National Engineering.
Drosha. 121th Lab meeting 석사 2 년 박 은 실.
UNIT 3 Transcriptionand Protein Synthesis. Objectives Discuss the flow of information from DNA to RNA to Proteins Discuss the flow of information from.
SmallRNAs Small molecules, big functions. Brief history The first described microRNA, lin-4 was cloned and characterised as a translational repressor.
© Wiley Publishing All Rights Reserved. RNA Analysis.
Fig.1.8 DNA STRUCTURE 5’ 3’ Antiparallel DNA strands Hydrogen bonds between bases DOUBLE HELIX 5’ 3’
Youngae Lee Identification of microRNA function by target prediction and expression profiling.
RNA Structure Prediction
MicroRNAs and Other Tiny Endogenous RNAs in C. elegans Annie Chiang JClub Ambros et al. Curr Biol 13:
Gene Regulation and Expression. Learning Objectives  Describe gene regulation in prokaryotes.  Explain how most eukaryotic genes are regulated.  Relate.
Nature, 2008, Doi: /nature07103 Semrah Kati
GENE REGULATION RESULTS IN DIFFERENTIAL GENE EXPRESSION, LEADING TO CELL SPECIALIZATION Eukaryotic DNA.
RNA and Gene Expression BIO 224 Intro to Molecular and Cell Biology.
Motif Search and RNA Structure Prediction Lesson 9.
Classification of real and pseudo microRNA precursors using local structure-sequence features and support vector machine 朱林娇 14S
MicroRNA Prediction with SCFG and MFE Structure Annotation Tim Shaw, Ying Zheng, and Bram Sebastian.
IB Saccharomyces cerevisiae - Jan Major model system for molecular genetics. For example, one can clone the gene encoding a protein if you.
Abstract Premise Figure 1: Flowchart pri-miRNAs were collected from miRBase 10.0 pri-miRNAs were compared to hsa and ptr genomes using BlastN and potential.
RNA Structure Prediction
CAMPBELL BIOLOGY IN FOCUS © 2014 Pearson Education, Inc. Urry Cain Wasserman Minorsky Jackson Reece Lecture Presentations by Kathleen Fitzpatrick and Nicole.
Transcription and RNA processing Fall, Transcription Outline Notes RNA Polymerase Structures Subunits Template versus coding strands Polymerase.
Mestrado Integrado em Medicina Biologia Celular e Molecular II
Genetic Algorithm(GA)
Building Excellence in Genomics and Computational Bioscience miRNA Workshop: miRNA biogenesis & discovery Simon Moxon
Gene Expression - Transcription
Gene Regulation and Expression
Nobel price 2006 Andrew Z. Fire Craig C. Mello
miRNA genomic organization, biogenesis and function
RNA Secondary Structure Prediction
Non-coding RNA April 11, 2018.
Non-coding RNA October 25, 2017.
MicroRNAs: regulators of gene expression and cell differentiation
mRNA Degradation and Translation Control
Noncoding RNA roles in Gene Expression
Volume 116, Issue 2, Pages (January 2004)
Prediction of Plant MicroRNA Targets
Presentation transcript:

For Prediction of microRNA Genes Vertebrate MicroRNA Genes Lee P. Lim, et. al. SCIENCE 2003 The microRNAs of Caenorhabditis elegans Lee P. Lim, et al GENES & DEVELOPMENT 2003 presented by Nam jin-wu

Direction of research Learning of RNA Structural Grammar using Genetic Programming KISS 2003 (only structure) Learning of Structure & Words CBGI 2003 (Deadline ) Prediction of miRNA site in Precursor sequence Wha-Jin Lee Confimation of hypothetic miRNA with Experiments Dr Kim Prediction of promoter region of miRNA Prediction of miRNA target

Why miRNA? rRNA miRNA RNAs have active roles in gene expression !!! – “ miRNA ” RNAs have passive roles in gene expression ??? arrest

miRNAs generated by biogenesis of precursor Polycistronic structure in miRNA pre precursor miRNA precursor ~70nt precursor which has a hairpin fold

miRNAs generated by biogenesis of precursor

miRNA? ~20nt miRNA molecule from Dicer cutting the stem loop

How did they discover miRNA? P : predicted with BLAST C : Cloned N : Northern blot confirmed

All Small RNAs are negative regulator? Positive regulation by small RNAs.

Vertebrate MicroRNA Genes Lee P. Lim, et. al. SCIENCE 2003

Problems of miRScan For predicting miRNAs, must need a other species ’ miRNA sequences having high homology but if it is not … noble miRNAs having new feature are difficult to be detected need common structural learning …

Distant sequence and structure similarity >gb|AE002602|AE trna11-AlaAGC ( ) Ala (AGC) 73 bp Sc: GGGGATGTAGCTCAGATGGTAGAGCGCTCGCTTAGCATGTGAGAGGTACGGGGATCGATG CCCCGCATCTCCA >gb|AE002708|AE trna16-AlaAGC ( ) Ala (AGC) 73 bp Sc: GGGGATGTAGCTCAGATGGTAGAGCGCTCGCTTAGCATGTGAGAGGTACGGGGATCGATA CCCCGCATCTCCA >gb|AE002575|AE trna6-AlaCGC ( ) Ala (CGC) 72 bp Sc: GGGGACGTAGCTCAGTGGTAGAGCGCTCGCTTCGCATGTGAGAAGTCCCGGGTTCAAACC CCGGCGTCTCCA >gb|AE002787|AE trna12-AlaTGC ( ) Ala (TGC) 72 bp Sc: GGGGATGTAGCTCAGTGGTAGAGCGCTCGCTTTGCATGTGAGAGGCCCCGGGTTCGATCC CCGGCATCTCCA >gb|AE002708|AE trna66-ArgACG ( ) Arg (ACG) 73 bp Sc: GGTCCTGTGGCGCAATGGATAACGCGTCTGACTACGGATCAGAAGATTCCAGGTTCGACT CCTGGCAGGATCG >gb|AE002769|AE trna10-ArgACG ( ) Arg (ACG) 73 bp Sc: GGTCCTGTGGCGCAATGGATAACGCGTCTGACTACGGATCAGAAGATTCCAGGTTCGACT CCTGGCAGGATCG >gb|AE002602|AE trna11-AlaAGC ( ) Ala (AGC) 73 bp Sc: 59. GGGGATGTAGCTCAGATGGTAGAGCGCTCGCTTAGC ATGTGAGAGGTACGGGGATCGATGCCCCGCATC >gb|AE002708|AE trna16-AlaAGC ( ) Ala (AGC) 73 bp Sc: GGGGATGTAGCTCAGATGGTAGAGCGCTCGCTTAGC ATGTGAGAGGTACGGGGATCGATACCCCGCATC f1f2f1f2 f1 f2 root h5 (minlen=2, maxlen=3) h5 (len=5, mispair=1) ss (len=2) h3 ss (len=4) h5 (minlen=3, maxlen=5) ss (len=3) h3 ss Discovering using PSSM Discovering with Grammar

Distant sequence and structure similarity Small RNA precursors have low similarity of sequence But, high similarity of structure Structural Learning is important

Schematic Overview First Screened Data (no false negative) Learning of RNA Structural Grammar using Genetic Programming Prediction of microRNA genes Learning both sequence and structure with a GA or others Learning of RNA Structural Grammar using Genetic Programming Learning both sequence and structure with a GA or others

Learning of RNA Structural Grammar using Genetic Programming Genetic programming Positive Data Set (tRNA || miRNA..) Negative Data Set (no tRNA || no miRNA) RNAmotif Grammar Parser (Counter of Hit) Evaluation Step Data set

Learning of RNA Structural Grammer using Genetic Programming descr H5 (minlen=8, maxlen=16, mispair=1) ss ( len=7 ) H3

Structural conversion of Functional Tree FunGrammarVariable f1h5 (f1 or f2) h3minlen/maxlen, len, mispair f2ssminlen/maxlen, len rootdescr h5 (minlen=2, maxlen=3) h5 (len=5, mispair=1) ss (len=2) h3 ss (len=4) h5 (minlen=3, maxlen=5) ss (len=3) h3 ss f1f2f1f2 f1 f2 root

RNA Structural Learning Algorithms using GP 1.Create Initial Population of Tree 2.Converse Tree to Grammar 3.Evaluation with RNAmotif 4. Repeat next steps until all new individuals created 4.1 Add Best fitness tree to new population 4.2 Select top 50% 3.3 Variation of selected trees 4.4 Add variation tree to new population 5. If it arrive end condition, stop it, but exchange old population to new population, repeat 2~5step 3.1 Local Search from X generation 6. Local Search with 7mer words C -> C++

Fitness Fitness = spC*Specificity + stC*Sensitivity + Complexity (1) spC + stC = 1 (spC=0.9, stC = 0.1) (2) Specificity = TP/(TP+FP) (3) Sensitivity = TP/(TP+FN) (4) Complexity = 1/(NS+PS)^2 X (iComp/ibestComp) (5) iComp = TreeDepthX10 + NodeNum (6) Fitness = (NH+1)/(PH+1)*(NS/PS) - Complexity

Result Average Fitness(Red) Best Fitness(Blue) descr h5(len=1 ) h5(minlen=4, maxlen=17) ss(minlen=4, maxlen=21 ) h3 ss(minlen=4, maxlen=24 ) h3 h5( mispair=5) h5(minlen=4, maxlen=21) ss( len=7) h3 ss( len=13) h3 Best Fitness Grammar

Additional Study Local search with words - almost done Local search of top 5 tree of each generation Make 7tuple words with training sequence Multiplication of best 100 trees on all generation to trees including word Learning both sequence and structure with GA – have ideahave idea Application of miRNA prediction Bioinformatics application notes miRNA site prediction in precursor sequence Sequence specific Dicer site Structure specific Dicer site miRNA promoter prediction – controversialcontroversial Promoter prediction of Polycistronic gene miRNA target prediction 3 ’ UTR of mRNA have low similarity of miRNA

Learning both sequence and structure with a GA

miRNAs may locate in Intron miRNA may locate in intron previously identified miRNAs found within annotated introns (Lau et al. 2001) 10 of 12 known Celegans miRNAs predicted to be in introns are in the same orientation as the predicted mRNAs, some of these miRNAs are not transcribed from their own promoters but instead derive from the excised pre-mRNA introns (Lee P. Lim, et al 2003 Genes & Development)