Improving Intergenic miRNA Target Genes Prediction Rikky Wenang Purbojati.

Slides:



Advertisements
Similar presentations
Gene Regulation in Eukaryotic Cells. Gene regulation is complex Regulation, and therefore, expression of a gene is complex. Regulation of these genes.
Advertisements

Methods for identifying microRNA binding motifs
Combined analysis of ChIP- chip data and sequence data Harbison et al. CS 466 Saurabh Sinha.
Prediction of Therapeutic microRNA based on the Human Metabolic Network Ming Wu, Christina Chan Bioinformatics Advance Access Published January 7, 2014.
Two short pieces MicroRNA Alternative splicing.
RNA Interference (RNAi) Pathway
Improving miRNA Target Genes Prediction Rikky Wenang Purbojati.
Naveen K. Bansal and Prachi Pradeep Dept. of Math., Stat., and Comp. Sci. Marquette University Milwaukee, WI (USA)
04/02/2006RECOMB 2006 Detecting MicroRNA Targets by Linking Sequence, MicroRNA and Gene Expression Data Joint work with Quaid Morris (2) and Brendan Frey.
Author: Jim C. Huang etc. Lecturer: Dong Yue Director: Dr. Yufei Huang.
MiRNA in computational biology 1 The Nobel Prize in Physiology or Medicine for 2006 Andrew Z. Fire and Craig C. Mello for their discovery of "RNA interference.
RNA Structure Prediction
Predicting RNA Structure and Function. Non coding DNA (98.5% human genome) Intergenic Repetitive elements Promoters Introns mRNA untranslated region (UTR)
Predicting RNA Structure and Function
Computational biology seminar
Predicting RNA Structure and Function. Nobel prize 1989Nobel prize 2009 Ribozyme Ribosome RNA has many biological functions The function of the RNA molecule.
Presenting: Asher Malka Supervisor: Prof. Hermona Soreq.
MicroRNA Target Prediction Using Muscle Atrophy Genes As Models Caltech Wold Lab Mentors: Dr. Barbara Wold Diane Trout Brandon King Gilberto Hernandez,
Accurate Method for Fast Design of Diagnostic Oligonucleotide Probe Sets for DNA Microarrays Nazif Cihan Tas CMSC 838 Presentation.
Predicting RNA Structure and Function
Predicting RNA Structure and Function. Nobel prize 1989 Nobel prize 2009 Ribozyme Ribosome.
More regulating gene expression. Fig 16.1 Gene Expression is controlled at all of these steps: DNA packaging Transcription RNA processing and transport.
MicroRNA Targets Prediction and Analysis. Small RNAs play important roles The Nobel Prize in Physiology or Medicine for 2006 Andrew Z. Fire and Craig.
Introduction Hepatitis C Virus
Regulating Gene Expression from RNA to Protein. Fig 16.1 Gene Expression is controlled at all of these steps: DNA packaging Transcription RNA processing.
From Structure to Function. Given a protein structure can we predict the function of a protein when we do not have a known homolog in the database ?
MicroRNA implication in urinary bladder cancer A. Zaravinos 1, J. Radojicic 3, G.I. Lambrou 2, D. Volanis 1,4, D. Delakas 4, E.N. Stathopoulos 3, D.A.
MicroRNA identification based on sequence and structure alignment Presented by - Neeta Jain Xiaowo Wang†, Jing Zhang†, Fei Li, Jin Gu, Tao He, Xuegong.
RNA Folding. RNA Folding Algorithms Intuitively: given a sequence, find the structure with the maximal number of base pairs For nested structures, four.
Computational Identification of Drosophila microRNA Genes Journal Club 09/05/03 Jared Bischof.
More regulating gene expression. Combinations of 3 nucleotides code for each 1 amino acid in a protein. We looked at the mechanisms of gene expression,
1 Classification of real and pseudo microRNA precursors using local structure-sequence features and support vector machine Chenghai Xue, Fei Li, Tao He,
MBP1007/ Nucleic Acids A functional mRNA: The cytoplasmic story Objectives (1) To discuss the iNUTS and iBOLTS of how mRNAs function in the cytoplasm.
What is RNA interference?
Welcome Everyone. Self introduction Sun, Luguo ( 孙陆果) Contact me by Professor in School of Life Sciences & National Engineering.
Marco Magistri , Journal Club. A non-coding RNA (ncRNA) is any RNA molecule that is not translated into a protein “Structural genes encode proteins.
CS5263 Bioinformatics Lecture 20 Practical issues in motif finding Final project.
RNA Structure Prediction
Human microRNAs co-silence in well-separated groups and have different essentialities Gábor Boross, Katalin Orosz, Illés J. Farkas Eotvos University and.
MicroRNAs and Other Tiny Endogenous RNAs in C. elegans Annie Chiang JClub Ambros et al. Curr Biol 13:
Nature, 2008, Doi: /nature07103 Semrah Kati
Data Mining the Yeast Genome Expression and Sequence Data Alvis Brazma European Bioinformatics Institute.
While gene expression data is widely available describing mRNA levels in different cancer cells lines, the molecular regulatory mechanisms responsible.
AP Biology Control of Prokaryotic (Bacterial) Genes.
Pre-mRNA secondary structures influence exon recognition Michael Hiller Bioinformatics Group University of Freiburg, Germany.
A Software Tool for Generating Non-Crosshybridizing libraries of DNA Oligonucleotides Russell Deaton, junghuei Chen, hong Bi, and John A. Rose Summerized.
Motif Search and RNA Structure Prediction Lesson 9.
Classification of real and pseudo microRNA precursors using local structure-sequence features and support vector machine 朱林娇 14S
Nonlinear differential equation model for quantification of transcriptional regulation applied to microarray data of Saccharomyces cerevisiae Vu, T. T.,
杜嘉晨 PlantMiRNAPred: efficient classification of real and pseudo plant pre-miRNAs.
Abstract Premise Figure 1: Flowchart pri-miRNAs were collected from miRBase 10.0 pri-miRNAs were compared to hsa and ptr genomes using BlastN and potential.
RNA Structure Prediction
NCode TM miRNA Analysis Platform Identifies Differentially Expressed Novel miRNAs in Adenocarcinoma Using Clinical Human Samples Provided By BioServe.
Expression profile of oncomiRs and tumor-suppressor miRs in urothelial carcinoma of the bladder Apostolos Zaravinos 1, George I. Lambrou 2, Jelena Radojicic.
Homework #2 is due 10/17 Bonus #1 is due 10/24 Office hours: M 10/ :30am 2-5pm in Bio 6.
Network Motifs See some examples of motifs and their functionality Discuss a study that showed how a miRNA also can be integrated into motifs Today’s plan.
Date of download: 7/10/2016 Copyright © 2016 McGraw-Hill Education. All rights reserved. Basic steps of gene expression—transcription factors regulate.
Building Excellence in Genomics and Computational Bioscience miRNA Workshop: miRNA biogenesis & discovery Simon Moxon
The Transcriptional Landscape of the Mammalian Genome
A validation of a two-step control model
From: Emerging Roles for MicroRNAs in Perioperative Medicine
ANIMAL TARGET PREDICTION - TIPS
miRNA genomic organization, biogenesis and function
Predicting RNA Structure and Function
Control of Gene Expression
Large Scale Data Integration
Ab initio gene prediction
Identification and Characterization of pre-miRNA Candidates in the C
A PASport to Cellular Proliferation
Computational Discovery of miR-TF Regulatory Modules in Human Genome
Presentation transcript:

Improving Intergenic miRNA Target Genes Prediction Rikky Wenang Purbojati

miRNA  MicroRNA (miRNA) is a class of RNA which is believed to play important roles in gene regulation.  It’s a short (21- to 23-nt) RNAs that bind to the 3 ′ untranslated regions (3 ′ UTRs) of target genes.

miRNA Characteristics  Short (22-25nts)  miRNA plays a major role in RNA Induced Silencing Complex (RISC).  miRNAs control the expression of large numbers of genes by:  mRNA degradation  Translational repression  Expression of miRNA will reduce the expression of its target genes  Intergenic miRNA gene is located outside gene bodies

Basic miRNA problem  Finding miRNA true target genes is not a trivial task  One approach is to make a computational prediction before validating it in wet-lab experiments  one basic challenge of miRNA: Given a miRNA sequence, what is its target genes?

miRNA sequence target prediction  Several requirements for matching:  Strong Watson-Crick base pairing of the 5’ seed (2-8 nts)  Conservation of the miRNA binding site across species  Local miRNA-mRNA interaction with positive balance of minimum free energy  Available tools for target genes prediction: PicTar, TargetScan, miRanda,microT, etc.  Most tool’s prediction does not complement each other, because they use different criteria

Problem and Opportunity  Problem: Pure computational target genes prediction produces a lot of candidates  Most of them are not validated  Common assumption is that most of them are false positives  Can we shorten the list to include only the strong candidates ?  Opportunity: Lots of publicly available experimental dataset i.e. cDNA microarray, miRNA microarray, etc.  Use the dataset to computationally invalidate some of the target genes

Assumptions  miRNA works by silencing target genes, thus miRNA gene and target genes should be anti-correlated  Intragenic miRNA are expressed along with the host gene.  a host gene should be anti-correlated with a target gene  Intergenic miRNA does not have a host gene, but its real target genes should be correlated together  The real target genes should be down-expressed whenever the intergenic miRNA is expressed.

How to invalidate a target gene prediction  A target gene prediction can be invalidated by using a set of microarray datasets  For Intragenic miRNA target gene:  If a target gene’s expressions has no correlation with the host gene’s expression, we assume that the target gene does not influenced by the host gene  For Intergenic miRNA target gene:  If a target gene behaves inconsistently compared to other target genes, we assume that it might not be affected by the miRNA gene

Filtering Intergenic miRNA Target Gene Prediction  Use a combination of 8 prediction tools to produce the initial predictions (union & intersection)  Use a collection of 190 microarray datasets to invalidate some of the predictions  Use a greedy method to approximate the final subset of high-confidence target genes

Consistent Target Genes  We need to establish the meaning of consistent target genes  In this context, target gene A and target gene B is consistent if:  For all microarray datasets in which gene A is down-regulated, then gene B is also down-regulated M1M2M3M4…Mk DHX9 ↑↓↑↓↑ ASTE1 ↓↓↓↓↑ C20ORF133 ↓↓↑↓↓ PARP11 ↑↓↓↓↓ SLC32A1 ↓↓↓↓↑ PPAPDC2 ↓↓↑↓↓ SCHIP1 ↑↓↓↓↑ MPST ↑↓↓↓↓

Greedy Method  Given a set of target gene predictions, and a collection of microarray dataset:  We wanted to find:  The longest subset of consistent target genes  The highest number of down-regulated target genes in the subset

Reasoning  Why we wanted to find:  The longest subset of consistent target genes?  Consistent target genes, on large number of microarray dataset with different experiments, might indicate that they are affected by a common factor, which may be microRNA  The longest subset ensures high probability of including the true target genes  The highest number of down-regulated target genes in the subset?  Since miRNA works by down-regulating target genes, it is desirable to find the largest subset of consistently down-regulated target genes

Current Algorithm for i = 0 to K A <- G[i] SigA <- signature(A) Temp_Subset = {SigA} down = countDownExpressedMicroarray(A) for j = 0 to K B <- G[j] SigB <= signature(B) if SigA == SigB Temp_Subset U {SigB} end if end for if (length(Temp_subset) > length(Subset)) && (down > downexpr_cnt) subset = Temp_Subset downexpr_cnt = down end if end for

Algorithm Limitations  The algorithm result might be biased based on the first pivot gene expression signature :  Might get stuck on local maxima  Can be solved by prioritizing, sorting of target gene down- expression value, or random selection of pivot gene  The subset is an approximation of high-confidence target genes, but it doesn’t necessarily include all real target genes (because of supporting data limitation)

Benchmarking  Compare the performance with other prediction tools, based on:  Number of correct predictions (based on validated target genes)  Number of predictions  The algorithm will use an initial target predictions with:  2, 3, and 4 prediction tools support

Performance Comparison

Sensitivity-Specificity Comparison

Conclusion  In general, the approximation method shows better sensitivity compared to other prediction tools  Specificity can be improved by including only target gene that is supported by more than 2 prediction tools

Further Work  Adjusting the scoring function to find the optimum balance between the length of the subset and the number of down-regulated target genes  Implementing a threshold on target gene signaturing to further reduce the specificity