01 The sRNA Workbench project date 20/07/2012 Matthew. B. Stocks.

Slides:



Advertisements
Similar presentations
Web Mining.
Advertisements

Copyright © 2008, SAS Institute Inc. All rights reserved. Discovering Meaningful Patterns in Genomics Data with JMP Genomics Jordan Hiller JMP Genomics.
RNAseq.
Visualise | communicate | ENGAGE Instant Atlas™ is a registered trademark of GeoWise Limited ©Copyright 2008 | Geowise Limited IA Desktop to LIS Solution.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
A new method of finding similarity regions in DNA sequences Laurent Noé Gregory Kucherov LORIA/UHP Nancy, France LORIA/INRIA Nancy, France Corresponding.
Comparison of microRNA populations in SACMV infected tolerant and susceptible cassava landraces 9 th Regional Plant Biotechnology Forum.
Peter Tsai Bioinformatics Institute, University of Auckland
The Imperial College Tissue Bank A searchable catalogue for tissues, research projects and data outcomes Prof Gerry Thomas - Dept. Surgery & Cancer The.
A turbo intro to (the bioinformatics of) microRNAs 11/ Peter Hagedorn.
Copyright OpenHelix. No use or reproduction without express written consent1.
Data-intensive Computing: Case Study Area 1: Bioinformatics B. Ramamurthy 6/17/20151.
The New Internet Explorer 7 By Ronald Pastor. Overview  Makes everyday web surfing easier –Internet Explorer 7 provides improved navigation through tabbed.
NGS Analysis Using Galaxy
FALL 2012 DSCI5240 Graduate Presentation By Xxxxxxx.
INTRODUCTION GOAL: to provide novel types of interaction between classification systems and MIAME-compliant databases We present a prototype module aimed.
Processing and Analyzing Large log from Search Engine Meng Dou 13/9/2012.
MicroRNA Targets Prediction and Analysis. Small RNAs play important roles The Nobel Prize in Physiology or Medicine for 2006 Andrew Z. Fire and Craig.
Micro Focus Net Express / Server Express in GDT Update.
Structural Bioinformatics R. Sowdhamini National Centre for Biological Sciences Tata Institute of Fundamental Research Bangalore, INDIA.
MarketLine HQ ADVANTAGE – your subscription service Explore today at
Basic features for portal users. Agenda - Basic features Overview –features and navigation Browsing data –Files and Samples Gene Summary pages Performing.
RNAseq analyses -- methods
Copyright OpenHelix. No use or reproduction without express written consent1.
Regulatory Genomics Lab Saurabh Sinha Regulatory Genomics Lab v1 | Saurabh Sinha1 Powerpoint by Casey Hanson.
Finish up array applications Move on to proteomics Protein microarrays.
From Structure to Function. Given a protein structure can we predict the function of a protein when we do not have a known homolog in the database ?
The Future of the iPlant Cyberinfrastructure: Coming Attractions.
Predicting MicroRNA Genes and Target Site using Structural and Sequence Features: Machine Learning Approach Malik Yousef Institute of Applied Research,
NIH Extracellular RNA Communication Consortium 2 nd Investigators’ Meeting May 19 th, 2014 Sai Lakshmi Subramanian – (Primary
Data Mining By Dave Maung.
Computational prediction of protein-protein interactions Rong Liu
The ISI Web of Knowledge nce/training/wok/#tab3.
PreDetector : Prokaryotic Regulatory Element Detector Samuel Hiard 1, Sébastien Rigali 2, Séverine Colson 2, Raphaël Marée 1 and Louis Wehenkel 1 1 Department.
EXAM REVIEW MIS2502 Data Analytics. Exam What Tool to Use? Evaluating Decision Trees Association Rules Clustering.
 Read quality  Adaptor trimming  Read sequence collapse Preprocessing Genome mapping  Map read to the spruce genome (Pabies1.0- genome.fa) using Patman
Analysis of GEO datasets using GEO2R Parthav Jailwala CCR Collaborative Bioinformatics Resource CCR/NCI/NIH.
Copyright OpenHelix. No use or reproduction without express written consent1.
Alistair Chalk, Elisabet Andersson Stem Cell Biology and Bioinformatic Tools, DBRM, Karolinska Institutet, September Day 5-2 What bioinformatics.
The UCSC Table Browser & Custom Tracks Advanced searching and discovery using the UCSC Table Browser and Custom Tracks Osvaldo Graña CNIO Bioinformatics.
Analysing Clickstream Data: From Anomaly Detection to Visitor Profiling Peter I. Hofgesang Wojtek Kowalczyk ECML/PKDD Discovery.
Analysis and comparison of very large metagenomes with fast clustering and functional annotation Weizhong Li, BMC Bioinformatics 2009 Present by Chuan-Yih.
Copyright OpenHelix. No use or reproduction without express written consent1.
Search Engine using Web Mining COMS E Web Enhanced Information Mgmt Prof. Gail Kaiser Presented By: Rupal Shah (UNI: rrs2146)
Bioinformatics for biologists Dr. Habil Zare, PhD PI of Oncinfo Lab Assistant Professor, Department of Computer Science Texas State University Presented.
Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013.
Motif Search and RNA Structure Prediction Lesson 9.
SPI NIGHTLIES Alex Hodgkins. SPI nightlies  Build and test various software projects each night  Provide a nightlies summary page that displays all.
Tools in Bioinformatics Genome Browsers. Retrieving genomic information Previous lesson(s): annotation-based perspective of search/data Today: genomic-based.
Abstract Premise Figure 1: Flowchart pri-miRNAs were collected from miRBase 10.0 pri-miRNAs were compared to hsa and ptr genomes using BlastN and potential.
Navigation Framework using CF Architecture for a Client-Server Application using the open standards of the Web presented by Kedar Desai Differential Technologies,
CCLE Cancer Cell Line Encyclopedia Alexey Erohskin.
Using Galaxy to build and run data processing pipelines Jelle Scholtalbers / Charles Girardot GBCS Genome Biology Computational Support.
Building Excellence in Genomics and Computational Bioscience miRNA Workshop: Plant miRNA targeting & PARE analysis Simon Moxon
Building Excellence in Genomics and Computational Bioscience miRNA Workshop: miRNA biogenesis & discovery Simon Moxon
Konstantin Okonechnikov Qualimap v2: advanced quality control of
Placental Bioinformatics
MATLAB Distributed, and Other Toolboxes
Data-intensive Computing: Case Study Area 1: Bioinformatics
Figure 1. The overall workflow of RNA-seq QC
Online password manager By: Anthony diveronica
miRPathDB: A Specialized Professional Database with Upkeep Concerns
Rod Eyles1, John Juma1, Morag Ferguson1, Trushar Shah1 1 IITA, Nairobi
ChipViewer is coded to visualize and analyze the tiling chip data.
MapView: visualization of short reads alignment on a desktop computer
Edwards Allen, Zhixin Xie, Adam M. Gustafson, James C. Carrington  Cell 
Figure 1. Circular taxonomy tree based on the species that were sequenced in our study. Unless provided in the caption above, the following copyright applies.
ChIP-seq Robert J. Trumbly
Cancer Cell Line Encyclopedia
Presentation transcript:

01 The sRNA Workbench project date 20/07/2012 Matthew. B. Stocks

Based on original algorithms developed for the web based UEA sRNA Tools Moxon et. al. (2008) Easy to use Designed to be used by biologists with limited access to bioinformatics support, perhaps on a desktop computer (but is extendable to servers or high performance clusters) Cross platform support To allow multiple analysis techniques from within a single program through the use of an MDI (Multiple Document Interface) Performance improvements, Speed, Memory Still accessible through the command line Code maintainability 02 PROJECT AIMS

03 WHAT WE ALREADY OFFER

Adapter Removal Removes 3’ and/or 5’ adaptors from raw high- throughput sequence data FASTA or FASTQ Provides sRNA length distribution Filter Filters sRNA sequences from high-throughput data according to user-defined criteria Sequence Alignment Aligns sRNA Sequences to a Reference Genome or other long read file DEFAULT STYLES 04 HELPER TOOLS

05 ANALYSIS TOOLS

06 miRCat miRCat (micro RNA Categorisation) Moxon et. al (2008) Stocks et. al. (2012) Identify novel miRNAs in high-throughput sequence data Identify known miRNAs found in miRBase in result set Griffiths-Jones (2010) View miRNA statistics One click viewing of the predicted miRNA as they appear on the genome One click viewing of miRNA pre-cursor secondary structure plot Concurrency....

07 miRCat threading sRNA CPU Core sRNA Database Thread Pool: FIFO data structure (Queue) First sequence added to the list (First In) is the first item processed (First Out)

PERFORMANCE TEST 08

09 TARGET PREDICTION Part of a parallel BBSRC tools and resources project A high-throughput technique known as Parallel Analysis of RNA Ends (PARE) is used to sequence mRNA cleavage products on a large-scale Sequence 5' end of uncapped mRNA (Degradome) Including transcripts targeted by sRNAs and subjected to endonucleolytic cleavage The sRNA and degradome data can be used to identify interactions between sRNAs and their target mRNA

10 PAREsnip can be used to search for genome-wide interactions between all sRNAs and transcripts as well as predicting targets of small groups of miRNAs PAREsnip outputs all potential sRNA target duplexes evidenced through the degradome Features: Categorisation system Addo-Quaye et. al (2009) Data structures based on m-way search trees and multi- threaded optimisation ~90 mins to process 100k sequences of A. thaliana data At least as sensitive to detecting targets as other tools designed for this type of analysis or better Folkes et. al. (2012) NAR PARESnip

11 PARESnip Each coloured rectangle represents an interaction The taller the rectangle, the higher the abundance Tool tips provide additional information T-plots can also be generated Target information is given on the right Navigation controls allow the user to simply click through the plots or output them to PDF for publication

miRProf (micro RNA Profiler) Detect all known miRNAs found in miRBase within your dataset Griffiths-Jones, (2010) Determines normalised expression levels of known miRNA found in miRBase Can be used to compare expression levels across multiple samples TA-SI prediction (Trans Acting Small Interfering RNA prediction) Predicts phased ta-siRNA in plant datasets Chen et. al. (2007) Requires sRNA database and genome database View phased sRNAs as they appear on the genome SiLoCo... Multi sample general loci prediction Expression profile DEFAULT STYLES 12 OTHER TOOLS

18 ONLINE AVAILABILITY

19 srna-workbench.cmp.uea.ac.uk

20 STATS Since launch on 27th Jan 2011 Total Visitors: 3,925 Page Views: 14,650 (3.73 pages per visit) Average time on site: 4.27 mins Total Version 2 Downloads: 1,200 Around 30 collated requests for the RSS feed per day

21 STATS Visits from 57 countries/territories UK top visitor then USA...

22 PUBLICATIONS Stocks, M. B.; Moxon, S.; Mapleson, D.; Woolfenden, H. C.; Mohorianu, I.; Folkes, L.; Schwach, F.; Dalmay, T. & Moulton, V. (2012) The UEA sRNA Workbench: A Suite of Tools for Analysing and Visualising Next Generation Sequencing microRNA and Small RNA Datasets. Bioinformatics Folkes, L.; Moxon, Simon.; Woolfenden, H.C.; Stocks, M.B.; Szittya, G.; Dalmay, T.: Moulton, V.; (2012) PAREsnip: a tool for rapid genome-wide discovery of small RNA/target interactions evidenced through degradome sequencing. Nucleic Acids Research

23 ACKNOWLEDGEMENTS Vincent Moulton Tamas Dalmay Simon Moxon Frank Schwach Irina Mohorianu Dan Mapleson Hugh Woolfenden Leighton Folkes

24 ANY QUESTIONS?