HOMER – a one stop shop for ChIP-Seq analysis

Slides:



Advertisements
Similar presentations
In Silico Primer Design and Simulation for Targeted High Throughput Sequencing I519 – FALL 2010 Adam Thomas, Kanishka Jain, Tulip Nandu.
Advertisements

Visualising and Exploring BS-Seq Data
The Maize Inflorescence Project Website Tutorial Nov 7, 2014.
Asking translational research questions using ontology enrichment analysis Nigam Shah
ChIP-seq analysis Ecole de bioinformatique AVIESAN – Roscoff, Jan 2013.
Advanced ChIP-seq Identification of consensus binding sites for the LEAFY transcription factor Explain that you can use your own data Explain that data.
Finding approximate palindromes in genomic sequences.
High Throughput Sequencing
Bioinformatics Analysis Team McGill University and Genome Quebec Innovation Center
Supplementary Material Supplementary Tables Supplementary Table 1. Sequencing statistics for ChIP-seq samples. Supplementary Table 2. Pearson correlation.
Before we start: Align sequence reads to the reference genome
NGS Analysis Using Galaxy
Whole Exome Sequencing for Variant Discovery and Prioritisation
Li and Dewey BMC Bioinformatics 2011, 12:323
Viewing & Getting GO COST Functional Modeling Workshop April, Helsinki.
Mapping protein-DNA interactions by ChIP-seq Zsolt Szilagyi Institute of Biomedicine.
Advanced ChIPseq Identification of consensus binding sites for the LEAFY transcription factor.
RNAseq analyses -- methods
Galaxy for Bioinformatics Analysis An Introduction TCD Bioinformatics Support Team Fiona Roche, PhD Date: 31/08/15.
Regulatory Genomics Lab Saurabh Sinha Regulatory Genomics Lab v1 | Saurabh Sinha1 Powerpoint by Casey Hanson.
NGS data analysis CCM Seminar series Michael Liang:
Next Generation DNA Sequencing
RNA-seq workshop ALIGNMENT
Chromatin Immunoprecipitation DNA Sequencing (ChIP-seq)
EDACC Primary Analysis Pipelines Cristian Coarfa Bioinformatics Research Laboratory Molecular and Human Genetics.
Chip – Seq Peak Calling in Galaxy Lisa Stubbs Chip-Seq Peak Calling in Galaxy | Lisa Stubbs | PowerPoint by Casey Hanson.
EDACC Quality Characterization for Various Epigenetic Assays
Introductory RNA-seq Transcriptome Profiling. Before we start: Align sequence reads to the reference genome The most time-consuming part of the analysis.
A B IL-4(+) IL-4(-) IL-4(+) IL-4(-) ChIP-Seq (STAT6) Ramos IL-4 (+) P-value Ramos IL-4 (-) P-value BEAS2B IL-4 (+) P-value BEASB IL-4 (-) P-value fold.
Regulatory Genomics Lab Saurabh Sinha Regulatory Genomics | Saurabh Sinha | PowerPoint by Casey Hanson.
Analyzing digital gene expression data in Galaxy Supervisors: Peter-Bram A.C. ’t Hoen Kostas Karasavvas Students: Ilya Kurochkin Ivan Rusinov.
Investigate Variation of Chromatin Interactions in Human Tissues Hiren Karathia, PhD., Sridhar Hannenhalli, PhD., Michelle Girvan, PhD.
 CHANGE!! MGL Users Group meetings will now be on the 1 st Monday of each month 3:00-4:00 Room Note the change of time and room.
Overview of ENCODE Elements
No reference available
First of all: “Darnit Jim, I’m a doctor not a bioinformatician!”
Accessing and visualizing genomics data
Objectives Genome-wide investigation – to estimate alternate Poly-Adenylation (APA) usage on 3’UTR – to identify polymorphism of Downstream Sequence Elements.
Introduction of the ChIP-seq pipeline Shigeki Nakagome November 16 th, 2015 Di Rienzo lab meeting.
The Genome Genome Browser Training Materials developed by: Warren C. Lathe, Ph.D. and Mary Mangan, Ph.D. Part 2.
CyVerse Workshop Transcriptome Assembly. Overview of work RNA-Seq without a reference genome Generate Sequence QC and Processing Transcriptome Assembly.
Chip – Seq Peak Calling in Galaxy Lisa Stubbs Lisa Stubbs | Chip-Seq Peak Calling in Galaxy1.
User-friendly Galaxy interface and analysis workflows for deep sequencing data Oskari Timonen and Petri Pölönen.
TRACKSTER &CIRCSTER DEMO Slides: /g/funcgen/trainings/visualization/Demos/Trackster+Circster.ppt Galaxy: Galaxy Dev:
Using Galaxy to build and run data processing pipelines Jelle Scholtalbers / Charles Girardot GBCS Genome Biology Computational Support.
Canadian Bioinformatics Workshops
Visualizing data from Galaxy
Practice:submit the ChIP_Streamline.pbs 1.Replace with your 2.Make sure the.fastq files are in your GMS6014 directory.
Using command line tools to process sequencing data
Day 5 Mapping and Visualization
Simon v1.0 Motif Searching Simon v1.0.
Regulatory Genomics Lab
RNA-Seq analysis in R (Bioconductor)
S1 Supporting information Bioinformatic workflow and quality of the metrics Number of slides: 10.
GE3M25: Data Analysis, Class 4
TSS Annotation Workflow
Analysing ChIP-Seq Data
Dynamic epigenetic enhancer signatures reveal key transcription factors associated with monocytic differentiation states by Thu-Hang Pham, Christopher.
ChIP-Seq Data Processing and QC
Exploring and Understanding ChIP-Seq data
Epigenetics System Biology Workshop: Introduction
Maximize read usage through mapping strategies
Simon V Motif Searching Simon V
Volume 19, Issue 4, Pages (April 2014)
Regulatory Genomics Lab
Introduction to RNA-Seq & Transcriptome Analysis
Regulatory Genomics Lab
Chromatin basics & ChIP-seq analysis
Quality Control & Nascent Sequencing
Presentation transcript:

HOMER – a one stop shop for ChIP-Seq analysis Graham Thomas 25th June 2015

Why do people normally use HOMER? HOMER (Hypergeometric Optimization of Motif EnRichment) is a set of tools for DNA motif discovery and over-representation analysis. Works very well to identify significantly enriched 8-20bp motifs in promoters (provide gene list) or a set of sequences (BED or Pos file) relative to genomic DNA background control. HOMER controls for CpG content in sequences to remove bias and also performs some low complexity filtering to remove nonsense results. De novo motifs are then reduced to non-redundant set, p-values for enrichment calculated, and compared against known motif library to help identify.

Other features of HOMER Peak calling DE analysis (Output file formats) UCSC/WashU visualization Basic arithmetic Peak position histogram RNA-Seq, HiC type anlaysis. One of my favorite things about HOMER is that it is very well documented

HOMER workflow HOMER FASTQ files BAM files Tag directory Visualization tracks (bedGraph / bigWig) Peak calling & merging Peak centering/extending HOMER Motif analysis Quantitation / Annotation Matrix and table generation for histograms/scatterplots GO analysis DE Analysis

findPeaks.pl 7 basic operation modes (parameters can be altered) Factor, histone, groseq, tss (5’GRO-Seq data), dnase, super, mC, Hi-C type data Calls peaks from single/multiple tag directories and generates .pos file of peaks/regions along with descriptive statistics to indicate if experiment successful. Peaks called in similar manner to MACS – using enrichment relative to 1) sequenced input control and 2) local background within sample 3) clonal amplification (PCR bias).

mergePeaks.pl annotatePeaks.pl Merges and/or compares peak position files. Will report overlapping, unique or co-bound peaks Statistics regarding significance of overlaps (based on genome size) annotatePeaks.pl Workhorse program for HOMER. Takes .pos file or BED file, alternatively option ‘TSS’ for promoter-centric analysis. Annotates features with nearby gene info -d - TagDirectory – counts reads in features -m - motif – counts motifs/ locates motif positions in features -p - peak/BED file – another way to identify overlapping features Generate histograms of read density over regions – !

makeUCSCfile.pl, makeBigWig.pl & makeMultiWigHub.pl Scripts for making files to view in genome browser included. makeMultiBigWigHub.pl creates overlay tracks getDiffExpression.pl Wrapper script for edgeR. findMotifsGenome.pl Finds de novo motifs from input peak file relative to genome/background position file.

Pipeline tools to help automation batchParallel.pl batchMakeTagDirectory.pl batchFindMotifsGenome.pl analyzeChIP-Seq.pl Wrapper for the following:- makeTagDirectory.pl makeUCSCfile.pl findPeaks.pl findMotifsGenome.pl annotatePeaks.pl

Thanks!