Chip – Seq Peak Calling in Galaxy Lisa Stubbs Lisa Stubbs | Chip-Seq Peak Calling in Galaxy1.

Slides:



Advertisements
Similar presentations
Before we start Login to the laptop: user: crgcomu Password: crgcomu Login to the network: Wifi: carretwifi Password : Login to galaxy (ldap):
Advertisements

IMGS 2012 Bioinformatics Workshop: RNA Seq using Galaxy
Overview Lesson 1. Objectives Step-by-Step: Start Excel 1.Click the Start menu, and then click All Programs. 2.On the list of programs, click Microsoft.
DAP-1520 FAQ’s Wireless AC750 Dual Band Range Extender.
Benchmark and Java Applet Test Scenario Presentation Outline Introduction to Benchmark Testing Procedure to create the test Benchmark Playback Results.
The Maize Inflorescence Project Website Tutorial Nov 7, 2014.
ChIP-seq analysis Ecole de bioinformatique AVIESAN – Roscoff, Jan 2013.
Variant Calling Workshop Chris Fields Variant Calling Workshop v2 | Chris Fields1 Powerpoint by Casey Hanson.
Dreamweaver Basics In this section you will learn how to:
1 of 4 This document is for informational purposes only. MICROSOFT MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS DOCUMENT. © 2007 Microsoft Corporation.
Working with SharePoint Document Libraries. What are document libraries? Document libraries are collections of files that you can share with team members.
Bacterial Genome Assembly | Victor Jongeneel Radhika S. Khetani
NGS Analysis Using Galaxy
Introduction to RNA-Seq and Transcriptome Analysis
Copyright © Texas Education Agency, All rights reserved. 1 Web Technologies Website Development with Dreamweaver.
Web Technologies Website Development Trade & Industrial Education
Variant Calling Workshop Chris Fields Variant Calling Workshop | Chris Fields | PowerPoint by Casey Hanson.
Washington Campus Compact New Time Log Database Note to users: You should use Internet Explorer to use this database. In other programs (i.e. Firefox)
An Introduction to Designing, Executing and Sharing Workflows with Taverna Nowgen, Next Gen Workshop 17/01/2012.
Genomics Virtual Lab: analyze your data with a mouse click Igor Makunin School of Agriculture and Food Sciences, UQ, April 8, 2015.
Computer Lab (I) Introduction of galaxy and UCSC genome browser.
Website Development with Dreamweaver
15.1 Fundamentals of HTML.
Introduction to RNA-Seq & Transcriptome Analysis
Galaxy for Bioinformatics Analysis An Introduction TCD Bioinformatics Support Team Fiona Roche, PhD Date: 31/08/15.
High Throughput Sequence (HTS) data analysis 1.Storage and retrieving of HTS data. 2.Representation of HTS data. 3.Visualization of HTS data. 4.Discovering.
This tutorial teaches Microsoft Word basics. Although knowledge of how to navigate in a Windows environment is helpful, this tutorial was created for.
Regulatory Genomics Lab Saurabh Sinha Regulatory Genomics Lab v1 | Saurabh Sinha1 Powerpoint by Casey Hanson.
NGS data analysis CCM Seminar series Michael Liang:
Next Generation DNA Sequencing
Welcome to DNA Subway Classroom-friendly Bioinformatics.
CMPS 1371 Introduction to Computing for Engineers MatLab.
An Introduction to Designing and Executing Workflows with Taverna Aleksandra Pawlik materials by: Katy Wolstencroft University of Manchester.
Chip – Seq Peak Calling in Galaxy Lisa Stubbs Chip-Seq Peak Calling in Galaxy | Lisa Stubbs | PowerPoint by Casey Hanson.
The Next Generation. Parent Access Grade History and Attendance.
1 PROJECT 5 WEB/HTML COMPUTER PURCHASE FORM Management Information Systems, 9 th edition, By Raymond McLeod, Jr. and George P. Schell © 2004, Prentice.
Sackler Medical School
IPlant Collaborative Discovery Environment RNA-seq Basic Analysis Log in with your iPlant ID; three orange icons.
FrontPage Tutorial Part 2 Creating a Course Web Site.
Regulatory Genomics Lab Saurabh Sinha Regulatory Genomics | Saurabh Sinha | PowerPoint by Casey Hanson.
Trinity College Dublin, The University of Dublin GE3M25: Data Analysis, Class 4 Karsten Hokamp, PhD Genetics TCD, 07/12/2015
Bioinformatics for biologists Dr. Habil Zare, PhD PI of Oncinfo Lab Assistant Professor, Department of Computer Science Texas State University Presented.
Using geWorkbench: Working with Sets of Data Fan Lin, Ph. D. Molecular Analysis Tools Knowledge Center Columbia University and The Broad Institute of MIT.
Trinity College Dublin, The University of Dublin Data download: bioinf.gen.tcd.ie/GE3M25/project Get.fastq.gz file associated with your student ID
Introduction to KE EMu Unit objectives: Introduction to Windows Use the keyboard and mouse Use the desktop Open, move and resize a.
How to download and PRINT your Moving ON Audits Monthly Audit and Indicator reports (click on the ▼button on your computer to continue this presentation)
Introducing Dreamweaver. Dreamweaver The web development application used to create web pages Part of the Adobe creative suite.
This is how you invoke the Microsoft Visual Studio 2010 Software. All Programs >> Microsoft Visual Studio 2010.
RNA-Seq in Galaxy Igor Makunin DI/TRI, March 9, 2015.
How to use the Hand-In Folder. Click on the Folder icon at the bottom of the screen.
UCSC Genome Browser Zeevik Melamed & Dror Hollander Gil Ast Lab Sackler Medical School.
Accessing and visualizing genomics data
Welcome to the combined BLAST and Genome Browser Tutorial.
User-friendly Galaxy interface and analysis workflows for deep sequencing data Oskari Timonen and Petri Pölönen.
TRACKSTER &CIRCSTER DEMO Slides: /g/funcgen/trainings/visualization/Demos/Trackster+Circster.ppt Galaxy: Galaxy Dev:
Using Galaxy to build and run data processing pipelines Jelle Scholtalbers / Charles Girardot GBCS Genome Biology Computational Support.
IGV Demo Slides:/g/funcgen/trainings/visualization/Demos/IGV_demo.ppt Galaxy Dev: 0.
Using command line tools to process sequencing data
WS9: RNA-Seq Analysis with Galaxy (non-model organism )
NGS Analysis Using Galaxy
Regulatory Genomics Lab
Chip – Seq Peak Calling in Galaxy
Yonglan Zheng Galaxy Hands-on Demo Step-by-step Yonglan Zheng
Cheat Sheet CSCI 100 JW Ryder
Cheat Sheet CSCI 100 JW Ryder
ChIP-seq Robert J. Trumbly
Regulatory Genomics Lab
Introduction to RNA-Seq & Transcriptome Analysis
Regulatory Genomics Lab
Chip – Seq Peak Calling in Galaxy
Presentation transcript:

Chip – Seq Peak Calling in Galaxy Lisa Stubbs Lisa Stubbs | Chip-Seq Peak Calling in Galaxy1

Introduction This goals of the lab are as follows: 1.Gain experience using Galaxy. 2.Teach how to map next generation (NSG) reads to a reference genome using Bowtie. 3.Demonstrate how to call peaks from Chip-Seq data. Lisa Stubbs | Chip-Seq Peak Calling in Galaxy2

Step 1A: Logging into Galaxy Go to: biocluster.igb.illinois.edu Click Enter Click Login Input your login credentials. Click Login. Lisa Stubbs | Chip-Seq Peak Calling in Galaxy3

Step 1B: Galaxy Start Screen The resulting screen should look like the figure below: Lisa Stubbs | Chip-Seq Peak Calling in Galaxy4

Step 2A: Importing the Data In this step, we will import the following data files: Lisa Stubbs | Chip-Seq Peak Calling in Galaxy5 FilenameMeaning G1E_ER4_CTCF_(chr19).fastqsanger A sample ChIP-seq dataset on CTCF in G1E_ER4 cells, reads have been reduced to those mapping to chr19 for demonstration use. G1E_ER4_input_(chr19).fastqsangerControl DNA taken from chr19. G1E_CTCF.fastqsangerCTCF Chip for G1E line. G1E_input.fastqsangerControl for G1E line. Note: G1E cell lines are erythroid, red blood cell, cell lines missing the GATA-1 gene. GATA-1 is crucial for the maturation of erythroid cells. G1E_E4R cell lines conditionally express GATA-1 in the presence of estradiol, enabling erythdoi maturation.

Step 2B: Import Data in Galaxy Go to Click Import History and on the next page click Start Using History. Lisa Stubbs | Chip-Seq Peak Calling in Galaxy6

Step 2C: Import Data into Galaxy Your Galaxy page should look like the following now: Lisa Stubbs | Chip-Seq Peak Calling in Galaxy7 Tools Pane History Pane Main Pane

Read Mapping and Peak Calling In this exercise, we will map ChIP Reads to a reference genome and call peaks among the mapped reads using MACs. Lisa Stubbs | Chip-Seq Peak Calling in Galaxy8

Step 3A: Summary Statistics In this step, we will gather summary statistics of ChIP data for quality control. Click NGS: QC and manipulation from the Tools pane. Then click FASTQ Summary Statistics. Lisa Stubbs | Chip-Seq Peak Calling in Galaxy9

Step 3B: FASTQ Summary Statistics On the next page, make sure 1: G1E_ER4_CTCF_(chr..fastqsanger) is selected. Press Execute. Lisa Stubbs | Chip-Seq Peak Calling in Galaxy10

Step 3C: FASTQ Summary Statistics The summary file will be the 6 th file in the History pane. Click to display the file in the Main pane. Lisa Stubbs | Chip-Seq Peak Calling in Galaxy11 Discussion How long are these reads? What is the median quality at the last position?

Step 4A: Map ChIP-Seq Reads to MM9 Genome Next, we will map the reads in G1E_E4R_CTCF_(chr9).fastqsanger to the mouse genome. Select NGS: Mapping Then select Map with Bowtie for Illumina Lisa Stubbs | Chip-Seq Peak Calling in Galaxy12

Step 4B: Map ChIP-Seq Reads to MM9 Genome Make sure to select mm9 as the reference genome. Make sure 1: G1E_ER4_CTCF_(chr..fastqsanger) is selected. Hit Execute. It will take a few moments to complete. When done, click the view icon. Lisa Stubbs | Chip-Seq Peak Calling in Galaxy13

Step 4C: Map ChIP-Seq Reads to MM9 Genome Your Main pane and History pane should look like the following: Lisa Stubbs | Chip-Seq Peak Calling in Galaxy14

Step 5A: Calling Peaks with MACs With our mapped ChiP-Seq reads, we now want to call peaks. Select NGS: Peak Calling Select MACS Lisa Stubbs | Chip-Seq Peak Calling in Galaxy15

Step 5B: Calling Peaks with MACs Run MACs with the default parameters. Lisa Stubbs | Chip-Seq Peak Calling in Galaxy16

Step 5C: Calling Peaks with MACs When done, MACs will create two files. 9: Macs on data 8 (html report) is an html document with information on the peak calling process. 8: MACs on data 7 (peaks:bed) is a BED file with coordinates and scores of ChiP-Seq peaks in chr19. Lisa Stubbs | Chip-Seq Peak Calling in Galaxy17

Step 5D: Calling Peaks with MACs In the 8: MACs on data 7 (peaks:bed) section in the History pane click main next to display at UCSC browser. The result should look similar to below: Lisa Stubbs | Chip-Seq Peak Calling in Galaxy18 Discussion 1. Look at the BED file. How many peaks were found?

Call Chip-Seq Peaks with a Control Sample We will perform the same procedure we did in the previous exercise. This time though, we will work with a control sample instead of an experimental one. Lisa Stubbs | Chip-Seq Peak Calling in Galaxy19

Step 6A: Map Control ChIP-Seq Reads to MM9 Genome Let’s map the reads in G1E_E4R_input_(chr19).fastqsanger to the mouse genome. Select NGS: Mapping Then select Map with Bowtie for Illumina Lisa Stubbs | Chip-Seq Peak Calling in Galaxy20

Step 6B: Map Control ChIP-Seq Reads to MM9 Genome Make sure to select mm9 as the reference genome. Make sure 2: G1E_ER4_input_(chr..fastqsanger) is selected. Click Execute. It will take a few moments to complete. When done, click the view icon. Lisa Stubbs | Chip-Seq Peak Calling in Galaxy21

Step 6C: Map Control ChIP-Seq Reads to MM9 Genome Your Main pane and History pane should look like the following: Lisa Stubbs | Chip-Seq Peak Calling in Galaxy22

Step 7A: Calling Peaks with MACs on Control Chip-Seq Reads Like before, we want to call peaks in our mapped Control ChiP-Seq reads. Select NGS: Peak Calling Select MACS Lisa Stubbs | Chip-Seq Peak Calling in Galaxy23

Step 7B: Calling Peaks with MACs on Control Chip-Seq Reads Select Map with Bowtie for Illumina on data 1 (the experimental aligned reads) for the Chip-Seq Tag File. Select Map with Bowtie for Illumina on data 2 (the control aligned reads) for the Chip-Seq Control File. Click Execute Lisa Stubbs | Chip-Seq Peak Calling in Galaxy24

Step 7C: Calling Peaks with MACs on Control Chip-Seq Reads Once again, MACs creates a BED file containing the peak coordinates and an HTML file containing information on the peak calling process. Discussion 1.Examine the BED track. 2.How many peaks are called when using a control sample? 3.How does this compare to the previous situation where we only had experimental Chip-Seq reads? Lisa Stubbs | Chip-Seq Peak Calling in Galaxy25

Workspace Extraction In this exercise, we will automate the Bowtie runs of our other 2 datasets. Lisa Stubbs | Chip-Seq Peak Calling in Galaxy26

Step 1A: Workspace Extraction Click the located in the History pane. From the drop down menu, select Extract Workshop. Lisa Stubbs | Chip-Seq Peak Calling in Galaxy27

Step 1B: Workspace Extraction In the resulting window, give the workflow a new name. Click Uncheck All. Ensure the following are checked and press Create Workflow. Lisa Stubbs | Chip-Seq Peak Calling in Galaxy28

Step 1C: Workspace Extraction Select All Workflows at the bottom of the Tool pane. Select chipseq2 On the next page, ensure the following settings and click Run. Lisa Stubbs | Chip-Seq Peak Calling in Galaxy29

Step 1D: Workspace Extraction Lisa Stubbs | Chip-Seq Peak Calling in Galaxy30 Click when done

Identifying Differential Binding Sites In this exercise, we will identify binding sites exclusive to undifferentiated and differentiated cell lines as well as those common to both. Lisa Stubbs | Chip-Seq Peak Calling in Galaxy31

Step 1: Subtract Peaks Between Cell Lines Select Operate on Genomic Intervals and Subtract. Choose your G1E MACs are your 2 nd dataset and your G1E-ER4 as your 1 st dataset. Click Execute. Lisa Stubbs | Chip-Seq Peak Calling in Galaxy32

Step 2: Subtract Peaks Between Cell Lines. The resulting BED file contains peaks exclusive to the differentiated cell line (G1E-ER4). Discussion 1.How many peaks are exclusive to G1E-ER Redo Step1 only SWITCH the input order. Choose your G1E MACs are your 1 st dataset and your G1E-ER4 as your 2nd dataset. Lisa Stubbs | Chip-Seq Peak Calling in Galaxy33

Step 3: Intersect Peaks Between Cell Lines Lisa Stubbs | Chip-Seq Peak Calling in Galaxy34 Select Operate on Genomic Intervals and Subtract. Choose your G1E MACs are your 2 nd dataset and your G1E-ER4 as your 1 st dataset. Click Execute.