I519 Introduction to Bioinformatics, Fall, 2012

Slides:



Advertisements
Similar presentations
Functional Genomics with Next-Generation Sequencing
Advertisements

Chromatin Immunoprecipitation and the Chip on Chip technique. Fredrik Fagerström Billai B E A- Core Facility for Bioinformatics and Expression Analysis.
Methods to read out regulatory functions
Epigenetics Xiaole Shirley Liu STAT115, STAT215, BIO298, BIST520.
Tingwen Chen (陳亭妏) Bioinformatics center CGU
Sequencing Genomes 1) Map the genome 2) Prepare an AC library 3) Order the library 4) Subdivide each AC into lambda contigs 5) Subdivide each lambda into.
Detecting DNA-protein Interactions Xinghua Lu Dept Biomedical Informatics BIOST 2055.
Understanding the Human Genome: Lessons from the ENCODE project
Analysis of ChIP-Seq Data
Canadian Bioinformatics Workshops
Organization of DNA Within a Cell from Lodish et al., Molecular Cell Biology, 6 th ed. Fig meters of DNA is packed into a 10  m diameter cell.
Greg Phillips Veterinary Microbiology
Hybridization Diagnostic tools Nucleic acid Basics PCR Electrophoresis
ChIP-seq QC Xiaole Shirley Liu STAT115, STAT215. Initial QC FASTQC Mappability Uniquely mapped reads Uniquely mapped locations Uniquely mapped locations.
“An integrated encyclopedia of DNA elements in the human genome” ENCODE Project Consortium. Nature 2012 Sep 6; 489: Michael M. Hoffman University.
High Throughput Sequencing
Next generation sequencing platforms Applications
Organization of DNA Within a Cell from Lodish et al., Molecular Cell Biology, 6 th ed. Fig meters of DNA is packed into a 10  m diameter cell.
ChIP-chip Data, Model and Analysis Ying Nian Wu Dept. Of Statistics UCLA Joint with Ming Zheng, Leah Barrera, Bing Ren.
Epigenome 1. 2 Background: GWAS Genome-Wide Association Studies 3.
Mapping protein-DNA interactions by ChIP-seq Zsolt Szilagyi Institute of Biomedicine.
The virochip (UCSF) is a spotted microarray. Hybridization of a clinical RNA (cDNA) sample can identify specific viral expression.
An Introduction to ENCODE Mark Reimers, VIPBG (borrowing heavily from John Stamatoyannopoulos and the ENCODE papers)
The Genome is Organized in Chromatin. Nucleosome Breathing, Opening, and Gaping.
The Center for Medical Genomics facilitates cutting-edge research with state-of-the-art genomic technologies for studying gene expression and genetics,
Kristen Horstmann, Tessa Morris, and Lucia Ramirez Loyola Marymount University March 24, 2015 BIOL398-04: Biomathematical Modeling Lee, T. I., Rinaldi,
Massive Parallel Sequencing
Next Generation Sequencing and its data analysis challenges Background Alignment and Assembly Applications Genome Epigenome Transcriptome.
Chromatin Immunoprecipitation DNA Sequencing (ChIP-seq)
Vidyadhar Karmarkar Genomics and Bioinformatics 414 Life Sciences Building, Huck Institute of Life Sciences.
Nozomu TAKAHASHI June 11th, 2012
ChIP-chip Data. DNA-binding proteins Constitutive proteins (mostly histones) –Organize DNA –Regulate access to DNA –Have many modifications Acetylation,
Next Generation Sequencing
Alistair Chalk, Elisabet Andersson Stem Cell Biology and Bioinformatic Tools, DBRM, Karolinska Institutet, September Day 5-2 What bioinformatics.
Starting Monday M Oct 29 –Back to BLAST and Orthology (readings posted) will focus on the BLAST algorithm, different types and applications of BLAST; in.
Analysis of protein-DNA interactions with tiling microarrays
Control of Eukaryotic Genome
ANALYSIS OF GENE EXPRESSION DATA. Gene expression data is a high-throughput data type (like DNA and protein sequences) that requires bioinformatic pattern.
Trends Biomedical Science
Lecture-5 ChIP-chip and ChIP-seq
Analysis of ChIP-Seq Data Biological Sequence Analysis BNFO 691/602 Spring 2014 Mark Reimers.
Biol 456/656 Molecular Epigenetics Lecture #5 Wed. Sept 2, 2015.
STAT115 STAT225 BIST512 BIO298 - Intro to Computational Biology.
Outline Molecular Cell Biology Assessment Review from last lecture Role of nucleoporins in transcription Activators and Repressors Epigenetic mechanisms.
March 6, 2016 EpiQ Chromatin Analysis Kit A New Tool for Epigenetic Research Gábor Kohut PhD Field Application Specialist Central and Eastern Europe.
Agenda  Epigenetics and microRNAs – Update –What’s epigenetics? –Preliminary results.
Quest for epigenetic determinants of local coexpression clusters Wieslawa Mentzen Labrador and Corces, 2002.
Peak Calling for ChIP-Seq data Larry Meyer UCSC Bioinformatics Dept. BME 230 January 11, 2011.
Engineering magnetosomes to express novel proteins Which ones? Tweaking p18 Linker Deleting or replacing GFP Something else? TRZN Oxalate decarboxylases.
Introduction to Next Generation Sequencing. Strategies For Interrogating the Transcriptome Known genes Predicted genes Surrogate strategy Exon verification.
ChIP-seq Downstream Analysis Xiaole Shirley Liu STAT115, STAT215, BIO298, BIST520.
Il principio della ChIP: arricchimento selettivo della frazione di cromatina contenente una specifica proteina La ChIP può anche esser considerata.
Presented by: John Lawson Developed by: John Lawson, Nathan Sheffield
Epigenetics Continued
EPIGENETICS Textbook Fall 2013.
Organization of DNA Within a Cell
Analysis of genomes and transcriptomes using ChIP-seq and RNA-seq
Figure 1. Annotation and characterization of genomic target of p63 in mouse keratinocytes (MK) based on ChIP-Seq. (A) Scatterplot representing high degree.
Dynamic epigenetic enhancer signatures reveal key transcription factors associated with monocytic differentiation states by Thu-Hang Pham, Christopher.
Simon v ChIP-Seq Analysis Simon v
Epigenetics Study of the modifications to genes which do not involve changing the underlying DNA
Organization of DNA Within a Cell
High-Resolution Profiling of Histone Methylations in the Human Genome
Epigenetics Heritable alteration of gene expression without a change in nucleotide sequence.
Chromosome Architecture
High-Resolution Profiling of Histone Methylations in the Human Genome
Volume 7, Issue 9, Pages (September 2014)
Volume 67, Issue 6, Pages e6 (September 2017)
Dynamic Regulation of Nucleosome Positioning in the Human Genome
A Role for Mammalian Sin3 in Permanent Gene Silencing
Presentation transcript:

I519 Introduction to Bioinformatics, Fall, 2012 From ChIP-chip to ChIP-Seq: the study of mammalian transcription factor binding sites and epigenetics

From Chip-Chip to Chip-Seq ChIP-chip (ChIP on tiled microarrays) ChIP-sequencing (ChIP-seq) combines chromatin immunoprecipitation (ChIP) and massively parallel sequencing to identify mammalian DNA sequences bound by transcription factors in vivo.

Chromatin immunoprecipitation (ChIP) between the side chains of two lysines between lysine & cytosine Formaldehyde (CH2O) is a very reactive dipolar compound (the carbon atom is the nucleophilic center). Amino and imino groups of proteins (e.g., the side chains of lysine and arginine) and of nucleic acids (e.g., cytosine) react with formaldehyde, leading to the formation of a Schiff base (reaction I)

Chip-Seq workflow Solexa sequencing technology provided short read length sequences of approx 30 base pairs that were ideal for characterizing ChIP-derived fragments. Nature Methods - 4, 613 - 614 (2007)

Advantages of ChIP-Seq Single base-pair resolution of direct sequencing ChIP-seq data are likely to have less noise or artifacts potential binding regions need not be specified prior to experiment lower cost, minimal hands-on processing and a requirement for fewer replicate experiments as well as less input material. Epigenetics meets next-generation sequencing. Epigenetics. 2008 Nov;3(6):318-21

Next generation sequencing (NGS) techniques Illumina/Solexa ABI SOLiD Sequencing Chemistry Pyrosequencing Polymerase-based sequence-by-synthesis Ligation-based sequencing Amplification approach Emulsion PCR Bridge amplification Paired end (PED) separation 3 kb 200-500 bp Mb per run 100 Mb 1300 Mb 3000 Mb Time per PED run <0.5 day 4 days 5 days Read length (update) 250-400 bp 35, 75 and 100 bp 35 and 50 bp Cost per run $ 8,438 USD $ 8,950 USD $ 17,447 USD Cost per Mb $ 84.39 USD $ 5.97 USD $ 5.81 USD

Tools for extracting transcription factor targets from ChIP-Seq data CisGenome uses a conditional binomial model to identify enriched regions when a control data set is provided (Nat. Biotechnol. 26:1293–1300, 2008) MACS (Model-based Analysis of ChIP-Seq) uses the control dataset to model the tag distribution across the genome using the Poisson distribution lBG (Genome Biol, 9:R137, 2009) PeakSeq enables systematic scoring of ChIP-seq experiments relative to controls (Biotechnol, 27:66–75, 2009) QuEST (Quantitative Enrichment of Sequence Tags) Nat. Methods, 5:829–834, 2008 GLITR (GLobal Identifier of Target Regions) identifies enriched regions in target data by calculating a fold-change based on random samples of control (input chromatin) data

Why peak detection is difficult PeakSeq: Biotechnol, 27:66–75, 2009 The signal for a given transcription factor is the 'convolution' of various effects: the density of mappable bases in a region, the underlying chromatin structure and the actual signal from transcription factor binding. Some fraction of the peaks in the ChIP-seq signal map for a transcription factor might be due to the nature of the open chromatin structure instead of the presence of transcription factor binding--one must compare the signal against one from a control.

PeakSeq scoring procedure Biotechnol, 27:66–75, 2009

High-Resolution Profiling of Histone methylations in the human genome Ref: Cell, 129(4):823-837, 2007 Generated high-resolution maps for the genome-wide distribution of 20 histone lysine and arginine methylations and others across the human genome using the Solexa 1G sequencing technology (The cells were digested with MNase to generate mainly mononucleosomes with minor fraction of dinucleosomes for histone modification mapping) Typical patterns of histone methylations exhibited at promoters, insulators, enhancers, and transcribed regions are identified. The monomethylations of H3K27, H3K9, H4K20, H3K79, and H2BK5 are all linked to gene activation trimethylations of H3K27, H3K9, and H3K79 are linked to repression. H2A.Z (a Histone variant) associates with functional regulatory elements, and CTCF marks boundaries of histone methylation domains. …

BS-seq for epigenetic profiling BS-seq (bisulphite sequencing) combines bisulphite treatment of genomic DNA with ultra-high-throughput sequencing Cytosine DNA methylation is important in regulating gene expression and in silencing transposons and other repetitive sequences

Bisulphite sequencing

References Genome-wide profiles of STAT1 DNA association using chromatin immunoprecipitation and massively parallel sequencing. Nature Methods - 4, 651 - 657 (2007)