Affymetrix Microarray and Illumina/ Solexa NextGen Sequencing Yuannan Xia, Ph.D Genomics Core Research Facility 10.27.2009.

Slides:



Advertisements
Similar presentations
The Good, Bad, and Ugly of Next-Gen Sequencing
Advertisements

Application of available statistical tools Development of specific, more appropriate statistical tools for use with microarrays Functional annotation of.
Microarray Simultaneously determining the abundance of multiple(100s-10,000s) transcripts.
Canadian Bioinformatics Workshops
RNA-Seq An alternative to microarray. Steps Grow cells or isolate tissue (brain, liver, muscle) Isolate total RNA Isolate mRNA from total RNA (poly.
Next-generation sequencing and PBRC. Next Generation Sequencer Applications DeNovo Sequencing Resequencing, Comparative Genomics Global SNP Analysis Gene.
Greg Phillips Veterinary Microbiology
DNA Microarray: A Recombinant DNA Method. Basic Steps to Microarray: Obtain cells with genes that are needed for analysis. Isolate the mRNA using extraction.
Additional Powerful Molecular Techniques Synthesis of cDNA (complimentary DNA) Polymerase Chain Reaction (PCR) Microarray analysis Link to Gene Therapy.
Chip arrays and gene expression data. With the chip array technology, one can measure the expression of 10,000 (~all) genes at once. Can answer questions.
The SOLiD System: Next-Generation Sequencing Overview of the SOLiD System –  Scalable  Accurate Ultra High Throughput  Flexible  Mate Pairs.
Central Dogma 2 Transcription mRNA Information stored In Gene (DNA) Translation Protein Transcription Reverse Transcription SELF-REPAIRING ARABIDOPSIS,
RNA-Seq An alternative to microarray. Steps Grow cells or isolate tissue (brain, liver, muscle) Isolate total RNA Isolate mRNA from total RNA (poly.
Microarrays: Theory and Application By Rich Jenkins MS Student of Zoo4670/5670 Year 2004.
Introduce to Microarray
Genomics I: The Transcriptome RNA Expression Analysis Determining genomewide RNA expression levels.
High Throughput Sequencing
Microarrays: Basic Principle AGCCTAGCCT ACCGAACCGA GCGGAGCGGA CCGGACCGGA TCGGATCGGA Probe Targets Highly parallel molecular search and sort process based.
and analysis of gene transcription
Diabetes and Endocrinology Research Center The BCM Microarray Core Facility: Closing the Next Generation Gap Alina Raza 1, Mylinh Hoang 1, Gayan De Silva.
NGS Data Generation Dr Laura Emery. Overview The NGS data explosion Sequencing technologies An example of a sequencing workflow Bioinformatics challenges.
Next Now-Generation Genomics: methods and applications for modern disease research Aaron J. Mackey, Ph.D. Center for Public Health.
Affymetrix vs. glass slide based arrays
Analyzing your clone 1) FISH 2) “Restriction mapping” 3) Southern analysis : DNA 4) Northern analysis: RNA tells size tells which tissues or conditions.
‘Omics’ - Analysis of high dimensional Data
-The methods section of the course covers chapters 21 and 22, not chapters 20 and 21 -Paper discussion on Tuesday - assignment due at the start of class.
Mapping protein-DNA interactions by ChIP-seq Zsolt Szilagyi Institute of Biomedicine.
High Throughput Sequencing Methods and Concepts
BUDDING TECHNOLOGIES AND BUDDING YEAST 2012 HHMI Summer Workshop for High School Science Teachers.
Library Preparation Application dependant, using standard molecular biological techniques. Fragment library oligo kit: (per library)$35 GeneAmp dNTP blend:
The Genome is Organized in Chromatin. Nucleosome Breathing, Opening, and Gaping.
How do you identify and clone a gene of interest? Shotgun approach? Is there a better way?
Data Type 1: Microarrays
Restriction Nucleases Cut at specific recognition sequence Fragments with same cohesive ends can be joined.
Gene Expression Data Qifang Xu. Outline cDNA Microarray Technology cDNA Microarray Technology Data Representation Data Representation Statistical Analysis.
Genome Sequencing & App. of DNA Technologies Genomics is a branch of science that focuses on the interactions of sets of genes with the environment. –
Technology for Systems Biology. Nucleic Acid Hybridization In principle complementary strands will associate Chemistry is quite different on surfaces.
High Throughput Sequencing Methods and Concepts Cedric Notredame adapted from S.M Brown.
Chromatin Immunoprecipitation DNA Sequencing (ChIP-seq)
Microarrays and Their Uses Brad Windle, Ph.D
ARK-Genomics: Centre for Comparative and Functional Genomics in Farm Animals Richard Talbot Roslin Institute and R(D)SVS University of Edinburgh Microarrays.
What Is Microarray A new powerful technology for biological exploration Parallel High-throughput Large-scale Genomic scale.
Genomics I: The Transcriptome
GeneChip® Probe Arrays
Introduction to Microarrays.
Transcriptomics Sequencing. over view The transcriptome is the set of all RNA molecules, including mRNA, rRNA, tRNA, and other non coding RNA produced.
Lecture 6. Functional Genomics: DNA microarrays and re-sequencing individual genomes by hybridization.
Idea: measure the amount of mRNA to see which genes are being expressed in (used by) the cell. Measuring protein might be more direct, but is currently.
Introduction to Microarrays. The Central Dogma.
Overview of Microarray. 2/71 Gene Expression Gene expression Production of mRNA is very much a reflection of the activity level of gene In the past, looking.
ANALYSIS OF GENE EXPRESSION DATA. Gene expression data is a high-throughput data type (like DNA and protein sequences) that requires bioinformatic pattern.
Lecture-5 ChIP-chip and ChIP-seq
Transcriptome What is it - genome wide transcript abundance How do you obtain it - Arrays + MPSS What do you do with it when you have it - ?
From: Duggan et.al. Nature Genetics 21:10-14, 1999 Microarray-Based Assays (The Basics) Each feature or “spot” represents a specific expressed gene (mRNA).
Genomics I: The Transcriptome RNA Expression Analysis Determining genomewide RNA expression levels.
Introduction to Oligonucleotide Microarray Technology
Gene Technologies and Human ApplicationsSection 3 Section 3: Gene Technologies in Detail Preview Bellringer Key Ideas Basic Tools for Genetic Manipulation.
Microarray: An Introduction
Green with envy?? Jelly fish “GFP” Transformed vertebrates.
Microarray Technology and Data Analysis Roy Williams PhD Sanford | Burnham Medical Research Institute.
Introduction to Illumina Sequencing
The Central Dogma. Life - a recipe for making proteins DNA protein RNA Translation Transcription.
DNA Sequencing Second generation techniques
Next generation sequencing
Hybridization.
Microarray Technology and Applications
CHAPTER 12 DNA Technology and the Human Genome
Introduction to Microarrays.
Next-generation DNA sequencing
Data Type 1: Microarrays
Presentation transcript:

Affymetrix Microarray and Illumina/ Solexa NextGen Sequencing Yuannan Xia, Ph.D Genomics Core Research Facility

1.High density oligo array: a single array containing 1 – 6 million features generates 1 – 6 million probe hybridization data points for summaring to values of 20K – 50K genes. 2. Hybridization Oven 640: Hybridize samples to array 3. Fluidics Station 450 : washing and staining 4.Scanner G: Confocal laser scanning; High pixel resolution at 0.5 micron level 5. Data generating and processing software: GCOS Software pipeline for data mining and annotation: Bioconductor Rosetta, AffyMiner, Ingenuity, Netaffix Affymetrix GeneChip Microarray System

Affymetrix Expression Arrays Array typeNumber of transcriptsNumber of genes Human U133 plus 2.0 >47,00038,500 Mouse >39,000>34,000 Rat >31,000>28,000 Arabidopsis ATH122,500>24,000 Drosophila18,50018,000 Wheat61,12755,052 Rice51,279 Soybean58,000 Barley25,50022,000

Illumina/Solexa Genome Analyzer II System Flow cell – A glass slide with 8 channels (lanes) and 16 manifold ports for performing all PCR and sequencing reactions inside each channel. Cluster generation station – Perform PCR bridge amplification to generate clusters inside the channels of flow cell and prepare flow cell ready for sequencing GAII and Paired End Module – Perform sequencing, imaging, cluster modification for paired end read 2 sequencing

Two Key Chemistries used in Solexa Sequencing Technology 1.PCR bridge amplification of individual templates in a shotgun library to generate clusters (DNA polymerase colony) High cluster density: 10 – 20 million/Lane 80 – 150 million/Run 2.Reversible Terminator Sequencing Chemistry Allow to incorporate only ONE nucleotide at each cycle Generate accurate (>99.5%) sequences: 300 – 800 Megabases/lane 3 – 6 Gigabases/Run

Bridge Amplification of Individual Templates by PCR

Cluster Generation

>All 4 bases with Reversible Terminators >4 labeling colors >Terminators can be removed >Add all 4 nucleotides in one reaction >No problem with homopolymer repeats >Higher accuracy Sequencing by Synthesis Using Reversible Terminators

A. Extend first base T, read, and deblock. B & C, Repeat step A to extend strand. D. Generate base calling. Steps of Sequencing by Synthesis AB C D

Base Calling From Image Raw Data The identity of each base of a cluster is read off from sequential images Cluster-a (x a,y a ) Cluster-b (x b,y b ) Read a Read b Cycles 1 - 9

Genomic Resequencing Yeast genome (V. Gladyshev; AGP Corn Processing) Fugus genome Aspergillus (S. Harris) ChIP Sequencing Arabidopsis CHlP DNA (Fromm; Cerutti) mRNA Sequencing - Transcritome Arabidopsis transcriptom (H. Cerutti) Human KSHV cell transcrptome (C. Wood) Chlorella/Virus transcriptome (J. Van Etten) Mole rat transcriptome (V. Gladyshev and D.Fomenko) Fugus Aspergillus transcriptome (S. Harris) Paired End Sequencing Arabidopsis mitochondrial genome (S. Mackenzie) Small RNA sequencing Several UNL faculty have expressed strong interest. (Y.Bin, H. Cerutti, J. Mower, J. Alfano) Solexa Sequencing Applications at UNL

Genomic Resequencing Data of Resequencing of 19 yeast genomes

 Chromatin immunoprecipitation sequencing  (ChIP-seq) Genome-wide analysis Gene Regulation and Control Epigenetic modifications DNA-protein interactions Nucleus Crosslink IP Sequencing Map binding sites

Transcriptome Analysis – mRNA-Seq Relative expression of transcripts Analysis of splice variants/coding SNPs Analysis of non-coding RNAs Transcript discovery

Paired End and Mate Pair Sequencing Provides long range information – Repeat sequences – Characterize copy number variants & rearrangements – De novo assembly Increases output per flow cell

Genomic DNA, total RNA, CHlP DNA (exp design, QC) DNA shotgun library preparation (SR, PE, cDNA) Cluster generation (35 PCR amplification cycles) Sequencing of clusters on GAII (1 TB machine, sequencing, imaging, image processing, base calling) Data analysis on remote server at Bioinfornatics Core Facility (8 TB machine, base calling, read alignment using Illumina pipeline software) Workflow of the service

Cost of Gene Expression Profiling Microarray $500 - $650/array/sample $ $4000 for 2 treatments-3replicats 6 samples experiment (6 arrays) Illumina $1300/lane/sample (400Mb sequence), $2100/2 lanes/sample (800Mb) $ 4200 for 2 treatment 2 samples 4 FC lanes experiment (without replicates)

Challenges More than 40 billion nucleotide sequences have been generated, will be double soon. Need solutions to - Sample preparation (e.g small RNA libraries, CHlP pulldown) - Further extracting sequencing data - Biological annotations - Data storage and management - Drafting publications Budget: - Maintaining both Affymetrix System and Illumina Solexa System is expensive. - Cost for upgrading the system.

ACKNOWLEDGMENTS Dr. Mike Fromm Drs. Jean-Jack Riethoven Ms. Mei Chen