CSU IDRC Next Generation Sequencing Core Genomic Sequencing Services.

Slides:



Advertisements
Similar presentations
Next-Generation Sequencing: Methodology and Application
Advertisements

Metabarcoding 16S RNA targeted sequencing
Next-generation sequencing
The 454 and Ion PGM at the Genomics Core Facility Dr. Deborah Grove, Director for Genetic Analysis Genomics Core Facility Huck Institutes of the Life Sciences.
The data flood: We need a bigger boat James A. Foster The Initiative for Bioinformatics and Evolutionary Studies (IBEST) Biological Sciences, Bioinformatics.
Practical Bioinformatics Community structure measures for meta-genomics István Albert Bioinformatics Consulting Center Penn State.
Workshop in Bioinformatics 2010 Class # Class 8 March 2010.
The Sorcerer II Global ocean sampling expedition Katrine Lekang Global Ocean Sampling project (GOS) Global Ocean Sampling project (GOS) CAMERA CAMERA METAREP.
Sequence comparison: Significance of similarity scores Genome 559: Introduction to Statistical and Computational Genomics Prof. James H. Thomas.
Arabidopsis Gene Project GK-12 April Workshop Karolyn Giang and Dr. Mulligan.
Zachary Bendiks. Jonathan Eisen  UC Davis Genome Center  Lab focus: “Our work focuses on genomic basis for the origin of novelty in microorganisms (how.
GENOME SEQUENCING. I. Genome sequencing The Sanger Method (1977) Denaturation +priming Polymerization.
Molecular Biology Dr. Chaim Wachtel April 4, 2013.
Metagenomics Binning and Machine Learning
Databases and tools to study the genomes of hundreds of pathogens, plants, and mammals Richard H. Scheuermann, Ph.D. Director of Informatics J. Craig Venter.
Metagenomic Analysis Using MEGAN4
Basic Introduction of BLAST Jundi Wang School of Computing CSC691 09/08/2013.
Discovery of new biomarkers as indicators of watershed health and water quality Anamaria Crisan & Mike Peabody.
From Metagenomic Sample to Useful Visual Anna Shcherbina 01/10/ Anna Shcherbina Bioinformatics Challenge Day 02/02/2013 From Metagenomic Sample to.
H = -Σp i log 2 p i. SCOPI Each one of the many microbial communities has its own structure and ecosystem, depending on the body environment it exists.
Introduction to next generation sequencing Rolf Sommer Kaas.
MES Genome Informatics I - Lecture IV. NGS basics Sangwoo Kim, Ph.D. Assistant Professor, Severance Biomedical Research Institute, Yonsei University.
“Comparative Human Microbiome Analysis” Remote Video Talk to CICESE Big Data, Big Network Workshop Ensenada, Mexico October 10, 2013 Dr. Larry Smarr Director,
The NIH Roadmap and the Human Microbiome Project Francis S. Collins, M.D., Ph.D. National Human Genome Research Institute April 22, 2007.
Integration and analysis of multi-type high-throughput data for biomolecular knowledge discovery Dr. Erik Bongcam-Rudloff SGBC-SLU Uppsala, Sweden.
Biodiversity initiative: Integrating Taxonomy, Genomics and Biodiversity ++ = ????? Speaker: Benjamin Linard Alfried Vogler Team.
David R. McWilliams, Ph.D. Section of Statistical Genetics, Department of Biostatistical Sciences, Center for Public Health Genomics Bioinformatician IV.
Advancing Science with DNA Sequence Metagenome definitions: a refresher course Natalia Ivanova MGM Workshop September 12, 2012.
Current Challenges in Metagenomics: an Overview Chandan Pal 17 th December, GoBiG Meeting.
Metagenomic Analysis Using MEGAN4 Peter R. Hoyt Director, OSU Bioinformatics Graduate Certificate Program Matthew Vaughn iPlant, University of Texas Super.
Error model for massively parallel (454) DNA sequencing Sriram Raghuraman (working with Haixu Tang and Justin Choi)
DNA sequencing, big data and health Mikael Huss Science for Life Laboratory / Stockholm Follow the Data blog:
Tsute (George) Chen Bioinformatics Core Department of Microbiology The Forsyth Institute March 24 th, 2015 HOMD A Tour to the Data and Tools.
Genomes To Life Biology for 21 st Century A Joint Initiative of the Office of Advanced Scientific Computing Research and Office of Biological and Environmental.
Introduction to Bioinformatics Dr. Rybarczyk, PhD University of North Carolina-Chapel Hill
An Investigation into Implementations of DNA Sequence Pattern Matching Algorithms Peden Nichols Computer Systems Research April,
The metagenomics sequencing service CD Genomics. Metagenomics: Metagenomics is the study of metagenomes, genetic material recovered directly from environmental.
Running BLAST on the cluster system over the Pacific Rim.
Analyzing Time Course Data: How can we pick the disappearing needle across multiple haystacks? IEEE-HPEC Bioinformatics Challenge Day Dr. C. Nicole Rosenzweig.
Analysis and comparison of very large metagenomes with fast clustering and functional annotation Weizhong Li, BMC Bioinformatics 2009 Present by Chuan-Yih.
Bioinformatics Lecture to accompany BLAST/ORF finder activity
Metagenome analysis Natalia Ivanova MGM Workshop February 2, 2012.
DOE Network PI Meeting 2005 Runtime Data Management for Data-Intensive Scientific Applications Xiaosong Ma NC State University Joint Faculty: Oak Ridge.
Big Data Bioinformatics By: Khalifeh Al-Jadda. Is there any thing useful?!
__________________________________________________________________________________________________ Fall 2015GCBA 815 __________________________________________________________________________________________________.
What is BLAST? Basic BLAST search What is BLAST?
DNA Sequencing Technology and its Applications in Evolution Research Julie Urban, Ph.D. Assistant Director, Genomics & Microbiology Laboratory NC Museum.
Parallel Computers Today Oak Ridge / Cray Jaguar > 1.75 PFLOPS Two Nvidia 8800 GPUs > 1 TFLOPS Intel 80- core chip > 1 TFLOPS  TFLOPS = floating.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Bioinformatics activity Christophe BLANCHET.
Big Data in Indian Agriculture D. Rama Rao Director, NAARM.
Introducing DOTUR, a Computer Program for Defining Operational Taxonomic Units and Estimating Species Richness Patric D. Schloss and Jo Handelsman Department.
Bioinformatics Computation in the Cloud A Joint Collaboration Between Microsoft’s External Research and eXtreme Computing Groups
Bioinformatics Shared Resource Bioinformatics : How to… Bioinformatics Shared Resource Kutbuddin Doctor, PhD.
Discussion on Genomic/Metagenomic Data for ANGUS Course Adina Howe.
Real time metagenomics Ross Overbeek Bob Olson Terry Disz Liz Dinsdale.
BLAST: Basic Local Alignment Search Tool Robert (R.J.) Sperazza BLAST is a software used to analyze genetic information It can identify existing genes.
What is BLAST? Basic BLAST search What is BLAST?
Metagenomic Species Diversity.
Research Paper on BioInformatics
Basics of BLAST Basic BLAST Search - What is BLAST?
Toward Next Generation Biodiversity Research
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Comparative Genomics.
Parallel System for BLAST
Dissemination of the mcr-1 colistin resistance gene
Evolution of Genomes Chapter 21.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Portable Performance for Many-Core Particle Advection
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Presentation transcript:

CSU IDRC Next Generation Sequencing Core Genomic Sequencing Services

Semiconductor DNA Sequencing Ion Proton Ion Torrent “Sequencing on a Chip”

Semiconductor Sequencing in a Nutshell “It’s a computational pH meter”

Metagenomics Environmental samples of communities of organisms water, soil samples human & animal microbiomes mine tailings, oil spills deep sea, polar ice etc.

Metagenomics Pipeline CSU Cray supercomputer; Oak Ridge Titan supercomputer Torrent/Proton sequencers Megan NCBI nucleotide databases

Metagenomics Tools Ion Proton Sequencer In: Sample DNA Out: 50M DNA fragments NCBI nucleotide database DNA fragments 15M+ records Do the math: 50M * 15M = queries mpiBLAST Highly parallelized Blast algorithm NGS sample DNA Query NCBI DB CSU Cray XT6m 2,016 CPU cores

Metagenomics Dr. Toni Piaggio, National Wildlife Research Center, Fort Collins Florida Everglades water samples (4) “What species are in the water?” CSU NextGen Sequencing Core: Ion Proton; 2 weeks CSU Cray: 1,000 cores, 24-hours, 4 runs; 1 week Results

Metagenomics Rarefaction curves Estimate species richness Asymptotic? Find rare species

Computational Resources Oak Ridge Titan Cray XK7 Supercomputer 300K CPU cores; 50M GPU cores mpiBlast NCBI nucleotide DB Query 100% of sample DNA CSU Cray XT6m Supercomputer 2,016 CPU cores mpiBlast NCBI nucleotide DB Query 1% of sample DNA Strong scaling

Summary Big Data Issues Semiconductor sequencer data Large-scale database queries High-performance computing