Towards Personal Genomics Tools for Navigating the Genome of an Individual Saul A. Kravitz J. Craig Venter Institute Rockville, MD Bio-IT World 2008.

Slides:



Advertisements
Similar presentations
Mapping analysis software Dr Ian Carr PhD. MCSD. Leeds Institute of Molecular Medicine St Jamess University Hospital.
Advertisements

Single Nucleotide Polymorphism And Association Studies Stat 115 Dec 12, 2006.
Genetic Approaches to Rare Diseases: What has worked and what may work for AHC Erin L. Heinzen, Pharm.D, Ph.D Center for Human Genome Variation Duke University.
Corso di Genomica lezione laurea magistrale Biotecnologia Industriale Giovedì 9 dicembre 2010 aula 6 orario : Martedì ore
Genetics notes For makeup. A gene is a piece of DNA that directs a cell to make a certain protein. –Homozygous describes two alleles that are the same.
1 of 25 Sequence Variation in Ensembl. 2 of 25 Outline SNPs SNPs in Ensembl Linkage disequilibrium SNPs in BioMart DAS sources.
Fatchiyah, PhD Dept Biology UB Fatchiyah.lecture.ub.ac.id
Peter Tsai, Bioinformatics Institute.  University of California, Santa Cruz (UCSC)  A rapid and reliable display of any requested portion of genomes.
Outline to SNP bioinformatics lecture
MENDEL’S GENETICS CH. 5-1 How Traits Are Inherited 1.Sex cells with a haploid number of chromosomes are united during fertilization to form a zygote.
CS177 Lecture 9 SNPs and Human Genetic Variation Tom Madej
Variant discovery Different approaches: With or without a reference? With a reference – Limiting factors are CPU time and memory required – Crossbow –
The 1000 Genomes Project Gil McVean Department of Statistics, Oxford.
Biology and Bioinformatics Gabor T. Marth Department of Biology, Boston College BI820 – Seminar in Quantitative and Computational Problems.
Computational Tools for Finding and Interpreting Genetic Variations Gabor T. Marth Department of Biology, Boston College
The Extraction of Single Nucleotide Polymorphisms and the Use of Current Sequencing Tools Stephen Tetreault Department of Mathematics and Computer Science.
Introduction to Linkage Analysis March Stages of Genetic Mapping Are there genes influencing this trait? Epidemiological studies Where are those.
Genome Browsers Ensembl (EBI, UK) and UCSC (Santa Cruz, California)
Something related to genetics? Dr. Lars Eijssen. Bioinformatics to understand studies in genomics – São Paulo – June Image:
Genomewide Association Studies.  1. History –Linkage vs. Association –Power/Sample Size  2. Human Genetic Variation: SNPs  3. Direct vs. Indirect Association.
A Computational Analysis of the H Region of Mouse Olfactory Receptor Locus 28 Deanna Mendez SoCalBSI August 2004.
Polymorphisms – SNP, InDel, Transposon BMI/IBGP 730 Victor Jin, Ph.D. (Slides from Dr. Kun Huang) Department of Biomedical Informatics Ohio State University.
SNPs DNA differs between humans by 0.1%, (1 in 1300 bases) This means that you can map DNA variation to around 10,000,000 sites in the genome Almost all.
The Sorcerer II Global ocean sampling expedition Katrine Lekang Global Ocean Sampling project (GOS) Global Ocean Sampling project (GOS) CAMERA CAMERA METAREP.
Human Genome Project Seminal achievement. Scientific milestone. Scientific implications. Social implications.
Population Genetics 101 CSE280Vineet Bafna. Personalized genomics April’08Bafna.
RExPrimer Pongsakorn Wangkumhang, M.Sc. Biostatistics and Informatics Laboratory, Genome Institute, National Center for Genetic Engineering and Biotechnology.
GeVab: Genome Variation Analysis Browsing Server Korean BioInformation Center, KRIBB InCoB2009 KRIBB
Todd J. Treangen, Steven L. Salzberg
Genetic Variations Lakshmi K Matukumalli. Human – Mouse Comparison.
Computational research for medical discovery at Boston College Biology Gabor T. Marth Boston College Department of Biology
How I learned to quit worrying Deanna M. Church Staff Scientist, Short Course in Medical Genetics 2013 And love multiple coordinate.
Loss-of-co-Homozygosity mapping and exome sequencing of a Syrian pedigree identified the candidate causal mutation associated with rheumatoid arthritis.
DAN LAWSON BRC 2011 – ANNUAL MEETING UT SOUTHWESTERN MEDICAL CENTER DALLAS, TX SEPTEMBER 2011 Challenges and opportunities of new sequencing technologies.
Biology 101 DNA: elegant simplicity A molecule consisting of two strands that wrap around each other to form a “twisted ladder” shape, with the.
Jeopardy Genes and Chromosomes Basics
CS177 Lecture 10 SNPs and Human Genetic Variation
SNP Haplotypes as Diagnostic Markers Shrish Tiwari CCMB, Hyderabad.
CSE280Vineet Bafna CSE280a: Algorithmic topics in bioinformatics Vineet Bafna.
Online Mendelian Inheritance in Man (OMIM): What it is & What it can do for you Knowledge Management & Eskind Biomedical Library January 27, 2012 helen.
Browsing the Genome Using Genome Browsers to Visualize and Mine Data.
© 2010 by The Samuel Roberts Noble Foundation, Inc. 1 The Samuel Roberts Noble Foundation, 2510 Sam Noble Parkway, Ardmore, OK, 73401, USA 2 National Center.
Web Databases for Drosophila Introduction to FlyBase and Ensembl Database Wilson Leung6/06.
Sahar Al Seesi and Ion Măndoiu Computer Science and Engineering
SCRIPPS GENOME ADVISER Galina Erikson Senior Bioinformatics Programmer The Scripps Translational Science Institute Scripps Translational Science Institute.
GENETICS REVIEW. A physical trait that shows as a result of an organism’s particular genotype. PHENOTYPE.
MEME homework: probability of finding GAGTCA at a given position in the yeast genome, based on a background model of A = 0.3, T = 0.3, G = 0.2, C = 0.2.
Linear Reduction Method for Tag SNPs Selection Jingwu He Alex Zelikovsky.
February 20, 2002 UD, Newark, DE SNPs, Haplotypes, Alleles.
The International Consortium. The International HapMap Project.
Vineet Bafna CSE280A CSE280Vineet Bafna. We will cover topics from Population Genetics. The focus will be on the use of algorithms for analyzing genetic.
Copyright OpenHelix. No use or reproduction without express written consent1.
Linkage Disequilibrium and Recent Studies of Haplotypes and SNPs
Computational Biology and Genomics at Boston College Biology Gabor T. Marth Department of Biology, Boston College
1 of 28 Evaluating Genes and Transcripts (“Genebuild”)
GSVCaller – R-based computational framework for detection and annotation of short sequence variations in the human genome Vasily V. Grinev Associate Professor.
Welcome to the combined BLAST and Genome Browser Tutorial.
Canadian Bioinformatics Workshops
分子診斷學概論  第一章 綜說 overview 疾病發生原因的影響層次 DNA 、 RNA 或蛋白質 分子診斷的目的 偵測這些致病因子是那個層次發生變化 本書著重 DNA 、 RNA 的變化 蛋白質層次由原文書章節提供 The Application of Proteomics To Disease.
Reliable Identification of Genomic Variants from RNA-seq Data Robert Piskol, Gokul Ramaswami, Jin Billy Li PRESENTED BY GAYATHRI RAJAN VINEELA GANGALAPUDI.
1 Finding disease genes: A challenge for Medicine, Mathematics and Computer Science Andrew Collins, Professor of Genetic Epidemiology and Bioinformatics.
Genetics: Inheritance. Meiosis: Summary  Diploid Cells (2n): Cells with two sets of chromosomes, (aka “homologous chromosomes”)  One set of chromosomes.
Genetics: Inheritance
Towards Personal Genomics
Mendelian Inheritance
Genetics: Inheritance
BF528 - Genomic Variation and SNP Analysis
Gene Safari (Biological Databases)
BF528 - Whole Genome Sequencing and Genomic Variation
Human Genome Project Seminal achievement. Scientific milestone.
Presentation transcript:

Towards Personal Genomics Tools for Navigating the Genome of an Individual Saul A. Kravitz J. Craig Venter Institute Rockville, MD Bio-IT World 2008

Personal Genomics: The future is now

Outline HuRef Project: Genome of an Individual HuRef Research Highlights The HuRef Browser – Towards Personal Genomics Browsers Conclusions and Credits

Genome of a Single Individual: Goals Provide a diploid genome that could serve as a reference for future individualized genomics Characterize the individual’s genetic variation – HuRef vs NCBI – HuRef haplotypes Understand the individual’s risk profile based on their genomic data

How does HuRef Differ? NCBI Genome – Multiple individuals – Collapsed Haploid Sequence of a Diploid Genome – No haplotype phasing or inference possible HuRef – Single individual – Can reconstruct haplotypes of diploid genome Haplotype Blocks – Segment of DNA inherited from one parent

The HuRef Genome PLoS Biology :e254 September 4, 2007

DNA from a single individual De Novo Assembly – 7.5x Coverage Sanger Reads Diploid Reconstruction – Half of genome is in haplotype blocks of >200kb HuRef Data Released – NCBI: Genome Project – JCVI: The HuRef Genome

Variants: NCBI-36 vs HuRef NCBI-36 vs HuRef yields Homozygous Variants SNPMN P InsertionDeletion NCBI HuRef variant: G/Avariant: TA/AT variant:

Reads ACCTTTGTAATTCCC ACCTTTGTAATTCCC ACCTTTGTAATTCCC ACCTTTACAATTCCC ACCTTTACAATTCCC ACCTTTACAATTCCC Computing Allelic Contributions Consensus generation conflates alleles Haploid Consensus ACCTTTGCAATTCCC

Computing Allelic Contributions Consensus generation conflates alleles Consensus generation modified to separate alleles Bioinformatics Apr 15;24(8):

Reads ACCTTTGTAATTCCC ACCTTTGTAATTCCC ACCTTTGTAATTCCC ACCTTTACAATTCCC ACCTTTACAATTCCC ACCTTTACAATTCCC Computing Allelic Contributions Modified Consensus generation separates allele Compare HuRef alleles to identify SNP, MNP, Indel Variants True Diploid Alleles ACCTTTGTAATTCCC ACCTTTACAATTCCC Haploid Consensus ACCTTTGCAATTCCC AC / GT MNP Variant

HuRef Variations 4.1 Million Variations (12.3 Mbp) 1.2 Million Novel Many non-synonymous changes ~700 indels and ~10,000 total SNPs Indels and non-SNP Sequence Variation 22% of all variant events, 74% of all variant bases % difference between haploid genomes 5-10x higher than previous estimates

HuRef Browser Why do this? Research tool focused on variation Verify assembly and variants Show ALL the evidence High Perfomance Features Use HuRef or NCBI as reference Genome vs Genome Comparison Drill down from chromosome to reads and alignments Overlay of Ensembl and NCBI Annotation Links from HuRef features in NCBI (e.g., dbSNP) Export of data for further analysis

Search by Feature ID or coordinates Navigate by Chromosome Band

Zinc Finger Protein Chr19: Assembly Structure Variations Transcript Gene Haplotype Blocks NCBI-36 HuRef Assembly-Assembly Mapping

chr19: Protein Truncated by 476 bp Insertion Homozygous SNP Heterozygous SNP

Assembly Structure

Drill Down to Multi-sequence Alignment Validation of Phased A/C Heterozygous SNPs in HuRef

14kbp Inversion Spanning TNFRSF14 chr1:

Browser for Multiple Genomes Expand on existing features – Variants and haplotype blocks in individuals – Structural variation among individuals – Genetic traits of variants related to diseases Required Features – Which genome/haplotype is the reference? – Correlation with phenotypic, medical, and population data – Correlation within families

Future Challenges Data volumes – read data included from new technologies – Multiplication of genomes Enormous number of potential comparisons – Populations, individuals, variants Dynamic generation of views in web time Use cases are evolving

Conclusion A high performance visualization tool for an individual genome – Validation of variants – Comparison with NCBI-36 Planned extensions for multi-genome era Website: Contact:

HuRef Browser: Nelson Axelrod, Yuan Lin, and Jonathan Crabtree Scientific Leadership: Sam Levy, Craig Venter, Robert Strausberg, Marvin Frazier Sequence Data Generation and Indel Validation: Yu-Hui Rogers, John Gill, Jon Borman, JTC Production, Tina McIntosh, Karen Beeson, Dana Busam, Alexia Tsiamouri, Celera Genomics. Data Analysis: Sam Levy, Granger Sutton, Pauline Ng, Aaron Halpern, Brian Walenz, Nelson Axelrod, Yuan Lin, Jiaqi Huang, Ewen Kirkness, Gennady Denisov, Tim Stockwell, Vikas Basal, Vineet Bafna, Karin Remington, and Josep Abril CNV, Genotyping, FISH mapping: Steve Scherer, Lars Feuk, Andy Wing Chun Pang, Jeff MacDonald Funding: J. Craig Venter FoundationDNA: J. Craig Venter Acknowledgements