1000G Phase 1 Release chr20 call sets Ryan Poplin Genome Sequencing and Analysis Medical and Population Genetics January 25, 2011.

Slides:



Advertisements
Similar presentations
Describing Location in a Distribution 2.1 Measures of Relative Standing and Density Curves YMS3e AP Stats at LSHS Mr. Molesky 2.1 Measures of Relative.
Advertisements

The 1000 Genomes Project Lessons From Variant Calling and Genotyping October 13 th, 2011 Hyun Min Kang University of Michigan, Ann Arbor.
Imputation for GWAS 6 December 2012.
Topic #3 Linkage Disequilibrium, Haplotypes & Tagging
Standard Deviation and Z score
Geuvadis RNAseq UNIGE Genetic regulatory variants
SHI Meng. Abstract The genetic basis of gene expression variation has long been studied with the aim to understand the landscape of regulatory variants,
G ENOTYPE AND SNP C ALLING FROM N EXT - GENERATION S EQUENCING D ATA Authors: Rasmus Nielsen, et al. Published in Nature Reviews, Genetics, Presented.
Genome-wide Association Study Focus on association between SNPs and traits Tendency – Larger and larger sample size – Use of more narrowly defined phenotypes(blood.
Efficient Algorithms for Genome-wide TagSNP Selection across Populations via the Linkage Disequilibrium Criterion Authors: Lan Liu, Yonghui Wu, Stefano.
Understanding GWAS Chip Design – Linkage Disequilibrium and HapMap Peter Castaldi January 29, 2013.
1000G Pilot 3 Progress (in silico analysis and comparison to experimental validation) Amit Indap, Wen-Fung Leong Gabor Marth (Boston College) Chris Hartl.
Lessons learnt from the 1000 Genomes Project about sequencing in populations Gil McVean Wellcome Trust Centre for Human Genetics and Department of Statistics,
Toward a unified view of human genetic variation Gabor Marth Boston College Biology Department on behalf of the International 1000 Genomes Project.
1000G Pilot 3 Progress in silico analysis and comparison to experimental validation Gabor Marth (Boston College) + A + L Kiran Garimella (Broad Institute)
The 1000 Genomes Project Gil McVean Department of Statistics, Oxford.
The Phase 1 Variant Set and Future Developments
Medical variations Gabor T. Marth Boston College Biology Department BI543 Fall 2013 February 5, 2013.
Molecular & Genetic Epi 217 Association Studies
1 Alexei Fedorov, Ph.D. Associate Professor Head of Bioinformatics Lab Department of Medicine Vice Director Program in Bioinformatics and Genomics/Proteomics.
The 1000 Genomes Project Gil McVean Department of Statistics, Oxford.
Recombination based population genomics Jaume Bertranpetit Marta Melé Francesc Calafell Asif Javed Laxmi Parida.
Addressing cryptic relatedness in candidate samples for 1KG James Nemesh Steve McCarroll 02/13/2012.
Geuvadis Analysis Meeting 16/02/2012 Micha Sammeth CNAG – Barcelona.
Motivations to study human genetic variation
Resources at HapMap.Org HapMap3 Tutorial Marcela K. Tello-Ruiz Cold Spring Harbor Laboratory.
Supplemental Figure 1. False trans association due to probe cross-hybridization and genetic polymorphism at single base extension site. (A) The Infinium.
Current Data And Future Analysis Thomas Wieland, Thomas Schwarzmayr and Tim M Strom Helmholtz Zentrum München Institute of Human Genetics Geneva, 16/04/12.
Canadian Bioinformatics Workshops
WHI Imputation. Target GWAS data WHIMS +, ~5,000-6,000 samples, Illumina Omni express GRANET, ~5,000 samples, Illumina Omni Hipfx, ~4,000-5,000 samples,
From Reads to Results Exome-seq analysis at CCBR
Integrated sequence analysis pipeline provides one-stop solution for identifying disease-causing mutations Cougar Hao Hu, MPIMG.
Canadian Bioinformatics Workshops
POPULATION GENOMICS, ADMIXTURE AND EPIDEMIOLOGY AT HIGH RESOLUTION
Human Population Genomics
K. Lakiotaki1, E. Kartsaki1, A. Kanterakis1, T. Katsila2, G. P
Alexei Fedorov January, 2011
The Genome Diversity in Africa Project
Week 5 Theory and application for setting up an RNA-Seq pipeline
Introduction to RAD Acropora millepora.
Differences in asthma genetics between Chinese and other populations
Analysis of Positive Selection at Single Nucleotide Polymorphisms Associated with Body Mass Index Does Not Support the “Thrifty Gene” Hypothesis  Guanlin.
Pulling out the 1%: Whole-Genome Capture for the Targeted Enrichment of Ancient DNA Sequencing Libraries  Meredith L. Carpenter, Jason D. Buenrostro,
A Common 16p11.2 Inversion Underlies the Joint Susceptibility to Asthma and Obesity  Juan R. González, Alejandro Cáceres, Tonu Esko, Ivon Cuscó, Marta.
Tracing the Route of Modern Humans out of Africa by Using 225 Human Genome Sequences from Ethiopians and Egyptians  Luca Pagani, Stephan Schiffels, Deepti.
Investigating the Association of Genetic Admixture and Donor/Recipient Genetic Disparity with Transplant Outcomes  Abeer Madbouly, Tao Wang, Michael Haagenson,
The Kalash Genetic Isolate: Ancient Divergence, Drift, and Selection
Differences in asthma genetics between Chinese and other populations
Reliable Identification of Genomic Variants from RNA-Seq Data
Alicia R. Martin, Christopher R. Gignoux, Raymond K
Deep Whole-Genome Sequencing of 100 Southeast Asian Malays
Brian K. Maples, Simon Gravel, Eimear E. Kenny, Carlos D. Bustamante 
Revisiting the Thrifty Gene Hypothesis via 65 Loci Associated with Susceptibility to Type 2 Diabetes  Qasim Ayub, Loukas Moutsianas, Yuan Chen, Kalliope.
Jacob E. Crawford, Ricardo Amaru, Jihyun Song, Colleen G
Jingjing Li, Xiumei Hong, Sam Mesiano, Louis J
Volume 150, Issue 3, Pages (August 2012)
Genomic Signatures of Selective Pressures and Introgression from Archaic Hominins at Human Innate Immunity Genes  Matthieu Deschamps, Guillaume Laval,
Japanese Population Structure, Based on SNP Genotypes from 7003 Individuals Compared to Other Ethnic Groups: Effects on Population-Based Association Studies 
D. Zaumsegel, R. Pereira, M. A. Rothschild, C. Phillips, L. Gusmão, P
Catarina D. Campbell, Nick Sampas, Anya Tsalenko, Peter H
Analysis of Positive Selection at Single Nucleotide Polymorphisms Associated with Body Mass Index Does Not Support the “Thrifty Gene” Hypothesis  Guanlin.
Leonardo Arbiza, Srikanth Gottipati, Adam Siepel, Alon Keinan 
Molecular and Functional Studies of Tyrosinase Variants Among Indian Oculocutaneous Albinism Type 1 Patients  Moumita Chaki, Mainak Sengupta, Maitreyee.
Trevor J. Pemberton, Chaolong Wang, Jun Z. Li, Noah A. Rosenberg 
Analysis of protein-coding genetic variation in 60,706 humans
Volume 152, Issue 8, Pages (June 2017)
Jazlyn A. Mooney, Christian D
KDM4A SNP-A482 (rs586339) correlates with worse outcome in patients with NSCLC. A, schematic of the human KDM4A protein is shown with both the protein.
Giulio Genovese, Robert E. Handsaker, Heng Li, Eimear E
Whole-Exome Sequencing Reveals Uncaptured Variation and Distinct Ancestry in the Southern African Population of Botswana  Gaone Retshabile, Busisiwe C.
Presentation transcript:

1000G Phase 1 Release chr20 call sets Ryan Poplin Genome Sequencing and Analysis Medical and Population Genetics January 25, 2011

Data and Definitions -- Pipeline 2 Full indel cleaning process including known indels BAQ calculation using GATK implementation of H. Li Called by main continental AP and by admixed+ AP Variant quality score recalibration Quality cut chosen using HapMap3.3 + Omni 2.5M chip sensitivity Cut at 99.2% of accessible sites Not yet done genotype refinement

Data and Definitions – 1004 Samples 3 ASN=CHB + CHS + JPT ASN+= CHB + CHS + JPT + MXL + CLM + PUR EUR=CEU + FIN + GBR + TSI + IBS EUR+=CEU + FIN + GBR + TSI + IBS + MXL + CLM + PUR + ASW AFR=LWK + YRI + ASW AFR+= LWK + YRI + ASW + CLM + PUR AMR=MXL + CLM + PUR AMR+=MXL + CLM + PUR + ASW Note these definitions differ from the other groups

# samples Analysis Panel Total # variants dbSNP % (129) # knowns Known ti/tv # novels Novel ti/tv Novel non-CpG ti/tv 266ASN264, , , ASN+446, , , EUR300, , , EUR+516, , , AFR475, , , AFR+529, , , AMR350, , , AMR+452, , , Final chr20 callsets including fragment-based calling and contrastive VQSR clustering