Selecting TagSNPs in Candidate Genes for Genetic Association Studies Shehnaz K. Hussain, PhD, ScM Assistant Professor Department of Epidemiology, UCLA.

Slides:



Advertisements
Similar presentations
applications of genome sequencing projects
Advertisements

What is an association study? Define linkage disequilibrium
SNP Applications statwww.epfl.ch/davison/teaching/Microarrays/snp.ppt.
Single Nucleotide Polymorphism Copy Number Variations and SNP Array Xiaole Shirley Liu and Jun Liu.
OMICS Group Contact us at: OMICS Group International through its Open Access Initiative is committed to make genuine and.
1 of 25 Sequence Variation in Ensembl. 2 of 25 Outline SNPs SNPs in Ensembl Linkage disequilibrium SNPs in BioMart DAS sources.
Understanding GWAS Chip Design – Linkage Disequilibrium and HapMap Peter Castaldi January 29, 2013.
Ferdinand van ’t Hooft Cardiovascular Genetics and Genomics Group Karolinska Institutet, Stockholm, Sweden Genome-Wide Association Study GWAS
Plant of the day! Pebble plants, Lithops, dwarf xerophytes Aizoaceae
Outline to SNP bioinformatics lecture
CS177 Lecture 9 SNPs and Human Genetic Variation Tom Madej
Dr. Almut Nebel Dept. of Human Genetics University of the Witwatersrand Johannesburg South Africa Significance of SNPs for human disease.
Predicting the Function of Single Nucleotide Polymorphisms Corey Harada Advisor: Eleazar Eskin.
SNP Resources: Finding SNPs Discovery and Databases Mark J. Rieder, PhD SeattleSNPs Workshop March 20-21, 2006.
Biology and Bioinformatics Gabor T. Marth Department of Biology, Boston College BI820 – Seminar in Quantitative and Computational Problems.
SNP Resources: Finding SNPs, Databases and Data Extraction Debbie Nickerson
NIEHS SNPs Workshop Introduction Debbie Nickerson Department of Genome Sciences University of Washington.
Picking SNPs Application to Association Studies Dana Crawford, PhD SeattleSNPs PGA University of Washington March 20, 2006.
SNP Resources: Finding SNPs, Databases and Data Extraction Debbie Nickerson NIEHS SNPs Workshop.
SNP Resources: Variation Discovery, HapMap and the EGP Mark J. Rieder Department of Genome Sciences NIEHS SNPs Workshop Jan 10-11,
SNP Resources: Finding SNPs Databases and Data Extraction Mark J. Rieder, PhD Robert J. Livingston, PhD NIEHS Variation Workshop January 30-31, 2005.
Something related to genetics? Dr. Lars Eijssen. Bioinformatics to understand studies in genomics – São Paulo – June Image:
Genomewide Association Studies.  1. History –Linkage vs. Association –Power/Sample Size  2. Human Genetic Variation: SNPs  3. Direct vs. Indirect Association.
Polymorphisms – SNP, InDel, Transposon BMI/IBGP 730 Victor Jin, Ph.D. (Slides from Dr. Kun Huang) Department of Biomedical Informatics Ohio State University.
Single nucleotide polymorphisms and applications Usman Roshan BNFO 601.
SNP Selection University of Louisville Center for Genetics and Molecular Medicine January 10, 2008 Dana Crawford, PhD Vanderbilt University Center for.
SNP Resources: Finding SNPs Databases and Data Extraction Mark J. Rieder, PhD SeattleSNPs Variation Workshop March 20-21, 2006.
Introduction Basic Genetic Mechanisms Eukaryotic Gene Regulation The Human Genome Project Test 1 Genome I - Genes Genome II – Repetitive DNA Genome III.
Georgia Wiesner, MD CREC June 20, GATACAATGCATCATATG TATCAGATGCAATATATC ATTGTATCATGTATCATG TATCATGTATCATGTATC ATGTATCATGTCTCCAGA TGCTATGGATCTTATGTA.
The medical relevance of genome variability Gabor T. Marth, D.Sc. Department of Biology, Boston College
Genetic Variations Lakshmi K Matukumalli. Human – Mouse Comparison.
Single Nucleotide Polymorphisms Mrs. Stewart Medical Interventions Central Magnet School.
Doug Brutlag 2011 Genomics & Medicine Doug Brutlag Professor Emeritus of Biochemistry &
SNPs Daniel Fernandez Alejandro Quiroz Zárate. A SNP is defined as a single base change in a DNA sequence that occurs in a significant proportion (more.
National Taiwan University Department of Computer Science and Information Engineering Haplotype Inference Yao-Ting Huang Kun-Mao Chao.
A single-nucleotide polymorphism tagging set for human drug metabolism and transport Kourosh R Ahmadi, Mike E Weale, Zhengyu Y Xue, Nicole Soranzo, David.
The medical relevance of genome variability Gabor T. Marth, D.Sc. Department of Biology, Boston College Medical Genomics Course – Debrecen,
©Edited by Mingrui Zhang, CS Department, Winona State University, 2008 Identifying Lung Cancer Risks.
Molecular & Genetic Epi 217 Association Studies
CS177 Lecture 10 SNPs and Human Genetic Variation
SNP Haplotypes as Diagnostic Markers Shrish Tiwari CCMB, Hyderabad.
SeattleSNPs Variation Discovery Resource Materials prepared by: Mary E. Mangan, PhD Updated: Q Version 1.
National Taiwan University Department of Computer Science and Information Engineering Pattern Identification in a Haplotype Block * Kun-Mao Chao Department.
Finnish Genome Center Monday, 16 November Genotyping & Haplotyping.
Polymorphism Haixu Tang School of Informatics. Genome variations underlie phenotypic differences cause inherited diseases.
ABC for the AEA Basic biological concepts for genetic epidemiology Martin Kennedy Department of Pathology Christchurch School of Medicine.
Epidemiology 217 Molecular and Genetic Epidemiology Bioinformatics & Proteomics John Witte, Xin Liu & Mark Pletcher.
Epidemiology 217 Molecular and Genetic Epidemiology Bioinformatics & Proteomics John Witte.
GVS: Genome Variation Server Materials prepared by: Warren C. Lathe, PhD Updated: Q Version 2.
Linear Reduction Method for Tag SNPs Selection Jingwu He Alex Zelikovsky.
The International Consortium. The International HapMap Project.
Association of functional polymorphisms of Bax and Bcl2 genes with schizophrenia Kristina Pirumya, PhD, Laboratory of Human Genomics and Immunomics Institute.
In The Name of GOD Genetic Polymorphism M.Dianatpour MLD,PHD.
Identification of Single Nucleotide Polymorphisms Predictive of Sporadic Prostate Cancer Susceptibility Using a Case-Control Association.
Copyright OpenHelix. No use or reproduction without express written consent1.
Linkage Disequilibrium and Recent Studies of Haplotypes and SNPs
Analyzing DNA using Microarray and Next Generation Sequencing (1) Background SNP Array Basic design Applications: CNV, LOH, GWAS Deep sequencing Alignment.
Using public resources to understand associations Dr Luke Jostins Wellcome Trust Advanced Courses; Genomic Epidemiology in Africa, 21 st – 26 th June 2015.
1 Finding disease genes: A challenge for Medicine, Mathematics and Computer Science Andrew Collins, Professor of Genetic Epidemiology and Bioinformatics.
Consideration for Planning a Candidate Gene Association Study With TagSNPs Shehnaz K. Hussain, PhD, ScM Epidemiology 243: Molecular.
School of Pharmacy, University of Nizwa
Introduction to bioinformatics lecture 11 SNP by Ms.Shumaila Azam
Single Nucleotide Polymorphisms
By Michael Fraczek and Caden Boyer
Polymorphisms in Cytokine Genes Are Associated With Higher Levels of Fatigue and Lower Levels of Energy in Women After Breast Cancer Surgery  Kord M.
Medical genomics BI420 Department of Biology, Boston College
Medical genomics BI420 Department of Biology, Boston College
Selecting a Maximally Informative Set of Single-Nucleotide Polymorphisms for Association Analyses Using Linkage Disequilibrium  Christopher S. Carlson,
SNPs and CNPs By: David Wendel.
Presentation transcript:

Selecting TagSNPs in Candidate Genes for Genetic Association Studies Shehnaz K. Hussain, PhD, ScM Assistant Professor Department of Epidemiology, UCLA Epidemiology 244: Cancer Epidemiology Methods

Objectives  Molecular genetics primer  Databases and tools to conduct in silico analyses for tagSNP selection/prioritization

Central dogma DNA Protein mRNA A T C G

What are SNPs?  More than 99% of all nucleotides are the same in all humans  1% of nucleotides are polymorphic  SNPs>> insertions-deletions  Bi-nucleotide – T (80%) A (20%)  Where do SNPs occur?  Exons  Introns  Flanking regions

? T ? G ? A A T G G A A T T C G T A What are haplotypes?  A haplotype is the pattern of nucleotides on a single chromosome  Two “copies” of each chromosome  The haplotype inference problem TA TT CG GG TA AA

What is linkage disequilibrium?  Linkage disequilibrium (LD) describes the non- random association of nucleotides on the same chromosome in a population  One nucleotide at one position (locus) predicts the occurrence of another nucleotide at another locus No LD LD

Disease Susceptibility Locus Disease Phenotype Test for genetic association between the phenotype and the DSL Marker loci (SNPs) LD Test for association between phenotype and marker loci What are markers? Candidate gene

What are tagSNPs?  TagSNPs are a subset of all SNPs in a gene that mark groups of SNPs in LD  Avoids redundant genotyping Disease Susceptibility Locus LD Marker loci (SNPs)

The joint effect of tagSNPs in cytokine genes and cigarette smoking in cervical cancer risk

T-cell proliferation IL-2gene IFNγgene Activated T-cell Proliferation of TH1-cells IL-2 IFNγ IL-2 receptor IL-2gene Activated T-cell IL-2 -2gene Activated T-cell IL-2 -2 receptor IL-2 receptor IFNγgene IFNγ γgene IFNγ

Background  Cigarette smoking ↑ 1.5- to 3-fold cancer risk  Cigarette smoking ↓ levels of IL-2 and IFNγ (cervical and circulating)  ↓ levels of IL-2 and IFNγ  HPV persistence in the cervix  Cervical neoplasia  Decreased survival from invasive cervical cancer

Cigarette smoking HPV-associated squamous cell cervical cancer Model SNPs in IL-2, IL-2R, and IFNG

 Study design  Population-based case-only study  Subjects  308 Caucasian squamous cell cervical cancer cases diagnosed  Residing in 3 western Washington counties  Data collection  Structured in–person interviews  DNA isolated from buffy coats Methods

Multi-stage tagSNP design Re-sequence panel, identify SNPs (many markers, few subjects) Choose tagSNPs Genotype tagSNPs in main study (few markers, many subjects) Select reference panel

 A sample of your study population  Most representative  Samples from the Coriell Repository  Ability to integrate your data with other resources 1. Select reference panel = Candidate gene SNPs= HapMap SNPs

2. Re-sequence reference panel PolyPhred Phred Phrap (Nickerson, 1997) (Ewing, 1998) Amplify and Sequence DNA Gene

Alternatives to re-sequencing  Program for Genomic Applications (PGA)  SeattleSNPs – inflammation  NIEHS SNPs – environmental response  Innate Immunity  International HapMap Project  5 million SNPs in four ethnically distinct populations

3. Choose tagSNPs Option LDSelect (Carlson, 2002) Tagger (de Bakker, 2005) r 2 thresholdYes SNP exclusions/inclusionsNoYes SNP design scoreNoYes

LDSelect output for IL-2 SeattleSNPs, r 2 ≥0.80, MAF ≥0.05, Caucasian Bin Total Number of Sites TagSNPs 12 rs rs rs rs rs rs rs

 Exons (cSNPs)  SIFT (Ng, 2002)  PolyPhen (Ramensky, 2002)  Upstream flanking region  Intron-exon junctions Genomic context

Sequence conservation Repeat regionUnique region  UCSC Genome Browser, PhasCons (Siepel, 2005) Score

TagSNP summary  Efficient yet comprehensive coverage of the genetic variation in our candidate genes  Reduce costs  Preference should be given to putatively functional variants:  Literature, gene context, sequence conservation

Thanks for your attention! Questions?