Computational research for medical discovery at Boston College Biology Gabor T. Marth Boston College Department of Biology

Slides:



Advertisements
Similar presentations
Lecture 2 Strachan and Read Chapter 13
Advertisements

CZ5225 Methods in Computational Biology Lecture 9: Pharmacogenetics and individual variation of drug response CZ5225 Methods in Computational Biology.
Understanding GWAS Chip Design – Linkage Disequilibrium and HapMap Peter Castaldi January 29, 2013.
Dr. Almut Nebel Dept. of Human Genetics University of the Witwatersrand Johannesburg South Africa Significance of SNPs for human disease.
Biology and Bioinformatics Gabor T. Marth Department of Biology, Boston College BI820 – Seminar in Quantitative and Computational Problems.
A coalescent computational platform for tagging marker selection for clinical studies Gabor T. Marth Department of Biology, Boston College
Computational Tools for Finding and Interpreting Genetic Variations Gabor T. Marth Department of Biology, Boston College
Polymorphism Structure of the Human Genome Gabor T. Marth Department of Biology Boston College Chestnut Hill, MA
A coalescent computational platform to predict strength of association for clinical samples Gabor T. Marth Department of Biology, Boston College
Positional Cloning LOD Sib pairs Chromosome Region Association Study Genetics Genomics Physical Mapping/ Sequencing Candidate Gene Selection/ Polymorphism.
The informatics of SNPs and haplotypes Gabor T. Marth Department of Biology, Boston College Cold Spring Harbor Laboratory Advanced Bioinformatics.
Evolutionary Genome Biology Gabor T. Marth, D.Sc. Department of Biology, Boston College Medical Genomics Course – Debrecen, Hungary, May 2006.
Human Migrations Saeed Hassanpour Spring Introduction Population Genetics Co-evolution of genes with language and cultural. Human evolution: genetics,
CSE 291: Advanced Topics in Computational Biology Vineet Bafna/Pavel Pevzner
The informatics of SNPs and haplotypes Gabor T. Marth Department of Biology, Boston College Cold Spring Harbor Laboratory Advanced Bioinformatics.
Lecture X.X1. 2 The informatics of SNPs and Haplotypes Gabor T. Marth Department of Biology, Boston College
Genomewide Association Studies.  1. History –Linkage vs. Association –Power/Sample Size  2. Human Genetic Variation: SNPs  3. Direct vs. Indirect Association.
Human Genome Sequence and Variability Gabor T. Marth, D.Sc. Department of Biology, Boston College Medical Genomics Course – Debrecen, Hungary,
Polymorphisms – SNP, InDel, Transposon BMI/IBGP 730 Victor Jin, Ph.D. (Slides from Dr. Kun Huang) Department of Biomedical Informatics Ohio State University.
Polymorphism discovery informatics Gabor T. Marth Department of Biology Boston College Chestnut Hill, MA
Sequence Variation Informatics Gabor T. Marth Department of Biology, Boston College BI420 – Introduction to Bioinformatics.
Introduction Basic Genetic Mechanisms Eukaryotic Gene Regulation The Human Genome Project Test 1 Genome I - Genes Genome II – Repetitive DNA Genome III.
Population Genetics 101 CSE280Vineet Bafna. Personalized genomics April’08Bafna.
Epigenome 1. 2 Background: GWAS Genome-Wide Association Studies 3.
Haplotype Blocks An Overview A. Polanski Department of Statistics Rice University.
The medical relevance of genome variability Gabor T. Marth, D.Sc. Department of Biology, Boston College
Software tools for the analysis of medically important sequence variations Gabor T. Marth, D.Sc. Boston College Department of Biology
Medical variations Gabor T. Marth Boston College Biology Department BI543 Fall 2013 February 5, 2013.
SNPs Daniel Fernandez Alejandro Quiroz Zárate. A SNP is defined as a single base change in a DNA sequence that occurs in a significant proportion (more.
National Taiwan University Department of Computer Science and Information Engineering Haplotype Inference Yao-Ting Huang Kun-Mao Chao.
The Complexities of Data Analysis in Human Genetics Marylyn DeRiggi Ritchie, Ph.D. Center for Human Genetics Research Vanderbilt University Nashville,
A single-nucleotide polymorphism tagging set for human drug metabolism and transport Kourosh R Ahmadi, Mike E Weale, Zhengyu Y Xue, Nicole Soranzo, David.
The medical relevance of genome variability Gabor T. Marth, D.Sc. Department of Biology, Boston College Medical Genomics Course – Debrecen,
Conservation of genomic segments (haplotypes): The “HapMap” n In populations, it appears the the linear order of alleles (“haplotype”) is conserved in.
Biology 101 DNA: elegant simplicity A molecule consisting of two strands that wrap around each other to form a “twisted ladder” shape, with the.
CS177 Lecture 10 SNPs and Human Genetic Variation
SNP Haplotypes as Diagnostic Markers Shrish Tiwari CCMB, Hyderabad.
Gene Hunting: Linkage and Association
Genome-Wide Association Study (GWAS)
A coalescent computational platform to predict strength of association for clinical samples Gabor T. Marth Department of Biology, Boston College
National Taiwan University Department of Computer Science and Information Engineering Pattern Identification in a Haplotype Block * Kun-Mao Chao Department.
Finnish Genome Center Monday, 16 November Genotyping & Haplotyping.
Polymorphism Haixu Tang School of Informatics. Genome variations underlie phenotypic differences cause inherited diseases.
Julia N. Chapman, Alia Kamal, Archith Ramkumar, Owen L. Astrachan Duke University, Genome Revolution Focus, Department of Computer Science Sources
MEME homework: probability of finding GAGTCA at a given position in the yeast genome, based on a background model of A = 0.3, T = 0.3, G = 0.2, C = 0.2.
Lecture 7.01 The informatics of SNPs and haplotypes Gabor T. Marth Department of Biology, Boston College CGDN Bioinformatics Workshop June.
February 20, 2002 UD, Newark, DE SNPs, Haplotypes, Alleles.
The International Consortium. The International HapMap Project.
In The Name of GOD Genetic Polymorphism M.Dianatpour MLD,PHD.
Linkage Disequilibrium and Recent Studies of Haplotypes and SNPs
Computational Biology and Genomics at Boston College Biology Gabor T. Marth Department of Biology, Boston College
Evolutionary Genome Biology Gabor T. Marth, D.Sc. Department of Biology, Boston College
A coalescent computational platform to predict strength of association for clinical samples Gabor T. Marth Department of Biology, Boston College
Notes: Human Genome (Right side page)
Different microarray applications Rita Holdhus Introduction to microarrays September 2010 microarray.no Aim of lecture: To get some basic knowledge about.
1 Finding disease genes: A challenge for Medicine, Mathematics and Computer Science Andrew Collins, Professor of Genetic Epidemiology and Bioinformatics.
Inferences on human demographic history using computational Population Genetic models Gabor T. Marth Department of Biology Boston College Chestnut Hill,
Pharmacogenetics/Pharmacogenomics. Outline Introduction  Differential drug efficacy  People react differently to drugs Why does drug response vary?
Introduction to bioinformatics lecture 11 SNP by Ms.Shumaila Azam
Discovery tools for human genetic variations
Genome organization and Bioinformatics
BI820 – Seminar in Quantitative and Computational Problems in Genomics
Haplotype Inference Yao-Ting Huang Kun-Mao Chao.
Incorporating changing population size into the coalescent
Haplotype Inference Yao-Ting Huang Kun-Mao Chao.
Medical genomics BI420 Department of Biology, Boston College
Medical genomics BI420 Department of Biology, Boston College
Haplotypes When the presence of two or more polymorphisms on a single chromosome is statistically correlated in a population, this is a haplotype Example.
Research for medical discovery at the Computational Genomics Laboratory at Boston College Biology Gabor T. Marth Department of Biology, Boston College.
Haplotype Inference Yao-Ting Huang Kun-Mao Chao.
Presentation transcript:

Computational research for medical discovery at Boston College Biology Gabor T. Marth Boston College Department of Biology

We study genetic variations because… … they underlie phenotypic differences … cause heritable diseases and determine responses to drugs … allow tracking ancestral human history

Our current projects investigate three essential aspects of genetic variations… how to discover inherited genetic polymorphisms that lead to disease? how to model human polymorphism structure to inform medical research? how to select the best genetic markers for clinical case-control association studies?

inherited (germ line) polymorphisms are important as they can predispose to disease the most common type of human polymorphisms are single-nucleotide polymorphisms (SNPs) and short insertion-deletions (INDELs) We build computer tools for variation discovery… we have developed a computer package, PolyBayes©, for accurate discovery of DNA polymorphisms in clonal sequences Marth et al. Nature Genetics 1999

… we are currently expanding our polymorphism detection capabilities. Homozygous T Homozygous C Heterozygous C/T for automated detection of somatic single base pair mutations in diploid samples to make the software available for genome centers with high-performance systems and small Biology labs with desktop computers to include our new knowledge of human variation structure into the detection algorithms

2. We measure genome-wise distributions of DNA polymorphism data… 1. marker density (MD): distribution of number of SNPs in pairs of sequences “rare” “common” 2. allele frequency spectrum (AFS): distribution of SNPs according to allele frequency in a set of samples

… we build models of these distributions under competing scenarios of human demographic history… past present stationaryexpansioncollapse MD (simulation) AFS (direct form) history bottleneck

… and determine the best-fitting models. European data African data genetic bottleneck modest but uninterrupted expansion Marth et al. PNAS 2003; Genetics 2004

3. The HapMap project aims to map out human polymorphism structure to aid gene mapping… However, the variation structure observed in the reference DNA samples genotyped by the HapMap project… … often does not match the structure in another set of samples such as clinical samples used to find disease genes and disease-causing genetic variants

… we generate “quasi-samples” with computational means to study sample-to-sample variability… Instead of genotyping additional sets of (clinical) samples with costly experimentation, and comparing the variation structure of these consecutive sets directly… … we generate additional samples with computational means, based on our Population Genetic models of demographic history, using the Coalescent process.

… and to optimize tag SNP (marker) selection for clinical association studies. 2. generate computational samples for this genome region 3. test the performance of markers across consecutive sets of computational samples 1. select markers (tag SNPs) with standard methods

We are developing projects to expand… from single-nucleotide DNA changes to developing computer tools for the detection of other types of genomic and epigenetic changes (e.g. in cancer) to developing visualization and statistical tools for the integration of diverse genetic and epigenetic data (Image from Nature Reviews Genetics) to using the fruits of the HapMap project, dense SNPs, Linkage Disequilibrium, and haplotype markers to help predict individual responses to drugs, including adverse drug reactions

Detecting SNPs in medical re-sequencing data, short insertions / deletions detection in new data types produced by the latest, super-high throughput sequencing technologies (i.e. 454 Life Sciences sequencing machines) that will be used for individual medical re-sequencing reliable detection of INDELs and microsatellite polymorphisms, both in clonal and in diploid sequence data, e.g. to detect repeat instabilities

Using SNP array data intelligently to detect chromosomal aberrations Speicher & Carter, NRG 2005

Software development for other genetic and epigenetic data (focus on data confidence) copy number detection chromatin structure Sproul, NRG 2005 methylation profile Laird, NRC 2005

Integrate genetic and epigenetic data from varied sources to find “common themes” during cancer development chromosome rearrangements chromatin structure gene expression profile copy number changes methylation profile repeat expansions

Using new haplotype resources to connect genotype and clinical outcome in pharmaco-genetic systems the HapMap was designed as a tool to detect high-frequency (common) phenotypic (e.g. disease-causing) alleles important drug metabolizing enzymes are relatively few in number, well studied, are at known genome locations, many associated phenotypes are well described many functional alleles are known, and of high frequency (common) multi-SNP alleles are highly predictive of metabolic phenotype clinical phenotype (adverse drug reaction) less predictable ideal candidate for applying haplotype resources

Multi-marker haplotypes as accurate markers for ADRs? functional allele (known metabolic polymorphism) genetic marker (haplotype) in genome regions of drug metabolizing enzyme (DME) genes molecular phenotype (drug concentration measured in blood plasma) clinical endpoint (adverse drug reaction) computational prediction based on haplotype structure

Resources specifics of enzyme- drug interactions LD and haplotype structure in the HapMap reference samples, based on high-density SNP map functional alleles existing DME P genotyping chips

Evolutionary / PopGen questions mutation age? mutations single-origin or recurrent? geographic origin of mutations? analysis based on complete local variation structure and haplotype background of functional mutations specifics of the selection process that led to specific functional alleles?

Proposed steps of analysis haplotypes vs. metabolic phenotype? complete polymorphic structure? ethnicity? additional functional SNPs? haplotypes vs. functional alleles? haplotype block? functional allele (genotype) metabolic phenotype clinical phenotype (ADR) haplotype haplotypes vs. ADR phenotype?

Funding sources / plans polymorphism discovery + medical re-sequencing data analysis: 5-year NIH R01 research grant awarded pop-gen modeling + haplotype analysis + marker selection system: NIH R01 application pending informatics tools for genomic and epigenetic changes in cancer: need a postdoc to establish project (startup or NIH R21 or private funding) haplotypes in Pharmacogenomics: need a postdoc to establish project (startup or NIH R21 or private funding)