Teresa Przytycka NIH / NLM / NCBI RECOMB 2010 Bridging the genotype and phenotype.

Slides:



Advertisements
Similar presentations
IDENTIFYING CAUSAL GENES AND DYSREGULATED PATHWAYS IN COMPLEX DISEASES Nov. 6 th, 2010 YOO-AH KIM NIH / NLM / NCBI.
Advertisements

Genetic Analysis of Genome-wide Variation in Human Gene Expression Morley M. et al. Nature 2004,430: Yen-Yi Ho.
1 Harvard Medical School Mapping Transcription Mechanisms from Multimodal Genomic Data Hsun-Hsien Chang, Michael McGeachie, and Marco F. Ramoni Children.
CSE Fall. Summary Goal: infer models of transcriptional regulation with annotated molecular interaction graphs The attributes in the model.
Florian Markowetz markowetzlab.org Joining the dots… Network analysis of gene perturbation data.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Functional genomics and inferring regulatory pathways with gene expression data.
Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break 14:45 – 15:15Regulatory pathways lecture 15:15 – 15:45Exercise.
Microarrays and Cancer Segal et al. CS 466 Saurabh Sinha.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
6. Gene Regulatory Networks
CS 374: Relating the Genetic Code to Gene Expression Sandeep Chinchali.
Office hours Wednesday 3-4pm 304A Stanley Hall Review session 5pm Thursday, Dec. 11 GPB100.
Give me your DNA and I tell you where you come from - and maybe more! Lausanne, Genopode 21 April 2010 Sven Bergmann University of Lausanne & Swiss Institute.
Epistasis Analysis Using Microarrays Chris Workman.
Identification of obesity-associated intergenic long noncoding RNAs
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Department of Biomedical Informatics Bioinformatics and Genetics Kun Huang Department of Biomedical Informatics OSUCCC Biomedical Informatics Shared Resource.
Genome of the week - Deinococcus radiodurans Highly resistant to DNA damage –Most radiation resistant organism known Multiple genetic elements –2 chromosomes,
Statistical Bioinformatics QTL mapping Analysis of DNA sequence alignments Postgenomic data integration Systems biology.
Modes of selection on quantitative traits. Directional selection The population responds to selection when the mean value changes in one direction Here,
Geuvadis RNAseq analysis at UNIGE Analysis plans
Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen
Epigenome 1. 2 Background: GWAS Genome-Wide Association Studies 3.
Introduction to BST775: Statistical Methods for Genetic Analysis I Course master: Degui Zhi, Ph.D. Assistant professor Section on Statistical Genetics.
Radiogenomics in glioblastoma multiforme
Learning Structure in Bayes Nets (Typically also learn CPTs here) Given the set of random variables (features), the space of all possible networks.
Reconstruction of Transcriptional Regulatory Networks
Variable gene expression and reduced penetrance in familial adenomatous polyposis Rohlin A* 1, Wernersson J 1, Björk J 2, and Nordling M 1 Dept of Clinical.
Regulation of gene expression in the mammalian eye and its relevance to eye disease Todd Scheetz et al. Presented by John MC Ma.
Supplementary Figure S1 eQTL prior model modified from previous approaches to Bayesian gene regulatory network modeling. Detailed description is provided.
Signatures of Accelerated Somatic Evolution in Gene Promoters in Multiple Cancer Types Update Talk Kyle Smith De Lab.
MEME homework: probability of finding GAGTCA at a given position in the yeast genome, based on a background model of A = 0.3, T = 0.3, G = 0.2, C = 0.2.
Affymetrix 2.0 miRNA arrays on lung tissue RNA n=3-4 mice/strain 92 differentially expressed miRNAs 38 miRNAs both differentially and highly expressed.
While gene expression data is widely available describing mRNA levels in different cancer cells lines, the molecular regulatory mechanisms responsible.
Identifying Causal Genes and Dysregulated Pathways in Complex Diseases Discussion leader: Nafisah Islam Scribe: Matthew Computational Network Biology BMI.
DNAmRNAProtein Small molecules Environment Regulatory RNA How a cell is wired The dynamics of such interactions emerge as cellular processes and functions.
Case Study: Characterizing Diseased States from Expression/Regulation Data Tuck et al., BMC Bioinformatics, 2006.
1 Paper Outline Specific Aim Background & Significance Research Description Potential Pitfalls and Alternate Approaches Class Paper: 5-7 pages (with figures)
Stitching the Tutorials Together Introduction to Systems Biology Course Chris Plaisier Institute for Systems Biology.
Network applications Sushmita Roy BMI/CS 576 Dec 9 th, 2014.
Genetics of Gene Expression BIOS Statistics for Systems Biology Spring 2008.
6/11/20161 Graph models and efficient exact algorithms in studying cancer signaling pathways Songjian Lu, Lujia Chen, Chunhui Cai Department of Biomedical.
Transcriptional Enhancers Looking out for the genes and each other Sridhar Hannenhalli Department of Cell Biology and Molecular Genetics Center for Bioinformatics.
Simultaneous identification of causal genes and dys-regulated pathways in complex diseases Yoo-Ah Kim, Stefan Wuchty and Teresa M Przytycka Paper to be.
Increasing Power in Association Studies by using Linkage Disequilibrium Structure and Molecular Function as Prior Information Eleazar Eskin UCLA.
Understanding GWAS SNPs Xiaole Shirley Liu Stat 115/215.
Brendan Burke and Kyle Steffen. Important New Tool in Genomic Medicine GWAS is used to estimate disease risk and test SNPs( the most common type of genetic.
EQTLs.
Journal club Jun , Zhen.
A graph-based integration of multiple layers of cancer genomics data (Progress Report) Do Kyoon Kim 1.
1. SELECTION OF THE KEY GENE SET 2. BIOLOGICAL NETWORK SELECTION
Reconstructing the Evolutionary History of Complex Human Gene Clusters
Integromic Analysis of Genetic Variation and Gene Expression Identifies Networks for Cardiovascular Disease PhenotypesCLINICAL PERSPECTIVE by Chen Yao,
Dept of Biomedical Informatics University of Pittsburgh
Gene Hunting: Design and statistics
Linking Genetic Variation to Important Phenotypes
Complex Traits Qualitative traits. Discrete phenotypes with direct Mendelian relationship to genotype. e.g. black or white, tall or short, sick or healthy.
Schedule for the Afternoon
In these studies, expression levels are viewed as quantitative traits, and gene expression phenotypes are mapped to particular genomic loci by combining.
Linkage analysis and genetic mapping
Volume 11, Issue 5, Pages (May 2015)
Principle of Epistasis Analysis
Towfique Raj, Manik Kuchroo, Joseph M
One SNP at a Time: Moving beyond GWAS in Psoriasis
Sherlock: Detecting Gene-Disease Associations by Matching Patterns of Expression QTL and GWAS  Xin He, Chris K. Fuller, Yi Song, Qingying Meng, Bin Zhang,
A systems view of genetics in chronic kidney disease
Evan G. Williams, Johan Auwerx  Cell 
Principles and Strategies for Developing Network Models in Cancer
On the Dependency of Cellular Protein Levels on mRNA Abundance
Presentation transcript:

Teresa Przytycka NIH / NLM / NCBI RECOMB 2010 Bridging the genotype and phenotype

GWAS studies – Genome wide scan for genotype - phenotype association

Expression as quantitative trait

expression Quantitative Trait Loci analysis (eQTL) 4 Control 1 Control 2 Control 3 Case 1 Case 2 Case 3 Case 4 Case 5 Case 6 Case 7 Case 8 Gene 1 Gene 2 Gene 3. Gene 3 Phenotype eQTL Putative target gene … SNP 1 SNP 2 SNP 4 Putative causal gene/loci Individuals

Importance of expression as quantitative trait Provides huge array of phenotypes Identifies putative regulatory regions It can be combined with “higher level” phenotypic variations such as diseases

Challenges 6 Limited statistical power due to multiple testing The expression of a gene might be influenced by many loci in additive or non-additive way While we assume that the genetic variation is the cause and expression change is the effect, we don’t know molecular mechanism behind this relation For genotype variation defined by changes of gene copy number, what is the impact of copy number variation on the expression of a given gene?

Challenges 7 Limited statistical power due to multiple testing Yang et al. ISMB 2009; Bioinformatics 2009 The expression of a gene might be influenced by many loci in additive or non-additive way Yang et al. in preparation While we assume that the genetic variation is the cause and expression change is the effect, we don’t know molecular mechanism behind this relation Kim et al. RECOMB 2010 What is the impact of copy number variation on the expression of a given gene? Malone, Cho et al. in preparation

Challenges 8 Limited statistical power due to multiple testing Yang et al. ISMB 2009; Bioinformatics 2009 The expression of a gene might be influenced by many loci in additive or non-additive way Yang et al. in preparation While we assume that the genetic variation is the cause and expression change is the effect, we don’t know molecular mechanism behind this relation Kim et al. RECOMB 2010 For genotype variation defined by changes of gene copy number, what is the impact of copy number variation on the expression of a given gene? Malone, Cho et al. in preparation

Copy number variations in cancer BSOSC Review, November 20089

10 Gene 1 Gene 2 Gene 3. Gene 3 controls Disease Cases Disease Associated over/under expressed genes?

11 Gene 1 Gene 2 Gene 3. Gene 3 loci … … controls Disease Cases Gene 1 Gene 2 Gene 3. Gene 3 eQTL

Candidate genes Gene NetworkTarget Gene C1 C2 C3 C4 C5 Case 1 Case 2 Case 7 … … Genotypic variations Current flow + -

Candidate genes Gene NetworkTarget Gene C1 C2 C3 C4 C5 Case 1 Case 2 Case 7 … … Genotypic variations Current flow + - Adding resistance R is set to be reversely proportional to the average correlation of the expression of the two genes with copy number variation of C2

BSOSC Review, November controls Disease Cases 14 Gene 1 Gene 2 Gene 3. Gene 3 1 … 234 Select subset that “explains” the disease

BSOSC Review, November Case Putative Causal gene Causal gene has copy number variation in the given case, low p-value pathway connecting it to a target gene that is differentially express in the same case # of such causal target genes = edge weight

Tree important sets of genes of interest Disease genes Causal geneses Disease hubs – genes that appear on many disease related pathways (pathways from a causal gene to a diseases gene) BSOSC Review, November

BSOSC Review, November

BSOSC Review, November

BSOSC Review, November Caveats: Some edges (e.g. transcription regulation) have direction At the end of each path there must be a transcription factor which directly affects gene expression Design appropriate permutation test to support the results The current flow needs to be solved on a huge network

BSOSC Review, November Caveats: Some edges (e.g. transcription regulation) have direction At the end of each path there must be a transcription factor which directly affects gene expression Design appropriate permutation test to support the results The current flow needs to be solved on a huge network

Dropping the restriction that last last but one node on the pathway is a TF target genes overlap causal genes overlap BSOSC Review, November

BSOSC Review, November Network distances nodes In the two sets

Effect of copy number variation of a gene on expression of this gene: Expected: But sometimes we observe : BSOSC Review, November Copy #Expression Copy #Expression Example CDK2, negative correlation -0.28

Impact of gene copy number variation (CNV) on gene expression; GLIOMA (this work) DrosDel (collaboration with experimental group of Brian Oliver NIDDK) Copy number variations caused by: Somatic cell mutation Experimental knock-out of one copy of a region (drosDel lines) How changes in copy number propagate trough the cellular system : Phenotype  Genotype Identify “causal” CNV and dys- regulated pathways Genotype  Phenotype How the organism reacts to the change in gene dosage

DrosDel lines profiled chr2L 8 MB and ~ 700 genes deficient

How fly responses to gene deletion BSOSC Review, November Genotype Phenotype +/+ Dose Network Cascade Df/+ ? ? ?

How fly responses to gene deletion Genotype Phenotype +/+ Dose Network Cascade Df/+ ? ? ?

Females Males log 2 Mean Df/+ / +/+ Expression -33 log 2 Mean Df/+ / +/+ Expression Distribution of Expression Fold Changes

Females Males To network GenotypeDose FEEDBACK Df/+ Adjusted dose Less feedback Reduced adjusted dose Network Buffering? Df/+ To network

Acknowledgments Przytycka’s group Yoo-ah Kim Collaboration Stefan Wuchty NCBI Przytycka’s group Dong Yeon Cho Brian’s Oliver group (NIDDK / NIH) John Malone; Justen Andrews Indiana University Thanks to other members of Przytycka’s group Yang Huang, Damian Wojtowicz, Jie Zhang, Dong Yeon Cho Funding NIH intramural program

Height - Quantitative trait aaAaAA height

BSOSC Review, November Starting from selecting “disease genes” we identified copy number variations that associate with expression changes of these genes and putative pathways that propagate the genetic perturbation from copy number variation to the disease genes

33 I computed p-values in the different levels of our algorithm and the following table shows the results. * GBM genes listed in AceView. 93 genes are listed. ** results with the best p-value among experiments with different parameters BSOSC Review, November Gene 1 Gene 2 Gene 3. Gene 3 … A.Number of GenesA.AceViewA.DAVID Association (75)0.027 (56) Circuit flow algorithm (10) 1.3  (25) Circuit flow + set cover  (6)9.9  (8)