Www.polyomx.org Introduction Hereditary predisposition (mutations in BRCA1 and BRCA2 genes) contribute to familial breast cancers. Eighty percent of the.

Slides:



Advertisements
Similar presentations
Regulation of Consumer Tests in California AAAS Meeting June 1-2, 2009 Beatrice OKeefe Acting Chief, Laboratory Field Services California Department of.
Advertisements

Which Phenotypes Can be Predicted from a Genome Wide Scan of Single Nucleotide Polymorphisms (SNPs): Ethnicity vs. Breast Cancer Mohsen Hajiloo, Russell.
CPSC 502, Lecture 15Slide 1 Introduction to Artificial Intelligence (AI) Computer Science cpsc502, Lecture 15 Nov, 1, 2011 Slide credit: C. Conati, S.
SNP Applications statwww.epfl.ch/davison/teaching/Microarrays/snp.ppt.
Clinical Trial Designs for the Evaluation of Prognostic & Predictive Classifiers Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer.
1 Single Nucleotide Polymorphisms (SNP) Gary Jones SPE, Technology Center 1600 (703)
Hereditary Factors in Breast Cancer
Recursive Partitioning Method on Survival Outcomes for Personalized Medicine 2nd International Conference on Predictive, Preventive and Personalized Medicine.
The Genetics of Breast and Ovarian Cancer Susceptibility Patricia Tonin, PhD Associate Professor Depts. Medicine, Human Genetics & Oncology McGill University.
Breast Cancer Risk and Risk Assessment Models
Estimating the penetrances of breast and ovarian cancer in the carriers of BRCA1/2 mutations Silvano Presciuttini University of Pisa, Italy.
Dr. Almut Nebel Dept. of Human Genetics University of the Witwatersrand Johannesburg South Africa Significance of SNPs for human disease.
Assuming normally distributed data! Naïve Bayes Classifier.
Introduction to Medical Genetics Fadel A. Sharif.
A Perspective on the Data Ajit Paul Singh M.Sc. Candidate Dept. of Computing Science University of Alberta.
Environmental Health III. Epidemiology Shu-Chi Chang, Ph.D., P.E., P.A. Assistant Professor 1 and Division Chief 2 1 Department of Environmental Engineering.
Chapter 5 Human Heredity by Michael Cummings ©2006 Brooks/Cole-Thomson Learning Chapter 5 Complex Patterns of Inheritance.
Introduction of Cancer Molecular Epidemiology Zuo-Feng Zhang, MD, PhD University of California Los Angeles.
Introduction to Molecular Epidemiology Jan Dorman, PhD University of Pittsburgh School of Nursing
BRCA Mutations and Breast Cancer Ruth Phillips and Patty Ashby.
Jeff Howbert Introduction to Machine Learning Winter Classification Bayesian Classifiers.
Paola CASTAGNOLI Maria FOTI Microarrays. Applicazioni nella genomica funzionale e nel genotyping DIPARTIMENTO DI BIOTECNOLOGIE E BIOSCIENZE.
Genomics Alexandra Hayes. Genomics is the study of all the genes in a person, as well as the interactions of those genes with each other and a person’s.
Prospective Study Cohort Study Assis.Prof.Dr Diaa Marzouk Community Medicine.
Chapter 13 Carrier Screening. Introduction Carrier screening involves testing of individuals for heterozygosity for genes that would produce significant.
Analysis of Molecular and Clinical Data at PolyomX Adrian Driga 1, Kathryn Graham 1, 2, Sambasivarao Damaraju 1, 2, Jennifer Listgarten 3, Russ Greiner.
From Genomic Sequence Data to Genotype: A Proposed Machine Learning Approach for Genotyping Hepatitis C Virus Genaro Hernandez Jr CMSC 601 Spring 2011.
Precision Medicine A New Initiative. The Concept of Precision Medicine (PM) The prevention and treatment strategies that take individual variability into.
Case(Control)-Free Multi-SNP Combinations in Case-Control Studies Dumitru Brinza and Alexander Zelikovsky Combinatorial Search (CS) for Disease-Association:
Conceptual Issues in Risk Prediction Colin B. Begg Memorial Sloan-Kettering Cancer Center.
SNPs Daniel Fernandez Alejandro Quiroz Zárate. A SNP is defined as a single base change in a DNA sequence that occurs in a significant proportion (more.
©Edited by Mingrui Zhang, CS Department, Winona State University, 2008 Identifying Lung Cancer Risks.
Wang Y 1,2, Damaraju S 1,3,4, Cass CE 1,3,4, Murray D 3,4, Fallone G 3,4, Parliament M 3,4 and Greiner R 1,2 PolyomX Program 1, Department.
Machine Learning II 부산대학교 전자전기컴퓨터공학과 인공지능연구실 김민호
BRCA1 and BRCA2 are genes which normally help to prevent breast cancer by making proteins which inhibit abnormal cellular growth. While a woman with an.
Quantitative Genetics
Polysubstance Abuse–Vulnerability Genes: Genome Scans for Association, Using 1,004 Subjects and 1,494 Single-Nucleotide Polymorphisms Uhl, Liu, Walther,
Julia N. Chapman, Alia Kamal, Archith Ramkumar, Owen L. Astrachan Duke University, Genome Revolution Focus, Department of Computer Science Sources
HUMAN GENOME PROJECT International effort of 13 years (1990 – 2003) Identified all the approximate 20,000 – 25,000 genes in human DNA Determined the sequences.
Computational Approaches for Biomarker Discovery SubbaLakshmiswetha Patchamatla.
Breast Cancer. Breast cancer is a disease in which malignant cells form in the tissues of the breast – “National Breast Cancer Foundation” The American.
Paolo Vineis University of Torino and ISI Foundation, Torino, Italy address: GENE-ENVIRONMENT INTERACTIONS IN CANCER.
Identification of Single Nucleotide Polymorphisms Predictive of Sporadic Prostate Cancer Susceptibility Using a Case-Control Association.
Pancreatic Cancer In 2012 there were 43,920 cases of pancreatic cancer. 10% of these cases have a family clustering of pancreatic cancers and associated.
Breast cancer affects 1 in 8 women during their lives. 1 Population Statistics.
A Comprehensive Genomic Approach to the Identification of Predictive Markers using DNA and Tissue Repair Gene Polymorphisms in Radiation.
Case Study: Characterizing Diseased States from Expression/Regulation Data Tuck et al., BMC Bioinformatics, 2006.
Unit 1 – Living Cells.  The study of the human genome  - involves sequencing DNA nucleotides  - and relating this to gene functions  In 2003, the.
Notes: Human Genome (Right side page)
Human Genomics Higher Human Biology. Learning Intentions Explain what is meant by human genomics State that bioinformatics can be used to identify DNA.
What Percentage of Cancer is Considered to be Hereditary?
Genetic Counseling Yahwardiah Siregar Sry Suryani W Mutiara Indah Sari.
Different microarray applications Rita Holdhus Introduction to microarrays September 2010 microarray.no Aim of lecture: To get some basic knowledge about.
Cell Biology & Cancer Unit Objective 1 Cancer types, incidence, pre-disposition, and risk factors Biomedical Technology.
1 Finding disease genes: A challenge for Medicine, Mathematics and Computer Science Andrew Collins, Professor of Genetic Epidemiology and Bioinformatics.
Single Nucleotide Polymorphisms (SNPs
Classification with Gene Expression Data
Results for all features Results for the reduced set of features
Chapter 6 Classification and Prediction
What are reduced penetrance and variable expressivity
Biomedical Technology
Family Tree Presentation
Introduction to bioinformatics lecture 11 SNP by Ms.Shumaila Azam
Transcriptional Signature of Histone Deacetylases in Breast cancer
Somi Jacob and Christian Bach
The decision tree with the two alternatives.
Coverage and Reimbursement to Genetic Testing
Biomedical Technology
Objective 1: Use Weka’s WrapperSubsetEval (Naïve Bayes
Using Bayesian Network in the Construction of a Bi-level Multi-classifier. A Case Study Using Intensive Care Unit Patients Data B. Sierra, N. Serrano,
Presentation transcript:

Introduction Hereditary predisposition (mutations in BRCA1 and BRCA2 genes) contribute to familial breast cancers. Eighty percent of the breast cancers are sporadic in nature, with contributions from several yet to be identified low penetrance genes and their interactions with carcinogenic environmental exposures. Identification of the molecular signatures would help target at- risk populations for early screening and preventive efforts. Hereditary predisposition (mutations in BRCA1 and BRCA2 genes) contribute to familial breast cancers. Eighty percent of the breast cancers are sporadic in nature, with contributions from several yet to be identified low penetrance genes and their interactions with carcinogenic environmental exposures. Identification of the molecular signatures would help target at- risk populations for early screening and preventive efforts. SNPs (Single Nucleotide Polymorphisms) are commonly occurring genetic variations. SNPs may affect an individual's susceptibility to disease or response to particular treatment by altering the expression of the gene in which it occurs. SNPs (Single Nucleotide Polymorphisms) are commonly occurring genetic variations. SNPs may affect an individual's susceptibility to disease or response to particular treatment by altering the expression of the gene in which it occurs Analysis of Single Nucleotide Polymorphisms in Candidate Genes and Application of Machine Learning Techniques for assessing Susceptibility to Breast Cancer in Alberta Women Data In an earlier study, we have validated 98 SNPs from candidate genes in a retrospective cohort of 174 cases and 158 apparently healthy individuals (controls) from the Edmonton region. The present study attempts to reproduce these earlier findings using 169 newly diagnosed breast cancer patients, coupled with the same control population. Information Gain is a concept coming from the information and decision tree theory. It defines the increase in information which is caused by adding a new attribute node to a rule or decision tree. Usually an attribute with high information gain should be preferred over other attributes. Results Cancer Prediction: we report high reproducibility of genetic markers identified in each of the two independent studies with a prediction accuracy of 63-65%. Machine Learning: The field of machine learning is concerned with the question of how to construct computer programs that automatically perform better with (experience) training[1]. The techniques are designed to find patterns in training data and classify new data. K-fold Cross Validation is a common method used for model checking. ( Example: when K=3) Reference [1] Mitchell, T. Machine Learning. McGraw-Hill, Boston, Future Work We plan to genotype these SNPs using a larger number of cases and controls (750 each) in a prospective study with emphasis also to environmental factors influencing the breast cancer risk. Acknowledgements Dufour J 1, Wang Y 1,2, Cass CE 1,3,4, Greiner R 1,2, Mackey J 1,3 and Damaraju S 1,3,4 PolyomX Program 1, Department of Computing Science 2 and Oncology 3, U of A and Cross Cancer Institute 4 Method We used information gain to rank the quality of the SNPs and then considered classifiers based on the top k SNPs, for different "k”s. We used 20-fold cross validation to estimate the quality (predictive accuracy) of each classifier with each feature subset, as a way to identify the best classification system. Significant SNPs: We found SNPs in MSH6, MLH1, P53, ADPRT, AGTR, RET, BCL6, CYP19A1, CYP1B1 and CYP11B2 as informative. Prediction Accuracy Early Study 63% Present Study 65% Rank Early StudyPresent Study IG valueSNP Name IG value CYP11B2_4_T201CMSH6_4_T201C CYP1B1_1_C201GMLH1_1_A132G BCL6_2_C201TBDKRB2_2_C152T ARO1_1_T201CCYP11B2_4_T201C CYP11B2_9_G201ATp53_2_G201A MLH1_1_A132GCD38_1_A177C MSH6_4_T201CCYP11B2_11_G201A AGTR1_1_T201CCYP1B1_2_A201G RET_1_G201ACYP11B1_6_G135A CYP17_1_G201TCYP1B1_1_C201G CD38_1_A177CRET_1_G201A ADPRT_3_C201TGRB10_1_A201G ERCC2_2_C201TCYP11B2_17_T201C CD68_1_G201ABRCA2_13_A201G CYP11B1_6_G135AADPRT_3_C201T CYP11B2_12_C201TAGTR1_1_T201C CYP11B2_17_T201CCYP11B2_12_C201T Tp53_6_G201ABRCA2_3_C201A if attribute “a” is NOT correlated with class “c” Positive if correlated Naïve Bayes Classifier assumes that attributes are independent given a class. It computes the probability of each label, given the evidence and returns the most likely prediction. This work was funded by the Alberta Cancer Foundation and the Alberta Cancer Board.