Groupe de travail athérosclérose 1 STULONG Discovery Challenges Feedback Marie Tomečková EuroMISE – Cardio This work is supported by the project LN00B107.

Slides:



Advertisements
Similar presentations
Mining for Patterns Based on Contingency Tables by KL-Miner First Experience Jan Rauch Milan Šimůnek (PhD. student) Václav Lín (student) University of.
Advertisements

SETTINGS AS COMPLEX ADAPTIVE SYSTEMS AN INTRODUCTION TO COMPLEXITY SCIENCE FOR HEALTH PROMOTION PROFESSIONALS Nastaran Keshavarz Mohammadi Don Nutbeam,
Data Mining in Computer Games By Adib Adam Hussain & Mohammed Sarfraz.
Groupe de travail athérosclérose 1 Project STULONG Project STULONG Some analytical work at EuroMISE and University of Economics, Prague Jan Rauch EuroMISE.
SDS-Rules and Association Rules March 17, 2004Nicosia, Cyprus Tomáš Karban 1 Jan Rauch 2 Milan Šimůnek 2 1 Charles University, Prague Dept. of Software.
PKDD Discovery Challenges short review Jan Rauch EuroMISE – Cardio University of Economics, Prague This work is supported by the project LN00B107 of the.
1 Ronald C. Kessler, Ph.D. Department of Health Care Policy Harvard Medical School March 6, 2008 Comorbidity of Anxiety Disorders in the National Comorbidity.
AtherEx: an Expert System for Atherosclerosis Risk Assessment Petr Berka, Vladimír Laš University of Economics, Prague Marie Tomečková Institute of Computer.
Active subgroup mining for descriptive induction tasks Dragan Gamberger Rudjer Bošković Instute, Zagreb Zdenko Sonicki University of Zagreb.
WRSTA, 13 August, 2006 Rough Sets in Hybrid Intelligent Systems For Breast Cancer Detection By Aboul Ella Hassanien Cairo University, Faculty of Computer.
Shipi Kankane Prashanth Nakirekommula.  Applying analytics and risk- management capabilities to health insurance through LexisNexis data platforms. 
Relational Data Mining in Finance Haonan Zhang CFWin /04/2003.
Harvard University Initiative for Global Health Global Health Challenges Social Analysis 76: Lecture 3.
Learning From Data Chichang Jou Tamkang University.
Cohort Studies.
Aim : Develop prediction model that can be used to facilitate clinicians in targeting patients at high or low risk of mortality. Method : Logistic Regression.
Introduction of Cancer Molecular Epidemiology Zuo-Feng Zhang, MD, PhD University of California Los Angeles.
April 6, o What is cancer? o Cancer statistics o Cancer prevention and early detection o Cancer disparities o Cancer survivorship o Cancer research.
SEWEBAR - a Framework for Creating and Dissemination of Analytical Reports from Data Mining Jan Rauch, Milan Šimůnek University of Economics, Prague, Czech.
TURKISH STATISTICAL INSTITUTE INFORMATION TECHNOLOGIES DEPARTMENT (Muscat, Oman) DATA MINING.
Infectious Disease Epidemiology Sharyn Orton, Ph.D. American Red Cross, Rockville, MD Suggested reading: Modern Infectious Disease Epidemiology (1994)
LÊ QU Ố C HUY ID: QLU OUTLINE  What is data mining ?  Major issues in data mining 2.
Data Mining Techniques
Data Mining Chun-Hung Chou
Sociological Research Methods and Techniques
Unit 1: Overview of HIV/AIDS Case Reporting #6-0-1.
Association between Systolic Blood Pressure and Congestive Heart Failure in Hypertensive Patients Mrs. Sutheera Intajarurnsan Doctor of Public Health Student.
Trend Analysis in Stulong Data The Gerstner laboratory for intelligent decision making and control Jiří Kléma, Lenka Nováková, Filip Karel, Olga Štěpánková.
Presented by Dr. Soe Sandi Tint
Evaluation of craniocerebral traumatisms treated at the Mures County Emergency Hospital between Author: Duka Ede-Botond Supervisor: PhD Dr. Madaras.
PhD Committee J. Vanthienen (promotor, K.U.Leuven) J. Vandenbulcke
Analysis of Death Causes in the STULONG Data Set Jan Burian, Jan Rauch EuroMISE – Cardio University of Economics Prague.
A three-step approach for STULONG database analysis: characterization of patients’ groups O. Couturier, H. Delalin, H. Fu, E. Kouamou, E. Mephu Nguifo.
HASAR : Mining Sequential Association Rules for Atherosclerosis Risk Factor Analysis Laurent Brisson, Nicolas Pasquier, Céline Hebert, Martine Collard.
Mrs. Watcharasa Pitug ID The Association between Waist Circumference and Renal Insufficiency among Hypertensive Patients 15/10/58 1.
Mrs. Watcharasa Pitug ID The Association between Waist Circumference and Renal Insufficiency among Hypertensive Patients 20/10/58 1.
Data Mining Knowledge on rough set theory SUSHIL KUMAR SAHU.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
YEREVAN STATE MEDICAL UNIVERSITY INCIDENCE AND STRUCTURAL PATTERN OF CANCER IN RURAL POPULATION OF ARARAT VALLEY, ARMENIA E. Matevosyan, MPH student A.
Trend Analysis and Risk Identification 1 The Gerstner laboratory for intelligent decision making and control, Czech Technical University, Prague Lenka.
Association between Systolic Blood Pressure and Congestive Heart Failure Complication among Hypertensive and Diabetic Hypertensive Patients Mrs. Sutheera.
Acknowledgements Contact Information Anthony Wong, MTech 1, Senthil K. Nachimuthu, MD 1, Peter J. Haug, MD 1,2 Patterns and Rules  Vital signs medoids.
Bi-directional incremental evolution Dr Tatiana Kalganova Electronic and Computer Engineering Dept. Bio-Inspired Intelligent Systems Group Brunel University.
Descriptive Epidemiology
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
1 STAT 5814 Statistical Data Mining. 2 Use of SAS Data Mining.
BY SANDY. WHAT IS DATAMINING TYPES OF DATAMINING TOOLS OVERVIEW OF TIBCO TIBCO SPOTFIRE MINER DATA ANALYSIS EXPLORE DATA MANIPULATE DATA CHART VIEW.
Learning from Positive and Unlabeled Examples Investigator: Bing Liu, Computer Science Prime Grant Support: National Science Foundation Problem Statement.
ECML/PKDD 2003 Discovery Challenge Attribute-Value and First Order Data Mining within the STULONG project Anneleen Van Assche, Sofie Verbaeten,
Data Mining: Knowledge Discovery in Databases Peter van der Putten ALP Group, LIACS Pre-University College LAPP-Top Computer Science February 2005.
Computational Approaches for Biomarker Discovery SubbaLakshmiswetha Patchamatla.
Research and Methodology
Causes elimination of specific causes of death among registered migrants to Thailand from Myanmar, the Lao PDR, and Cambodia Nucharee Srivirojana Writing.
Association between Systolic Blood Pressure and Congestive Heart Failure in Hypertensive Patients Mrs. Sutheera Intajarurnsan Doctor of Public Health Student.
BIOSTATISTICS Lecture 2. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and creating methods.
Discovery Challenge – ECML/PKDD2004 September 20, 2004, Pisa, Italy Atherosclerosis Marie Tomečková EuroMISE Centre – Cardio Institute of Computer Science,
1 Mining Episode Rules in STULONG dataset N. Méger 1, C. Leschi 1, N. Lucas 2 & C. Rigotti 1 1 INSA Lyon - LIRIS FRE CNRS Université d’Orsay – LRI.
Csci 418/618 Simulation Models Dr. Ken Nygard, IACC 262B
Mary McClure, EdD, LPC Class 3. Multiple approaches to instruction are probably appropriate… Diversity of student populations is increasing – more inclusive.
INTER-SPOUSE COMMUNICATION AND CONTRACEPTIVE BEHAVIOR IN CAMEROON: A COUPLE-BASED ANALYSIS MBELLA MBELLA Cédric Stéphane Ministry of Economy, Planning.
DIABETES AS A RISK FACTOR FOR COGNITIVE DECLINE IN OLDER PATIENTS F. Limongi, A. Marseglia, S. Maggi, M. Noale, G. Romanato, P. Siviero, G. Crepaldi CNR,
SDS-Rules and Classification Tomáš Karban ECML/PKDD 2003 – Dubrovnik (Cavtat) September 22, 2003.
Data Mining is the process of analyzing data and summarizing it into useful information Data Mining is usually used for extremely large sets of data It.
Pulmonary Hypertension Is Associated with Mortality and Cardiovascular Events in Chronic Kidney Disease Patients Am J Nephrol 2017;45: DOI: /
Surveillance. Public Health Approach Surveillance What is the Problem ? Problem Risk Factor Identification : What Is the Cause ? Intervention Evaluation.
Kaplan-Meier survival curves comparing survival between both time periods according to management strategy. Survival in patients with infective endocarditis.
Epidemiological Terms
EPIDEMIOLOGY AS A TOOL TO EVALUATE QUALITY OF CARE
Kaplan-Meier survival curves for non-fatal stroke or mortality based on IMRS categories among (A) women (p-trend
Baseline Clinical Characteristics of All Patients and Patients Grouped by Statin Therapy - Part I H. Fukuta et al. Circulation 2005;112:
Presentation transcript:

Groupe de travail athérosclérose 1 STULONG Discovery Challenges Feedback Marie Tomečková EuroMISE – Cardio This work is supported by the project LN00B107 of the Ministry of Education of the Czech Republic

Groupe de travail athérosclérose2 STULONG Challenges – Medical feedback STULONG = acronym LONGitudinal STUdy Main aims of the study: To determine prevalence of the risk factors of atherosclerosis in middle-age men To follow up the development of the risk factors To follow up the development of the risk factors To asses the possibilities and the influence of the complex intervention on the incidence and values of the risk factors and on the cardiovascular mortality To asses the possibilities and the influence of the complex intervention on the incidence and values of the risk factors and on the cardiovascular mortality

Groupe de travail athérosclérose3 Atherosclerosis: a total complicated disease of all over organism a dynamic process, it begins in childhood and adolescence and continues for the whole life opinions on the origin and progress of the disease are developing interaction and influence of genetic predisposition and exterior environment the influence of so-called risk factors is still regarded some so-called protective factors exist

Groupe de travail athérosclérose4 STULONG - analysis Statistical - descriptive statistics - logistic regression - survival analysis Data mining - different methods - resulting in different conclusions

Prevalence of risk factors in RG

Mortality caused by atherosclerotic CVD

Mortality caused by atherosclerotic CVD depending on the number of RFA

Kaplan-Meier analysis in RG (at age of years) depending on the number of RFA

Kaplan-Meier analysis in RG (at age of years) depending on the number of RFA

The relative risk of death caused by other. CVD

Groupe de travail athérosclérose11 univaried and bivaried data analysis univaried and bivaried data analysis association rules association rules trend analysis trend analysis analysis so called time windows analysis so called time windows ROC analysis ROC analysis Miner tool SDS, WEKA tool, STATISTICA tool Miner tool SDS, WEKA tool, STATISTICA tool genetic approach genetic approach standard attribute-value data mining techniques standard attribute-value data mining techniques inductive logic programming technique inductive logic programming technique Different approaches to solve the analytic questions

Groupe de travail athérosclérose12 Different approaches to solve the analytic questions – continue: fuzzy approximate dependencies fuzzy approximate dependencies explicit relations - functional dependencies explicit relations - functional dependencies the inductive logic programming technique the inductive logic programming technique Rough Set Exploration System to solve both classification and descriptive tasks Rough Set Exploration System to solve both classification and descriptive tasks approach to generate a mathematical algebraic model – discriminate function - Werner, Kalganova approach to generate a mathematical algebraic model – discriminate function - Werner, Kalganova the selection of = interesting = emerging patterns (strong emerging patterns the selection of = interesting = emerging patterns (strong emerging patterns

Groupe de travail athérosclérose13 Challenge 2003 Some approaches to solve the analytic questions Genetic approach – function based on the Area Under the ROC curve - Conclusions very good understanable Azé,J. - Lucas,N. - Sebag,M.: A New Medical Test for Atherosclerosis Detection GeNo Fuzzy Approximate Dependencies – Fuzzy logic – Interesting relations – for discussion Berzal,F. - Cubero,J.C.- Sanchez,D. - Serrano,J.M. - Vila,M.A.: Finding Fuzzy Approximate Dependencies within STULONG Data

Groupe de travail athérosclérose14 Challenge 2003 – cont. Some approaches to solve the analytic questions Association rules – see later (Prague) Burian,J. - Rauch,J.: Analysis of Death Causes in the STULONG Data Set Strongest Emerging Patterns – very interesting approach, results to discuss Cremilleux,B.- Soulet,A. - Rioult,F.: Mining the Strongest Emerging Patterns Characterizing Patients Affected by Diseases Due to Atherosclerosis Rough Set Exploration System (RSES) – experimental tool, not yet implemented, without explications Hoa,N.S. - Son,N.H.: Analysis of STULONG Data by Rough Set Exploration System (RSES)

Groupe de travail athérosclérose15 Challenge 2003 – cont. Some approaches to solve the analytic questions SDS rules (Set Differs of Set) – some very interesting results – diferences among the groups in more than two variables – good conclusions Karban,T.: SDS-Rules and Classification on PKDD2003 Discovery Challenge Trend analysis, analysis so called time windows - interesting approache, some conlusiones to discuss from medical point of view Novakova,L. - Klema,J. - Jakob,M.- Rawles,S. - Stepankova,O.: Trend Analysis and Risk Identification

Groupe de travail athérosclérose16 Challenge 2003 – cont. Some approaches to solve the analytic questions WEKA tool, ACE data tool – very good presentation with ilustrative explanations Van Assche,A. - Verbaeten,S. - Krzywania,D. - Struyf,J. - Blockeel,H.: Attribute-Value and First Order Data Mining within the STULONG Project Discriminate function Discriminate function Werner,J.C. - Kalganova,T.: Risk Evaluation using Evolvable Discriminate Function