StatMaster – An Update Kartik Vishwanath Chintan Patel Yugyung Lee UMKC William Drake Richard Stroup Steve Simon Childrens Mercy Hospital, Kansas City,

Slides:



Advertisements
Similar presentations
Data Warehousing and Data Mining J. G. Zheng May 20 th 2008 MIS Chapter 3.
Advertisements

BOAT - Optimistic Decision Tree Construction Gehrke, J. Ganti V., Ramakrishnan R., Loh, W.
Active subgroup mining for descriptive induction tasks Dragan Gamberger Rudjer Bošković Instute, Zagreb Zdenko Sonicki University of Zagreb.
A Paradigm for Space Science Informatics Kirk D. Borne George Mason University and QSS Group Inc., NASA-Goddard or
Data Mining: A Closer Look Chapter Data Mining Strategies.
Report on Intrusion Detection and Data Fusion By Ganesh Godavari.
EPSY 430 Behavioral Constructs Behavioral Constructs Three Behavioral Domains.
© Prentice Hall1 DATA MINING TECHNIQUES Introductory and Advanced Topics Eamonn Keogh (some slides adapted from) Margaret Dunham Dr. M.H.Dunham, Data Mining,
Data Mining By Archana Ketkar.
Data Mining: A Closer Look Chapter Data Mining Strategies (p35) Moh!
Data Mining: A Closer Look Chapter Data Mining Strategies.
Thanks to Nir Friedman, HU
Advanced Database Applications Database Indexing and Data Mining CS591-G1 -- Fall 2001 George Kollios Boston University.
Data Mining: A Closer Look
Data Mining: A Closer Look Chapter Data Mining Strategies 2.
Chapter 5 Data mining : A Closer Look.
Data Mining and Decision Tree CS157B Spring 2006 Masumi Shimoda.
ROUGH SET THEORY AND FUZZY LOGIC BASED WAREHOUSING OF HETEROGENEOUS CLINICAL DATABASES Yiwen Fan.
Enterprise systems infrastructure and architecture DT211 4
OLAM and Data Mining: Concepts and Techniques. Introduction Data explosion problem: –Automated data collection tools and mature database technology lead.
Data Mining : Introduction Chapter 1. 2 Index 1. What is Data Mining? 2. Data Mining Functionalities 1. Characterization and Discrimination 2. MIning.
Basic Data Mining Techniques
Data Mining Techniques
Hypothesis Testing A hypothesis is a conjecture about a population. Typically, these hypotheses will be stated in terms of a parameter such as  (mean)
Sampling Distributions and Hypothesis Testing. 2 Major Points An example An example Sampling distribution Sampling distribution Hypothesis testing Hypothesis.
1 An Introduction to Data Mining Hosein Rostani Alireza Zohdi Report 1 for “advance data base” course Supervisor: Dr. Masoud Rahgozar December 2007.
An Excel-based Data Mining Tool Chapter The iData Analyzer.
Data Mining CS157B Fall 04 Professor Lee By Yanhua Xue.
Chapter 1 Introduction to Data Mining
Introduction to Data Mining Group Members: Karim C. El-Khazen Pascal Suria Lin Gui Philsou Lee Xiaoting Niu.
INTRODUCTION TO DATA MINING MIS2502 Data Analytics.
Using Text Mining and Natural Language Processing for Health Care Claims Processing Cihan ÜNAL
Data Mining Chapter 1 Introduction -- Basic Data Mining Tasks -- Related Concepts -- Data Mining Techniques.
Data Mining: Classification & Predication Hosam Al-Samarraie, PhD. Centre for Instructional Technology & Multimedia Universiti Sains Malaysia.
Report on Intrusion Detection and Data Fusion By Ganesh Godavari.
AI Week 14 Machine Learning: Introduction to Data Mining Lee McCluskey, room 3/10
Data Mining: An Introduction Billy Mutell. “The Library of Babel” Analogy Network of bookshelves with every book ever written All the books one could.
Kernel Methods A B M Shawkat Ali 1 2 Data Mining ¤ DM or KDD (Knowledge Discovery in Databases) Extracting previously unknown, valid, and actionable.
I2B2 Shared Task 2011 Coreference Resolution in Clinical Text David Hinote Carlos Ramirez.
Dr. Chen, Data Mining  A/W & Dr. Chen, Data Mining Chapter 2 Data Mining: A Closer Look Jason C. H. Chen, Ph.D. Professor of MIS School of Business Administration.
Fox MIS Spring 2011 Data Mining Week 9 Introduction to Data Mining.
Risk Assessment Farrokh Alemi, Ph.D.. Session Objectives 1.Discuss the role of risk assessment in the TQM process. 2.Describe the five severity indices.
Decision Trees. Decision trees Decision trees are powerful and popular tools for classification and prediction. The attractiveness of decision trees is.
Data Management and Database Technologies 1 DATA MINING Extracting Knowledge From Data Petr Olmer CERN
Acknowledgements Contact Information Anthony Wong, MTech 1, Senthil K. Nachimuthu, MD 1, Peter J. Haug, MD 1,2 Patterns and Rules  Vital signs medoids.
1 Improving quality of graduate students by data mining Asst. Prof. Kitsana Waiyamai, Ph.D. Dept. of Computer Engineering Faculty of Engineering, Kasetsart.
Introduction of Data Mining and Association Rules cs157 Spring 2009 Instructor: Dr. Sin-Min Lee Student: Dongyi Jia.
Advanced Database Course (ESED5204) Eng. Hanan Alyazji University of Palestine Software Engineering Department.
3-1 Data Mining Kelby Lee. 3-2 Overview ¨ Transaction Database ¨ What is Data Mining ¨ Data Mining Primitives ¨ Data Mining Objectives ¨ Predictive Modeling.
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
Understanding User’s Query Intent with Wikipedia G 여 승 후.
Christoph F. Eick Questions and Topics Review November 11, Discussion of Midterm Exam 2.Assume an association rule if smoke then cancer has a confidence.
Dr. Chen, Data Mining  A/W & Dr. Chen, Data Mining Chapter 4 An Excel-based Data Mining Tool (iData Analyzer) Jason C. H. Chen, Ph.D. Professor of MIS.
Introduction to Data Mining by Yen-Hsien Lee Department of Information Management College of Management National Sun Yat-Sen University March 4, 2003.
MIS2502: Data Analytics Advanced Analytics - Introduction.
Data Mining and Decision Support
Modeling of Core Protection Calculator System Software February 28, 2005 Kim, Sung Ho Kim, Sung Ho.
1. ABSTRACT Information access through Internet provides intruders various ways of attacking a computer system. Establishment of a safe and strong network.
DATA MINING TECHNIQUES (DECISION TREES ) Presented by: Shweta Ghate MIT College OF Engineering.
Data Mining – Introduction (contd…) Compiled By: Umair Yaqub Lecturer Govt. Murray College Sialkot.
CS570: Data Mining Spring 2010, TT 1 – 2:15pm Li Xiong.
MULTI DISEASE CLASSIFICATION BASED ON EFFECTIVE ANALYTICAL TECHNIQUES Guide: Mr.R. Nandhi kesavan S.Aabitha Banu A.Karthika.
Swamy Kurra Stephen Albanese Patrick Cahill Randal Betz
Data Mining Functionalities
By Arijit Chatterjee Dr
MIS2502: Data Analytics Advanced Analytics - Introduction
Introduction to Data Mining
MIS5101: Data Analytics Advanced Analytics - Introduction
Discovering Constrained Association Rules to Predict Heart Disease
Presentation transcript:

StatMaster – An Update Kartik Vishwanath Chintan Patel Yugyung Lee UMKC William Drake Richard Stroup Steve Simon Childrens Mercy Hospital, Kansas City, MO 07 June 2004

Defining Data Mining The automated extraction of hidden predictive information from (large) databases Three key words: Automated Hidden Predictive Implicit is a statistical methodology Data mining lets you be proactive Prospective rather than Retrospective

Kinds of Data Mining Problems Classification: Finding a set of models that describe or distinguish data classes. Clustering: Grouping objects by minimizing interclass similarity and maximizing intraclass similarity. Association: Discovery of association rules showing attribute-value conditions that occur frequently.

Examples of Data Mining

A Healthy Class Rule for the Cardiology Patient Dataset IF 169 <= Maximum Heart Rate <=202 THEN Concept Class = Healthy Rule accuracy: 85.07% Rule coverage: 34.55% The rule works correctly 85% of the time % of all healty patients meet the conditions specified in this rule

A Sick Class Rule for the Cardiology Patient Dataset IF Thal = Rev & Chest Pain Type = Asymptomatic THEN Concept Class = Sick Rule accuracy: 91.14% Rule coverage: 52.17%

Drawing Conclusions Recall the rule: IF 169 <= Maximum Heart Rate <=202 THEN Concept Class = Healthy Possible interpretations If patient’s max heart rate is low, s/he might have a heart attack? If patient had a heart attack, his max heart rate would decrease? A low max heart rate causes a heart attack? Only a medical expert can tell.

Another Example Hypoplastic Left Heart Syndrome Case Study Affects infants and is uniformly fatal without surgery. Extremely complex relationships among physiologic parameters in a given patient. Temporal datasets

Parameters continuously measured

Parameters intermittently measured and the Interventions

Some rules extracted by mining

Results Wellness score predicted with accuracy of 94.57%. Incorrect predictions for 1.60% of new cases (with unknown value of the wellness score) 2.22% of new cases the decision rules could not make any predications.

Discussion !! Discussion !!!