An Inteligent System to Diabetes Prediction

Slides:



Advertisements
Similar presentations
Naïve-Bayes Classifiers Business Intelligence for Managers.
Advertisements

G53MLE | Machine Learning | Dr Guoping Qiu
Application of Stacked Generalization to a Protein Localization Prediction Task Melissa K. Carroll, M.S. and Sung-Hyuk Cha, Ph.D. Pace University, School.
Multiple Criteria for Evaluating Land Cover Classification Algorithms Summary of a paper by R.S. DeFries and Jonathan Cheung-Wai Chan April, 2000 Remote.
SUPPORT VECTOR MACHINES PRESENTED BY MUTHAPPA. Introduction Support Vector Machines(SVMs) are supervised learning models with associated learning algorithms.
1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.
1 Lecture 5: Automatic cluster detection Lecture 6: Artificial neural networks Lecture 7: Evaluation of discovered knowledge Brief introduction to lectures.
Supervised classification performance (prediction) assessment Dr. Huiru Zheng Dr. Franscisco Azuaje School of Computing and Mathematics Faculty of Engineering.
Ensemble Learning: An Introduction
1 MACHINE LEARNING TECHNIQUES IN IMAGE PROCESSING By Kaan Tariman M.S. in Computer Science CSCI 8810 Course Project.
Lecture 10 Comparison and Evaluation of Alternative System Designs.
On the Application of Artificial Intelligence Techniques to the Quality Improvement of Industrial Processes P. Georgilakis N. Hatziargyriou Schneider ElectricNational.
Rohit Ray ESE 251. What are Artificial Neural Networks? ANN are inspired by models of the biological nervous systems such as the brain Novel structure.
SPAM DETECTION USING MACHINE LEARNING Lydia Song, Lauren Steimle, Xiaoxiao Xu.
CSCI 347 / CS 4206: Data Mining Module 06: Evaluation Topic 01: Training, Testing, and Tuning Datasets.
METU Informatics Institute Min720 Pattern Classification with Bio-Medical Applications Lecture Notes by Neşe Yalabık Spring 2011.
DATA MINING : CLASSIFICATION. Classification : Definition  Classification is a supervised learning.  Uses training sets which has correct answers (class.
CLassification TESTING Testing classifier accuracy
Cost-Sensitive Bayesian Network algorithm Introduction: Machine learning algorithms are becoming an increasingly important area for research and application.
Data Mining Practical Machine Learning Tools and Techniques Chapter 4: Algorithms: The Basic Methods Section 4.6: Linear Models Rodney Nielsen Many of.
Combining multiple learners Usman Roshan. Bagging Randomly sample training data Determine classifier C i on sampled data Goto step 1 and repeat m times.
Today Ensemble Methods. Recap of the course. Classifier Fusion
Ensemble Learning Spring 2009 Ben-Gurion University of the Negev.
School of Engineering and Computer Science Victoria University of Wellington Copyright: Peter Andreae, VUW Image Recognition COMP # 18.
Tony Jebara, Columbia University Advanced Machine Learning & Perception Instructor: Tony Jebara.
A.N.N.C.R.I.P.S The Artificial Neural Networks for Cancer Research in Prediction & Survival A CSI – VESIT PRESENTATION Presented By Karan Kamdar Amit.
Class Imbalance Classification Implementation Group 4 WEI Lili, ZENG Gaoxiong,
Classification (slides adapted from Rob Schapire) Eran Segal Weizmann Institute.
Machine Learning in Practice Lecture 10 Carolyn Penstein Rosé Language Technologies Institute/ Human-Computer Interaction Institute.
Combining multiple learners Usman Roshan. Decision tree From Alpaydin, 2010.
An Effective Hybridized Classifier for Breast Cancer Diagnosis DISHANT MITTAL, DEV GAURAV & SANJIBAN SEKHAR ROY VIT University, India.
Developing outcome prediction models for acute intracerebral hemorrhage patients: evaluation of a Support Vector Machine based method A. Jakab 1, L. Lánczi.
Computational Intelligence: Methods and Applications Lecture 15 Model selection and tradeoffs. Włodzisław Duch Dept. of Informatics, UMK Google: W Duch.
Modeling of Core Protection Calculator System Software February 28, 2005 Kim, Sung Ho Kim, Sung Ho.
Overfitting, Bias/Variance tradeoff. 2 Content of the presentation Bias and variance definitions Parameters that influence bias and variance Bias and.
Screening System for Hypertension and Diabetes at Primary Care Level
Data Science Credibility: Evaluating What’s Been Learned
Ensemble Classifiers.
Combining Models Foundations of Algorithms and Machine Learning (CS60020), IIT KGP, 2017: Indrajit Bhattacharya.
Machine Learning: Ensemble Methods
CS 9633 Machine Learning Support Vector Machines
Data-intensive Computing Algorithms: Classification
Machine Learning – Classification David Fenyő
Evaluating Classifiers
How many study subjects are required ? (Estimation of Sample size) By Dr.Shaik Shaffi Ahamed Associate Professor Dept. of Family & Community Medicine.
MIRA, SVM, k-NN Lirong Xia. MIRA, SVM, k-NN Lirong Xia.
International Workshop
CSSE463: Image Recognition Day 11
Trees, bagging, boosting, and stacking
Classification with Perceptrons Reading:
Machine Learning Week 1.
Management of Type II Diabetes
A Modified Naïve Possibilistic Classifier for Numerical Data
CSSE463: Image Recognition Day 11
network of simple neuron-like computing elements
Classification of class-imbalanced data
Advanced Artificial Intelligence Classification
MACHINE LEARNING TECHNIQUES IN IMAGE PROCESSING
Evaluating Impacts: An Overview of Quantitative Methods
Ensemble learning.
MACHINE LEARNING TECHNIQUES IN IMAGE PROCESSING
CSSE463: Image Recognition Day 11
CSSE463: Image Recognition Day 11
Physics-guided machine learning for milling stability:
MAS 622J Course Project Classification of Affective States - GP Semi-Supervised Learning, SVM and kNN Hyungil Ahn
Using Clustering to Make Prediction Intervals For Neural Networks
MIRA, SVM, k-NN Lirong Xia. MIRA, SVM, k-NN Lirong Xia.
Outlines Introduction & Objectives Methodology & Workflow
Austin Karingada, Jacob Handy, Adviser : Dr
Evaluation David Kauchak CS 158 – Fall 2019.
Presentation transcript:

An Inteligent System to Diabetes Prediction Zhilbert Tafa Presented at Mediterranean Conference on Embedded Computing MECO ‘2015, Budva, Montenegro

MECO'2015, Budva, June 2015, Mentenegro Outline Facts Objectives Materials Methods Results Conclusions MECO'2015, Budva, June 2015, Mentenegro

MECO'2015, Budva, June 2015, Mentenegro Facts Diabetes recently affects around 346 million people. Also the mayor cause for: Heart stroke Kidney failure Lower-limb amputation Blindness One-third go undetected in early stage. Early detection and treatment - substantial health benefits, (avoiding or minimizing the mentioned complications). MECO'2015, Budva, June 2015, Mentenegro

MECO'2015, Budva, June 2015, Mentenegro Objectives Final: contribution to reducing the time between diabetes onset and clinical diagnosis. Showing the efficiency of computer-based methods Improving the reliability by using a hybrid semi-automatic system for diabetes prediction. MECO'2015, Budva, June 2015, Mentenegro

MECO'2015, Budva, June 2015, Mentenegro Existing approaches Machine learning - the area of artificial intelligence that uses the statistical data analyses. recognized to be a promising area that can help in patient classification regarding the medical conditions. Many methods have been used so far: SVM, K Node Neighbors, Naïve Bayes, Neural Networks… Some studies have shown: SVM over performs other algorithms in detecting Diabetes. MECO'2015, Budva, June 2015, Mentenegro

Overview of the materials/approach Dataset is acquired from 402 patients located in three different locations in Kosovo Dataset contains some new features (physical activity, diet) as recognized to be important in medical examinations Sample size assures for the confidence level of 95 % and the confidence interval of 5%. Approach: individual and joint implementation of SVM Naïve Bayes classification Although some other methods were also tested… MECO'2015, Budva, June 2015, Mentenegro

MECO'2015, Budva, June 2015, Mentenegro Dataset description Regular diet - patients were asked to estimate if: - they take their meals in approx. same daily intervals - they take their meals at least 3 times/day - their meals are not voluminous The adult person is considered to be physically active if he/she conducts the 150-200 min of physical activities a week Attribute Value range From To BMI 15 40 Pre meal glucose 3.5 19 Post meal glucose 4.9 22.8 Diastolic blood pressure 55 110 Systolic blood pressure 90 200 Family history of diabetes No (0) Yes (1) Regular diet Physical activities MECO'2015, Budva, June 2015, Mentenegro

SVM and Naïve Bayes Classification SVM creates classifier by maximizing the margin between classes and placing the hyper plane classifier between support vectors Naïve Bayes Classifier relies on statistical analysis: The classifier is build upon the probabilities for each feature to belong to either class A or B. Even though based on unrealistic assumption (independency, importance eq.), the idea leads to a simple scheme that works well in practice. MECO'2015, Budva, June 2015, Mentenegro

MECO'2015, Budva, June 2015, Mentenegro The experiment (1) The application were built in MATLAB Data were randomly divided into train-set and test set. The classifiers (SVM and NB) were build on the train-set. The instances of the test-set were used to test the classifiers. To avoid the bias of the “wrong split” the procedure was repeated 100 times with randomly different sets. MECO'2015, Budva, June 2015, Mentenegro

MECO'2015, Budva, June 2015, Mentenegro The experiment (2) Given the output results, the individual average classification measures were then estimated. Also, on each iteration the common true and wrong answers as well as different classifications (of the SVM and NB) were also evaluated. This is related to the hybrid implementation where two conditions are considered: When two algorithms point to the different output. When both algorithms point to the same result. Finally, Given the output results, the average classification measures of the hybrid system were then also estimated. MECO'2015, Budva, June 2015, Mentenegro

MECO'2015, Budva, June 2015, Mentenegro The experiment (3) GUI – two modes. MECO'2015, Budva, June 2015, Mentenegro

Results – individual implementation The average performance of SVM and NB individually. Precision: on all the X’s found, how many of them were actually correct. Recall: of all the real X’s, how many of them were found. MECO'2015, Budva, June 2015, Mentenegro

Results – hybrid implementation After 100 iterations, the result show that, in 94, 8 % of cases, both classifiers will point to the same result. Under this condition,   Precision (NO) Overall accuracy Hybrid 99.3% 97,6% MECO'2015, Budva, June 2015, Mentenegro

MECO'2015, Budva, June 2015, Mentenegro Discussion (1) We consider the overall accuracy and the class NO precision as more important YES will lead to the further (periodic) examinations while NO will often mean no further examinations. Regarding these two criteria, the algorithms can be used individually with the SVM having a slightly better performances. SVM performance – 95,52 %, NO class precision – 97 % NB performance – 94,53 %, NO class precision – 98 % MECO'2015, Budva, June 2015, Mentenegro

MECO'2015, Budva, June 2015, Mentenegro Discussion (2) Regarding these two criteria, the hybrid algorithm improves performances: If both algorithms point to the different result (in around 5% of cases) the user is notified and the patient is led to the further med. ex. If both algorithms show to the same result (in around 95 % of cases), the average system performance is around 98 % while the class NO precision will be 99,3 %. MECO'2015, Budva, June 2015, Mentenegro

Conclusions (1) Both SVM classifier and NB classifier perform well on given dataset The hybrid classifier increases the reliability of the computer-based diagnostic process.

MECO'2015, Budva, June 2015, Mentenegro Conclusions (2) The presented approaches can contribute to the timely and more precise (pre) diabetes diagnosis. The presented approaches are flexible – data could be updated periodically to follow the changes that happen due to various factors This can dynamically improve the classifiers MECO'2015, Budva, June 2015, Mentenegro

MECO'2015, Budva, June 2015, Mentenegro Conclusions (3) Therefore: the presented systems could be used as a support to medical decision making in health care environments The system could also be adapted for use in home environment. Users can use home glucose-meters to measure the level of blood glucose They can calculate the BMI their self User can provide other input parameters as well. However: Needs further adjustments and clinical testing. MECO'2015, Budva, June 2015, Mentenegro

THANK YOU Q&A Zhilbert Tafa tafaul@t-com.me