Presentation is loading. Please wait.

Presentation is loading. Please wait.

Breast Cancer Diagnosis via Linear Hyper-plane Classifier Presented by Joseph Maalouf December 14, 2001 December 14, 2001.

Similar presentations


Presentation on theme: "Breast Cancer Diagnosis via Linear Hyper-plane Classifier Presented by Joseph Maalouf December 14, 2001 December 14, 2001."— Presentation transcript:

1 Breast Cancer Diagnosis via Linear Hyper-plane Classifier Presented by Joseph Maalouf December 14, 2001 December 14, 2001

2 Problem Description Breast Cancer is second only to lung cancer as a tumor-related cause of death in women. Although there exists reasonable agreement on the criteria for benign/malignant (Benign means that the lump or other problem was not cancer, and Malignant means that the tissue does contain cancer cells) diagnoses using fine needle aspirate (FNA) and mammogram data, the application of these criteria are often quite subjective and a time consuming task for the physician. The idea of the project is to come up with a discriminant function (a separating plane in this case) to determine if an unknown sample is benign or malignant. The project will use the Wisconsin Diagnosis Breast Cancer Database (WDBC) made publicly available by Dr.William H.Wolberg of the Department of Surgery of the University of Wisconsin Medical School. There are 569 samples (357 benign, 212 malignant) with 32 attributes (patient ID, diagnosis type, 30 real-valued input features). The most effective pair of the attributes in determining a correct diagnosis will be determined and used to plot all the testing set points on a two dimensional figure.

3 Solution Methods Formally, given two sets B and M in the 30-dimensional real space R 30, we wish to construct a discriminant function f, from R 30 into R, such that: f (x) > 0 => x  M, f (x)  0 => x  B, Two approaches will be used to find the optimal hyper-plane: Linear Optimization Quadratic Optimization Linear Optimization Problem Formulation: The discriminant function f can be given by : f(x) = w’x - , determining a plane w’x =  that separates, to the extent possible, malignant points from benign ones in R 30. It remains to show how to determine w  R 30 and   R from the training data. If we let the sets of m points, M, be represented by a matrix M  R mxn and the set of k points, B, be represented by a matrix B  R kxn,

4 Solution Methods (continued) then the problem becomes one of choosing w and  to: min w,  (1/m)*|| (-M*w + e*  +e) + || 1 + (1/k)*|| (B*w - e *  +e) + || 1 This can be implemented using matlab optimization tool box lp. Quadratic Optimization Problem Formulation: The function f is given by : f(x) = w’x + b. For x i being a support vector, For x i  M  d i = 1, f(x i ) = w ’ * x i + b =  |w|  w o ’ * x i + b o = 1 For x i  B  d i =  1, f(x i ) = w ’ * x i + b =  |w|  w o ’ * x i + b o =  1 The objective is to find w and b such that  (w o ) = (w o ’ * w o )/2 is minimized subject to the constraints: d i * (w o ’ * x i + b o )  1, where 1  i  N (number of support vectors) To implement the quadratic optimization solution, I ’ ll use the OSU support vector machine.

5 Results The linear optimization method determined a correct diagnosis with a success rate of 97.2 %.. For the quadratic optimization method, I tried different types of kernels with different parameters. oIt seems as far, the SVM classifier with polynomial kernel of order 4 gives a better result, with a success rate of 91.4 %. oTraining a SVM classifier involves a time consuming. The following plot shows a figure of one of the most effective pair of the attributes in determining a correct diagnosis.


Download ppt "Breast Cancer Diagnosis via Linear Hyper-plane Classifier Presented by Joseph Maalouf December 14, 2001 December 14, 2001."

Similar presentations


Ads by Google