Presentation is loading. Please wait.

Presentation is loading. Please wait.

Numerical Analysis 1 EE, NCKU Tien-Hao Chang (Darby Chang)

Similar presentations

Presentation on theme: "Numerical Analysis 1 EE, NCKU Tien-Hao Chang (Darby Chang)"— Presentation transcript:

1 Numerical Analysis 1 EE, NCKU Tien-Hao Chang (Darby Chang)

2 Correlation coefficient 2 Two continuous variables

3 Correlation coefficient (CC) What we need is a single summary number that answers the following questions: –does a relationship exist? –if so, is it a positive or a negative relationship? and –is it a strong or a weak relationship? Correlation coefficient, a single summary number that gives you a good idea about how closely one variable is related to another variable 3

4 Correlation coefficient Two-way scatter plot 4

5 5 The mortality rate tends to decrease as the percentage of children immunized increase

6 Correlation Coefficient Pearsons correlation coefficient 6

7 7

8 8

9 Correlation Coefficient Correlation coefficient is not a percent 9

10 Correlation Coefficient Coefficient of determination 10

11 11

12 Statistical test 12

13 Correlation coefficient Statistical inference 13

14 14

15 Correlation coefficient Limitations It quantifies only the strength of the linear relationship between two variables Care must be taken when the data contain any outliers, or pairs of observations that lie considerably outside the range of the other data points A high correlation between two variables does not imply a cause-and-effect relationship 15

16 16

17 Correlation coefficient Spearmans rank CC 17

18 Any Questions? 18 About correlation coefficient

19 Statistical inference Basic tests –tests about proportions –tests about one mean –tests of the equality of two means –tests for variances –references (pp. 27-33) Distribution.PPT Distribution.PPT More advanced tests –ANOVA (analysis of variance) –goodness of fit (Wilcoxon test, Kolmogorov-Smirnov test, …) 19

20 Multivariate analysis Statistics –ANOVA –Multiple linear regression –PCA (principle component analysis) –ICA (independent component analysis) –LDA (linear discriminant analysis) So far, all techniques belong to statistics. You could find them in most statistical software, such as MATLAB, R (, SPSS… Machine learning –Naïve Bayes ( pp. 13-27) –LIBSVM ( –RVKDE ( 20

21 21 Lets see an Excel tutorial

22 22 Lets see the data

23 23 Points to a good final project

24 Raise some interesting issues –from observations –you have at least two trap issues (next slide) Design good analyses –make sure that your analyses fit your issues –do the results concur with your speculations? –design further analyses 24

25 Predict masked disease codes There are some masked diseases codes –for example, disease #14 has no acode, class and name First, predict the masked disease names Second, some masked diseases whose names are not in the file (namely, novel diseases). Try to identify them, and, if possible, to figure out what disease they are 25 acodeclass:name

26 The final project includes Presentation –slides (.ppt) and how you present them –convincing for me and your classmates –reasonably evaluate other works (voting others works if we have time) Project –scripts (executable) –results (.txt,.xls, …) –a step-by-step README of how you get the results from cd.dat (.txt,.doc, …) Report –a more detailed document of your slide (.doc) –the duty of each group member –anything worthy extra credit 26

27 Final grade Email all the materials to before 2011/6/20 The raw grade will be available as soon as the final project of your group is received Ask me ( about your grade with your NCKU email The final (adjusted) grades must wait all groups (2011/6/21, I hope) You have about one week to double-check the grade, and the final grades will be submitted around 2011/6/27 27

Download ppt "Numerical Analysis 1 EE, NCKU Tien-Hao Chang (Darby Chang)"

Similar presentations

Ads by Google