Presentation is loading. Please wait.

Presentation is loading. Please wait.

Active subgroup mining for descriptive induction tasks Dragan Gamberger Rudjer Bošković Instute, Zagreb Zdenko Sonicki University of Zagreb.

Similar presentations


Presentation on theme: "Active subgroup mining for descriptive induction tasks Dragan Gamberger Rudjer Bošković Instute, Zagreb Zdenko Sonicki University of Zagreb."— Presentation transcript:

1 Active subgroup mining for descriptive induction tasks Dragan Gamberger Rudjer Bošković Instute, Zagreb Zdenko Sonicki University of Zagreb

2 Talk overview: - descriptive induction - active subgroup mining - subgroup discovery - data mining server - a real medical example

3 Descriptive induction is aimed at generating (inducing) knowledge that is understandable (interpretable) by humans. It is different from classification aimed induction where the main goal is high classification quality (but induced classification schemes are typically too complex for human interpretation).

4 Main properties of descriptive induction: - simple rules - reasonable prediction quality (both on available and future cases) Main problem: overfitting functional genomics domain has 150 examples with 16000 measured attribute values

5 - descriptive induction - active subgroup mining - subgroup discovery - data mining server - a real medical example

6 Active subgroup mining is a data analysis approach specially developed for medical applications (but applicable also for other domains). It is based on the observation that expert knowledge (in medical domains it means knowledge and experience of medical doctors) is very important for the quality of obtained results.

7 In active subgroup mining the expert is positioned in the center of the process and machine learning (subgroup discovery) is only a tool that helps him in the data analysis process.

8 definition of task(s) induction of models presentation visualization integration statistical evaluation selection of models expert subgroup discovery

9 - descriptive induction - active subgroup mining - subgroup discovery - data mining server - a real medical example

10 ++++++ + + + + + + + + + + + + classical versus subgroup discovery induction

11 + + + + + + very specific subgroup very sensitive subgroup generality – the main parameter of the subgroup induction process

12 Subgroup discovery is a beam search algorithm which generates short rules in the form of conjunctions of conditions. Conditions are based on the values of available attributes. example: CHD 53 AND T.CH > 6.1 AND BMI < 30

13

14 - descriptive induction - active subgroup mining - subgroup discovery - data mining server - a real medical example

15 dms.irb.hr

16

17 meningoencephalitis domain subgroup describing bacteria in contrast to the virus type disease

18 - descriptive induction - active subgroup mining - subgroup discovery - data mining server - a real medical example

19

20 Conclusions: -descriptive induction and active subgroup mining are novel concepts potentially very interesting for data analysis and knowledge induction in medical applications - active and central role of medical experts is essential

21 - we have extensive and positive experience with these methodology on different medical domains but no experience in constructing medical guidelines. For such applications potentially useful might be: - detection of decision points for numerical attributes - detection of apparent but significant contradictions - explicit noise detection


Download ppt "Active subgroup mining for descriptive induction tasks Dragan Gamberger Rudjer Bošković Instute, Zagreb Zdenko Sonicki University of Zagreb."

Similar presentations


Ads by Google