Presentation is loading. Please wait.

Presentation is loading. Please wait.

Intelligent Database Systems Lab Presenter : Chang,Chun-Chih Authors : David Milne *, Ian H. Witten 2012, AI An open-source toolkit for mining Wikipedia.

Similar presentations


Presentation on theme: "Intelligent Database Systems Lab Presenter : Chang,Chun-Chih Authors : David Milne *, Ian H. Witten 2012, AI An open-source toolkit for mining Wikipedia."— Presentation transcript:

1 Intelligent Database Systems Lab Presenter : Chang,Chun-Chih Authors : David Milne *, Ian H. Witten 2012, AI An open-source toolkit for mining Wikipedia

2 Intelligent Database Systems Lab Outlines Motivation Objectives Methodology Experiments Conclusions Comments

3 Intelligent Database Systems Lab Motivation The online encyclopedia Wikipedia is a vast, constantly evolving tapestry of interlinked articles. For developers and researchers it represents a giant multilingual database of concepts and semantic relations, a potential resource for natural language processing

4 Intelligent Database Systems Lab Objectives The Wikipedia Miner toolkit, an open-source software system that allows researchers and developers to integrate Wikipedia’s rich semantics into their own applications. Wikipedia Miner is intended to be a platform for sharing data mining techniques.

5 Intelligent Database Systems Lab Methodology - Architecture of the wikipedia Miner toolkit

6 Intelligent Database Systems Lab Methodology - Measuring relatedness between concepts

7 Intelligent Database Systems Lab Methodology - Measuring relatedness between concepts

8 Intelligent Database Systems Lab Methodology -Features for measuring artucle relatedness

9 Intelligent Database Systems Lab Experiments - Impact of thresholds for disambiguation and detection

10 Intelligent Database Systems Lab Experiments - Impact of relatedness dependencies

11 Intelligent Database Systems Lab Experiments - Impact of traning data

12 Intelligent Database Systems Lab Experiments - performance of the disambiguator

13 Intelligent Database Systems Lab Experiments - performance of the detector

14 Intelligent Database Systems Lab Conclusions Our aim in releasing this work open source is not to provide a complete and polished product, but rather a resource for the research community to collaborate around and continue building together.

15 Intelligent Database Systems Lab Comments Advantages Applications - wikipedia - Disambiguation - Annotation


Download ppt "Intelligent Database Systems Lab Presenter : Chang,Chun-Chih Authors : David Milne *, Ian H. Witten 2012, AI An open-source toolkit for mining Wikipedia."

Similar presentations


Ads by Google