Presentation is loading. Please wait.

Presentation is loading. Please wait.

Author : Jochen Dijrre, Peter Gerstl, Roland Seiffert Proceedings of the 5th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining,

Similar presentations


Presentation on theme: "Author : Jochen Dijrre, Peter Gerstl, Roland Seiffert Proceedings of the 5th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining,"— Presentation transcript:

1 Author : Jochen Dijrre, Peter Gerstl, Roland Seiffert Proceedings of the 5th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Diego, California, August 15-18, 1999, 398-401. Presented by Xxxxxx

2 Outline  Motivation  Methodology  Feature Extraction  Clustering and Categorizing  Data Mining VS Text Mining  Conclusion

3 Motivation  Problem: Most of data in a company is unstructured or semi-structured  Examples:  Letters  Emails  Phone transcripts  Contracts

4 Definition and Application  Text mining: The discovery by computer of new, previously unknown information, by automatically extracting information from different written resources.  Applications:  Summarizing documents  Discovering/monitoring relations among people  Customer profile analysis  Trend analysis  Documents summarization

5 Methodology  Aspect 1: Knowledge Discovery  Aspect 2: Information Distillation Approaches: Extraction Analysis

6 Feature Extraction  Recognize and classify significant vocabulary items from the text  Categories of vocabulary Proper names Multiword terms Abbreviations Relations Other useful things

7 Clustering Model

8 Categorization Model

9 Data Mining VS Text Mining Data MiningText Mining GoalDiscover hidden modelsDiscover hidden facts MethodTries to generalize all of data into a single model Tries to understand the details, cross reference between individual instances FieldsMarketing, medicine, health care Biosciences, customer profile analysis

10 Conclusion  Introduction of text mining  Differences between data mining and text mining  Overview of IBM’s Intelligent Miner for Text  The tools and methods used in the past


Download ppt "Author : Jochen Dijrre, Peter Gerstl, Roland Seiffert Proceedings of the 5th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining,"

Similar presentations


Ads by Google