 ODM provides infrastructure for developing data mining applications suitable for addressing a variety of business problems involving text. Among these,

Slides:



Advertisements
Similar presentations
Using CLUUZ. © 2008 Sprylogics International Corp. Enter your search term/terms. By default, CLUUZ will extract and display people, companies, phone numbers,
Advertisements

This document contains information and data that AAUM considers confidential. Any disclosure of Confidential Information to, or use of it by any other.
Data Mining and the Web Susan Dumais Microsoft Research KDD97 Panel - Aug 17, 1997.
® IBM Research © 2006 IBM Corporation Faceted Logic, Ontologies, and Wikis: Possible Approaches for ONTOLOG Content John Boz Handy-Bosma, Ph.D., Senior.
Reinventing using REST. Anything addressable by a URI is called a resource GET, PUT, POST, DELETE WebDAV (MOVE, LOCK)
Classification & Your Intranet: From Chaos to Control Susan Stearns Inmagic, Inc. E-Libraries E204 May, 2003.
Document Summarization using Conditional Random Fields Dou Shen, Jian-Tao Sun, Hua Li, Qiang Yang, Zheng Chen IJCAI 2007 Hao-Chin Chang Department of Computer.
Chapter 5: Introduction to Information Retrieval
Taxonomies, Lexicons and Organizing Knowledge Wendi Pohs, IBM Software Group.
By: Mr Hashem Alaidaros MIS 211 Lecture 4 Title: Data Base Management System.
Benjamin J. Deaver Advisor – Dr. LiGuo Huang Department of Computer Science and Engineering Southern Methodist University.
Information Retrieval in Practice
Creating Concept Hierarchies in a Customer Self-Help System Bob Wall CS /29/05.
Data Mining Jessica Jackson Kimberli Klein Kevin Wood.
Media Planning: Advertising and the Internet
1 Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang, Assistant Professor Dept. of Computer Science & Information Engineering National Central.
XML on Semantic Web. Outline The Semantic Web Ontology XML Probabilistic DTD References.
Exploring Windows 98 and Essential Computing Concepts - Chapter 2 1 Exploring the Internet Chapter 2 Search Engines: Finding Information on the Web.
Copyright © 2006 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Technology Education Copyright © 2006 by The McGraw-Hill Companies,
Knowledge Process Outsourcing1 Turning Information into Knowledge... for YOU The Gyaan Team.
Chapter 2 Survey of Designing Web and Tools. Objectives To be able to: Describe what are the considerations in the Web design Describe your website’s.
Search Engine Optimization
Introduction to UDDI From: OASIS, Introduction to UDDI: Important Features and Functional Concepts.
Text Analytics And Text Mining Best of Text and Data
LÊ QU Ố C HUY ID: QLU OUTLINE  What is data mining ?  Major issues in data mining 2.
The SEASR project and its Meandre infrastructure are sponsored by The Andrew W. Mellon Foundation SEASR Overview Loretta Auvil and Bernie Acs National.
Web Design Using HTML Codes. WHAT DO I NEED TO BEGIN DESIGNING A HOME PAGE? 1.YOU NEED A FOLDER (also called a DIRECTORY) You should set up a folder or.
Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification on Reviews Peter D. Turney Institute for Information Technology National.
Basic Web Applications 2. Search Engine Why we need search ensigns? Why we need search ensigns? –because there are hundreds of millions of pages available.
Computational Linguistics WTLAB ( Web Technology Laboratory ) Mohsen Kamyar.
6 October Information Retrieval and Knowledge Management Within the VisIT Web Services Architecture.
CMPF124:Basics Skills for Knowledge Workers Overview of other Windows Software.
Exploring Microsoft Office Search Engines: Finding Information on the Web1 Exploring the Internet Chapter 2 Search Engines: Finding Information.
Text mining
Text Feature Extraction. Text Classification Text classification has many applications –Spam detection –Automated tagging of streams of news articles,
New Ideas for IA Readings review - How to manage the process Content Management Process Management - New ideas in design Information Objects Content Genres.
Data Mining By Dave Maung.
Presenter: Shanshan Lu 03/04/2010
Mining fuzzy domain ontology based on concept Vector from wikipedia category network.
Lecture 5: Writing the Project Documentation Part III.
“This presentation is for informational purposes only and may not be incorporated into a contract or agreement.”
CS315-Web Search & Data Mining. A Semester in 50 minutes or less The Web History Key technologies and developments Its future Information Retrieval (IR)
Next Generation Search Engines Ehsun Daroodi 1 Feb, 2003.
Search Tools and Search Engines Searching for Information and common found internet file types.
WEB 2.0 PATTERNS Carolina Marin. Content  Introduction  The Participation-Collaboration Pattern  The Collaborative Tagging Pattern.
Web Review The Web Web 1.0 Web 2.0 Future of the Web Internet Programming - Chapter 01:XHTML1.
Advanced Semantics and Search Beyond Tag Clouds and Taxonomies Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services.
ICT-enabled Agricultural Science for Development Scenarios, Opportunities, Issues by ICTs transforming agricultural science, research & technology generation.
MICROSOFT SEMANTIC ENGINE Unified Search, Discovery and Insight.
Achieving Semantic Interoperability at the World Bank Designing the Information Architecture and Programmatically Processing Information Denise Bedford.
Evaluating Educational Resources W301. What are Educational Resources? There are many kinds of resources available to you: –Productivity Resources are.
Content Analytics – Uncovering Critical Insight YellowBrix 3/2/20161.
Evaluating Educational Resources W301. What is learning? / Does Google make us stupid???
Microsoft Office 2008 for Mac – Illustrated Unit D: Getting Started with Safari.
HTML5 SEMANTICS TO OR NOT TO THAT IS THE QUESTION BY WILLIAM MURRAY.
Cruising the Semantic Web with Noadster Lloyd Rutledge and Jacco van Ossenbruggen With Noadster, we demonstrate a hypermedia-oriented Semantic Web browser.
AQUAINT Mid-Year PI Meeting – June 2002 Integrating Robust Semantics, Event Detection, Information Fusion, and Summarization for Multimedia Question Answering.
Semantic Web Technologies Readings discussion Research presentations Projects & Papers discussions.
Data Mining Concept Submitted TO: Mrs. MONIKA SUBMITTED BY: SHALU 4717.
WEB STRUCTURE MINING SUBMITTED BY: BLESSY JOHN R7A ROLL NO:18.
Future-oriented Benchmarking Through Social Media Analysis
Introduction to Data Mining
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
BIS 221 RANK Education for Service-- bis221rank.com.
Rob Gleasure robgleasure.com
Rob Gleasure robgleasure.com
BIS 221 Great Wisdom/tutorialrank.com. BIS 221 All Assignments For more course tutorials visit BIS 221 Week 2 Assignment Business.
International Marketing and Output Database Conference 2005
Jonathan Griffin, Managing Director, IFIS Publishing &
Semi-Automatic Data-Driven Ontology Construction System
Presentation transcript:

 ODM provides infrastructure for developing data mining applications suitable for addressing a variety of business problems involving text. Among these, the following specific technologies provide key elements for addressing problems that require text mining :

 A large number of document classification applications fall into one of the following:  Assigning multiple labels to a document. ODM does not support this case.  Assigning a document to one of many labels..

 Clustering is used frequently in text mining; the main applications of clustering in text mining are  Taxonomy generation  Topic extraction  Grouping the hits returned by a search engine

There are two kinds of text mining problems for which feature extraction is useful:  Extract features from actual text. Oracle Text is designed to solve this kind of problem.  Extract semantic features or higher-level features from the basic features uncovered when features are extracted from actual text.

 Association models can be used to uncover the semantic meaning of words. For example, suppose that the word sheep co-occurs with words like sleep, fence, chew, grass, meadow, farmer, and shear. An association model would include rules connecting sheep with these concepts.

 Text mining methods and software is also being researched and developed by major firms, including IBM and Microsoft, to further automate the mining and analysis processes, and by different firms working in the area of search.IBMMicrosoft

 Text mining is being used by large media companies, such as the Tribune Company, to clarify information and to provide readers with greater search experiences, which in turn increases site "stickiness" and revenue. Tribune Company