Semantic Wiki: Automating the Read, Write, and Reporting functions Chuck Rehberg, Semantic Insights.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

1 OOA-HR Workshop, 11 October 2006 Semantic Metadata Extraction using GATE Diana Maynard Natural Language Processing Group University of Sheffield, UK.
Classification & Your Intranet: From Chaos to Control Susan Stearns Inmagic, Inc. E-Libraries E204 May, 2003.
Understanding Tables on the Web Jingjing Wang. Problem to Solve A wealth of information in the World Wide Web Not easy to access or process by machine.
C6 Databases.
Web Mining Research: A Survey Authors: Raymond Kosala & Hendrik Blockeel Presenter: Ryan Patterson April 23rd 2014 CS332 Data Mining pg 01.
Chapter 6 UNDERSTANDING AND DESIGNING QUERIES AND REPORTS.
Documentation Generators: Internals of Doxygen John Tully.
1 Knowledge Technologies 2001 Siemens Automation and Drive Help Desk: A Knowledge Work-Place with Self-Service Norman Zimmer empolis NA, Inc. Burlington,
Page 1 Integrating Multiple Data Sources using a Standardized XML Dictionary Ramon Lawrence Integrating Multiple Data Sources using a Standardized XML.
OneGeology-Europe - the first step to the European Geological SDI INSPIRE Conference 2010, Session Thematic Communities: Geology Krakow, June 24 th 2010.
Information and Business Work
Information Retrieval in Practice
Semantic Search Jiawei Rong Authors Semantic Search, in Proc. Of WWW Author R. Guhua (IBM) Rob McCool (Stanford University) Eric Miller.
Visual Web Information Extraction With Lixto Robert Baumgartner Sergio Flesca Georg Gottlob.
Overall Information Extraction vs. Annotating the Data Conference proceedings by O. Etzioni, Washington U, Seattle; S. Handschuh, Uni Krlsruhe.
Web Logs and Question Answering Richard Sutcliffe 1, Udo Kruschwitz 2, Thomas Mandl University of Limerick, Ireland 2 - University of Essex, UK 3.
1 Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang, Assistant Professor Dept. of Computer Science & Information Engineering National Central.
Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang National Central University
11/8/20051 Ontology Translation on the Semantic Web D. Dou, D. McDermott, P. Qi Computer Science, Yale University Presented by Z. Chen CIS 607 SII, Week.
Simfund Filing Training Introduction First Look Step by Step Training.
INTEGRATION INTEGRATION Ramon Lawrence University of Iowa
CS580: Building Web Based Information Systems Roger Alexander & Adele Howe The purpose of the course is to teach theory and practice underlying the construction.
1 Chapter 11 Developing Custom Help. 11 Chapter Objectives Use HTML to create customized Help topics for an application Use the HTML Help Workshop to.
Information Extraction from Documents for Automating Softwre Testing by Patricia Lutsky Presented by Ramiro Lopez.
Enhance legal retrieval applications with an automatically induced knowledge base Ka Kan Lo.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Drew DeHaas.
Accessible Word Document Training Microsoft Word 2010.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
The SEASR project and its Meandre infrastructure are sponsored by The Andrew W. Mellon Foundation SEASR Overview Loretta Auvil and Bernie Acs National.
Chapter 11 Databases.
ACS1803 Lecture Outline 2 DATA MANAGEMENT CONCEPTS Text, Ch. 3 How do we store data (numeric and character records) in a computer so that we can optimize.
Aurora: A Conceptual Model for Web-content Adaptation to Support the Universal Accessibility of Web-based Services Anita W. Huang, Neel Sundaresan Presented.
Classroom User Training June 29, 2005 Presented by:
Enterprise & Intranet Search How Enterprise is different from Web search What to think about when evaluating Enterprise Search How Intranet use is different.
Lesson 4: Using HTML5 Markup.  The distinguishing characteristics of HTML5 syntax  The new HTML5 sectioning elements  Adding support for HTML5 elements.
Information Need Question Understanding Selecting Sources Information Retrieval and Extraction Answer Determina tion Answer Presentation This work is supported.
Building Search Portals With SP2013 Search. 2 SharePoint 2013 Search  Introduction  Changes in the Architecture  Result Sources  Query Rules/Result.
GLOSSARY COMPILATION Alex Kotov (akotov2) Hanna Zhong (hzhong) Hoa Nguyen (hnguyen4) Zhenyu Yang (zyang2)
Ontology-Based Information Extraction: Current Approaches.
WebMining Web Mining By- Pawan Singh Piyush Arora Pooja Mansharamani Pramod Singh Praveen Kumar 1.
Flexible Text Mining using Interactive Information Extraction David Milward
Theory and Application of Database Systems A Hybrid Approach for Extending Ontology from Text He Wei.
Péter Schönhofen – Ad Hoc Hungarian → English – CLEF Workshop 20 Sep 2007 Performing Cross-Language Retrieval with Wikipedia Participation report for Ad.
Data Mining By Dave Maung.
C6 Databases. 2 Traditional file environment Data Redundancy and Inconsistency: –Data redundancy: The presence of duplicate data in multiple data files.
BAA - Big Mechanism using SIRA Technology Chuck Rehberg CTO at Trigent Software and Chief Scientist at Semantic Insights™
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
Next Generation Search Engines Ehsun Daroodi 1 Feb, 2003.
Tool for Ontology Paraphrasing, Querying and Visualization on the Semantic Web Project By Senthil Kumar K III MCA (SS)‏
OWL Representing Information Using the Web Ontology Language.
Trustworthy Semantic Webs Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #4 Vision for Semantic Web.
Towards a Glossary of Activities in the Ontology Engineering Field Mari Carmen Suárez-Figueroa and Asunción Gómez-Pérez {mcsuarez, Ontology.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Semantic web Bootstrapping & Annotation Hassan Sayyadi Semantic web research laboratory Computer department Sharif university of.
Search Engine using Web Mining COMS E Web Enhanced Information Mgmt Prof. Gail Kaiser Presented By: Rupal Shah (UNI: rrs2146)
UWMS Data Mining Workshop Content Analysis: Automated Summarizing Prof. Marti Hearst SIMS 202, Lecture 16.
A wiki is a collaborative web application which allows people to add and edit content using a browser… …it creates communities and empowers users as they.
INTRODUCTION JavaScript can make websites more interactive, interesting, and user-friendly.
IRS Tax Map Electronic Research Tool David Brown Internal Revenue Service Media and Publications Division David Brown Internal Revenue Service Media and.
Delivering written prose, over the web, using semantic web technology. 22 October 2014 Joint Meeting of the Supply Process Review Committee (PRC) and Joint.
Selected Semantic Web UMBC CoBrA – Context Broker Architecture  Using OWL to define ontologies for context modeling and reasoning  Taking.
Integrated Departmental Information Service IDIS provides integration in three aspects Integrate relational querying and text retrieval Integrate search.
Applying ontology and linguistics to automate reading, writing, and reporting of knowledge and information in a semantic wiki Chuck Rehberg Semantic Insights.
© 1990—2006 Visual Knowledge Software® | Private and Confidential | 2 Semantic Agent Wikis For Engineering.
Chapter 8: Web Analytics, Web Mining, and Social Analytics
Of 24 lecture 11: ontology – mediation, merging & aligning.
Thinking of Drupal 8? Get started with the resources.
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
Social Knowledge Mining
Presentation transcript:

Semantic Wiki: Automating the Read, Write, and Reporting functions Chuck Rehberg, Semantic Insights

Slide 2 Approach Automate the capture of content and knowledge and extraction of information of interest to the community in a semantic form – based on training and reading styles. Provide semantic search, access, across a corpus of content that has previously been read to create information of interest. For example a plug-in. (IRS tax info, for example, or other G2C or G2B needs.) Produce reports (in various styles, formats) from information of interest based on training and writing and reporting styles

Slide 3 Training the system to read To Automate the Reading process you need: –A Reading Style States how to linguistically identify your specific Concepts –An Ontology Defines your specific Concepts and how those Concepts are related –A Dictionary is useful too

Slide 4 Reading Styles and Ontologies A “Reading Style” contains the linguistic patterns used to identify the links between natural language text and the Concepts in the Ontology We provide three approaches to creating a Reading Style and Ontology –Manual Approach –Semi-Automatic Approach –Automated Approach Each of these approaches begins with the selection of one or more training documents

Slide 5 Reading Style: Manual Approach Starting with a training document The user indicates the concepts of interest by highlighting words or phrases, whole sentences or groups of sentences. The system analyzes the linguistic structure within and between sentences, and: –Identifies or Creates newly discovered Concepts in the Ontology –Adds relationships and characteristics of those Concepts –Links the key words or phrases to Concepts in the Ontology

Slide 6 Reading Styles: Semi-Automated Approach Provide a list of key words or phrases representing concepts of interest Identify one or more documents containing those key words The system reads through the documents and identifies each sentence or group of sentences containing the key words As with the manual approach, the system analyzes the linguistic relationships, and creates the Ontology and Reading Style.

Slide 7 Reading Style: Automated Approach Identify one or more documents containing Concepts of Interest (as yet not specified) The system reads through the documents and identifies possible key words or phrases of interest (Concepts) in each sentence or group of sentences. As with the Semi-Automated Approach the system analyzes the linguistic relationships, and creates the Ontology and Reading Style.

Slide 8 Automated Document Reading The system generates a new Document Reader from your Reading Style and Ontology. You identify the document corpus to read The Document Reader harvests and stores the Information of Interest from the document corpus

Slide 9 Automated Report Generation Starting with the results of reading –You may select a subset of the information read on which to report –You then apply a report template and writing style to generate a fresh report covering the information read Reports can do many things, for example; –Extract business rules from policy documentation –Identify complex relationships between any set of concepts

Slide 10 Report Specification Reports may include –Document structuring (sections, subsections, headers, etc,.) –Narrative prose –Lists –Tables and Diagrams –Glossaries, dictionaries, references The form, content, and presentation of a report are specified using Report Templates, and both Presentation and Writing Styles.

Slide 11 Different Points of View The need to Harmonize Different Points of View arises when: –Different Reading Styles are mapped to the same Ontology –Different Reading Styles are mapped to different Ontologies –A single Reading Style developed over time by one or more people come to have conflicting mappings

Slide 12 Harmonizing when Reading Styles are mapped to the same Ontology By analyzing how Concepts are mapped between the different Reading Styles the system can: –Detect different understandings (semantic conflict) –Detect common understandings (semantic overlap)

Slide 13 Harmonizing when Reading Styles are mapped to different Ontologies Assist the users in identifying where the same words or phrases –Map to essentially the same concept in each Ontology –Map to different concepts –Have overlapping mapping to concepts

Slide 14 Harmonizing within a single Reading Style By analyzing how each Concept is mapped within a single Reading Style the system can identify when: –A single Concept is identified by multiple different linguistic constructs (many ways of saying the same thing) –A single linguistic construct identifies more than one Concept (multiple meanings for the same language)

Slide 15 Reading Wikis and other information sources Standard information source encodings include: –pdf, html, doc, ppt, txt, wiki –Results from standard web search engines Additional information source encodings can be added as plug-ins

Slide 16 Narrowing down Report Content The results of reading may generate a large amount of IOI. To identify the useful subset of information the user may –Identify the specific set of concepts to include/exclude –Pose a Natural Language Query

Slide 17 Interactive Query Dialogue Another form of reporting is to interactively query the results by using natural language queries –For example; What relationships exist between President Bush and Herbert A. Allen III? The answers come back according to the chosen Writing Style. *Note: Without the addition of Reasoning Rules the results are limited only to what was read in the documents read.

Slide 18 Building on the IOI through Reasoning Rules So far we have heard about Reading Rules Another type of rules are, Reasoning Rules –Reasoning Rules operate on IOI to produce new IOI –The resulting IOI can represent conclusions, inferences, trends, etc. –This new IOI is available for query, selection, and reporting as before

Slide 19 Concept Demo SIRA LITE – A browser plug-in that invokes a SIRA reader to “read” one or more web pages and produce a filtered report containing the information of interest Note: Due to where we are in product development, this demo is not live.

Slide 20 Questions