An Ontological Approach to Assessing IC Need to Know Phillip BurnsCTA Inc. Prof. Amit ShethLSDIS Lab, University of Georgia Presented to ARDA PI Meeting,

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Lukas Blunschi Claudio Jossen Donald Kossmann Magdalini Mori Kurt Stockinger.
Semantic Web Thanks to folks at LAIT lab Sources include :
An Ontological Approach to the Document Access Problem of Insider Threat ISI 2005, (May 20) Boanerges Aleman-Meza 1 Phillip Burns 2 Matthew Eavenson 1.
SEVENPRO – STREP KEG seminar, Prague, 8/November/2007 © SEVENPRO Consortium SEVENPRO – Semantic Virtual Engineering Environment for Product.
OntoBlog: Informal Knowledge Management by Semantic Blogging Aman Shakya 1, Vilas Wuwongse 2, Hideaki Takeda 1, Ikki Ohmukai 1 1 National Institute of.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Shared Ontology for Knowledge Management Atanas Kiryakov, Borislav Popov, Ilian Kitchukov, and Krasimir Angelov Meher Shaikh.
Swoogle Swoogle Semantic Search Engine Web-enhanced Information Management Bin Wang.
Semantic Web Technology Evaluation Ontology (SWETO): A test bed for evaluating tools and benchmarking semantic applications WWW2004 (New York, May 22,
Redefining Perspectives A thought leadership forum for technologists interested in defining a new future June COPYRIGHT ©2015 SAPIENT CORPORATION.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
© 2011 Cisco and/or its affiliates. All rights reserved. Cisco Confidential 1 August 15th, 2012 BP & IA Team.
Predicting Missing Provenance Using Semantic Associations in Reservoir Engineering Jing Zhao University of Southern California Sep 19 th,
Semantic Similarity Computation and Concept Mapping in Earth and Environmental Science Jin Guang Zheng Xiaogang Ma Stephan.
Semantic Analytics on Social Networks: Experiences in Addressing the Problem of Conflict of Interest Detection Boanerges Aleman-Meza, Meenakshi Nagarajan,
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification on Reviews Peter D. Turney Institute for Information Technology National.
Ranking Documents based on Relevance of Semantic Relationships Boanerges Aleman-Meza LSDIS labLSDIS lab, Computer Science, University of Georgia Advisor:
Ranking Relationships on the Semantic Web Budak Arpinar This work is funded by NSF-ITR-IDM Award# titled '‘SemDIS: Discovering Complex Relationships.
Rohit Aggarwal, Kunal Verma, John Miller, Willie Milnor Large Scale Distributed Information Systems (LSDIS) Lab University of Georgia, Athens Presented.
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
Science Research: Journey to 10,000 Sources Presented by: Abe Lederman, President and Founder Deep Web Technologies, Inc. Special Libraries Association.
Grant Number: IIS Institution of PI: Arizona State University PIs: Zoé Lacroix Title: Collaborative Research: Semantic Map of Biological Data.
Ontologies for the Integration of Geospatial Data Michael Lutz Workshop: Semantics and Ontologies for GI Services, 2006 Paper: Lutz et al., Overcoming.
SWETO: Large-Scale Semantic Web Test-bed Ontology In Action Workshop (Banff Alberta, Canada June 21 st 2004) Boanerges Aleman-MezaBoanerges Aleman-Meza,
PAUL ALEXANDRU CHIRITA STEFANIA COSTACHE SIEGFRIED HANDSCHUH WOLFGANG NEJDL 1* L3S RESEARCH CENTER 2* NATIONAL UNIVERSITY OF IRELAND PROCEEDINGS OF THE.
Query Expansion By: Sean McGettrick. What is Query Expansion? Query Expansion is the term given when a search engine adding search terms to a user’s weighted.
NLP And The Semantic Web Dainis Kiusals COMS E6125 Spring 2010.
Of 33 lecture 10: ontology – evolution. of 33 ece 720, winter ‘122 ontology evolution introduction - ontologies enable knowledge to be made explicit and.
Querying Structured Text in an XML Database By Xuemei Luo.
Linked-data and the Internet of Things Payam Barnaghi Centre for Communication Systems Research University of Surrey March 2012.
PLoS ONE Application Journal Publishing System (JPS) First application built on Topaz application framework Web 2.0 –Uses a template engine to display.
Web Mining: Phrase-based Document Indexing and Document Clustering Khaled Hammouda, Ph.D. Candidate Mohamed Kamel, Supervisor, PI PAMI Research Group University.
updated CmpE 583 Fall 2008 Ontology Integration- 1 CmpE 583- Web Semantics: Theory and Practice ONTOLOGY INTEGRATION Atilla ELÇİ Computer.
Ranking of Web Services Eyhab Al-Masri. Outline Discovery of Web Services 1 Ranking of Web Services 2 Approaches 3 Conclusion 4 Q & A 5.
SemRank: Ranking Complex Relationship Search Results on the Semantic Web Kemafor Anyanwu, Angela Maduko, Amit Sheth LSDIS labLSDIS lab, University of Georgia.
Semantic (Web) Technology in Action - today The Semantic Web – Scientific American article considered harmful? WWW2003 Panel (PN2), Budapest, May 21, 2003.
Oracle Database 11g Semantics Overview Xavier Lopez, Ph.D., Dir. Of Product Mgt., Spatial & Semantic Technologies Souripriya Das, Ph.D., Consultant Member.
OntoQA: Metric-Based Ontology Quality Analysis Samir Tartir, I. Budak Arpinar, Michael Moore, Amit P. Sheth, Boanerges Aleman-Meza IEEE Workshop on Knowledge.
Searching and Ranking Documents based on Semantic Relationships PaperPaper presentation ICDE Ph.D. Workshop 2006 April 3rd, 2006, Atlanta, GA, USA This.
1 Context-Aware Internet Sharma Chakravarthy UT Arlington December 19, 2008.
Probabilistic Latent Query Analysis for Combining Multiple Retrieval Sources Rong Yan Alexander G. Hauptmann School of Computer Science Carnegie Mellon.
Jed Hassell, Boanerges Aleman-Meza, Budak ArpinarBoanerges Aleman-MezaBudak Arpinar 5 th International Semantic Web Conference Athens, GA, Nov. 5 – 9,
Context Aware Semantic Association Ranking SWDB Workshop Berlin, September 7, 2003 Boanerges Aleman-MezaBoanerges Aleman-Meza, Chris Halaschek, I. Budak.
Web Information Retrieval Prof. Alessandro Agostini 1 Context in Web Search Steve Lawrence Speaker: Antonella Delmestri IEEE Data Engineering Bulletin.
Ontology Quality by Detection of Conflicts in Metadata Budak I. Arpinar Karthikeyan Giriloganathan Boanerges Aleman-Meza LSDIS lab Computer Science University.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
An Ontology-based Approach to Context Modeling and Reasoning in Pervasive Computing Dejene Ejigu, Marian Scuturici, Lionel Brunie Laboratoire INSA de Lyon,
Providing web services to mobile users: The architecture design of an m-service portal Minder Chen - Dongsong Zhang - Lina Zhou Presented by: Juan M. Cubillos.
A System for Automatic Personalized Tracking of Scientific Literature on the Web Tzachi Perlstein Yael Nir.
An Ontological Approach to Financial Analysis and Monitoring.
Ontology Evaluation and Ranking using OntoQA Samir Tartir and I. Budak Arpinar Large-Scale Distributed Information Systems Lab University of Georgia The.
Discovering and Ranking Semantic Associations over a Large RDF Metabase Chris Halaschek, Boanerges Aleman- Meza, I. Budak Arpinar, Amit P. Sheth 30th International.
Of 24 lecture 11: ontology – mediation, merging & aligning.
Semantic Web Technologies Readings discussion Research presentations Projects & Papers discussions.
Trends in NL Analysis Jim Critz University of New York in Prague EurOpen.CZ 12 December 2008.
Semantic Graph Mining for Biomedical Network Analysis: A Case Study in Traditional Chinese Medicine Tong Yu HCLS
Data mining in web applications
By: Chris Halaschek Advisors: Dr. I. Budak Arpinar Dr. Amit P. Sheth
User Characterization in Search Personalization
Data-Driven Educational Data Mining ---- the Progress of Project
Evaluating Adaptive Authoring of AH
Knowledge Discovery in the Semantic Web
Associative Query Answering via Query Feature Similarity
Gong Cheng, Yanan Zhang, and Yuzhong Qu
ece 627 intelligent web: ontology and beyond
Managing Semantic Content for the Web
Amit Sheth, CTO, Semagix Inc
Context-Aware Internet
Presentation transcript:

An Ontological Approach to Assessing IC Need to Know Phillip BurnsCTA Inc. Prof. Amit ShethLSDIS Lab, University of Georgia Presented to ARDA PI Meeting, Myrtle Beach, February Contract # NBCHC030083

6/21/2004 A thought to begin with … You cannot separate two facets of information retrieval (“systematic serendipity)— information recovery and information discovery.  Eugene Garfield … in essays of an Information Scientist

6/21/2004 Objective & Approach Determine if (classified) documents reviewed by an IC analyst satisfy his/her “need to know”  Characterization of “need to know” w.r.t. ontology  Characterizing document content in terms of ontology  Discovering weighted semantic relationships between document content and “need to know” characterization

6/21/2004 Characterizing “Need to Know” using a Semantic Approach (using Ontology) Requires domain ontology  models important concepts & relationships of domain (schema), captures factual knowledge (instances) Relate analyst’s need to know to concepts & relationships in ontology  e.g. terrorist organization, funding sources, facilitators, members, methods

6/21/2004 Characterizing document content in terms of ontology: “Semantic Annotation” Correlate words/phrases from document with concepts/relationships in ontology Meta-data added to document (from associated ontological knowledge) Active area of research but practically useful technology now available (e.g., Semagix Freedom)

6/21/2004 Semantic Relationships between Document & “Need to Know” Semantic associations: relationships between document concepts & “need to know” concepts are discovered and ranked Ranking based on multiple factors  no. of links, types of links, location in ontology, … Ranking indicates degree of semantic “closeness”  and therefore, how related document is to “need to know”

6/21/2004 Research Content Discovery & ranking of semantic associations Characterizing “need to know” in terms of ontological concepts & relationships (context of investigation) While applying emerging technologies for Ontology design and population Meta-data annotation of heterogeneous documents  correlation of document content with concepts in ontology

6/21/2004 Relevance Ranking of Documents Four groups of document-ranking: -Not Related Documents -unable to determine relation to context -Ambiguously Related Documents -some relationship exists to the context -Closely Related Documents -Entities are closely related to the context -Highly Related Documents -Entities are a direct match to the context Cut-off values determine grouping of documents w.r.t. relevance -These are customizable cut-off values (more control and more meaningful parameters compared to say automatic classification or statistical approaches) “Inspection” of a document is possible via (a) original document or (b) original document with highlighted entities

6/21/2004 Relevance Function (w.r.t. Context) “Closely related entities are more relevant than distant entities” E = {e | e  Document } E k = {f | distance(f, e  E) = k }

6/21/2004 IA Context of Investigation (characterization of “Need to Know”) We define the context of investigation as a combination of the following: A set of entity classes and relationships, and/or a negation of a set of entity classes and relationships A set of entity instance names, and/or a negation of a set of entity instance names A set of keyword values that might appear at any attribute of the populated instance data, and/or a negation of a set of keyword values

6/21/2004 Context of Investigation (cont) Goal is to capture, at a high level, the types of entities, (or relationships), that are considered important. Relationships can be constrained to be associated with specified class types  E.G. It can be specified that a relation ‘affiliated with’ is part of the context only when it is connected with an entity that belongs to a specific class, say, ‘Terror Organization’

6/21/2004 graph-based creation of a context of investigation 26,489 entities 34,513 (explicit) relationships Add relationship to context

6/21/2004 Additional Semantic Constraints

6/21/2004 Components of Document Relevance (specific entities) Abu Abdallah Turkmenistan Konduz Province … Context of Investigation Entities belong to classes in the Context type(entity)  Context 1. Relationships constrains Relationship  [Class] 2. Entities match a list of entities of interest (in the Context) entity  Entities-List 3.

6/21/2004 Some thoughts along the way “An object by itself is intensely uninteresting.” Grady Booch, Object Oriented Design with Applications, 1991 I might as well join my better known colleagues: “Relationship is at the heart of semantics. Ontology is at the hear of the Semantic Web.”

6/21/2004 Schematic of Ontological Approach to the Legitimate Access Problem Semagix Freedom

6/21/2004 Show me the stuff … here you go … demonstrationdemonstration

6/21/2004

Security and Terrorism Part of SWETO Ontology

6/21/2004 Semantic Annotation Document searched for entity names (or synonyms) contained in ontology Then document entities are annotated with additional information from corresponding entities in ontology including named relationships to other entities Following chart is example  Highlighted text are entities found corresponding to concepts in ontology  XML is corresponding meta-data annotation

6/21/2004

Relevance Measures for Documents (relating document content to IA “need to know” Relevance engine input  the set of semantically annotated documents  the context of investigation for the assignment  the ontology schema represented in RDFS, and the ontology instances represented in RDF Relevance measure function used to verify whether the entity annotations in the annotated document can be fit into the entity classes, entity instances, and/or keywords specified in the context of investigation.

6/21/2004 Relevance Measures for Documents (relating document content to IA “need to know” (cont) Documents classified as:  Highly relevant Document entities directly related  Closely related Document entities related through strong semantic associations  Ambiguous Document entities related through weak semantic associations  Not relevant Document entities not related to “need to know”  Undeterminable Document entities not found in ontology

6/21/2004 Challenges we have addressed -Discovery of Semantic Associations per entity per document -Input/Visualization/Management of Context of Investigation -Scalability on number of documents & ontology size -Performs well (in terms of time and scalability) with thousands of documents and for scenarios when a IA investigation has involved hundreds of documents -No systematic measure of quality for this specific application/scenario (general evaluation of research is done)

6/21/2004 Challenges to be addressed -Scalability to a million+ documents (possibly with preprocessing/filtering) -Further development/enrichment of the ontology -Improved measure of the strength of Semantic Associations -Evaluations by human subjects -Visualization and interactive discovery

6/21/2004 References 1. B. Aleman-Meza, C. Halaschek, I.B. Arpinar, A. Sheth, Context-Aware Semantic Association Ranking. Proceedings of Semantic Web and Databases Workshop, Berlin, September , pp B. Aleman-Meza, C. Halaschek, A. Sheth, I.B. Arpinar, and G. Sannapareddy. SWETO: Large-Scale Semantic Web Test-bed. Proceedings of the 16th International Conference on Software Engineering and Knowledge Engineering (SEKE2004): Workshop on Ontology in Action, Banff, Canada, June 21-24, 2004, pp R. Anderson and R. Brackney. Understanding the Insider Threat. Proceedings of a March 2004 Workshop. Prepared for the Advanced Research and Development Activity (ARDA) K. Anyanwu and A. Sheth ρ-Queries: Enabling Querying for Semantic Associations on the Semantic Web The Twelfth International World Wide Web Conference, Budapest, Hungary, 2003, pp K. Anyanwu, A. Maduko, A. Sheth, SemRank: Ranking Complex Relationship Search Results on the Semantic Web, In Proceedings of the 14th International World Wide Web Conference, Japan 2005 (accepted, to appear) 6. K. Anyanwu, A. Maduko, A. Sheth, J. Miller. Top-k Path Query Evaluation in Semantic Web Databases. (submitted for publication), C. Halaschek, B. Aleman-Meza, I.B. Arpinar, A. Sheth Discovering and Ranking Semantic Associations over a Large RDF Metabase Demonstration Paper, VLDB 2004, 30th International Conference on Very Large Data Bases, Toronto, Canada, 30 August - 3 September, B. Hammond, A. Sheth, and K. Kochut, Semantic Enhancement Engine: A Modular Document Enhancement Platform for Semantic Applications over Heterogeneous Content, in Real World Semantic Web Applications, V. Kashyap and L. Shklar, Eds., IOS Press, December 2002, pp

6/21/2004 References (cont) 9. M. Rectenwald, K. Lee, Y. Seo, J.A. Giampapa, and K. Sycara. Proof of Concept System for Automatically Determining Need-to-Know Access Privileges: Installation Notes and User Guide. Technical Report CMU-RI-TR-04-56, Robotics Institute, Carnegie Mellon University, October, _3.pdf 10. C. Rocha, D. Schwabe, M.P. Aragao. A Hybrid Approach for Searching in the Semantic Web, In Proceedings of the 13th International World Wide Web, Conference, New York, May 2004, pp M.A. Rodriguez, M.J. Egenhofer, Determining Semantic Similarity Among Entity Classes from Different Ontologies, IEEE Transactions on Knowledge and Data Engineering (2): A. Sheth, C. Bertram, D. Avant, B. Hammond, K. Kochut, and Y. Warke. Managing Semantic Content for the Web. IEEE Internet Computing, (4): A. Sheth, B. Aleman-Meza, I.B. Arpinar, C. Halaschek, C. Ramakrishnan, C. Bertram, Y. Warke, D. Avant, F.S. Arpinar, K. Anyanwu, and K. Kochut. Semantic Association Identification and Knowledge Discovery for National Security Applications. Journal of Database Management, Jan-Mar 2005, 16 (1): Boanerges Aleman-Meza, Phillip Burns, Matthew Eavenson,Devanand Palaniswami, Amit Sheth. An Ontological Approach to the Document Access Problem of Insider Threat

6/21/2004 Conclusions New Semantic Approach to a class of challenging problems: vendor vetting, knowledge discovery, …. Viability demonstrated on a small scale (comprehensive demonstration) Significant new research that builds upon the latest Semantic Platform

6/21/2004 A parting thought “Discovery commences with an awareness of anomaly …”  Thomas S. Kuhn, in The Structure of Scientific Revolutions