2nd Semantic Web Mining Workshop at ECML/PKDD-2002, August 2002, Helsinki, Finland Data Fusion and Semantic Web: Meta-Models of Distributed Data and Decision.

Slides:



Advertisements
Similar presentations
Modelling with expert systems. Expert systems Modelling with expert systems Coaching modelling with expert systems Advantages and limitations of modelling.
Advertisements

Building and Analyzing Social Networks Web Data and Semantics in Social Network Applications Dr. Bhavani Thuraisingham February 15, 2013.
Provenance in Open Distributed Information Systems Syed Imran Jami PhD Candidate FAST-NU.
Using the Semantic Web to Construct an Ontology- Based Repository for Software Patterns Scott Henninger Computer Science and Engineering University of.
Effective Coordination of Multiple Intelligent Agents for Command and Control The Robotics Institute Carnegie Mellon University PI: Katia Sycara
Report on Intrusion Detection and Data Fusion By Ganesh Godavari.
Sensemaking and Ground Truth Ontology Development Chinua Umoja William M. Pottenger Jason Perry Christopher Janneck.
1 Draft of a Matchmaking Service Chuang liu. 2 Matchmaking Service Matchmaking Service is a service to help service providers to advertising their service.
UML CASE Tool. ABSTRACT Domain analysis enables identifying families of applications and capturing their terminology in order to assist and guide system.
1 Lecture 13: Database Heterogeneity Debriefing Project Phase 2.
IASW – 2005, Jyväskylä, FinlandUniversity of Vaasa, Department of Computer Science, Finland INFORMATION ARCHITECTURES FOR SEMANTIC WEB APPLICATIONS Kimmo.
Modified from Sommerville’s originalsSoftware Engineering, 7th edition. Chapter 8 Slide 1 System models.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
A Heterogeneous Network Access Service based on PERMIS and SAML Gabriel López Millán University of Murcia EuroPKI Workshop 2005.
Report on Multi-agent Data Fusion System: Design and implementation issues 1 By Ganesh Godavari.
Building Knowledge-Driven DSS and Mining Data
Ontology-based Access Ontology-based Access to Digital Libraries Sonia Bergamaschi University of Modena and Reggio Emilia Modena Italy Fausto Rabitti.
Architectural Design Establishing the overall structure of a software system Objectives To introduce architectural design and to discuss its importance.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
LÊ QU Ố C HUY ID: QLU OUTLINE  What is data mining ?  Major issues in data mining 2.
Overview of Distributed Data Mining Xiaoling Wang March 11, 2003.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
ONTOLOGY SUPPORT For the Semantic Web. THE BIG PICTURE  Diagram, page 9  html5  xml can be used as a syntactic model for RDF and DAML/OIL  RDF, RDF.
Ana Chanaba Robert Huylo
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
Katanosh Morovat.   This concept is a formal approach for identifying the rules that encapsulate the structure, constraint, and control of the operation.
The Data Attribution Abdul Saboor PhD Research Student Model Base Development and Software Quality Assurance Research Group Freie.
Ontology Development Kenneth Baclawski Northeastern University Harvard Medical School.
An approach to Intelligent Information Fusion in Sensor Saturated Urban Environments Charalampos Doulaverakis Centre for Research and Technology Hellas.
Chapter 4 System Models A description of the various models that can be used to specify software systems.
Chapter 8 Architecture Analysis. 8 – Architecture Analysis 8.1 Analysis Techniques 8.2 Quantitative Analysis  Performance Views  Performance.
©Ian Sommerville 2000 Software Engineering, 6th edition. Chapter 6 Slide 1 Requirements Engineering Processes l Processes used to discover, analyse and.
Environmental Terminology Research in China HE Keqing, HE Yangfan, WANG Chong State Key Lab. Of Software Engineering
Funded by: European Commission – 6th Framework Project Reference: IST WP 2: Learning Web-service Domain Ontologies Miha Grčar Jožef Stefan.
V. Gorodetsky IADM-06, Discussion, Hong Kong, December 18, 2006 Vladimir Gorodetsky Head of Laboratory of Intelligent Systems
Dart: A Meta-Level Object-Oriented Framework for Task-Specific Behavior Modeling by Domain Experts R. Razavi et al..OOPSLA Workshop DSML‘ Dart:
Data Management Information Management Knowledge Management Data and Applications Security Challenges Bhavani Thuraisingham October 2006.
Odyssey A Reuse Environment based on Domain Models Prepared By: Mahmud Gabareen Eliad Cohen.
Report on Intrusion Detection and Data Fusion By Ganesh Godavari.
EU Project proposal. Andrei S. Lopatenko 1 EU Project Proposal CERIF-SW Andrei S. Lopatenko Vienna University of Technology
IBIS-Admin New Mexico’s Web-based, Public Health Indicator, Content Management System.
INTERACTIVE ANALYSIS OF COMPUTER CRIMES PRESENTED FOR CS-689 ON 10/12/2000 BY NAGAKALYANA ESKALA.
Chapter 7 System models.
Database Design and Management CPTG /23/2015Chapter 12 of 38 Functions of a Database Store data Store data School: student records, class schedules,
1 CS 502: Computing Methods for Digital Libraries Lecture 19 Interoperability Z39.50.
IBISAdmin Utah’s Web-based Public Health Indicator Content Management System.
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
Grid Computing & Semantic Web. Grid Computing Proposed with the idea of electric power grid; Aims at integrating large-scale (global scale) computing.
EEL 5937 Ontologies EEL 5937 Multi Agent Systems Lecture 5, Jan 23 th, 2003 Lotzi Bölöni.
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
Christoph F. Eick University of Houston Organization 1. What are Ontologies? 2. What are they good for? 3. Ontologies and.
Workpackage 3 New security algorithm design ICS-FORTH Ipswich 19 th December 2007.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
Hybrid Intelligent Systems for Detecting Network Anomalies Lane Thames ECE 8833 Intelligent Systems.
Architecture View Models A model is a complete, simplified description of a system from a particular perspective or viewpoint. There is no single view.
An Ontology-based Approach to Context Modeling and Reasoning in Pervasive Computing Dejene Ejigu, Marian Scuturici, Lionel Brunie Laboratoire INSA de Lyon,
Erik Jonsson School of Engineering and Computer Science The University of Texas at Dallas Cyber Security Research on Engineering Solutions Dr. Bhavani.
WonderWeb. Ontology Infrastructure for the Semantic Web. IST Project Review Meeting, 11 th March, WP2: Tools Raphael Volz Universität.
Enable Semantic Interoperability for Decision Support and Risk Management Presented by Dr. David Li Key Contributors: Dr. Ruixin Yang and Dr. John Qu.
Sharing personal knowledge over the Semantic Web ● We call personal knowledge the knowledge that is developed and shared by the users while they solve.
VIEWS b.ppt-1 Managing Intelligent Decision Support Networks in Biosurveillance PHIN 2008, Session G1, August 27, 2008 Mohammad Hashemian, MS, Zaruhi.
Semantic Web. P2 Introduction Information management facilities not keeping pace with the capacity of our information storage. –Information Overload –haphazardly.
Informatics for Scientific Data Bio-informatics and Medical Informatics Week 9 Lecture notes INF 380E: Perspectives on Information.
Semantic and geographic information system for MCDA: review and user interface building Christophe PAOLI*, Pascal OBERTI**, Marie-Laure NIVET* University.
FROM THE ESSENCE OF AN ENTERPRISE TOWARDS ENTERPRISE SUPPORTING INFORMATION SYSTEMS Tanja Poletaeva Tutors: Habib Abdulrab Eduard Babkin.
Intelligent Systems Development
CCNT Lab of Zhejiang University
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
Distributed and Grid Computing Research Group
Toward an Ontology-Driven Architectural Framework for B2B E. Kajan, L
Presentation transcript:

2nd Semantic Web Mining Workshop at ECML/PKDD-2002, August 2002, Helsinki, Finland Data Fusion and Semantic Web: Meta-Models of Distributed Data and Decision Fusion. Project Report Vladimir Gorodetski, Oleg Karsaev, Vladimir Samoilov Intelligent System Laboratory of the St. Petersburg Institute for Informatics and Automation {gor, ok,

2nd Semantic Web Mining Workshop at ECML/PKDD-2002, August 2002, Helsinki, Finland Title of the Project “Autonomous Information Collection, Knowledge Discovery Techniques and Software Tool Prototype for Knowledge-Based Data Fusion” Project from European Office of Aerospace Research and Development (EOARD) –AFRL/IF (USA) (December December 2003)

2nd Semantic Web Mining Workshop at ECML/PKDD-2002, August 2002, Helsinki, Finland Outline of the Project Presentation 1. Outline of the Data and Information Fusion problems 2. Project research objectives 3. Examples of case studies and applications used 4. Ontology-centered meta-model of data sources 5. Meta-model of decision fusion 6. Multi-agent architecture 7. Conclusion

2nd Semantic Web Mining Workshop at ECML/PKDD-2002, August 2002, Helsinki, Finland Tasks and Applications of Data and Information Fusion Application Fields Critical areas of human society security, life support, security of critical state infrastructures, large-scale logistics, natural and man-made disasters, etc. Examples of Applications  Assessment and prediction of situations,  Resource management and rescue operation planning in large scale natural and man-made disasters,  Decision making and planning of rescue operations in systems like US 911, Situational awareness and prediction for terrorist intents and anti-terrorist activity planning,  Military situation assessment,  Safeguard of critical plants like nuclear power stations, electrical power grids, etc.

2nd Semantic Web Mining Workshop at ECML/PKDD-2002, August 2002, Helsinki, Finland Information Fusion-Definition “…data fusion is a formal framework in which means and tools for the alliance of data originating from different sources are expressed. It aims at obtaining information of greater quality; the exact definition of “greater quality” will depend on the application” (JDL-Joint Directors of Laboratories model, USAF) Level 1-Object assessment Level 2- Situation assessment Level 3- Impact assessment Distributed data sources Level 4- Process refinement Data Base Management System Support DB Fusion DB Level 5-User refinement Distributed information sources Human- Computer interface Sensor management, resource management (Erik Blash, Fusion-2002, July, 2002, Annapolis, USA) Areas of the current and Future research projects are yellowed Sensor 1 Sensor 2 Sensor N … Level 0-Pre-processing of sensor data

2nd Semantic Web Mining Workshop at ECML/PKDD-2002, August 2002, Helsinki, Finland Project Research Objectives Development of DF software tool providing support for design (first of all, for learning!) and implementation of DF applications of broad spectrum, in particular, providing support for :  Development of ontology-based meta-models of data sources, meta-model of decision fusion and conceptual model of DF software tool,  Development of Multi-agent architecture and  Design and implementation of applications of broad spectrum.

2nd Semantic Web Mining Workshop at ECML/PKDD-2002, August 2002, Helsinki, Finland Examples of case studies and application used in Projects Case studies -KDD Cup99 dataset -- Preprocessed relational data specifying Intrusion Detection task -Landsat Multi-Spectral Scanner image dataset -STULONG dataset– Longitudinal Study of Atherosclerosis Risk Factors Application Application to be used in debugging and validation of MAS DK-DF - Intrusion detection learning system (Project also funded by EOARD/AFRL)

2nd Semantic Web Mining Workshop at ECML/PKDD-2002, August 2002, Helsinki, Finland Subtasks of the Project matching Semantic Web Mining area 1. Design and implementation of meta-model of data sources caused by heterogeneity and distribution of data to be fused. 2. Design and implementation of meta-model of distributed learning.

2nd Semantic Web Mining Workshop at ECML/PKDD-2002, August 2002, Helsinki, Finland Multiplicity of Data Sources Presenting User’s Activity in Intrusion Detection system

2nd Semantic Web Mining Workshop at ECML/PKDD-2002, August 2002, Helsinki, Finland Interrelation of Semantic Web and Ontology-oriented Research within the Project Semantic Web considers development and standardization of the ontology specification languages (XML, RDF, DAML+OIL), ontology-based query languages, ontology editors, etc). Semantic Web Mining considers specific problems of ontology design technology for (Web-based) Data Mining systems. Any DF system technology supposes (Web-based) distributed Data Mining and KDD and that is why it is a sub- area of the Semantic Web Mining. Ontology-based Data and Information Fusion system design put a number of specific problems of technological sort. Among them, the most important one is a technology for distributed design of distributed ontology.

2nd Semantic Web Mining Workshop at ECML/PKDD-2002, August 2002, Helsinki, Finland What is distributed design of distributed ontology? Data Sources Meta-model Data Source Sensor Data Source Manager Data Source management agent Data SourceSensor Data Source management agent Data Source Sensor Data Source management agent Data SourceSensor Data Source Manager Meta-data manager “KDD Master” Agent Ontology-based meta-model of Data sources Data Source Manager Data Source management agent ……. Meta-model =Ontology + Data source models at meta-level supporting a unified view of data of particular sources

2nd Semantic Web Mining Workshop at ECML/PKDD-2002, August 2002, Helsinki, Finland DF system ontology DF Problem ontology … Private component of application ontology of data source 1 Private component of application ontology of data source k Private component of application ontology of data source 2 Tower of DF application ontology components Shared component of Application ontology

2nd Semantic Web Mining Workshop at ECML/PKDD-2002, August 2002, Helsinki, Finland Agent 1 Agent 2Agent 3 Agent k Distributed Ontology and Protocols for Distributed Ontology Design “KDD Master” Agent Problem and shared components of application ontology Data Source 2 Data Source 1 Shared component of application ontology Private component of application ontology-3 Shared component of application ontology Private component of application ontology-k DS- 1 management agent KDD agent of source 1 DS- 2 management agent KDD agent of source 2 Meta-level KDD Agent Data Source k Shared component of application ontology Private component of application ontology-3 Shared component of application ontology Private component of application ontology-k Data Source 3 DS- 3 management agent KDD agent of source 3 DS- k management agent KDD agent of source k Protocols, Functions …….

2nd Semantic Web Mining Workshop at ECML/PKDD-2002, August 2002, Helsinki, Finland Particular Tasks to Be Solved on the Basis of Meta- model of Data Sources Providing for monosemantic understanding of terminology used in data specification by distributed analysts; Solution of the entity identification problem; Providing consistency of data representation (in case if the same attributes are presented differently in different data sources); Providing a gateway between ontology and distributed databases accessibility making possible interaction between ontology and distributed databases, and several other tasks.

2nd Semantic Web Mining Workshop at ECML/PKDD-2002, August 2002, Helsinki, Finland Meta-model of Data Sources: => Monosemantic understanding of terminology Meta-model of Data Sources: Ontology + Protocols => Monosemantic understanding of terminology Monosemantic understanding of terminology among DF system components is provided by shared vocabulary used by DF system distributed entities for communication. This excludes different naming of the same entities and their properties in different sources, and equal naming of different entities within different data sources thus providing integrity and consistency of shared vocabulary. Protocols Supports distributed collaborative design of coherent ontology by distributed analysts.

2nd Semantic Web Mining Workshop at ECML/PKDD-2002, August 2002, Helsinki, Finland Example of Application Ontology: High-level Part of Intrusion Detection Domain Ontology Network attack A Reconnaissance R Implantation and threat realization I Collection of Information Identification of hosts Identification of services Identification of OS Resource Enumeration Users and Groups Enumeration IH IS IO CI RE UE Applications and Banners Enumeration ABE Escalating Privilege Threat Realization Covering Tracks Getting Access to Resources GAR ER GAD TR CT Creating Back Doors CBD Gaining Additional Data Network Ping Sweeps DC Port Scanning SPIH TCP connect scan ST TCP SYN scan SS Notions of micro-layer TCP FIN scan SF TCP Xmas Tree scan SX Proxy scanning Dumb host scan Scanning 'FTP Bounce' PS DHS SFB TCP Null scan SN Half scan HS UDP scan SU "Part of" relationship N o t i o n s o f l o w e r l e v e l s “ Subclass of" relationship CD ID DOS Confidentiality destruction Integrity destructio n Denial of Service

2nd Semantic Web Mining Workshop at ECML/PKDD-2002, August 2002, Helsinki, Finland The Simplest ("top-down") Meta-protocol for Collaborative Ontology Design Source 1. Local source expert Source 1: Data preparation agent Source N: Data preparation agent Meta-data description agent Application domain expert Source N: Local source expert Forming the basic variant of ontology Sending the basic variant Analysis of the suggested basic variant Modifying and expanding the ontology Synchronization of modifications by the basic protocol Modifying and expanding the ontology Synchronization of modifications by the basic protocol …

2nd Semantic Web Mining Workshop at ECML/PKDD-2002, August 2002, Helsinki, Finland Ontology Synchronization Protocol Represented in Terms of UML-sequence Diagram Legend: 1. Local source expert 2. Local source data managing agent 3. Local source ontology 4. Local source: buffer of temporary changes 5. KDD master (Meta- data description agent) 6. Shared ontology 7. Meta-level agent: buffer of temporary changes 8. Application expert (meta-level) 9. Local source determining the modified ontology part

2nd Semantic Web Mining Workshop at ECML/PKDD-2002, August 2002, Helsinki, Finland Meta-model of Data Sources: Entity Identification Problem # of case Attributes of Data Source # of case Attributes of Data source # of case Attributes of Data source Data Source 1 Data Source 3 Data Source 2 Explanation of Entity Identification Problem

2nd Semantic Web Mining Workshop at ECML/PKDD-2002, August 2002, Helsinki, Finland Demonstration of Entity Identification Problem: Intrusion Detection Application

2nd Semantic Web Mining Workshop at ECML/PKDD-2002, August 2002, Helsinki, Finland A Technique for Entity Identification Problem In the DF problem ontology, for each instance of an object to be classified, the notion of entity identifier ("ID entity") is introduced. This entity identifier plays the role of the primary key of the instance (in analogy with the primary key of a table). For each such identifier, a rule as a component of the shared part of application ontology is defined, which can be used to calculate the value of the instance key. A rule is a function which arguments are chosen from the set of this entity attributes. A rule is defined for each local data source to uniquely connect the entity identifier and the local primary key in this source. This rule specifies: how to derive the local primary key of instance from the entity identifier value; how to derive the entity identifier value from the value of the local primary key of an instance of the source.

2nd Semantic Web Mining Workshop at ECML/PKDD-2002, August 2002, Helsinki, Finland Meta-model of Data Sources: Diversity of Measurement Scales of the Same Attributes in Different Data Sources Let X be an attribute in application ontology that is measured differently in different sources. 1.In the shared component of application ontology, the type and the measurement unit of the attribute X are determined. Selection of attribute X specification within shared part of application ontology is made by experts during negotiations according to a synchronization protocol. 2.In all the sources where X is present, expressions are determined for this attribute, through which it can further be converted into the same scale in all the sources. This allows using the values of attributes on the meta- level regardless of the data source from which they originated.

2nd Semantic Web Mining Workshop at ECML/PKDD-2002, August 2002, Helsinki, Finland Meta-model of Data Sources: Interaction of Ontology and Databases of Sources The task arises due to the fact that application ontology entities are specified in terms of ontology notions but their instances are represented in terms of database language. To provide interaction of ontology and databases of sources (accessibility of data requested in ontology terms), a special gateway is developed. Application Access via VIEW objects Database objects Local data source Client-gateway DF Application ontology Local source data properties DF problem ontology Three-level hierarchy of access to the database objects DF application ontology

2nd Semantic Web Mining Workshop at ECML/PKDD-2002, August 2002, Helsinki, Finland Meta-model of Distributed Learning Components of meta-model of distributed learning: Meta-model of decision making and combining decisions of multiple base-level classifiers; Model of distributed data management (allocation training and testing data sets for learning particular classifiers; management by computation of meta- data for upper level example-based learning, etc.); Approaches and formal techniques used for combining decisions.

2nd Semantic Web Mining Workshop at ECML/PKDD-2002, August 2002, Helsinki, Finland Meta-model of Data Fusion: Hierarchy of Classifiers and Combining Decisions Meta-level classifier of source Local database (database of source) Base classifier 1 Base classifier 2 Variant 2... Base classifier k To DF system meta-level classifier Local database (database of source) Base classifier 1 Base classifier 2... Base classifier k Variant 1 To DF system meta-level classifier Meta-level classifier of source Local database (database of source) Base classifier 1... Base classifier k Variant 3

2nd Semantic Web Mining Workshop at ECML/PKDD-2002, August 2002, Helsinki, Finland Meta-model of Data Fusion: Distributed data management Distributed data management that is allocation training and testing data sets for learning particular classifiers; management by computation of meta-data for upper level example-based learning, etc. These tasks are solved through using in DF system special agents operating on source- located components and meta-level component of DF system. These agents solve the task in question through special negotiation protocol under management of local source and meta-level analysts.

2nd Semantic Web Mining Workshop at ECML/PKDD-2002, August 2002, Helsinki, Finland Meta-models of Training and Testing Data: An Example Results of distributed data management for the case study KDDCup-99 for two data sources BC4 Attributes - 16 Examples BC5 Attributes– 7 Examples Testing data of the local source DS2 Examples BC6 Attributes– 7 Examples BC6 Attributes– 7 Examples BC1 Attributes - 16 Examples BC2 Attributes - 17 Examples Testing data of the local source DS2 Examples BC3 Attributes - 15 Examples Data source DS1Data source DS2

2nd Semantic Web Mining Workshop at ECML/PKDD-2002, August 2002, Helsinki, Finland Meta-model of Data Fusion: Approaches for combining decisions-1 1. Meta-classification scheme for combining decisions (based on “stacked generalization”) Legend : Data KDD algorithmsResulting classifiers Meta-learning level Training and testing data set (source k) Training and testing data set (source 2) Training and testing data set (source 1) … Result: Meta-classifier Algorithm for learning meta- classifier Meta-classifier’s training and testing data ("meta-data") Algorithms for Base classifier learning Base Classifier 2 to be learned Base Classifier 1 to be learned Base Classifier k to be learned … Testing data

2nd Semantic Web Mining Workshop at ECML/PKDD-2002, August 2002, Helsinki, Finland Meta-model of Data Fusion: Approaches for Combining Decisions-2 Competence-based Approach Classifier 1 Classifier 2 Classifier K …….. Referee 1 Referee 2 Referee K …….. Training and testing data Decision of the most competent classifier Examples of class 1 Examples of class 2 Correctly classified examples Erroneously classified examples Partition of learning data for classifier training Partition of learning data for referee training Selection of the most competent classifier and its decision

2nd Semantic Web Mining Workshop at ECML/PKDD-2002, August 2002, Helsinki, Finland Architecture of DF Software Tool Architecture of the source-based component of DF software tool Local data source KDD agent Testing Training User interface Local classification agents of DF system Base classifier Meta-classifier Referee To the KDD Master To the Meta-classification agent Server (library) of learning methods Data source managing agent

2nd Semantic Web Mining Workshop at ECML/PKDD-2002, August 2002, Helsinki, Finland Architecture of DF system Architecture of meta-level component of DF software tool To the Data source managing agent KDD Master agent Agent-classifier of meta- level Inference engine User interface Meta-classifier Referee To the KDD agent Local classification agents Meta-level KDD agent Server of learning methods

2nd Semantic Web Mining Workshop at ECML/PKDD-2002, August 2002, Helsinki, Finland Conclusion: Future work. 1. Development of sophisticated ontology editor supporting distributed design of a distributed ontology. 2. Further design and Implementation of Data Fusion System software tool for development and implementation of particular distributed applications in Data Fusion area.

2nd Semantic Web Mining Workshop at ECML/PKDD-2002, August 2002, Helsinki, Finland For more information and related publications please contact Acknowledgement This research is funded by AFRL/IF (EOARD), Thank you!