Goal 2 Activities 4, 6, 7 Letizia Tanca Politecnico di Milano.

Slides:



Advertisements
Similar presentations
Chungnam National University DataBase System Lab
Advertisements

Università di Modena e Reggio Emilia ;-)WINK Maurizio Vincini UniMORE Researcher Università di Modena e Reggio Emilia WINK System: Intelligent Integration.
An Overview of the Integration of the UCSF Dept. of Radiology Teaching File with MIRC Wyatt M. Tellis University of California San Francisco Departments.
1 Copyright ©2007 Sandpiper Software, Inc. Vocabulary, Ontology & Specification Management at OMG Elisa Kendall Sandpiper Software
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
DAML Ontology Library Mike Dean OntoLog Forum 28 February
1 Ontolog OOR Use Case Review Todd Schneider 1 April 2010 (v 1.2)
1 Ontolog Open Ontology Repository Review 19 February 2009.
INTO: A way to represent institutional knowledge base by Biswanath Dutta Documentation Research and Training Centre Indian.
Dr. Bruce A. Scharlau, AHDIT, ES2002 E-Business Workshop AHDIT: Ad Hoc Data Interoperability Tool Dr. Bruce A. Scharlau Dept. of Computing Science University.
Semantic Web based Collaborative Knowledge Management LSL, ECS Feng (Barry) Tao A generic SOA for managing semantics driven domain knowledge.
A Domain Level Personalization Technique A. Campi, M. Mazuran, S. Ronchi.
Controlled Vocabularies in TELPlus Antoine ISAAC Vrije Universiteit Amsterdam EDLProject Workshop November 2007.
ACACIA in short… Objectives: Offer methodological and software support (i.e. models, methods and tools) for construction, management and diffusion of.
Ontology-based User Modeling for Web-based Information Systems Anton Andrejko, Michal Barla and Mária Bieliková {andrejko, barla,
26/10/2008 SWESE'08 1 Enhanced Semantic Access to Software Artefacts Danica Damljanović and Kalina Bontcheva.
The 20th International Conference on Software Engineering and Knowledge Engineering (SEKE2008) Department of Electrical and Computer Engineering
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Dr. Bruce A. Scharlau, AHDIT, August 2002 AHDIT: Ad Hoc Data Interoperability Tool Dr. Bruce A. Scharlau Dept. of Computing Science University of Aberdeen.
Querying Integrated Observation and Measurement data SONet June 8,
CILC2011 A framework for structured knowledge extraction and representation from natural language via deep sentence analysis Stefania Costantini Niva Florio.
Haystack: Per-User Information Environment 1999 Conference on Information and Knowledge Management Eytan Adar et al Presented by Xiao Hu CS491CXZ.
The Semantic Web – WEEK 4: RDF
Logics for Data and Knowledge Representation Projects and thesis introduction.
GridVine: Building Internet-Scale Semantic Overlay Networks By Lan Tian.
XML Technology in E-Commerce
27 January Semantically Coordinated E-Market Semantic Web Term Project Prepared by Melike Şah 27 January 2005.
CS570 Artificial Intelligence Semantic Web & Ontology 2
RDF Tutorial.
© Copyright IBM Corporation 2014 Getting started with Rational Engineering Lifecycle Manager queries Andy Lapping – Technical sales and solutions Joanne.
Building and Analyzing Social Networks Web Data and Semantics in Social Network Applications Dr. Bhavani Thuraisingham February 15, 2013.
Reference scenario and demo plan. Networked enterprises and multi- organization Web applications (1)  Networked enterprises can significantly benefit.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
1 DCS861A-2007 Emerging IT II Rinaldo Di Giorgio Andres Nieto Chris Nwosisi Richard Washington March 17, 2007.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
Amarnath Gupta Univ. of California San Diego. An Abstract Question There is no concrete answer …but …
New trends in Semantic Web Cagliari, December, 2nd, 2004 Using Standards in e-Learning Claude Moulin UMR CNRS 6599 Heudiasyc University of Compiègne (France)
Katanosh Morovat.   This concept is a formal approach for identifying the rules that encapsulate the structure, constraint, and control of the operation.
Concept demo System dashboard. Overview Dashboard use case General implementation ideas Use of MULE integration platform Collection Aggregation/Factorization.
Ontology Alignment/Matching Prafulla Palwe. Agenda ► Introduction  Being serious about the semantic web  Living with heterogeneity  Heterogeneity problem.
SICoP Presentation A story about communication Michael Lang BEARevelytix May 2, 2007.
Knowledge based Learning Experience Management on the Semantic Web Feng (Barry) TAO, Hugh Davis Learning Society Lab University of Southampton.
Logics for Data and Knowledge Representation
DBrev: Dreaming of a Database Revolution Gjergji Kasneci, Jurgen Van Gael, Thore Graepel Microsoft Research Cambridge, UK.
Digital Enterprise Research Institute HADA – An Access Controlled Application for Publishing and Discovering Linked Government Data Owen Sacco.
Chapter 2 Architecture of a Search Engine. Search Engine Architecture n A software architecture consists of software components, the interfaces provided.
NLP And The Semantic Web Dainis Kiusals COMS E6125 Spring 2010.
10/18/2015Page 1 Introduction to Semantic Web Design B. Ramamurthy.
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Strategies for subject navigation of linked Web sites using RDF topic maps Carol Jean Godby Devon Smith OCLC Online Computer Library Center Knowledge Technologies.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
Application Ontology Manager for Hydra IST Ján Hreňo Martin Sarnovský Peter Kostelník TU Košice.
Semantic web Bootstrapping & Annotation Hassan Sayyadi Semantic web research laboratory Computer department Sharif university of.
SICoP Presentation A story about communication Michael Lang BEARevelytix April 25, 2007.
An Ontological Approach to Financial Analysis and Monitoring.
Selected Semantic Web UMBC CoBrA – Context Broker Architecture  Using OWL to define ontologies for context modeling and reasoning  Taking.
Semantic Wiki: Automating the Read, Write, and Reporting functions Chuck Rehberg, Semantic Insights.
Integrated Departmental Information Service IDIS provides integration in three aspects Integrate relational querying and text retrieval Integrate search.
ECOTerm April 15, 2005, SNS Semantic Network Service Applications of Environmental Semantics Maria Rüther, Federal Environmental.
BBY 464 Semantic Information Management (Spring 2016) Semantic Query Languages Yaşar Tonta & Orçun Madran [yasartonta, Hacettepe.
Middleware independent Information Service
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
Big Data Quality the next semantic challenge
Peer–Mediated Distributed Knowledge Management
Analyzing and Securing Social Networks
LOD reference architecture
Introduction to Semantic Web Design
Presentation transcript:

Goal 2 Activities 4, 6, 7 Letizia Tanca Politecnico di Milano

Goal 2: Knowledge Management (Polimi) Goal 2: Knowledge Management (Polimi) Activity 4: Activity 4: Knowledge extraction from natural language actions (Polimi + IBM + Bari) Activity 6: Activity 6: Knowledge extraction, modeling and integration from semi-structured information sources, driven by domain ontologies (PoliMI) Activity 7: Activity 7: Knowledge fusion, tailoring and dissemination for business model redesign (PoliMI)

Context-aware Web Portal

Contextual data analysis At GialloRosso the oenologist and the agronomist interact with the data related to harvesting and to the wine ageing –the information they interact with depend on their role and on the workflow phase –The agronomist inserts information related to the nature of the natural phoenomena –The agronomist and the oenologist ask information related to the phase At BiancoRosso the sales manager: – analyzes sales data – in a different moment analyzes the market trends, then –reads similar information in natural language from the web GialloRosso performs market analyses by accessing its own information combined with market information collected by its ally BiancoRosso

GialloRosso Logical Schema

BiancoRosso Logical Schema VINO(ID_Vino, nome, vinificazione, invecchiamento, denominazione, temperatura, min_temp, note) EVENTO(ID_Evento, nome, tipo, data, luogo) TRENDSETTER(ID_Trend, nome, professione) FONTE(ID_Fonte, nome, uri, tipo, rilevanza, provenienza, descrizione) DOCUMENTO(ID_Doc, riassunto, url, data, autore, titolo, argomento, descrittore, ID_Fonte) VALUTAZIONEMERITO(ID_valutazione, descrizione, giudizio, lingua) RISULTATORICERCA(ID_risultato, ID_vino, ID_evento, ID_trend, ID_fonte, ID_doc, posizione, ID_valutazione)

Context-aware data tailoring

Data tailoring via view composition

Context Dimension Tree

Some relevant areas

AT GIALLOROSSO THE OENOLOGIST AND THE AGRONOMIST INTERACT WITH THE DATA RELATED TO CULTIVATION AND TO THE CELLAR

A PORTION OF THE CDT OF OUR SCENARIO oenologist

Some contextual views C1= C2 = C3= C4 =

Some contextual queries The agronomist during the harvesting phase (context C1) wants to collect all the available information coming from sensors: SELECT m.date_time,m.value,s.s_id,s.meas_unit FROM sensor s, measure_data m WHERE s.s_id=m.s_id; S/he obtains only the information from sensors placed in the vineyards (see Rel(C1))

Some contextual queries The oenologist during the harvesting phase (context C3) wants to collect all the available information about bottles of Aglianico wine: SELECT * FROM bottle b WHERE b.appellation="aglianico"; But the query is out of context, in the context C3 only information about vineyard and grapevine are available for the oenologist.

Some more contextual queries The previous query makes sense in context C4, where the oenologist is in the ageing phase: SELECT * FROM bottle b WHERE b.appellation="aglianico"; Produces a non- empty result.

AT BIANCOROSSO: 1.THE SALES MANAGER ANALYZES SALES DATA 2.THE OENOLOGIST ANALYZES WINE FEATURES TO DESIGN A NEW WINE 3.THEN S/HE READS SIMILAR INFORMATION IN NATURAL LANGUAGE FROM THE WEB 4.ALSO INTENSIONAL QUERIES ARE PERFORMED

Sales and promotions planning (Q1) Sales and promotions planning for events and festivals The sales manager of BiancoRosso wants to select the wines to promote for each event or festival –For each event or type of event he/she needs to identify the most related wines –Interesting wines for each event can be obtained by analyzing frequent rules in the form EventType=value Wine=value E.g., EventType=Summer party Wine=White wine support=20%, confidence=36%

Sales and promotions planning (Q2) Sales and promotions planning depending on time periods The sales manager wants to plan specific promotions for each time period of the year –For each time period (e.g., month) the manager needs to select the most related wines –Interesting wines can be obtained by analyzing frequent rules in the form Month=value Wine=value E.g., Month=June Wine=White wine support=20%, confidence=36%

Design of wine (Q3) Analysis of the main characteristics of wines The oenologist of BiancoRosso wants to produce new wines He/she needs to know the main characteristics of each wine to select the most interesting wines to produce –He/she obtains the characteristics of each wine by exploiting rules in the form Wine=value Characteristic=value E.g., Wine=White wine Characteristic=Mainly drunk in a specific time period support=6%, confidence=100%

Design of wine (Q4) Identification of correlations between wines and time periods The time period in which each wine is mainly consumed is useful to select the wines to produce For each wine the oenologist wants to obtain the time period (e.g., month) in which the wine is mainly consumed –Allows selecting wines related to time periods not already covered by the wines currently produced by BiancoRosso –He/she uses rules in the form Wine=value Month=value E.g., Wine=White wine Month=June support=20%, confidence=100%

Design of wine (Q5) Identification of correlations between wines and information sources Once the oenologist has selected the new wines to be produced, he/she needs to identify the sources containing documents related to the selected wines –The oenologist identifies the sources containing information about the wines of his/her interest by exploiting the following rules Wine=value Source=value E.g., Wine=Montello e colli asolani cabernet superiore Source=Gambero Rosso support=11%, confidence=100%

DIESIRAE A semantic search engine based on Natural Language Processing

Knowledge Management

Knowledge Indexing & Extraction: Goals Domain model Ontology (W3C OWL standard) –Describes the concepts of the domain Domain vocabulary Semantic Network –Describes the lemmas of the domain Mapping model Stochastic model –2° order HMM-inspired model –Transition probs approximated by means of MaxEnt models –Solves mapping ambiguities Queries: –Keyword-based (AND/OR; max probability/exaustive) –Phrase-based (Disambiguated Word queries and Ontological queries)

Knowledge indexing & extraction: Functionalities TrainingIndexing, querying, and extending

Knowledge indexing & extraction: Information Extraction Engine TrainingIndexing, querying, and extending Linguistic Context Extractor: –Calls linguistic tools (Stanford Parser, FreeLing, JavaRAP,…) –words W i (lemmas L i, linguistic context information I i ) MaxEnt Models: –Calculates HMM transition probabilities (takes in account the linguistic context info) Extended Viterbi: –(L i, I i ) concepts C i TF-IDF: –Document ranking, based on concept frequencies

Art deco Wine Domain Ontology

Keyword-based queries Sequence of isolated words –No linguistic structure Exhaustive AND/OR keywords –No concept disambiguation –Searches for multiple tuples –Example: light wine several meanings found… country wine search for instances… taste wine search for subclasseses… Max probability AND/OR keywords –Searches for a single tuple –Exploits the a-priori concept probabilities –Example: [light wine] max probability meaning

Phrase-based queries Phrase –Linguistic structure –Context-based disambiguation Disambiguated Word queries –Context used for concept disambiguation Index the phrase ( extract concepts) Search for AND-ed concepts –Example: (fruit taste) disambiguates fruit Ontological queries –Context used to select the request to the ontology Indexes the sentences Select the request; searches the ontology for the mapped concepts –Example: type of tannins in wine instance list

GIALLOROSSO PERFORMS MARKET ANALYSES BY ACCESSING ITS OWN INFORMATION COMBINED WITH MARKET INFORMATION COLLECTED BY ITS ALLY BIANCOROSSO

The Integration problem from the user point of view DATA SOURCE 1 (RDBMS) DATA SOURCE 2 (XML) DATA SOURCE 3 (WWW) GLOBAL KNOWLEDGE INTERFACE queryanswer DATA SOURCE 4 (Base station) User APPLICATION

Information integration in ART DECO Information integration in ART DECO

Knowledge retrieval from the sources In order to integrate the two original sources, we define the following query to populate the ontology: PREFIX rdf: PREFIX rdfs: PREFIX fn: PREFIX do: file:///home/lele/workspace/TIS-RewSparQL_progetto/ontologies/art- deco/wineDomain.owl#file:///home/lele/workspace/TIS-RewSparQL_progetto/ontologies/art- deco/wineDomain.owl# SELECT ?w1 ?w2 ?wn1 ?wn2 ?wb ?bq ?dse ?dso ?sn FROM file:///home/lele/workspace/TIS-RewSparQL_progetto/ontologies/art- deco/wineDomain.owlfile:///home/lele/workspace/TIS-RewSparQL_progetto/ontologies/art- deco/wineDomain.owl WHERE { ?w1 rdf:type do:WineInFarm. ?wb rdf:type do:WineBottle. ?wb do:containsWine ?w1. ?wb do:bottleQuantity ?bq. ?w1 do:appellationInFarm ?wn1. ?w2 do:appellationInDocument ?wn2. ?w2 rdf:type do:WineInDocument. ?dse rdf:type do:DocSearch. ?dso rdf:type do:DocSource. ?dse do:searchWineID ?w2. ?dse do:searchSrcID ?dso. ?dso do:docSrcName ?sn. }

Query 1 Quantity of bottles (in the GialloRosso DB) available for each wine cited by the web source Percorsi di Vino (stored in the BiancoRosso DB): PREFIX rdf: PREFIX rdfs: PREFIX fn: PREFIX do: file:///home/lele/workspace/TIS-RewSparQL_progetto/ontologies/art- deco/wineDomain.owl#file:///home/lele/workspace/TIS-RewSparQL_progetto/ontologies/art- deco/wineDomain.owl# SELECT ?wine_name sum(?bottle_quantity) FROM file:///home/lele/workspace/TIS-RewSparQL_progetto/ontologies/art- deco/wineDomain.owlfile:///home/lele/workspace/TIS-RewSparQL_progetto/ontologies/art- deco/wineDomain.owl WHERE { ?w1 rdf:type do:WineInFarm. ?wb rdf:type do:WineBottle. ?wb do:containsWine ?w1. ?wb do:bottleQuantity ?bottle_quantity. ?w1 do:appellationInFarm ?wn1. ?w2 do:appellationInDocument ?wine_name. ?w2 rdf:type do:WineInDocument. ?dse rdf:type do:DocSearch. ?dso rdf:type do:DocSource. ?dse do:searchWineID ?w2. ?dse do:searchSrcID ?dso. ?dso do:docSrcName ?source_name. FILTER regex(?source_name, PercorsiDiVino") FILTER fn:contains(?wine_name, ?wn1) } GROUP BY ?wine_name ?source_name

Query 2 Which sources (from BiancoRosso) cite wines of which we (GialloRosso) have at least a bottle available? PREFIX rdf: PREFIX rdfs: PREFIX fn: PREFIX do: file:///home/lele/workspace/TIS-RewSparQL_progetto/ontologies/art- deco/wineDomain.owl#file:///home/lele/workspace/TIS-RewSparQL_progetto/ontologies/art- deco/wineDomain.owl# SELECT ?wine_name ?source_name FROM file:///home/lele/workspace/TIS-RewSparQL_progetto/ontologies/art- deco/wineDomain.owlfile:///home/lele/workspace/TIS-RewSparQL_progetto/ontologies/art- deco/wineDomain.owl WHERE { ?w1 rdf:type do:WineInFarm. ?wb rdf:type do:WineBottle. ?wb do:containsWine ?w1. ?wb do:bottleQuantity ?bottle_quantity. ?w1 do:appellationInFarm ?wn1. ?w2 do:appellationInDocument ?wine_name. ?w2 rdf:type do:WineInDocument. ?dse rdf:type do:DocSearch. ?dso rdf:type do:DocSource. ?dse do:searchWineID ?w2. ?dse do:searchSrcID ?dso. ?dso do:docSrcName ?source_name. FILTER (?bottle_quantity > 0) FILTER fn:contains(?wine_name, ?wn1) } GROUP BY ?wine_name ?source_name

Q & A (If you see this slide weve not run out of time)

Part 3 of the book Ontology-based knowledge elicitation: an architecture (Chapter editor Licia Sbattella, Roberto Tedesco, Giorgio Orsi, Politecnico di Milano, Marcello Montedoro, IBM Italia) Knowledge extraction from Natural Language (Chapter editor Licia Sbattella, Roberto Tedesco, Politecnico di Milano) Knowledge extraction from event flows (Chapter editor Alberto Sillitti, Università di Bolzano) Context-aware knowledge querying in a networked enterprise (Chapter editor Cristiana Bolchini, Elisa Quintarelli, Fabio A. Schreiber, Politecnico di Milano, Teresa Baldassare, Università di Bari) On-the-fly and Context-Aware Integration of Heterogeneous Data Sources (Chapter editors Giorgio Orsi, Letizia Tanca, Politecnico di Milano) A methodology for context-driven data-warehouse design (Chapter editor Cristiana Bolchini, Elisa Quintarelli, Letizia Tanca, Politecnico di Milano)