KAnOE: Research Centre for Knowledge Analytics and Ontological Engineering Managing Semantic Data NACLIN-2014, 10 Dec 2014 Dr. Kavi Mahesh Dean of Research,

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Semantic Web Thanks to folks at LAIT lab Sources include :
© Copyright IBM Corporation 2014 Getting started with Rational Engineering Lifecycle Manager queries Andy Lapping – Technical sales and solutions Joanne.
International Workshop Linked Open Data & the Jewish Cultural Heritage Rome, 20 th January 2015 International Workshop Linked Open Data & the Jewish Cultural.
Dewey Summaries as Multilingual Linked Data Dewey Breakfast/Update ALA Annual July 11, 2009.
OntoBlog: Linking Ontology and Blogs Aman Shakya 1, Vilas Wuwongse 2, Hideaki Takeda 1, Ikki Ohmukai 1 1 National Institute of Informatics, Japan 2 Asian.
CSCI 572 Project Presentation Mohsen Taheriyan Semantic Search on FOAF profiles.
Actual Trends Semantic Web Lecture WS 2010/2011. What‘s next? W3C view: Look at Semantic Web activity:
LINKED DATA COMS E6125 Prof. Gail Kaiser Presented By : Mandar Mohe ( msm2181 )
Enterprise Linked Data Seán O’Riain Domain of eBusiness Digital Enterprise Research Institute - National University of Ireland, Galway  Copyright 2010.
Behshid Behkamal Ferdowsi University of Mashhad Web Technology Lab.
RDF: Building Block for the Semantic Web Jim Ellenberger UCCS CS5260 Spring 2011.
Samad Paydar Web Technology Laboratory Computer Engineering Department Ferdowsi University of Mashhad 1389/11/20 An Introduction to the Semantic Web.
Cloud based linked data platform for Structural Engineering Experiment Xiaohui Zhang
Linked Data The Short Version. Linked Data is a set of best practices for publishing and deploying instance and class data using the RDF data model, naming.
Mendeley What is it? How is it different from other “Bibliographic databases” like End Note and Reference.
Metadata and identifiers for e- journals Copenhagen Juha Hakala Helsinki University Library
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
Linked Open Data: a new resource for eResearch Dr Anne Cregan eResearch Analyst, Intersect and ANDS
IFLA Satellite Meeting, 13 August 2014, Frankfurt-am-Main, Germany
1 © Netskills Quality Internet Training, University of Newcastle Metadata Explained © Netskills, Quality Internet Training.
Michalis Vafopoulos NTUA, GFOSS & The transformers GREEN CITY HACKATHON.
Semantic Publishing Update Second TUC meeting Munich 22/23 April 2013 Barry Bishop, Ontotext.
Scotland's Environment Web Data Journey Dave Watson, Duncan Taylor.
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
INF 384 C, Spring 2009 Ontologies Knowledge representation to support computer reasoning.
Shared innovation Linking Distributed Data across the Web Dr Tom Heath Researcher, Platform Division Talis Information Ltd t
UPData A data curation experiment at the University of Porto using DSpace João Rocha da SilvaFEUP Cristina RibeiroDEI- FEUP / INESC-Porto João Correia.
Interoperability through Library APIs Library Technology Services Open House 7/30/15.
Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.
Boris Villazón-Terrazas, Ghislain Atemezing FI, UPM, EURECOM, Introduction to Linked Data.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
Keyword Query Routing.
Lifecycle Metadata for Digital Objects November 1, 2004 Descriptive Metadata: “Modeling the World”
You sexy beast. Ok, inappropriate. How about: Web of links to Web of Meaning Hello Semantic Web!
Linked Data: Emblematic applications on Legacy Data in Libraries.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Introduction to the Semantic Web and Linked Data
Auditing Grey in a CRIS Environment
Dr. Lowell Vizenor Ontology and Semantic Technology Practice Lead Alion Science and Technology Semantic Technology: A Basic Introduction.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
ESIP Semantic Web Products and Services ‘triples’ “tutorial” aka sausage making ESIP SW Cluster, Jan ed.
Strategies for subject navigation of linked Web sites using RDF topic maps Carol Jean Godby Devon Smith OCLC Online Computer Library Center Knowledge Technologies.
THE BIBFRAME EDITOR AND THE LC PILOT Module 3 – Unit 1 The Semantic Web and Linked Data : a Recap of the Key Concepts Library of Congress BIBFRAME Pilot.
ELIS – Multimedia Lab PREMIS OWL Sam Coppens Multimedia Lab Department of Electronics and Information Systems Faculty of Engineering Ghent University.
Linked Data Profiling Andrejs Abele National University of Ireland, Galway Supervisor: Paul Buitelaar.
1 Class exercise II: Use Case Implementation Deborah McGuinness and Peter Fox CSCI Week 8, October 20, 2008.
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
Paloma Marín Arraiza 17 th International Conference on Grey Literature 1 st and 2 nd December 2015, Amsterdam (Netherlands) SCIENTIFIC AUDIOVISUAL MATERIALS.
Semantic Web COMS 6135 Class Presentation Jian Pan Department of Computer Science Columbia University Web Enhanced Information Management.
Characterizing Knowledge on the Semantic Web with Watson Mathieu d’Aquin, Claudio Baldassarre, Laurian Gridinoc, Sofia Angeletou, Marta Sabou, Enrico Motta.
© Copyright 2015 STI INNSBRUCK PlanetData D2.7 Recommendations for contextual data publishing Ioan Toma.
STAR, STELLAR and SKOS Ceri Binding, Phil Carlisle, Keith May, Doug Tudhope, Andreas Vlachidis University of Glamorgan and English Heritage.
Linked Open Data for European Earth Observation Products Carlo Matteo Scalzo CTO, Epistematica epistematica.
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
Shared innovation Linking Distributed Data across the Web Dr Tom Heath Researcher, Platform Division Talis Information Ltd t
Semantic Web. P2 Introduction Information management facilities not keeping pace with the capacity of our information storage. –Information Overload –haphazardly.
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
INHA UNIVERSITY, KOREA Rainer Simon Austrian Institute of Technology.
SemTech 2010: My feedback & ideas
Linked Data Web that can be processed by machines
Cloud based linked data platform for Structural Engineering Experiment
Presented at Archives Records 2016, session 510
DBpedia 2014 Liang Zheng 9.22.
LOD reference architecture
BUILDING A DIGITAL REPOSITORY FOR LEARNING RESOURCES
Linked Data Ryan McAlister.
Introduction of Linked Data – From Cataloging to Catalinking
Classifications and Linked Open Data Formalizing the structure and content of statistical classifications Item 9.1 Standards Working Group Luxembourg,
Presentation transcript:

KAnOE: Research Centre for Knowledge Analytics and Ontological Engineering Managing Semantic Data NACLIN-2014, 10 Dec 2014 Dr. Kavi Mahesh Dean of Research, PES University 1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 1

Managing Semantic Data in Research Data Services  Trend: Publish research data along with paper  Digital library of research data  How do we manage this data?  E.g., our research requires:  Several Tera Bytes of data  5 billion data elements so far 1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 2

Inverting the Publication Model Past:  Description of research results in English  Show samples of data  “Results, Discussion, Conclusion” framework Present:  Publish article and entire dataset  No links between article and data 1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 3

The Inverted Publication Model Future:  Inverted model:  Publish self-contained data  Publish data analytics  Annotate the data with English descriptions where needed  Rich linkage between datasets  Web of linked data… 1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 4

Illustration of Publishing Data 1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 5

1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 6

1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 7

Self-Contained Dataset Requirements:  Have a proper and consistent structure;  Define each element both syntactically and semantically;  Specify all the semantic constraints on permissible data values, their types and cardinalities; and  Specify data provenance, etc. 1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 8

Ontology of Research Data  In other words, an ontology of research data  Where is the “Dublin Core” of research data?  E.g., CERIF 1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 9

Why Semantic Data Management?  Epistemology of science: Verifying research results  Making sense of someone else’s data  Documenting the usage scenario of data 1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 10

1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 11

 Ontology-based multi-domain metadata for research data management using triple stores  Full Text: PDF Buy this Article Authors: João Rocha da Silva Universidade do Porto/INESC TEC, Portugal Cristina Ribeiro DEI, Universidade do Porto/INESC TEC, Portugal João Correia Lopes DEI, Universidade do Porto/INESC TEC, PortugalBuy this ArticleJoão Rocha da SilvaUniversidade do Porto/INESC TEC, PortugalCristina RibeiroDEI, Universidade do Porto/INESC TEC, PortugalJoão Correia LopesDEI, Universidade do Porto/INESC TEC, Portugal 1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 12

Data on the Web: 5-Star Rating System * Data on the Web: E.g., data published as a set of scanned images ** Machine-Readable Data: E.g., data published as a spreadsheet *** Non-Proprietary Format: E.g., data published as a CSV file **** RDF Data: E.g., a drug database published in RDF ***** Linked RDF Data: Links to other people’s data are included. E.g., the Dbpedia dataset extracted from wikipedia 1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 13

Linked Open Data: Principles  Use URIs as names of things: E.g, mention author by URI, not just name.  Use HTTP URIs so that people can look up those names.  When someone looks up a URI, provide useful information, using the standards (RDF*, SPARQL).  Include links to other URIs, so people can discover more things. Sir Tim Berners-Lee 1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 14

Linked Open Research Data Services Requirements:  Uniquely identify all entities used in datasets such as experiments, specimens, locations, organizations, etc.;  Interlink parts of datasets with precise parts of an article in both directions;  Classify datasets using a suitable universal classification scheme;  Cite other datasets, i.e., refer to them through links;  Manage multiple versions and revisions of datasets; and  Incorporate a suitable controlled vocabulary or ontology. 1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 15

Architecture of Digital Library of Data 1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 16

An Ontology for Research Data 1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 17

Concluding Remarks  Publishing and citing research data will be a common practice  Digital libraries need to manage research data  Data needs to be self-contained, therefore semantic  Linked open data is promising  We need a proper ontology of research data  Keyword search may be good enough for documents, but not for datasets 1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 18

Questions?  Thank you! 1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 19

1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 20

How?  By applying Natural Language Generation Techniques on structure and semantics of Linked Open Datasets and underlying Ontologies. 1/25/ (c) Dr. Kavi Mahesh; Do not copy or distribute

Input Triples SubjectPredicateobject 1/25/ (c) Dr. Kavi Mahesh; Do not copy or distribute

Ontology for Discourse Structuring 1/25/ (c) Dr. Kavi Mahesh; Do not copy or distribute

Classes 1/25/ (c) Dr. Kavi Mahesh; Do not copy or distribute

Subclasses 1/25/ (c) Dr. Kavi Mahesh; Do not copy or distribute

Individuals 1/25/ (c) Dr. Kavi Mahesh; Do not copy or distribute

Ontology as a Chart 1/25/ (c) Dr. Kavi Mahesh; Do not copy or distribute

A few snapshots of the “ MECHANISM ” ontology, in the protégé software, are shown: 1/25/ (c) Dr. Kavi Mahesh; Do not copy or distribute

The 12 subgroups: 1/25/ (c) Dr. Kavi Mahesh; Do not copy or distribute

The functions accommodated under every subgroup: 1/25/ (c) Dr. Kavi Mahesh; Do not copy or distribute

Hierarchy of mechanisms: 1/25/ (c) Dr. Kavi Mahesh; Do not copy or distribute

Object & data properties being added to each mechanism: 1/25/ (c) Dr. Kavi Mahesh; Do not copy or distribute

1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 33

Subclasses and their Descriptions 1/25/ (c) Dr. Kavi Mahesh; Do not copy or distribute

Object properties 1/25/ (c) Dr. Kavi Mahesh; Do not copy or distribute

Data properties added 1/25/ (c) Dr. Kavi Mahesh; Do not copy or distribute

Linked Open Data Tools Pallavi Karanth ©KAnOE, PES Institute of Technology 1/25/ (c) Dr. Kavi Mahesh; Do not copy or distribute

Data  Data ©KAnOE, PES Institute of Technology 1/25/ (c) Dr. Kavi Mahesh; Do not copy or distribute

Web for Data Discovery ©KAnOE, PES Institute of Technology 1/25/ (c) Dr. Kavi Mahesh; Do not copy or distribute

Web for Data Discovery ©KAnOE, PES Institute of Technology 1/25/ (c) Dr. Kavi Mahesh; Do not copy or distribute

Machine Understandable Data ©KAnOE, PES Institute of Technology 1/25/ (c) Dr. Kavi Mahesh; Do not copy or distribute

Machine Understandable Data ©KAnOE, PES Institute of Technology Ram Nickname DOB Location Bangalore 1/25/ (c) Dr. Kavi Mahesh; Do not copy or distribute

Open Data and Linked Data  Open Data - open access  Linked Data  Semantic  Machine Readable ©KAnOE, PES Institute of Technology 1/25/ (c) Dr. Kavi Mahesh; Do not copy or distribute

Linked Open Data  Five Star Rating of Linked Data by Tim Berners Lee  ★ make your stuff available on the Web (whatever format) under an open license  ★★ make it available as structured data (e.g., Excel instead of image scan of a table)  ★★★ use non-proprietary formats (e.g., CSV instead of Excel)  ★★★★ use URIs to denote things, so that people can point at your stuff  ★★★★★ link your data to other data to provide context  About 50 billion triples as of 2013 ©KAnOE, PES Institute of Technology 1/25/ (c) Dr. Kavi Mahesh; Do not copy or distribute

LOD - IT (Kappa)  For Software Developers  Technical Helpdesk  LOD-IT Video LOD-IT Video  LOD-IT Demo LOD-IT Demo ©KAnOE, PES Institute of Technology 1/25/ (c) Dr. Kavi Mahesh; Do not copy or distribute

LODScape  Ontology based Multiple LOD Object Browser  DbPedia and Freebase datasets used  LODScape Demo LODScape Demo ©KAnOE, PES Institute of Technology 1/25/ (c) Dr. Kavi Mahesh; Do not copy or distribute

Semantic Smart-Aleck  Automatic Fact Generator  Based on Interestingness Algorithm  Uses Dbpedia and Yago datasets  SemanticSmartAleck Demo SemanticSmartAleck Demo ©KAnOE, PES Institute of Technology 1/25/ (c) Dr. Kavi Mahesh; Do not copy or distribute

Acknowledgments 1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 48

Suggestions?  Thank you! 1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 49