RDF and triplestores CMSC 461 Michael Wilson. Reasoning  Relational databases allow us to reason about data that is organized in a specific way  Data.

Slides:



Advertisements
Similar presentations
RDF and RDB 1 Some slides adapted from a presentation by Ivan Herman at the Semantic Technology & Business Conference, 2012.
Advertisements

SPARQL Dimitar Kazakov, with references to material by Noureddin Sadawi ARIN, 2014.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Setting Up Your Very Own Semantic Web Developer Environment (It’s easier than you think)
Semantic Web Introduction
Master Informatique 1 Semantic Technologies Part 0Course Organization Semantic Technologies Werner Nutt.
RDF and RDB 1 Some slides adapted from a presentation by Ivan Herman at the Semantic Technology & Business Conference, 2012.
Progress Update Semantic Web, Ontology Integration, and Web Query Seminar Department of Computing David George.
CSCI 572 Project Presentation Mohsen Taheriyan Semantic Search on FOAF profiles.
You Cannot ReSIST Hugh Glaser Electronics & Computer Science University of Southampton DSSE, 28th February 2007.
Intelligent Systems Semantic Web. Aims of the session To introduce the basic concepts of semantic web ontologies.
Semantic Web Presented by: Edward Cheng Wayne Choi Tony Deng Peter Kuc-Pittet Anita Yong.
Presented by Gentre Dozier and Spencer Dille management.com/newsletters/database_metadata_unstructured_data_triple_store html.
Confidential 111 Semantics Repository Case Study, Findings, Wider implications Mike Bennett, Head of Semantics and Standards, EDM Council July 21-22, 2010.
Triple Stores.
Linked Data Visualizations for Eurostat Linked Data Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
Ontologies: Making Computers Smarter to Deal with Data Kei Cheung, PhD Yale Center for Medical Informatics CBB752, February 9, 2015, Yale University.
Managing Large RDF Graphs (Infinite Graph) Vaibhav Khadilkar Department of Computer Science, The University of Texas at Dallas FEARLESS engineering.
ONTOLOGY SUPPORT For the Semantic Web. THE BIG PICTURE  Diagram, page 9  html5  xml can be used as a syntactic model for RDF and DAML/OIL  RDF, RDF.
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
RDA and Linking Library Data VuStuff III Conference Villanova University, Villanova, PA October 18, 2012 Dr. Sharon Yang Rider University.
TOOLS FOR LLD Vocabularies, linking, and application programming.
Entity Recognition via Querying DBpedia ElShaimaa Ali.
Web 2.0 meet Semantic Web at Yahoo! Dave Beckett Yahoo! Media Group November 8, 2006.
By: Dan Johnson & Jena Block. RDF definition What is Semantic web? Search Engine Example What is RDF? Triples Vocabularies RDF/XML Why RDF?
CubicWeb – The Semantic Web is a construction game! Student: Uglješa Milić University of Belgrade School of Electrical.
Meta Tagging / Metadata Lindsay Berard Assisted by: Li Li.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Towards a semantic web Philip Hider. This talk  The Semantic Web vision  Scenarios  Standards  Semantic Web & RDA.
Semantic Web - an introduction By Daniel Wu (danielwujr)
ToolMatch Discovering What Tools can be used to Access, Manipulate, Transform, and Visualize Data Products Patrick West 1 Nancy Hoebelheinrich.
Linked Data: Emblematic applications on Legacy Data in Libraries.
Tool for Ontology Paraphrasing, Querying and Visualization on the Semantic Web Project By Senthil Kumar K III MCA (SS)‏
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Introduction to the Semantic Web and Linked Data
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
THE BIBFRAME EDITOR AND THE LC PILOT Module 3 – Unit 1 The Semantic Web and Linked Data : a Recap of the Key Concepts Library of Congress BIBFRAME Pilot.
ELIS – Multimedia Lab PREMIS OWL Sam Coppens Multimedia Lab Department of Electronics and Information Systems Faculty of Engineering Ghent University.
RDF and Relational Databases
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
Triple Storage. Copyright  2006 by CEBT Triple(RDF) Storages  A triple store is designed to store and retrieve identities that are constructed from.
Interface for Glyco Vault Functionality and requirements. Initial proposal. Maciej Janik.
Fall CSE330/CIS550: Introduction to Database Management Systems Prof. Susan Davidson Office: 278 Moore Office hours: TTh
Steven Perry Dave Vieglais. W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Overview WASABI is a framework for.
1 A Medical Information Management System Using the Semantic Web Technology Networked Computing and Advanced INFORMATION MANAGEMENT, NCM '08. Fourth.
1 The Semantic Web Jonathan Jackson GCUU Master’s Seminar Spring 2005.
Semantic Web 06 T 0006 YOSHIYUKI Osawa. Problem of current web  limits of search engines Most web pages are only groups of character strings. Most web.
Team members: Sen Yan Chiu (Team Lead) Frank Chou Chih Wei Lee Lulie Gaston Viet Nguyen Sumeet Chandra Ankur Singh April 13, 2009.
Selected Semantic Web UMBC CoBrA – Context Broker Architecture  Using OWL to define ontologies for context modeling and reasoning  Taking.
Author: Akiyoshi Matonoy, Toshiyuki Amagasay, Masatoshi Yoshikawaz, Shunsuke Uemuray.
SysML v2 Model Interoperability & Standard API Requirements Axel Reichwein Consultant, Koneksys December 10, 2015.
SEMANTIC WEB Presented by- Farhana Yasmin – MD.Raihanul Islam – Nohore Jannat –
Components.
The Semantic Web By: Maulik Parikh.
Triple Stores.
Collaborative Vocabulary Management
RDF For Semantic Web Dhaval Patel 2nd Year Student School of IT
Ontology.
Zachary Cleaver Semantic Web.
Triple Stores.
PREMIS Tools and Services
Database Systems Instructor Name: Lecture-3.
ToolMatch Discovering What Tools can be used to Access, Manipulate, Transform, and Visualize Data Products Patrick West1 Nancy
Chaitali Gupta, Madhusudhan Govindaraju
Triple Stores.
Low-bandwidth Semantic Web
Semantic-Web, Triple-Strores, and SPARQL
Taxonomy of public services
Taxonomy of public services
A framework for ontology Learning FROM Big Data
Presentation transcript:

RDF and triplestores CMSC 461 Michael Wilson

Reasoning  Relational databases allow us to reason about data that is organized in a specific way  Data that models specific relationships  Data that is very cleanly structured  What other reasoning methods are available to us?

Metadata  “Data about data”  Data that describes other data  Gives context  Example metadata:  Image EXIT data (geolocation, rotation, etc.)  User statistics  Last saved information in a file

What’s so important?  The context that we gather from metadata often allows us to understand a much greater picture  Can correlate and tie metadata together  Calculate statistics on metadata  Understand trends  Infinite possibilities

The depth of metadata  Many systems have their own way of storing metadata  Database tables may be organized to house specific metadata  This does not lend itself well to discovering new types of metadata  Person may have age, DOB  Later want to add new types (friends, Facebook ID, Twitter ID, etc.)

Metadata structures  RDF  Resource Description Framework  OWL  Web Ontology Language  Ontology – established vocabulary to describe knowledge within a domain  RDF is more widely used

Schemas  RDF and other structured metadata formats allow us to establish a common language to describe different sorts of metadata  We can make schemas that describe  Social media  Physical location  Job details  Moreover, we can tie them all to one subject  Doesn’t require database reorganization

Why is that cool?  What this means is that we can tie any arbitrary sets of data together with very little work on our part  We make a schema that describes a new domain, and staple that information onto an existing subject

Triples  Within these schemas, data is conceptually organized as   Subject  The subject of the expression  Predicate  The relationship between the subject and object  Object  The direct object of the expression  These expressions are called “triples”

Triple examples  Examples?

Storing triples  Since we are often interesting in large amounts of data, we need to think on how to store these  Triplestores  Pretty obvious  What do these give us over doing something like storing the information in a database?

Triplestore querying  Triplestores can also be queried  SQL is more limited for the kinds of queries we’d like to be able to make  SPARQL  The acronym stands for:  SPARQL Protocol and RDF Query Language

SPARQL  SPARQL is a SQL-like query language  Allows us to query on the various schemas we have assigned to our subjects  SPARQL queries can look surprisingly readable

SPARQL example PREFIX abc: SELECT ?capital ?country WHERE { ?x abc:cityname ?capital ; abc:isCapitalOf ?y. ?y abc:countryname ?country ; abc:isInContinent abc:Africa.

Querying power  Using SPARQL, you can make extremely deep, powerful queries and reason very intuitively on the data present in a triplestore  Organizing data this way allows computers to actually be able to reason on data as well

Caveats  All this tech is SUPER new  All tied very heavily into the Semantic Web  Basically introduce a system like this into the web at large  Metadata stored about web pages, computers can reason about them  Much of this is a moving target  Not a whole lot of production applications using this stuff yet

Tools  There are a few triplestore servers and other tools you can use  Jena  Apache project  Framework that allows for Semantic Web concepts to be employed  Can query using SPARQL  Jena can use Postgres in the background

More tools  RDFLib   Python library for RDF  Can run entirely in memory  Good for experimentation purposes and more