Download presentation
Presentation is loading. Please wait.
Published byΖένια Παχής Modified over 6 years ago
1
DBpedia – A Crystallization Point for the Web of Data
Zheng Liang
2
DBpedia is a community effort to extract structured information from Wikipedia and to make this information available on the Web. The DBpedia knowledge base currently provides information about more than 3.77 million “things”, including at least: 764,000 persons 573,000 places(including 387,000 populated places) 333,000 creative works (including 112,000 music albums, 72,000 films and 18,000 video games) ……
3
Contributions of the DBpedia
An information extraction framework that converts Wikipedia content into a rich multi-domain knowledge base. Timely and automatically evolves as Wikipedia changes . A Web-dereferenceable identifier for each DBpedia entity .To overcome the problem of missing entity identifiers Publish RDF links pointing from DBpedia into other Web data sources and support data publishers in setting links from their data sources to DBpedia
4
Outline DBpedia Knowledge Extraction Framework DBpedia Knowledge Base
Accessing the DBpedia Knowledge Base Interlinking DBpedia with other Data Sets DBpedia Applications Summary
5
DBpedia Knowledge Extraction Framework
Open Archives Initiative Protocol for Metadata Harvesting
6
Extracting from Wikipedia Page
Label Abstract Interlanguage Links Images Redirects Disambiguates External Links Pagelinks Homepages Categories Geo-coordinates
7
Extracting Infobox Data
dbpedia-owl:country dbpedia:China dbpedia-owl:elevation (xsd:double) dbpedia-owl:governmentType dbpedia-owl:isPartOf dbpedia:Jiangsu dbpedia-owl:populationTotal (xsd:integer) dbpedia-owl:populationUrban (xsd:integer) ...
9
DBpedia常用URI及其含义 http://DBpedia.org/ontology/xxx 对应Wiki Infobox 类
Person 类 Book类 Wiki Infobox-specific property 外部资源链接地址 指向对应的Wiki文章 重定向信息 消除歧义属性 页面ID 资源的名称信息
10
DBpedia Knowledge Base
DBpedia Ontology is a shallow, cross-domain ontology, which has been manually created based on the most commonly used infoboxes within Wikipedia. The ontology currently covers 359 classes which form a subsumption hierarchy and are described by 1,775 different properties.
11
DBpedia Knowledge Base
DBpedia DataSet provides three different classification schemata. Wikipedia Categories; using the SKOS vocabulary and DCMI terms. YAGO Classification; is derived from the Wikipedia category system using WordNet WordNet ; should be more precise than the Wikipedia category system.
12
Accessing the DBpedia Knowledge Base
Querying DBpedia SPARQL Endpoint Public Faceted Web Service Interface DBpedia Linked Data Interface
13
Querying DBpedia SPARQL Endpoint SPARQL is a query language for RDF.
provided using OpenLink Virtuoso as the back-end database engine Leipzig query builder at OpenLink Interactive SPARQL Query Builder (iSPARQL) at SNORQL query explorer at not work with Internet Explorer); or any other SPARQL-aware client(s).
14
sparql http://DBpedia.org/sparql
PREFIX : < PREFIX dbpedia2: < PREFIX dbpedia: < SELECT ?name ?y WHERE { ?name dbpedia2:centre ?name dbpedia2:postalCode ?y. }
15
iSPARQL http://dbpedia.org/isparql/
PREFIX : < PREFIX dbpedia2: < PREFIX dbpedia: < SELECT ?name ?y WHERE { ?name dbpedia2:centre ?name dbpedia2:postalCode ?y. } /////// ?point Georess:point
16
SNORQL http://DBpedia.org/snorql SELECT ?game ?title WHERE {
SELECT ?game ?title WHERE { ?game < < . ?game foaf:name ?title . } ORDER by ?title
17
Public Faceted Web Service Interface
Querying DBpedia Public Faceted Web Service Interface There is a public Faceted Browser “search and find” user interface at Tim Berners-Lee founder
18
DBpedia Linked Data Interface
Linked Data is a method of publishing RDF data on the Web and of interlinking data between different data sources. The DBpedia data set is served as Linked Data, meaning that all DBpedia URIs are dereferenceable. Browse the DBpedia data set with Semantic Web browsers like DISCO, Marbles, the OpenLink Data Explorer,Tabulator, the Zitgist Data Viewer or the Fluidops Information Workbench.
19
DISCO a simple browser for navigating the Semantic Web as an unbound set of data sources. This resource description contains hyperlinks that allow you to navigate between resources. While you move from resource to resource, the browser dynamically retrieves information by dereferencing HTTP URIs and by following rdfs:seeAlso links.
20
Marbles Marbles is a server-side application that formats Semantic Web content for XHTML clients using Fresnel lenses and formats. Colored dots are used to correlate the origin of displayed data with a list of data sources, hence the name.
21
Tabulator Using outline and table modes, it provides a way to browse RDF data on the web. ?v0 < ?v2. ////////////////// SELECT ?v0 ?v1 ?v2 WHERE { < < ?v0 . ?v0 < ?v1 . ?v0 < ?v2 . }
22
Interlinking DBpedia with other Data Sets
The DBpedia data set is interlinked with various other data sources.
23
External Links The DBpedia data set contains HTML links to external web pages as well as RDF links into external data sources. Two types of links to HTML pages: dbpedia:reference links point; foaf:homepage links that point to web pages. RDF links are represented using the owl:sameAs property. Examples of External RDF Links # Two RDF links taken from DBpedia < owl:sameAs < . < owl:sameAs < . SPARQL: PREFIX owl: < PREFIX xsd: < PREFIX rdfs: < PREFIX link: < PREFIX foaf: < PREFIX rdf: < PREFIX dc: < PREFIX map: <file:///Users/richard/D2RQ/DBLP/dblp-mapping.n3#> PREFIX d2r: < PREFIX dblp: < SELECT * WHERE { ?z dc:creator < . ?z rdf:type < ?z dc:title ?name }
24
DBpedia Applications gFacet- Graph-based Faceted Exploration of RDF Data.
25
DBpedia Applications RelFinder –extracts and visualizes relationships between given objects in RDF data and makes these relationships interactively explorable.
26
DBpedia Applications SemLens – uses scatter plots for the analysis of Dependencies in DBpedia data and semantic lenses for further exploration.
27
DBpedia Applications DBpedia Mobile – is a location-centric DBpedia client application for mobile devices consisting of a map view annotated with DBpedia, the Marbles Linked Data Browser and a GPS-enabled launcher application.
28
Future Work Revolutionize Wikipedia Search
Include DBpedia Data in Your Web Page Mobile and Geographic Applications Document Classification, Annotation and Social Bookmarking Multi-Domain Ontology
29
Summary 对 DBpedia 知识抽取框架,知识库结构,如何访问知识库进行简要介绍,并对现有的查询浏览等工具的功能进行验证。
存在问题: 大多数工具有浏览,提供过滤及SPARQL查询,但大多数针对单个数据集,没有跨多数据源的查询。如何针对众多开放的SPARQL Endpoint进行集成查询? Interlinking 中如果对不同数据集中的相似实体进行匹配关联? 众多SPARQL Endpoint可否与SView系统集成?
30
Thanks!
Similar presentations
© 2024 SlidePlayer.com Inc.
All rights reserved.