KIT – University of the State of Baden-Württemberg and National Large-scale Research Center of the Helmholtz Association Institut AIFB – Angewandte Informatik.

Slides:



Advertisements
Similar presentations
Ulrich Frank, Stefan Strecker Information Systems and Enterprise Modelling research group ICB Institute for Computer Science and Business Information Systems.
Advertisements

OMV Ontology Metadata Vocabulary April 10, 2008 Peter Haase.
WDL Technical Architecture Working Group (TAWG) June 2010 Achievements and Recommendations Co-chaired by Noha Adly, Bibliotheca Alexandrina Babak Hamidzadeh,
OASIS OData Technical Committee. AGENDA Introduction OASIS OData Technical Committee OData Overview Work of the Technical Committee Q&A.
Creating Page Layouts using SharePoint Designer or Visual Studio Becky Bertram MCSD, MCAD MCTS WSS Development MCTS MOSS Development
project management office(PMO)
SaaS, PaaS & TaaS By: Raza Usmani
Software Documentation Written By: Ian Sommerville Presentation By: Stephen Lopez-Couto.
Newsletters The Art of the Ask These training materials have been prepared by Aspiration.
User Group 2015 Version 5 Features & Infrastructure Enhancements.
1 Introduction to Web Development. Web Basics The Web consists of computers on the Internet connected to each other in a specific way Used in all levels.
The ECHA-term project Multilingual REACH and CLP Terminology Dieter Rummel, Translation Centre for the Bodies of the EU Luxembourg EAFT - Oslo, 11 October.
Web 2.0 for Government Knowledge Management Everyone benefits by sharing knowledge March 24, 2010 Emerging Technologies Work Group Rich Zaziski, CEO FYI.
CONTROLSITE & Accessibility Independence for All Presented by: John Leal Goss Interactive Production Team.
Semantic Web. Course Content
Leveraging Oracle Data for Web- Based Reporting Northern California Oracle Users Group May 2001.
ADC Meeting ICEO Standards Working Group Steven F. Browdy, Co-Chair ADC Workshop Washington, D.C. September, 2007.
Multi-agent Research Tool (MART) A proposal for MSE project Madhukar Kumar.
Entity Recognition via Querying DBpedia ElShaimaa Ali.
PLATFORM INDEPENDENT SOFTWARE DEVELOPMENT MONITORING Mária Bieliková, Karol Rástočný, Eduard Kuric, et. al.
E-Commerce: Introduction to Web Development 1 Dr. Lawrence West, Management Dept., University of Central Florida Topics What is a Web.
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
1 SWEET Simple Wiki Embedded Editing Tool The SWEET Team Michael Kouyessein Brian Sullivan Yuan-Hsun Tang Fangyan Xu The SWEET Website
KIT – University of the State of Baden-Württemberg and National Large-scale Research Center of the Helmholtz Association Institute of Applied Informatics.
© Federal Statistical Office of Germany, section PR 2011 Census, CRM Wikidata – Social Media meets Open Data Work Session on the Communication of Statistics,
PLoS ONE Application Journal Publishing System (JPS) First application built on Topaz application framework Web 2.0 –Uses a template engine to display.
FITT Fostering Interregional Exchange in ICT Technology Transfer Communication & Collaboration Tools.
An interactive website was established to improve communication and establish a place for section policy and educational materials. The site is a success.
Markup and Validation Agents in Vijjana – A Pragmatic model for Self- Organizing, Collaborative, Domain- Centric Knowledge Networks S. Devalapalli, R.
Wikidata: A New Way to Disseminate Structured Data Luca Martinelli Rome, February 27 ʰ, 2014 Submission released under a Creative Commons Attribution-ShareAlike.
Portal for ArcGIS An Introduction
Getting the most out of ArcGIS Web Application Templates
Semantic mapping with MediaWiki Jeroen De Dauw. Presentation outline Introduction to MediaWiki Introduction to Semantic MediaWiki – Questions Maps Semantic.
The Evolving Digital Mathematics Library: A Mathematics Librarian’s Perspective Timothy W. Cole University of Illinois at Urbana-Champaign 8 Dec
Center for E-Business Technology Seoul National University Seoul, Korea Freebase: A Collaboratively Created Graph Database For Structuring Human Knowledge.
RHIT COURSE CATALOGUE SEMANTIC WIKI Overview and Initial Thoughts From your client for : Christina Selby, RHIT Math Dept G214,
Grid Computing & Semantic Web. Grid Computing Proposed with the idea of electric power grid; Aims at integrating large-scale (global scale) computing.
Andreas Abecker Knowledge Management Research Group From Hypermedia Information Retrieval to Knowledge Management in Enterprises Andreas Abecker, Michael.
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
Domain Modeling In FREMA Yvonne Howard David Millard Hugh Davis Gary Wills Lester Gilbert Learning Societies Lab University of Southampton, UK.
Semantic Mapping with MediaWiki Jeroen De Dauw. Presentation outline Introduction to MediaWiki Introduction to Semantic MediaWiki – Questions Maps Semantic.
NETWORK VISUALIZATION ABHISHEK KUMAR (2011CS50272)
A wiki is a collaborative web application which allows people to add and edit content using a browser… …it creates communities and empowers users as they.
Copyright All right reserved 1 i - LIKE Linked Data enrichment for an e-learning system Networked interactions to create, learn and share knowledge.
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
Jens Hartmann York Sure Raphael Volz Rudi Studer The OntoWeb Portal.
Infrastructure Breakout What capacities should we build now to manage data and migrate it over the future generations of technologies, standards, formats,
Professional Website Content Management Systems Tapas Shome
Metadata Driven Aspect Specification Ricardo Ferreira, Ricardo Raminhos Uninova, Portugal Ana Moreira Universidade Nova de Lisboa, Portugal 7th International.
Business Data Integration with MOSS 2007 Naveedullah Khan PMP, MCAD.NET Senior Consultant.
ISWG / SIF / GEOSS OOS - August, 2008 GEOSS Interoperability Steven F. Browdy (ISWG, SIF, SCC)
Fedora Commons Overview and Background Sandy Payette, Executive Director UK Fedora Training London January 22-23, 2009.
International Planetary Data Alliance Registry Project Update September 16, 2011.
By: Jamie Morgan  A wiki is a web page or collection of web pages which you and your students can access to contribute or modify content without having.
KIT – University of the State of Baden-Württemberg and National Large-scale Research Center of the Helmholtz Association Institut AIFB – Angewandte Informatik.
Control Choices and Network Effects in Hypertext Systems
Improving searches through community clustering of information
Software Documentation
API Documentation Guidelines
The Re3gistry software and the INSPIRE Registry
Knowledge Based Workflow Building Architecture
Cyberinfrastructure in practice
Health Ingenuity Exchange - HingX
Serpil TOK, Zeki BAYRAM. Eastern MediterraneanUniversity Famagusta
LOD reference architecture
Automation of Control System Configuration TAC 18
Semantic MediaWiki BCHB697.
QoS Metadata Status 106th OGC Technical Committee Orléans, France
OU BATTLECARD: Oracle WebCenter Training
Presentation transcript:

KIT – University of the State of Baden-Württemberg and National Large-scale Research Center of the Helmholtz Association Institut AIFB – Angewandte Informatik und Formale Beschreibungsverfahren Towards a Semantic Wikipedia: WikiData Project proposal overview Denny Vrandečić, Daniel Kinzler SMWcon, Berlin, September 22, 2011

Institut AIFB WikiData Wikimania 2005

Institut AIFB WikiData WIKIDATA

Institut AIFB WikiData WikiData What Why How

Institut AIFB WikiData WHAT

Institut AIFB WikiData shortipedia Second-hand facts. For free. i i

Institut AIFB WikiData

Institut AIFB WikiData

Institut AIFB WikiData

Institut AIFB WikiData The biggest city in Washington state Also known as: Seattle, WA Main page Contents Access the API Random page Donate to Wikidata Interaction Help About Wikidata Community portal Recent changes Languages Catalá Cesky Dansk Deutsch Eesti Español Esperanto Français Hrvatski Italiano Complete list Seattle From Wikidata edit | x StateWashington [3 sources] CountryUSA [2 sources] Population608,660 [1 source] 600,000 [2 sources] [other values] Area code206 [2 sources] MayorMichael McGi| [0 sources] DemonymSeattleite [1 source] Area369.2 km” [2 sources] Coordinates [3 sources] [new fact] Michael McGillicutty American professional wrestler Michael McGimpsey North Irish politician Michael McGinn US lawyer and politician Michael McGinlay Irish footballer Michael McGinn Scottish playwright edit

Institut AIFB WikiData Project plan: 3 phases Phase 1: Interwiki links Phase 2: Infobox augmentation Phase 3: Inline queries

Institut AIFB WikiData Phase 1: Interwiki links Current: every language links to every other In Wikidata: create one page for each entity, list representations in each language Also have labels, aliases, and short descriptions Maybe external identifiers too? In Wikipedias: pull Interwiki links from Wikidata and display upon using magic word

Institut AIFB WikiData Phase 2: Infobox augmentation Current: each article calls an infobox with values In Wikidata: centralize the values In Wikipedias: just call the infobox and populate it with values from Wikidata For each value, give the possibility to add sources Just like in Shortipedia All still highly scalable (only lookups)

Institut AIFB WikiData Phase 3: Inline queries Enable inline queries in Wikipedias With several formats

Institut AIFB WikiData WHY

Institut AIFB WikiData WikiData: Goals Provide a database of the world’s knowledge that anyone can edit Collect references and quotes for millions of data items Engage a sustainable community that collects data from everywhere in a machine-readable way Increase the quality and lower the maintenance costs of Wikipedia and related projects Deliver software and community best practices enabling others to engage in projects of data collection and provisioning

Institut AIFB WikiData Database of the world’s knowledge that anyone can edit Facts about millions of entities Collaboratively edited and maintained database Read-write access for humans and bots Data can be reused anywhere Common vocabulary of entities for the Web

Institut AIFB WikiData Annotations of text with facts all over the Web Every single fact can be given a reference to text on the Web Incentive: maintaining the validity of the references Can be used for training and validating text understanding in several languages Can be automatically learned from reading the text and validated by humans Starbuck s Seattle Founded in

Institut AIFB WikiData Sustainable community with clear incentives Additional extrinsic motivation through improving Wikipedia Build on interest of working Wikipedia communities Some tasks accessible to game mechanisms and ‘casual encyclopeding’ Heterogeneous tasks available for contributors

Institut AIFB WikiData Increase the quality and lower the maintenance costs of Wikipedia WikiData replaces a lot of manual or bot effort Centralizing interwiki link decreases current quadratic costs to linear Centralizing infobox maintenance decreases current linear costs to constant Centralizing infobox maintenance also decouples language capabilities from data maintenance Make Wikipedia more attractive by including more data and visualizations Removes argument ‘who will maintain this visualization?’ Enable automatic creation of millions of stubs in more than 100 languages

Institut AIFB WikiData Provide software, experience, and example for similar projects WikiData will not be the only data gathering community Provide software used on WikiData Share experience about managing such a project Encourage other communities to create new bold projects for knowledge acquisition in research in enterprises in culture in hobbies

Institut AIFB WikiData HOW

Institut AIFB WikiData Software architecture MediaWiki Semantic MediaWiki Data backend WikiData extension Wikimedia Foundation infrastructure Browser MediaWiki WikiData client External website External website Browser App

Institut AIFB WikiData Technical differences to SMW Annotate statements With sources With context (most important, time) No free text Save directly as structure instead of wikitext Probably save JSON first instead of wikitext content Back end to save and scalable query the data

Institut AIFB WikiData Clear incentives structure per phase / task Phase 1: Interwiki links Wikipedians are not creating abstract entites Replace current quadratic cost interwiki system with linear cost Phase 2: Infoboxes Wikipedians do not gather data aimlessly Replacing current (horrible!) templates in many articles Increase consistency, decrease maintenance costs Provide sources for all facts in order to ensure quality Informative stubs for 100,000s of articles in over 100 languages Phase 3: Inline queries Enable attractive visualizations of data Not only in Wikipedia, but anywhere! Gather data for specific sets of interest

KIT – University of the State of Baden-Württemberg and National Large-scale Research Center of the Helmholtz Association Institut AIFB – Angewandte Informatik und Formale Beschreibungsverfahren Thank you! Questions and discussions