Presentation is loading. Please wait.

Presentation is loading. Please wait.

AIFB Denny Vrandečić AIFB, Universität Karlsruhe (TH) Semantics 2006 Vienna, November 28th, 2006 Semantic Wikipedia The missing links.

Similar presentations


Presentation on theme: "AIFB Denny Vrandečić AIFB, Universität Karlsruhe (TH) Semantics 2006 Vienna, November 28th, 2006 Semantic Wikipedia The missing links."— Presentation transcript:

1 AIFB Denny Vrandečić AIFB, Universität Karlsruhe (TH) Semantics 2006 Vienna, November 28th, 2006 Semantic Wikipedia The missing links

2 AIFB 2

3 3

4 4

5 5

6 6 Universal Access to All Knowledge

7 AIFB 7

8 8

9

10

11 11 Universal Access to All Knowledge

12 Marrying Wikipedia and the Semantic Web

13 AIFB 13 Overview Background Wikis, Wikipedia, and the Semantic Web Semantic Wikipedia Idea, How it looks like, Advantages Challenges and Opportunities Web ecosystem, Open issues, Next steps

14 AIFB Wikis

15 AIFB 15 Long, long time ago… Using Pattern Languages for Object Oriented Programming, OOPSLA87

16 AIFB 16 Design Patterns

17 AIFB 17 Design Patterns Web Page HTML Simple Syntax HTML

18 AIFB 18 Wikis Everyone can edit Technology allows to edit Syntax is easy to learn History of edits Recent changes Community building / Attribution Easy to revert Important for fighting vandalism No fear of breaking the system

19 AIFB Wikipedia

20 AIFB 20 A new encyclopedia

21 AIFB 21 Created March 2000 Free, web based encyclopedia Everyone can read Expert authors and editors Extensive formal peer review Until January 2001: 22 articles Side project: wiki-based Nupedia

22 AIFB 22 Created March 2000 Free, web based encyclopedia Everyone can read Expert authors and editors Extensive formal peer review Until January 2001: 22 articles Until January 2002: 20,342 articles in 17 languages, 17,307 in English January 2001 edit 31

23 AIFB 23 Wikipedia growth YearArticlesEnglishLanguages ,34217,30717

24 AIFB 24 Wikipedia growth YearArticlesEnglishLanguages ,34217, ,12998,47525

25 AIFB 25 Wikipedia growth YearArticlesEnglishLanguages ,34217, ,12998, ,562189,12452

26 AIFB 26 Wikipedia growth YearArticlesEnglishLanguages ,34217, ,12998, ,562189, ,311,697438,289162

27 AIFB 27 Wikipedia growth YearArticlesEnglishLanguages ,34217, ,12998, ,562189, ,311,697438, ,100,360893,237197

28 AIFB 28 Wikipedia growth YearArticlesEnglishLanguages ,34217, ,12998, ,562189, ,311,697438, ,100,360893, /20065,565,8301,462,910250

29 AIFB 29 Wikipedia users 2.7 Mio registered users About 70,000 contributors 2% (1,400) make 73.4% of all edits Most content from wide user base Clean up / gardening by small group [English Wikipedia, study by Aaron Swartz]

30 AIFB Wikipedias problems

31 AIFB 31 But it cant work! Everyone can edit Repairing is easier than breaking No special status for experts Community building Nature study on quality 4 / 3 error rate Wikipedia / EB Controversial

32 AIFB 32 Quality Hard to discover factual errors Focusing on quality Repeated facts Big number of lists But still not all interesting lists

33 AIFB 33

34 AIFB 34

35 AIFB 35

36 AIFB 36

37 AIFB 37

38 AIFB 38

39 AIFB 39

40 AIFB 40

41 AIFB 41

42 AIFB 42

43 AIFB 43

44 AIFB 44

45 AIFB 45

46 AIFB 46

47 AIFB 47

48 AIFB 48 Coverage by language English: 1.5 Mio German: 0.5 Mio 10 more languages: more languages: But what about other languages?

49 AIFB Semantic Web

50 AIFB 50 Angola Africa located in Zambia located in borders Country Continent

51 AIFB

52 AIFB Angola Africa Located in Zambia Country Borders Continent

53 AIFB 53 Angola Africa located in Zambia located in borders Country Continent

54 AIFB 54

55 AIFB 55 Angola Africa located in Zambia located in borders Country Continent

56 AIFB Semantic Wikipedia: Idea

57 AIFB 57 Wikipedia today Brač is a Croatian island in the Adriatic Sea. The island has a population of 13,000, living in numerous little towns, ranging from the 'main town' Supetar, with more than 2,500 inhabitants, to Novo Selo, where only a dozen people live.CroatianAdriatic SeaSupetarNovo Selo Today, Brač lives mostly on tourism, but fishing and agriculture (especially wine and olives) are very important too, as is selling its precious, white stone (which was used in building Diocletian's Palace in Split, and is built into the White House in Washington, DC, too).tourismfishing agriculturewineolivesDiocletian's PalaceSplitWhite HouseWashington, DC Category: Croatian IslandCroatian Island

58 AIFB 58 Brač Croatia Adriatic Sea Italy tourism Zagreb Split Montenegro

59 AIFB 59

60 AIFB 60 How are they linked? Brač Croatia BračAdriatic Sea BračSupetar BračNovo Selo Bračtourism Bračfishing Bračagriculture

61 AIFB 61 How are they linked? Brač belongs toCroatia BračAdriatic Sea BračSupetar BračNovo Selo Bračtourism Bračfishing Bračagriculture

62 AIFB 62 How are they linked? Brač belongs toCroatia Bračlocated inAdriatic Sea BračSupetar BračNovo Selo Bračtourism Bračfishing Bračagriculture

63 AIFB 63 How are they linked? Brač belongs toCroatia Bračlocated inAdriatic Sea Bračhas townSupetar Bračhas townNovo Selo Bračtourism Bračfishing Bračagriculture

64 AIFB 64 How are they linked? Brač belongs toCroatia Bračlocated inAdriatic Sea Bračhas townSupetar Bračhas townNovo Selo Bračlivesontourism Bračlives onfishing Bračlives onagriculture

65 AIFB 65 Brač is a [[Croatia]]n island with an area of 396 km². Brač is a [[belongs to::Croatia]]n island with an area of [[area:=396 km²]]. Typed links Extend wiki with typed links So the computer understands it BračCroatia belongs to area 396 km²

66 AIFB 66

67 AIFB 67

68 AIFB How does it look like?

69 AIFB 69

70 AIFB 70

71 AIFB 71

72 AIFB 72

73 AIFB 73

74 AIFB Advantages

75 AIFB 75 Many pages answer questions list of female tennis players asteroids named after people countries sorted by area, population, … They can be generated automatically Less maintenance tasks Higher consistency Automatic tables and lists

76 AIFB 76 Inline queries [[Category:Country]] [[located in::Africa]] [[population:=>1,000,000]] [[population:=<10,000,000]] [[population:=*]] [[area:=*km²]] [[borders::*]]

77 AIFB 77 Inline query results

78 AIFB 78 Ontoworld

79 AIFB 79 Ontoworld

80 AIFB 80 Does every country have one capital? Is there a person with more than one mother? Is every person born before dying? Does the population density fit to population and size? Hand crafted checks

81 AIFB 81 Multilinguality Automatic check of consistency over language boundaries Generating pages for smaller Wikipedias? Browsing information in different languages

82 AIFB 82

83 AIFB 83

84 AIFB Web ecosystem

85 AIFB 85 Wikipedia as a vocabulary Semantic Wikipedia as a resource of URIs Maintaining good URIs is hard Documented Multilingual labels Enables easier mapping Reuse for common terms Helps in mapping the rest structurally

86 AIFB 86 Chrissie Robert Jordan Robert Jordan Till fan friend 14,95 price Wheel of Time Wheel of Time author of suggest Till looks for a gift for Chrissie, for 20 Scenario: looking for gifts

87 AIFB 87 Wikipedia as knowledge base Tools already integrate Wikipedia articles Bits of knowledge make more sense! Amarok with knowledge import instead full articles?

88 AIFB 88 Use Semantic Web tools Integration of data Querying SPARQL endpoints Browsing Faceted browsers Visualization Timeline

89 AIFB 89

90 AIFB 90 SMW installations Ontoworld sembase WWW2006 & ESWC2006 Wiki Semantic Karlsruhe KM Bible wiki Esoteric knowledge wiki Wikicompany JurisPedia …

91 AIFB Open issues

92 AIFB 92 Convergence of vocabulary Consistency of vocabulary Author of, has written, creator… Documentation of all types Visual feedback: red and blue links Queries: consistent vocabulary needed Autocompletion? UI hints?

93 AIFB 93 NLP and Semantic Wikipedia Suggestion / learning of relationships Based on patterns Statistical Background knowledge Good playground for evaluations

94 AIFB 94 Simpler User Interface WYSIWYG Interface for MediaWiki is in development How to integrate with Semantic Extension?

95 AIFB 95 Lack of expressivity How to say Ronald Reagan was US president from 1980 to 1988? What about relations between relations, like inverses? What about transitivity, symmetry? What about constraints like class disjointness?

96 AIFB Next steps

97 AIFB 97 Next steps Marry Wikipedia and Semantic Web Need to run stress tests Wikipedia at 12,000 hits per second Scalability, scalability, scalability Show cool apps Multilinguality Tons of details

98 AIFB 98 Conclusions Very flexible system for creating data Soft introduction People often scared about Semantic Web You can still use it as a standard wiki Immediate benefit The simplest database that could work Data is there – play with it! Kickstart the Semantic Web

99 AIFB 99 Universal Access to All Knowledge

100 AIFB Thank you! ontoworld.org

101 AIFB 101 Conclusions Very flexible system for creating data Soft introduction People often scared about Semantic Web You can still use it as a standard wiki Immediate benefit The simplest database that could work Data is there – play with it! Kickstart the Semantic Web

102 AIFB Backup slides

103 AIFB Technicalities

104 AIFB 104 MediaWiki Runs Wikipedia Active development Scalable Easy to use and powerful PHP / MySQL Not many SemWeb tools here

105 AIFB 105 Mapping of OWL to SMW owl:Individual Article owl:Class Category owl:ObjectProperty Relation, Link type owl:DatatypeProperty Attribute Object property instanceTyped link [[property::object]] Datatype property instance [[property:=value]] rdf:type class Class instantiation [[Category:class]] (on article page) rdfs:subClassOf class Subsumption [[Category:class]] (on category page)

106 AIFB 106 Reuse vocabulary Existing vocabulary can be mapped Wiki directly usable as data source No external mapping required Define vocabulary and mapping But: no complex mappings

107 AIFB 107

108 AIFB 108

109 AIFB 109

110 AIFB 110

111 AIFB Reuse in a Webpage

112 AIFB 112

113 AIFB 113

114 AIFB 114 Angola Africa located in Zambia located in borders Country Continent

115 AIFB

116 AIFB Angola Africa Located in Zambia Country Borders Continent

117 AIFB 117 Angola Africa located in Zambia located in borders Country Continent

118 AIFB Angola Africa Located in Zambia Country Borders Continent

119 AIFB 119

120 AIFB 120 SPARQL : RDF Query Lang PREFIX rdfs: PREFIX thing: PREFIX relation: SELECT ?label WHERE { thing:Angola relation:Located_in ?c. ?c rdfs:label ?label }

121 AIFB Angola Africa Located in Zambia Country Borders Continent

122 AIFB 122 Angola is in load("full URI/ExportRDF/Angola"); $result = $model->sparqlQuery('SPARQL'); $value = $result[0]['?label']; echo $value->getLabel(); ?>

123 Angola is in Africa.

124 AIFB TBox engineering

125 AIFB 125 Full TBox engineering? Not meant for it, but possible Does not capture semantics Does not propagate to categories

126 AIFB 126 Reified ontologies SubclassOf Axiom1 meta: Populated place meta: City refers subclass superclass subclass of refers Populated place City meta:subclassOf

127 AIFB 127 Reified ontologies SubclassOf Axiom1 meta: Populated place meta: City subclass superclass subclass of Populated place City meta:subclassOf owl tools

128 AIFB 128 Everything can be discussed All pages have a discussion page All individuals, classes, properties have a page With reification, even axioms may have a page Very fine grained discussion possible Opinions can be formalized explicitly

129 AIFB Ontology import

130 AIFB 130 Ontology import Reuse existing ontologies Upload mapped ontologies Kickstart a wiki Circumvent empty sheet problem Enrich an existing wiki Only the simple parts owl:Individual Article owl:Class Category owl:ObjectProperty Relation, Link type owl:DatatypeProperty Attribute Object property instanceTyped link [[property::object]] Datatype property instance [[property::value]] rdf:type class Class instantiation [[Category:class]] On article page rdfs:subClassOf class Subsumption [[Category:class]] On category page

131 AIFB 131

132 AIFB Reasoning wikis

133 AIFB 133 Dynamics Semantic MediaWiki Semantic MediaWiki KAON2 User Browser User Browser edit article check for consistency consistent? warn if inconsistent Example: -wife of has domain Woman -Woman and Man are disjoint

134 AIFB 134 Mockup screen

135 AIFB 135 Automatic classification Infer categories from statements Based on background ontology and wiki knowledge Automatic classification of articles Can be reused in queries

136 AIFB 136 Mockup screen

137 AIFB URI crisis resolved?

138 AIFB 138 URIs for everything Uniform Resource Identifiers Based on known protocols Linked data: resolve URI for description Maintaining URIs is hard Clutter namespace Setting up descriptions Persistence Reuse and mapping

139 AIFB 139 Romeo and Juliet URI crisis Is Romeo and Juliet the book, or the article about the book? One URI for each Mind the gap! Redirects in browser Shakespeare author JSmith 42 author about

140 AIFB 140 Bug status: still open

141 AIFB 141

142 AIFB Slides index – Backup slides 103 – Technicalities 111 – Reuse in webpages 124 – TBox engineering 129 – Ontology import 132 – Reasoning in wikis 137 – URI crisis resolved 19 – Wikipedia 49 – Semantic Web 56 – Basic idea 74 – Advantages 91 – Open issues 101 – Conclusions


Download ppt "AIFB Denny Vrandečić AIFB, Universität Karlsruhe (TH) Semantics 2006 Vienna, November 28th, 2006 Semantic Wikipedia The missing links."

Similar presentations


Ads by Google