Presentation is loading. Please wait.

Presentation is loading. Please wait.

What is the Semantic Web? Ivan Herman, Semantic Web Activity Lead, W3C Zheijiang University, Hangzhou, China, 2007 November 22 Ivan Herman.

Similar presentations


Presentation on theme: "What is the Semantic Web? Ivan Herman, Semantic Web Activity Lead, W3C Zheijiang University, Hangzhou, China, 2007 November 22 Ivan Herman."— Presentation transcript:

1 What is the Semantic Web? Ivan Herman, Semantic Web Activity Lead, W3C Zheijiang University, Hangzhou, China, 2007 November 22 Ivan Herman

2 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 2 ) ‏ ( 2 )‏ > Towards a Semantic Web The current Web represents information using  natural language (English, Hungarian, Chinese,…)  graphics, multimedia, page layout Humans can process this easily  can deduce facts from partial information  can create mental associations  are used to various sensory information (well, sort of… people with disabilities may have serious problems on the Web with rich media!)

3 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 3 ) ‏ ( 3 )‏ > Towards a Semantic Web Tasks often require to combine data on the Web:  hotel and travel information may come from different sites  searches in different digital libraries  etc. Again, humans combine these information easily  even if different terminologies are used!

4 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 4 ) ‏ ( 4 )‏ > However… However: machines are ignorant!  partial information is unusable  difficult to make sense from, e.g., an image  drawing analogies automatically is difficult  difficult to combine information automatically is same as ? how to combine different XML hierarchies?  …

5 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 5 ) ‏ ( 5 )‏ > Example: automatic airline reservation Your automatic airline reservation  knows about your preferences  builds up knowledge base using your past  can combine the local knowledge with remote services: airline preferences dietary requirements calendaring etc It communicates with remote information (i.e., on the Web!)  (M. Dertouzos: The Unfinished Revolution)

6 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 6 ) ‏ ( 6 )‏ > Example: data(base) integration Databases are very different in structure, in content Lots of applications require managing several databases  after company mergers  combination of administrative data for e-Government  biochemical, genetic, pharmaceutical research  etc. Most of these data are accessible from the Web (though not necessarily public yet)

7 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 7 ) ‏ ( 7 )‏ > And the problem is real…

8 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 8 ) ‏ ( 8 )‏ > Example: “smart” portal Various types of “portals” are created (for a journal on line, for a specific area of knowledge, for specific communities, etc) The portals may:  integrate lots of different data sources  may have access to specialized domain knowledge Goal is to provide a better local access, search on the integrated data, reveal new relationships among the data

9 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 9 ) ‏ ( 9 )‏ > Example: Digital Libraries It means catalogs on the Web  librarians have known how to do that for centuries  goal is to have this on the Web, World-wide  extend it to multimedia data, too But it is more: software agents should also be librarians!  help you in finding the right publications

10 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 10 ) ‏ ( 10 )‏ > What is needed? (Some) data should be available for machines for further processing Data should be possibly combined, merged on a Web scale Sometimes, data may describe other data (like the library example, using metadata)… … but sometimes the data is to be exchanged by itself, like my calendar or my travel preferences Machines may also need to reason about that data

11 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 11 ) ‏ ( 11 )‏ > In what follows… We will use a simplistic example to introduce the main Semantic Web concepts We take, as an example area, data integration

12 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 12 ) ‏ ( 12 )‏ > The rough structure of data integration 1. Map the various data onto an abstract data representation  make the data independent of its internal representation… 2. Merge the resulting representations 3. Start making queries on the whole!  queries that could not have been done on the individual data sets

13 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 13 ) ‏ ( 13 )‏ > A simplified bookstore data (dataset “A”)

14 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 14 ) ‏ ( 14 )‏ > 1 st : export your data as a set of relations

15 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 15 ) ‏ ( 15 )‏ > Some notes on the exporting the data Relations form a graph  the nodes refer to the “real” data or contain some literal  how the graph is represented in machine is immaterial for now Data export does not necessarily mean physical conversion of the data  relations can be generated on-the-fly at query time via SQL “bridges” scraping HTML pages extracting data from Excel sheets etc. One can export part of the data

16 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 16 ) ‏ ( 16 )‏ > Another bookshop data (dataset “F”)

17 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 17 ) ‏ ( 17 )‏ > 2 nd : export your second set of data

18 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 18 ) ‏ ( 18 )‏ > 3 rd : start merging your data

19 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 19 ) ‏ ( 19 )‏ > 3 rd : start merging your data (cont.)

20 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 20 ) ‏ ( 20 )‏ > 3 rd : merge identical resources

21 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 21 ) ‏ ( 21 )‏ > Start making queries… User of data “F” can now ask queries like:  « donnes-moi le titre de l’original »  (ie: “give me the title of the original”) This information is not in the dataset “F”… …but can be retrieved by merging with dataset “A”!

22 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 22 ) ‏ ( 22 )‏ > However, more can be achieved… We “feel” that a:author and f:auteur should be the same But an automatic merge doest not know that! Let us add some extra information to the merged data:  a:author same as f:auteur  both identify a “Person”  a term that a community may have already defined: a “Person” is uniquely identified by his/her name and, say, homepage it can be used as a “category” for certain type of resources

23 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 23 ) ‏ ( 23 )‏ > 3 rd revisited: use the extra knowledge

24 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 24 ) ‏ ( 24 )‏ > Start making richer queries! User of dataset “F” can now query:  « donnes-moi la page d’accueil de l’auteur de l’original »  (ie, “give me the home page of the original’s author”) The information is not in datasets “F” or “A”… …but was made available by:  merging datasets “A” and datasets “F”  adding three simple extra statements as an extra “glue”

25 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 25 ) ‏ ( 25 )‏ > Combine with different datasets Using, e.g., the “Person”, the dataset can be combined with other sources For example, data in Wikipedia can be extracted using dedicated tools  there is an active development to add some simple semantic “tag” to wikipedia entries (so called “Semantic Wiki”-s)  the “dbpedia” project can extract the “infobox” information from Wikipedia already…dbpedia

26 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 26 ) ‏ ( 26 )‏ > Merge with Wikipedia data

27 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 27 ) ‏ ( 27 )‏ > Merge with Wikipedia data

28 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 28 ) ‏ ( 28 )‏ > Merge with Wikipedia data

29 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 29 ) ‏ ( 29 )‏ > Is that surprising? Maybe but, in fact, no… What happened via automatic means is done all the time, every day by the users of the Web! The difference: a bit of extra rigor (e.g., naming the relationships) is necessary so that machines could do this, too

30 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 30 ) ‏ ( 30 )‏ > What did we do? We combined different datasets  all may be of different origin somewhere on the web  all may have different formats (mysql, excel sheet, XHTML, etc)  all may have different names for relations (e.g., multilingual)  ombination was done independently of the local schemas We could combine the data because some URI-s were identical (the ISBN-s in this case) We could add some simple additional information (the “glue”), also using common terminologies that a community has produced As a result, new relations could be found and retrieved

31 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 31 ) ‏ ( 31 )‏ > It could become even more powerful We could add extra knowledge to the merged datasets  e.g., a full classification of various type of library data  geographical information  etc. This is where ontologies, extra rules, etc, may come in Even more powerful queries can be asked as a result

32 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 32 ) ‏ ( 32 )‏ > What did we do? (cont)

33 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 33 ) ‏ ( 33 )‏ > So where is the Semantic Web? The Semantic Web provides technologies to make such integration possible! For example:  an abstract model for the relational graphs: RDF  extract RDF information from XML (eg, XHTML) pages: GRDDL  add structured information to XHTML pages: RDFa  a query language adapted for the relational graphs: SPARQL  characterize the relationships, categorize resources: RDFS, OWL, SKOS, Rules applications may choose among the different technologies some of them may be relatively simple with simple tools (RDFS), whereas some require sophisticated systems (OWL, Rules)  reuse of existing “ontologies” that others have produced (FOAF in our case)

34 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 34 ) ‏ ( 34 )‏ > So where is the Semantic Web? (cont)

35 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 35 ) ‏ ( 35 )‏ > SW data begins to accumulate on the Web IgentaConnectIgentaConnect bibliographic metadata storage: over 200 million triplets Tracking the US CongressTracking the US Congress: data stored in RDF (around 25 million triplets) RDFS/OWL Representation of WordNetRDFS/OWL Representation of WordNet: also downloadable as 150MB of RDF/XML “Département/canton/commune” structure of France published by the French Statistical InstituteDépartement/canton/commune Geonames OntologyGeonames Ontology and associated RDF data: 6 million (and growing) geographical features RDF Book MashupRDF Book Mashup, integrating book data from, eg, Amazon “dbpedia”: get infobox data of Wikipedia into RDFdbpedia See, for example, the linked data indexlinked data index

36 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 36 ) ‏ ( 36 )‏ > And what about applications?

37 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 37 ) ‏ ( 37 )‏ > A number of projects in data integration Developments are under way at various companies, institutions  not always easy to find out the details… Data integration comes to the fore as one of the SW application areas

38 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 38 ) ‏ ( 38 )‏ > Integrate knowledge for Chinese Medicine Integration of a large number of relational databases (on traditional Chinese medicine) using a Semantic Layer  around 80 databases, around 200,000 records each A visual tool to map databases to the semantic layer using a specialized ontology Form based query interface for end users Courtesy of Huajun Chen, Zhejiang University, (SWEO Case Study)(SWEO Case Study)

39 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 39 ) ‏ ( 39 )‏ > Find the right experts at NASA Expertise locater for nearly 20,000 NASA civil servants using RDF integration techniques over 6 or 7 geographically distributed databases, data sources, and web services… Courtesy of Kendall Clark, Clark & Parsia, LLC

40 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 40 ) ‏ ( 40 )‏ > Ontology controlled annotation Annotation of different data formats all along the full drug discovery process… RDF Triple Store Web API Acroba t Chemical Series Compounds Assay Data Points Scientific Papers Any PDF Pathways Lab data Collaborations Targets BioMarkers Spreadsheets Powerpoints Word… Websites/Pages Views of exp data BrainStorming Meeting Notes Semantic Agents Automatic Email Alerts Project Portals Wikis Courtesy of Giles Day, Pfizer

41 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 41 ) ‏ ( 41 )‏ > Public health surveillance Integrated biosurveillance system (biohazards, bioterrorism, disease control, etc) Courtesy of Parsa Mirhaji, School of Health Information Sciences, University of Texas (SWEO Case Study)(SWEO Case Study)

42 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 42 ) ‏ ( 42 )‏ > Help in choosing the right drug regimen Help in finding the best drug regimen for a specific case  find the best trade-off for a patient Use an ontology for medical conditions, signs, symptoms Integrate data from various sources (patients, physicians, Pharma, researchers, etc) Courtesy of Erick Von Schweber, PharmaSURVEYOR Inc., (SWEO Use Case)(SWEO Use Case)

43 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 43 ) ‏ ( 43 )‏ > Some other names… Pfizer, NASA, Eli Lilly, MITRE Corp., Elsevier, Novartis, … UN FAO’s MeteoBroker, … Semantic Digital Library projects (JeromeDL, Simile, Fedora,…)

44 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 44 ) ‏ ( 44 )‏ > Web sites, portals, local site search Portal’s internal organization makes use of semantic data, ontologies  integration with external and internal data these are, often, extensions of data integration projects  better queries, often based on controlled vocabularies or ontologies…

45 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 45 ) ‏ ( 45 )‏ > Semantic portal for art collections Courtesy of Jacco van Ossenbruggen, CWI, and Guus Schreiber, VU Amsterdam

46 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 46 ) ‏ ( 46 )‏ > Semantic portal for cultural heritage Courtesy of Francisca Hernández, Fundación Marcelino Botín, and Richard Benjamins, iSOCO, (SWEO Case Study)(SWEO Case Study)

47 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 47 ) ‏ ( 47 )‏ > Help for deep sea drilling operations Integration of experience and data in the planning and operation of deep sea drilling processes Discover relevant experiences that could affect current or planned drilling operations  uses an ontology backed search engine Courtesy of David Norheim and Roar Fjellheim, Computas AS (SWEO Use Case)(SWEO Use Case)

48 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 48 ) ‏ ( 48 )‏ > Portal to Principality of Asturias’ documents Search through governmental documents A “bridge” is created between the users and the juridical jargon using SW vocabularies and tools Courtesy of Diego Berrueta and Luis Polo, CTIC, U. of Oviedo, and the Principality of Asturias, (SWEO Case Study)(SWEO Case Study)

49 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 49 ) ‏ ( 49 )‏ > Digital music asset portal at NRK Used by program production to find the right music in the archive for a specific show Courtesy of Robert Engels, ESIS, and Jon Roar Tønnesen, NRK (SWEO Case Study)(SWEO Case Study)

50 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 50 ) ‏ ( 50 )‏ > Elsevier’s DOPE browser Single interface to multiple data sources (in life sciences) Integration, search, etc, via thesauri and metadata in RDF(S) Courtesy of Anita de Waard, Elsevier, Christiaan Fluit, Aduna, and Frank van Harmelen, VU Amsterdam (SWEO Use Case)(SWEO Use Case)

51 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 51 ) ‏ ( 51 )‏ > Intelligent search for public services Semantic Web based search engine for public services at the municipality of Zaragoza (Spain) The search is based a local ontology, natural language processing and ontological reasoning Courtesy of Jesús Fernando Ruíz, Municipality of Zaragoza (SWEO Use Case)(SWEO Use Case)

52 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 52 ) ‏ ( 52 )‏ > Vodafone live! Integrate various vendors’ product descriptions via RDF  ring tones, games, wallpapers  manage complexity of handsets, binary formats A portal is created to offer appropriate content Significant increase in content download after the introduction Courtesy of Kevin Smith, Vodafone Group R&D (SWEO Case Study)(SWEO Case Study)

53 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 53 ) ‏ ( 53 )‏ > Other examples… Sun’s White Paper and System Handbook collectionscollections Nokia’s S60 support portalS60 support Harper’s Online Magazine Oracle’s virtual pressroomvirtual pressroom Opera’s community sitecommunity site Dow Jones’ SynapticaSynaptica

54 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 54 ) ‏ ( 54 )‏ > All kind of other types of applications…

55 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 55 ) ‏ ( 55 )‏ > Adobe’s XMP Metadata is added by, e.g., Photoshop into files in RDF XMPXMP is a way of embedding + vocabulary + a set of (public) tools (there are also 3 rd party tools to extract the RDF content) Used by a number of platform solutions

56 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 56 ) ‏ ( 56 )‏ > Natural interface to business applications Courtesy of C. Anantaram, Tata Consultancy Services Limited (SWEO Case Study)(SWEO Case Study) Users interact with a business application (eg, via email) in natural language; OWL helps in the retrieval of relevant concepts

57 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 57 ) ‏ ( 57 )‏ > Suggestions’ database… Employees of the bank can submit new ideas for innovation, improving the business process, reduce costs, etc The entry system analyses the entry, shows similar ideas already in the system based on the concepts (not words) User gets immediate feedback, system gets better search, analysis, etc Courtesy of José Luís Bas Uribe, Bankinter, and Richard Benjamins, iSOCO, (SWEO Case Study)(SWEO Case Study)

58 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 58 ) ‏ ( 58 )‏ > Other application areas come to the fore Content management Business intelligence Collaborative user interfaces Sensor-based services Linking virtual communities Grid infrastructure Multimedia data management Etc

59 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 59 ) ‏ ( 59 )‏ > Conclusions The Semantic Web is there to integrate data on the Web The goal is the creation of a Web of Data

60 , “What is the Semantic Web?”, Zheijiang University, 22.11.’07( 60 ) ‏ ( 60 )‏ > Thank you for your attention! These slides are publicly available on: http://www.w3.org/2007/Talks/1122-Hangzhou-IH/


Download ppt "What is the Semantic Web? Ivan Herman, Semantic Web Activity Lead, W3C Zheijiang University, Hangzhou, China, 2007 November 22 Ivan Herman."

Similar presentations


Ads by Google