Presentation is loading. Please wait.

Presentation is loading. Please wait.

Linked Library Data Modeling Metadata for the [Semantic] Web Presented 2010-11-19 Columbia University Digital Library Seminar Series Corey A Harper.

Similar presentations


Presentation on theme: "Linked Library Data Modeling Metadata for the [Semantic] Web Presented 2010-11-19 Columbia University Digital Library Seminar Series Corey A Harper."— Presentation transcript:

1 Linked Library Data Modeling Metadata for the [Semantic] Web Presented 2010-11-19 Columbia University Digital Library Seminar Series Corey A Harper

2 2010-11-19Harper - Linked Library Data - Columbia University2 Topical Overview Semantic Web Intro Linked Open Data –Graphs: Entity – Attribute – Value –A Few Examples Library Data

3 2010-11-19Harper - Linked Library Data - Columbia University3 Topical Overview (cont) Linked Library Data –SKOS and Authority Control –FRBR and Bibliographic Data –National Libraries Resource Description and Access (RDA) Dublin Core Metadata Initiative

4 2010-11-19Harper - Linked Library Data - Columbia University4 Semantic Web TBL’s original vision –“Weaving the Web” – 1999 Then: Focus on Machine Reasoning –Scientific American Article Now: Focus on things & links –Reasoning becoming lower level

5 2010-11-19Harper - Linked Library Data - Columbia University5 Semantic Web Originally: –Metadata standard built on XML –Metadata about “Web” things Eventually: –Metadata about all things –Metadata about relationships between things

6 2010-11-19Harper - Linked Library Data - Columbia University6 Semantic Web Terminology Resource: Any thing Class: Abstraction of a type of thing Individual: An instance of a class Property: An attribute of an individual Ontology: A domain specific collection of classes and properties Statement/Triple: –A Resource (subject) - Nodes –A Property (predicate) - Arcs –A Value (object) - Nodes

7 2010-11-19Harper - Linked Library Data - Columbia University7 Semantic Web Terminology Graphs: Representations of statements about resources Nodes: The Subjects and Objects in a Graph Arcs: The Predicates in a Graph Literals: “Objects” represented as strings (constant values) rather than things (URI References) Domains and Ranges: Constraints on Nodes For Example…

8 2010-11-19Harper - Linked Library Data - Columbia University8

9 2010-11-19Harper - Linked Library Data - Columbia University9 RDF Resource Description Framework Formally Begun in 1999 Ideas from 1995 Finalized in 2004 Frighteningly complex at times… –“Directed Labeled Graphs”

10 2010-11-19Harper - Linked Library Data - Columbia University10 SemWeb Value Proposition Formally Modeled (Meta) Data Formal Semantics Declaration Increased Granularity compared to record-based Metadata Improved Interoperability

11 2010-11-19Harper - Linked Library Data - Columbia University11 “The vast bulk of data to be on the Semantic Web is already sitting in databases … all that is needed [is] to write an adapter to convert a particular format into RDF and all the content in that format is available.” - Tim Berners-Lee in an interview with the Consortium Standards Bulletin

12 2010-11-19Harper - Linked Library Data - Columbia University12 Linked Open Data Use URIs as names for things Use HTTP URIs so that people can look up those names. When someone looks up a URI, provide useful information. Include links to other URIs. so that they can discover more things. http://www.w3.org/DesignIssues/LinkedData.html

13 2010-11-19Harper - Linked Library Data - Columbia University13

14 2010-11-19Harper - Linked Library Data - Columbia University14

15 2010-11-19Harper - Linked Library Data - Columbia University15

16 2010-11-19Harper - Linked Library Data - Columbia University16 Linked Data Cloud Automated generation –Comprehensive Knowledge Archive Network (CKAN)Comprehensive Knowledge Archive Network (CKAN) –Vocabulary of Interlinked Datasets (voiD)Vocabulary of Interlinked Datasets (voiD) –Basically, catalog your metadata! Recent criticism: data quality

17 2010-11-19Harper - Linked Library Data - Columbia University17 Data in the Cloud Hubs in the May 2008 Version: –FOAF –DBPedia Myriad Sources coming online: –Thompson Reuters –New York Times –British Broadcasting Corporation –Google and Facebook –More and More Library Data –Geonames –MusicBrains

18 2010-11-19Harper - Linked Library Data - Columbia University18 DBpedia Structured Wikipedia Data Genres, Influences, External Links Multi-lingual / Multi-script labels Rich Semantics Many linkages to other datasets

19 2010-11-19Harper - Linked Library Data - Columbia University19 DBpedia 3.4 Million “things” described Ontology based on “infoboxes”Ontology –1.5 million things classified Approx. 50,000 “Properties” –Approx. 1,200 defined in ontology Brief Example

20 2010-11-19Harper - Linked Library Data - Columbia University20 Domain Modeling Starting from application / goal / function “To guide and evaluate our designs, we need objective criteria that are founded on the purpose of the resulting artifact, rather than based on a priori notions of naturalness or Truth.” – Gruber, 1993 Does this apply to Libraries? FRBRer?

21 2010-11-19Harper - Linked Library Data - Columbia University21 DBPedia Model Partial basis in data entry conventions InfoBox’s, and InfoBox Templates Metadata Entry Format Partial source of Ontology –Class Structure –Vocabulary Design

22 2010-11-19Harper - Linked Library Data - Columbia University22 DBpedia 3.4 Million “things” described Ontology based on “infoboxes” –1.5 million things classified –http://wiki.dbpedia.org/Ontology Approx. 50,000 “Properties” –Approx. 1,200 defined in ontology

23 2010-11-19Harper - Linked Library Data - Columbia University23

24 2010-11-19Harper - Linked Library Data - Columbia University24

25

26 2010-11-19Harper - Linked Library Data - Columbia University26 More Examples British Broadcasting Corporation –Programmes, Music, Wildlife Google Refine Data.gov and data.gov.uk NY Times

27 2010-11-19Harper - Linked Library Data - Columbia University27 What *things* are in our data???

28 2010-11-19Harper - Linked Library Data - Columbia University28 …Library data is extremely complicated

29 2010-11-19Harper - Linked Library Data - Columbia University29 Bibliographic Data Rich stores of MARC, MODS, &c. Robust Controlled Vocabularies –Subject Heading lists –Code lists –Thesauri Emerging data model in FR*

30 2010-11-19Harper - Linked Library Data - Columbia University30 Bibliographic Vocabs Bibliographic Ontology –Zotero, Omeka, EPrints and Others FRBR – unofficial –And now Official (Thank you IFLA!) ISBD

31 2010-11-19Harper - Linked Library Data - Columbia University31 Library Authority Data “Include links to other URIs. so that they can discover more things.” Short of providing and linking to URIs, this *is* authority data. This is what our authority files are for.

32 2010-11-19Harper - Linked Library Data - Columbia University32 Library Controlled Vocabularies: Benefits Reputation - Trusted Tradition Mature - Time tested and carefully developed General & Comprehensive - Cover large knowledge spaces

33 2010-11-19Harper - Linked Library Data - Columbia University33 SKOS Simple Knowledge Organization System Properties and Classes for describing Controlled Vocabulary RDF Page skos:primaryTopic skos:person

34 2010-11-19Harper - Linked Library Data - Columbia University34 LCSH in Dublin Core Encoding Scheme for DC Subject No easy way to draw on equivelent terms and cross-references Abstract Model, RDF and SKOS could enable applications to make use of the whole vocabulary

35 2010-11-19Harper - Linked Library Data - Columbia University35 LCSH as a Web Service! Uses principles of linked data lcsh.info -> id.loc.gov People noticed when taken down Links to French Subject Headings URIs for Literal String lookup http://id.loc.gov/authorities/label/World Wide Web

36 2010-11-19Harper - Linked Library Data - Columbia University36

37 2010-11-19Harper - Linked Library Data - Columbia University37 Other Vocabularies Thesaurus for Economics French Subject Headings Swedish Subject Headings IconClass (not on web yet) OCLC Terminology Services Dewey Decimal Classification Virtual International Authority File

38 2010-11-19Harper - Linked Library Data - Columbia University38 Linked Library Data VIAF, LCSH, MARC Codes Open Library, XC, Kualli OLE Library of Congress, OCLC Hungarian, German, British, Swedish National Libraries Formalized Efforts: W3C, IFLA & RDA

39 2010-11-19Harper - Linked Library Data - Columbia University39 Kungliga Biblioteket Image courtesy of Martin Malmstemhttp://blog.libris.kb.se/semweb/?p=7

40 2010-11-19Harper - Linked Library Data - Columbia University40 National Széchényi Library “ Our RDFDC, FAOF and SKOS statements are linked together. Our name authority is matched with the DBPedia name files and URI aliases are handled as owl:sameAs statements.” - Adam Horvath

41 2010-11-19Harper - Linked Library Data - Columbia University41 W3C LLD XG “Incubator Group” Membership: –Researchers, Consultants, Librarians –National Libraries: Germany, France, LoC, Sweden –OCLC & IFLA

42 2010-11-19Harper - Linked Library Data - Columbia University42

43 2010-11-19Harper - Linked Library Data - Columbia University43 W3C LLD XG Goals Collecting, Curating and Clustering over 50 Use Cases Mining use cases for functional requirements and design patterns Recommendations to W3C –Should lead to Working Groups

44 2010-11-19Harper - Linked Library Data - Columbia University44 RDA Development RDA elements, roles and vocabularies have been provisionally registered IFLA FRBRer and ISBD elements and vocabularies have been officially registered Discussions about long term maintenance of both RDA and the vocabularies Effort to create multi-language RDA Vocabularies RDA Slides Adapted from Diane Hillmann

45 2010-11-19Harper - Linked Library Data - Columbia University45 RDA Elements Listing 334!

46 2010-11-19Harper - Linked Library Data - Columbia University46 RDA Elements Listing 334! Base material

47 2010-11-19Harper - Linked Library Data - Columbia University47 Detail: Base Material

48 2010-11-19Harper - Linked Library Data - Columbia University48 Detail: Base Material URI

49 2010-11-19Harper - Linked Library Data - Columbia University49 RDA Base Material Vocabulary

50 2010-11-19Harper - Linked Library Data - Columbia University50 RDA WEMI Relationships

51 2010-11-19Harper - Linked Library Data - Columbia University51 Detail: RDA WEMI Relationship

52 2010-11-19Harper - Linked Library Data - Columbia University52 Metadata Registries Formerly NSDL Registry –Now “Open Metadata Registry” –Managing Vocabularies –Providing Vocabulary Services DCMI Registry Community DCMI Architecture Forum

53 2010-11-19Harper - Linked Library Data - Columbia University53 DCMI and the Semantic Web Collaboration from the start Libraries (esp. OCLC) were at the table Perception of DCMI as DCMES –DCMI = Metedata Vocab / Framework –DCMES = Metadata Record Format

54 2010-11-19Harper - Linked Library Data - Columbia University54 DCMI and the Semantic Web Every example above had dcterms DCMI as Research Institute and Metadata Think Tank –Modeling Work –Metadata Registries –Application Profiles –Description Set Profiles –Singapore Framework

55 2010-11-19Harper - Linked Library Data - Columbia University55 Changing Role of DCMI Mike Bergman at DC2010: –Reference Metadata –Reference Concepts –Mapping Predicates “Mappings should be approximate” –Usage Guidelines Compliment to W3C Standards

56 2010-11-19Harper - Linked Library Data - Columbia University56 Why Does This Matter? Our descriptions no longer stand alone! Connect our data with the rest of the WEB Allow others to reuse more easily –FOAF –DBPedia –Geonames –MusicBrains –New York Times –Thompson Reuters –Government Data - data.gov –British Broadcasting Corporation

57 2010-11-19Harper - Linked Library Data - Columbia University57 Conclusions Distributed bibliographic control environment –Linking Data –Focus on identification over description “In short, by treating values as non- literal resources and assigning URIs to them we give ourselves (and others) the hooks on which to hang further descriptions.” - Andy Powell

58 2010-11-19Harper - Linked Library Data - Columbia University58 Endless possibilities This barely scratches the surface The Giant Global Graph!! With more soundly modeled bibliographic and authority data… –Terminology Services –Context sensitive interfaces –Customized Exhibits –Mashups –Web Services –User Profiling –Collaboration tools

59 2010-11-19Harper - Linked Library Data - Columbia University59 Continuing Challenges Emerging Technology Design Patterns Complexity (http-range14) Existing Technical Infrastructure Bootstrapping Business Cases

60 2010-11-19Harper - Linked Library Data - Columbia University60 More Information W3C LLD XG: http://www.w3.org/2005/Incubator/lld/wiki/Main_Page ALA LLD Interest Group: –http://kcoyle.net/lld-ala.htmlhttp://kcoyle.net/lld-ala.html IFLA Semantic Web SIG –https://wiki.d-nb.de/x/vA10Aghttps://wiki.d-nb.de/x/vA10Ag

61 2010-11-19Harper - Linked Library Data - Columbia University61 Thanks! corey.harper@nyu.edu 212.998.2479 Questions?


Download ppt "Linked Library Data Modeling Metadata for the [Semantic] Web Presented 2010-11-19 Columbia University Digital Library Seminar Series Corey A Harper."

Similar presentations


Ads by Google