Presentation is loading. Please wait.

Presentation is loading. Please wait.

EDUG 2012 Symposium 26 April 2012 Boston Spa, UK Michael Panzer Assistant Editor, DDC OCLC DDC metadata.

Similar presentations


Presentation on theme: "EDUG 2012 Symposium 26 April 2012 Boston Spa, UK Michael Panzer Assistant Editor, DDC OCLC DDC metadata."— Presentation transcript:

1 EDUG 2012 Symposium 26 April 2012 Boston Spa, UK Michael Panzer Assistant Editor, DDC OCLC panzerm@oclc.org panzerm@oclc.org DDC metadata

2 Types of DDC data -Usually, Dewey numbers provide metadata for describing other resources -DDC as value vocabulary for metadata element sets -Instead, the following focuses on cases where Dewey numbers and DDC editions are the resources described -Two levels of DDC metadata -Number-level metadata (focus on bibliographic records) -Edition-level metadata (focus on classification records)

3 DDC metadata Metadata about -Dewey numbers (082, 083, 085 fields in MARC Bibliographic) -Provenance of machine-generated classication data -Dewey number components in linked 085 fields -Dewey editions (084, 686 fields in MARC Classification) -Interplay between class- and edition-level metadata rendered in MARC Classification format

4 Agenda Scenario 1.Provenance of machine-generated data 2.Edition-level metadata 3.Metadata about Dewey number components Context -Proposal for MARBI; (metadata) provenance initiatives at W3C / DCMI -Relationship between translations and other “versions” -Enhancing Dewey numbers for retrieval

5 MARBI proposal -Drafted over the last two months in cooperation with colleagues from DNB and LC -To be presented at MARBI meeting at ALA Annual Conference 2012 -Two options -Option 1: Addresses the immediate needs of documenting information about machine generation of classification data -Defines additional subfields in 082, 083, 084 -Option 2: Proposes a more general way of dealing with metadata provenance -Applicable to all MARC variable fields (in principle) -Heeds the distinction between provenance in general and metadata provenance in particular

6 Option 1 Defined for 082, 083, 084 $i - Method of assignment designator Fully machine-generated (m) Not fully machine-generated (x) $u - Process of assignment May contain a URI, a process name, or some other description of process designated in $i $1 - Confidence value Confidence of the assigning agency in relation to the process described in $u. Contains value from the interval [0,1] $q – Assigning agency (already defined)

7 Examples DDC 23 number assigned by LC using AutoDewey. The AutoDewey process involves machine assistance followed by intellectual review: 08200 $a829/.3$223$ix$uautodewey$11 Fictitious example of DDC 22 number assigned by OCLC in a fully automated way using information in Classify: 08204 $a394.12$222$im$uclassify$10.5$qOCoLC

8 Option 2 883 - Data provenance (R) First Indicator: Method of assignment # - No information provided 0 – Fully machine-generated 1 – Not fully machine-generated $d - Date on which the linked field was generated $u - Process used to generate linked field $q - Agency using the process/activity to generate the linked field $1 - Confidence value $x - Ending date of validity $0 - Authority record control number or standard number $8 - Field link and sequence number (with new field link type “p – Data provenance”)

9 Examples 08200 $81\p$a829/.3$223 8831# $81\p$uautodewey$d20120407$qDLC$11 08204 $81\p$a394.12$222$qOCoLC 8830# $81\p$uclassify$d20120407$qOCoLC$10.5

10 Examples (2) 08204 $81\p$a004$222/ger$qNO-OsNB 8830# $81\p$udeweyclassifierv0.1$d20120101 $x20141231$qNO-OsNB$10.25 $0(DE-101)040268942 08204 $81\p$a004$222/ger$qDE-101 8830# $81\p$uparallelrecordcopy$d20120101 $x20141231$qNO-OsNB

11 Agenda Scenario 1.Provenance of machine-generated data 2.Edition-level metadata 3.Metadata about Dewey number components Context -Proposal for MARBI; (metadata) provenance initiatives at W3C / DCMI -Relationship between translations and other “versions” -Enhancing Dewey numbers for retrieval

12 Edition-level metadata -Edition registry: capturing information about editions and translations in a centralized manner outside of MARC records -Storing additional metadata about editions/translations in MARC records -Better management of translation data and other versions -MARC does not offer edition-level records -Data info has to be carried in individual records, even when it applies to the whole edition -Relevant fields: 084 - Classification Scheme and Edition 686 - Relationship to Source Note

13 French DDC 22 German DDC 22 Italian DDC 22 Swedish Mixed DDC 22 Italian A14 Vietnamese A14 FrenchA14FrenchA14 Spanish A14 Hebrew A14 200 Religion Class Guide (French) DDC 22 A14A14 DDC Sach- Gruppen (German) DDC Summaries English French Italian Rhaeto- Romansch English French Italian Rhaeto- Romansch Afrikaans Arabic Chinese French German Norwegian Portuguese Russian Scots Gaelic Spanish Swedish Afrikaans Arabic Chinese French German Norwegian Portuguese Russian Scots Gaelic Spanish Swedish DDC translations: Anatomy of an edition

14 Types of editions -Related to an edition, with relationships not captured at record level Examples: sdnb, DDC Summaries, Guide versus -Related to an edition, with relationships captured at record level Examples: 200 Religion, translations, A15engind

15 Tracking edition-to-edition relationships Translation of standard edition 084 1# $a ddc $c 15 $e ind Source edition 084 1# $a ddc $c 15 $e eng Authorized derivative version of standard edition 084 8# $a ddc $c 22sdnb $d 22 $e ger Source edition 084 0# $a ddc $c 22 $e eng -Not explicitly full or abridged; “8” is used for value of first indicator -$n should be automatically populated with relevant information about the changes regarding the source edition.

16 Tracking record-to-record relationships 1. Record has been modified Translation of standard edition 084 1# $a ddc $c 15 $e ind 686 3# $i modified Source record 084 1# $a ddc $c 15 $e eng

17 Tracking record-to-record relationships (2) 2. Record was created for translation Translation of standard edition 084 1# $a ddc $c 15 $e ind 686 1# $b 305.899 Source record [does not exist]

18 Tracking record-to-record relationships (3) 3. Unmodified record from different source edition Translation of standard edition 084 1# $a ddc $c 15 $e ind 686 0# $2 23 Source record 084 0# $a ddc $c 23 $e eng

19 Agenda Scenario 1.Provenance of machine-generated data 2.Edition-level metadata 3.Metadata about Dewey number components Context -Proposal for MARBI; (metadata) provenance initiatives at W3C / DCMI -Relationship between translations and other “versions” -Enhancing Dewey numbers for retrieval

20 085 - Synthesized Classification Number Components -085 fields provide information about components of Dewey numbers in linked 082 or 083 fields -Mirror 765 fields in MARC Classification format -Vital for faceted retrieval driven by Dewey numbers -Further enhancements possible by utilizing mappings of Dewey numbers that occur prominently as components, e.g, geographic data, time periods -Definition of new indexes is a requirement for retrieval use for WoldCat data

21 Exploiting Dewey facets in WorldCat Das Highlander-Kochbuch 082 04 $8 1\x $a 641.594115 $q DE-101 $2 22/ger 085 ## $8 1\x $b 641.59 085 ## $8 1\x $z 2 $s 4115 641.593-.599 Cooking characteristic of specific continents, countries, localities T2—4115 Highland

22 Proposed new indexes (083 fields) “Dewey additional” index da index:Add $z and $c ($y) to elements already in dd index Pattern:[z--]a[-c][:a[-c]]

23 Proposed new indexes (085 fields) “Dewey components” index dc index:Index $s and $t concatenated with full address Pattern:[z--]rs|w[-c][:t] “Dewey synthesized” index ds index:Index all components Pattern:[z--]a|b|rs|u|w[-c][:a|b|t|u|v[-c]]

24 Proposed new indexes (082/083/085 fields) “Dewey general” index dg index:Index all elements in Dewey numbers Pattern:Combine dd, da, and ds indexes

25 Built number: 943.55140864 9History & geography +T2—435514Cologne +943.0864Period of World War II, 1939-1945 082 00 $8 1\x $a 943/.55140864 $2 22 085 0# $8 1\x $b 9 $a 930 $c 990 $z 2 $s 435514 $u 943.5514 085 0# $8 1\x $b 943.5514 $a 930 $c 990 $v 01 $c 09 $f 0 $r 943.0 $s 864 $u 943.55140864 Example: History of Cologne during WWII

26 Access points / findability Components in dc index: 2--435514 943.0864 Synthesized components in ds index: 2--435514, 9, 930-990, 930-990:01-09, 943.0864, 943.5514, 943.55140864 082 00 $8 1\x $a 943/.55140864 $2 22 085 0# $8 1\x $b 9 $a 930 $c 990 $z 2 $s 435514 $u 943.5514 085 0# $8 1\x $b 943.5514 $a 930 $c 990 $v 01 $c 09 $f 0 $r 943.0 $s 864 $u 943.55140864 082 00 $8 1\x $a 943/.55140864 $2 22 085 0# $8 1\x $b 9 $a 930 $c 990 $z 2 $s 435514 $u 943.5514 085 0# $8 1\x $b 943.5514 $a 930 $c 990 $v 01 $c 09 $f 0 $r 943.0 $s 864 $u 943.55140864

27 Scenarios / Use cases -Components / facets can be varied independently of each other -Allows for expanding, but also "morphing" the query by changing individual components -Integration of mapped vocabularies into Dewey-driven discovery process -Using terms that have been mapped to any number components -Usage of local hierarchies of number components instead of just the hierarchical relationships of the base number

28 Example: Dewey-driven discovery Number components + mapped GeoNames 394.120954 394.12 + T2—54 394.120954 Neighboring countries: ChinaT2—51 PakistanT2—5491 BangladeshT2—5492 NepalT2—5496 BhutanT2—5498 MyanmarT2—591 notational structural 394.13 + T2—51 gn:neighbour 394.13Drinking of alcoholic beverages 394.125 Meals 395.54 Table manners structural

29 Thank You! Questions? Comments? Ideas?

30 Some useful links DDC 23http://www.oclc.org/us/en/dewey/versions/print/default.htm Abridged Edition 15http://www.oclc.org/us/en/dewey/versions/abridged/default.htm WebDewey 2.0http://dewey.org/webdewey dewey.infohttp://dewey.info Dewey webinars & presentations http://www.oclc.org/us/en/dewey/news/conferences/default.htm 025.431: The Dewey blog http://ddc.typepad.com Classifyhttp://classify.oclc.org/classify2/ Questions?dewey@loc.gov dewey@loc.gov (Dewey Editorial Office) dewey@oclc.org dewey@oclc.org (Licensing, group purchases, LIS program)


Download ppt "EDUG 2012 Symposium 26 April 2012 Boston Spa, UK Michael Panzer Assistant Editor, DDC OCLC DDC metadata."

Similar presentations


Ads by Google