Presentation is loading. Please wait.

Presentation is loading. Please wait.

Copyright 2000 OCLC Worldwide Dewey Models 11: UKOLN/mda Terminology Workshop Diane Vizine-Goetz OCLC Online Computer Library Center.

Similar presentations

Presentation on theme: "Copyright 2000 OCLC Worldwide Dewey Models 11: UKOLN/mda Terminology Workshop Diane Vizine-Goetz OCLC Online Computer Library Center."— Presentation transcript:

1 Copyright 2000 OCLC Worldwide Dewey Models 11: UKOLN/mda Terminology Workshop Diane Vizine-Goetz OCLC Online Computer Library Center

2 Copyright 2000 OCLC Outline of talk zHow the DDC is evolving to meet the present & future needs of worldwide users zDewey in the Desire II project: the OCLC/NetLab collaboration zEnriching the content of the DDC database through vocabulary mapping projects

3 Copyright 2000 OCLC How Dewey evolves zRestructuring in new editions zContinuous updates zMultilingual use

4 Copyright 2000 OCLC New editions zRestructuring yTo reflect new views and fields of interest xCitation order changed in life sciences yTo remove bias xReduction in Christian bias in religion yTo provide for new topics xComputer science updated and expanded

5 Copyright 2000 OCLC Continuous updates zBiweekly yNew Library of Congress Subject Headings (LCSH) mapped to DDC numbers zMonthly yNew and changed entries zAnnual electronic edition yInclusion of frequently used synthesized numbers in Dewey for Windows and Dewey in CORC

6 Copyright 2000 OCLC New and Changed Entries for December 1999 The following new and changed entries are effective on December 1. Change is underlined. Edition 21 Changed entry: Rock climbing Class sport and indoor rock climbing in New entry: Sport climbing Class here indoor climbing

7 Copyright 2000 OCLC New Library of Congress Subject Headings (LCSH) mapped to DDC numbers

8 Inclusion of frequently used synthesized numbers in DFW and Dewey in CORC

9 Copyright 2000 OCLC Multilingual use zWorldwide use in WorldCat z59 national bibliographies use Dewey, including the BNB zTranslations underway or under discussion in 11 languages zTranslations often lead to expansions or adaptations to address cultural needs not met by standard arrangement

10 Copyright 2000 OCLC English/French/Spanish DDC Weather forecasts and reports for specific areas Prévisions et bulletins météorologiques pour des régions déterminées Pronósticos e informes del tiempo atmosférico para áreas especificas

11 Copyright 2000 OCLC Dewey in the Desire II project: the OCLC/NetLab collaboration zWhy use Dewey -- traditional schemes such as the DDC have many features that make them powerful tools in the electronic environment: yWell-defined categories yWell-developed hierarchies yMeaningful notation, and yA rich network of relationships

12 Copyright 2000 OCLC Further... zClassification systems have rich potential as switching languages in the polyglot Web environment--the controlled vocabulary (the notation) remains the same when the concept descriptions and indexes appear in another language

13 Copyright 2000 OCLC Engineering electronic library, Sweden (EELS) zDESIRE project partner zSubject-Based Information Gateway yService contains ~1,400 Internet resources about engineering yThe resources are selected according to agreed-upon quality criteria, intellectually described & assigned thesaurus terms and class codes from the Ei scheme

14 Copyright 2000 OCLC "All" Engineering resources on the Internet zAs a complement to EELS, the service provides access to 253,000 harvested engineering pages zThe pages are searchable by full text and browsable by domain/country, title, most cited URL, and most-cited engineering file directories

15 Copyright 2000 OCLC Common goals zTo test the effectiveness of a universal scheme v.s. a subject specific scheme for organizing a large domain specific collection zTo refine research methodologies for automated subject assignment

16 Copyright 2000 OCLC Research questions zShould we employ universal schemes, subject specific systems, or both for domain specific collections? zHow can reactive schemes deal with emerging topics? zWhat schemes provide effective bases for automated subject description?

17 Copyright 2000 OCLC Some characteristics of Ei and Dewey (engineering) Ei z700 classification categories z16,000 terms mapped to class structure z8,273 of these are preferred terms DDC y2,210 classification numbers y3,699 Dewey index terms y587 intellectually mapped terms y2,263 statistically mapped terms yTotal = 6,549

18 Copyright 2000 OCLC Automated classification in the DESIRE II project zMatched Ei terms to text in more than 132,000 harvested engineering pages zFor matched terms, associated Ei classes are assigned to the documents zThe matches are weighted based on yterm complexity, location of term in the document, frequency

19 Copyright 2000 OCLC NetLab Results z86,468 documents classified zAbout 6 class codes assigned per document z57% percent class codes automatically assigned to 923 EELS documents matched the intellectual classifications

20 Sample page for automated classification

21 Results

22 Copyright 2000 OCLC Automated classification with DDC using Scorpion zUses the DDC knowledge structure yEnhanced with supplemental terminology zEmploys an IR approach yTerms extracted from documents, formulated as a query, produces a ranked list of DDC classes

23 Copyright 2000 OCLC Enhanced DDC database--Terms for Semiconductors ( ) DDC Index Terms zCrystal devices--electronics zMiniaturization--electronics zModules--electronics zOptoelectronic devices zSemiconductors zThin-film technology zWafers (Electronics) Mapped Terms zQuantum dots zAmorphous semiconductors zCompound semiconductors zElectric contacts zFerroelectric thin films zGallium arsenide semiconductors zHot carriers zIon implantation zLithography, Electron beam

24 Scorpion Classifier

25 Scorpion results

26 Copyright 2000 OCLC Next Steps zCompare DDC intellectual & automated classifications for subset of EELS documents zProvide browsing access to document collection using both schemes zMap Ei & DDC class structures to enable cross-database browsing (EELS service to CORC, and the reverse)

27 Copyright 2000 OCLC Enrich Dewey by linking to other knowledge organization schemes zVocabulary Mapping yEnhances vocabulary & terminology yExpands the knowledge base yEnables the construction of custom views linked to or derived from the general scheme

28 Copyright 2000 OCLC Vocabulary Mapping Projects zLCSH & LC Children's Subject Headings zMedical Subject Headings (MeSH) zA Women's Thesaurus zEngineering Index Thesaurus (for research purposes only) zGEM subject terms zERIC Thesaurus (under discussion) zBioethics Thesaurus (under discussion)

29 Copyright 2000 OCLC Associated Vocabulary Profile

30 Copyright 2000 OCLC Statistics for terms associated with core Dewey framework Term Type Dewey Index Intellectually mapped (IM) Children's Headings (SHC) NetFirst (NF) Statistically Mapped(SM) Frequency Mapped (FM) Total Count 64,427 4,220 15,201 6,978 32,075 52, ,005








38 Copyright 2000 OCLC General Taxonomy Service zDescriptions of classes yBrief (class number and caption); Medium (brief plus DDC index terms); Full (all information associated with a DDC class) zCustom relationships yHierarchical; Interdisciplinary zAssociated Terminology yDDC Relative Index terms (Including terms from translated editions) yMapped Vocabularies (e.g., LCSH, MeSH, etc. zOther Services yClass number updating; Class number decomposition yMultilingual views based on translations of DDC yWeb site organizing tools (numbers and captions of 50,000 most frequently used categories)

39 Copyright 2000 OCLC Links zOCLC Office of Research zDewey Research Projects zOCLC Forest Press - zCORC - zEngineering electronic library, Sweden (EELS) -

Download ppt "Copyright 2000 OCLC Worldwide Dewey Models 11: UKOLN/mda Terminology Workshop Diane Vizine-Goetz OCLC Online Computer Library Center."

Similar presentations

Ads by Google