Presentation is loading. Please wait.

Presentation is loading. Please wait.

1.2 Content Management Catherine M. Jannik Georgia Institute of Technology MetaArchive Distributed Digital Preservation Workshop Emory University – Atlanta,

Similar presentations


Presentation on theme: "1.2 Content Management Catherine M. Jannik Georgia Institute of Technology MetaArchive Distributed Digital Preservation Workshop Emory University – Atlanta,"— Presentation transcript:

1 1.2 Content Management Catherine M. Jannik Georgia Institute of Technology MetaArchive Distributed Digital Preservation Workshop Emory University – Atlanta, Georgia May 30, 2007

2 Session Learning Objectives Scoping Content: Determining the scope of your network and what you will harvest. Describing Content: Determining how you will describe content and why you need a conspectus. Inventorying Collections: Completing the conspectus based on your scope and schema. Harvesting: Prioritizing and preparing for harvest.

3 Scoping Content Identify criteria for what is in and out of scope; what will be harvested Scoping decisions 1. Subject 2. Media formats 3. Risk 4. MetaArchive case study

4 Scoping Content 1. Subject a. Is there a subject area the members share in common that might provide a starting point? b. How to define that subject area and its boundaries. c. Establish or adopt a controlled vocabulary.

5 The partner institutions of this project are engaged in a three-year process to develop a cooperative for the preservation of at-risk digital content with a particular content focus: the culture and history of the American South.

6

7 A discussion of Southern culture and history must always begin with clarification of the terms. Southern is a term that, to most, brings to mind a particular region. However, upon closer inspection, the South and its boundaries are not so easily mapped. One could begin and end with the eleven former Confederate states, though that excludes the four other slave states that remained part of the Union. One could consider the “census south:” the Confederacy with the addition of Delaware, Maryland, West Virginia, Oklahoma, and the District of Columbia. There is also the Gallup organization’s South that includes the Confederate eleven plus Oklahoma and Kentucky. Then there are the areas of the country that serve as home to former Southerners who retain much of their culture and infuse their new locals with vestiges of their former homes. As the Encyclopedia’s editors and authors did, we will rely on a cultural definition of the South more inclusive than not, focusing largely on the former states of the Confederacy but without excluding the margins of the region where the culture of the South is evident. After careful contemplation of the meaning of “culture,” the editors of the Encyclopedia planned their work “to carry out [T.S.] Eliot’s belief that ‘culture is not merely the sum of several activities, but a way of life.’” History is the most easily defined of the terms and is evident in most of the collections in the MetaArchive project however, an historical component was not required for consideration. 10.13.2004

8 The definition of Southern culture and history used in this project is constructed with broad strokes. The Content Committee responsible for this definition owes a debt of gratitude to the editors of the Encyclopedia of Southern Culture on whose introduction we relied heavily. A discussion of Southern culture and history must always begin with clarification of the terms. Southern is a term that, to most, brings to mind a particular region. However, upon closer inspection, the South and its boundaries are not so easily mapped. One could begin and end with the eleven former Confederate states, though that excludes the four other slave states that remained part of the Union. One could consider the “census south:” the Confederacy with the addition of Delaware, Maryland, West Virginia, Oklahoma, and the District of Columbia. There is also the Gallup organization’s South that includes the Confederate eleven plus Oklahoma and Kentucky, and the National Endowment for the Humanities includes Puerto Rico and the Virgin Islands in its South Atlantic Humanities Center. The South is also an identity. Southerners who move outside of the region, however defined, retain much of their culture and infuse their new locales with vestiges of their former homes. Conversely, people born outside of the South who come to live within the region find that their work and lives are influenced by their adopted home and themselves become a part of the evolving South. As the Encyclopedia’s editors and authors did... 04.19.2005

9 MetaArchive Scope document https://www.metaarchive.org/metawiki/index. php?title=Main_Page https://www.metaarchive.org/metawiki/index. php?title=Main_Page

10 Scoping Content 2. Media formats a. What can the repository support? (LOCKSS is agnostic) b. Which of those formats does each member have c. Of that list, which formats do we want to include? d. Stance on master vs. derivative or compressed files (jpg vs. tiff)

11 MetaArchive and Formats Formats and Media The digital formats of the material considered do not affect the harvest because LOCKSS is format-agnostic. The LOCKSS system provides redundant replication of files in any format. Hence, formats were not a major consideration for risk ranking. Most of the candidate collections incorporate content in several formats and each institution will handle its own format migration outside the scope of this project. Collections stored only on off-line media should be considered at high risk and, therefore, become part of this preservation cache. The extent or size of the collection and the Internet Media [MIME] Types included will be noted. NOTE: This follows the Western States Dublin Core Metadata Best Practices Draft v 2.0 Draft August, 2004 http://www.cdpheritage.org/resource/metadata/documents/WSDCMBP_v 2-0.pdf http://www.cdpheritage.org/resource/metadata/documents/WSDCMBP_v 2-0.pdf Example Format [Extent] 3,000,000 bytes Format [Medium] DVD Format [IMT] image/jpeg

12 Master vs. derivative The MetaArchive of Southern Digital Culture understands that in the digital realm there are often times various versions of a digital object which comprise the complete copy of the object as a whole. In regards to this area we encourage our partners to select the version or versions of the digital object which best represent the original content of the object. We recognize that organizations such as DLF have created standards for elements of a digital master registry (http://www.diglib.org/collections/reg/DigRegGuide.htm). In such documents we note the attempt to distinguish two categories of digital objects which we will refer to as (i) preservation or digital masters and (ii) access, digital use or surrogate copies. Member institutions will decide whether they want to preserve their digital masters, their access copies, or all versions of a digital object.http://www.diglib.org/collections/reg/DigRegGuide.htm

13 http://www.diglib.org/collections/reg/DigRegGuide.pdf

14 Scoping Content 3. Risk a. Born digital vs. digitized b. Particular formats c. Items in use vs. dark items

15 Scoping Content 3. Risk

16 Scoping Content 4. MetaArchive Case Study

17 Describing Content How will the content be described? Cataloging decisions 1. Schemas 2. Conspectus database 3. MetaArchive case study

18 Describing Content 1.Schemas a. Adopt, adapt, or create a schema? b. Content, context, and format must be captured.

19 Western States Dublin Core Metadata Best Practices http://www.westernwater.org/pdf/Western%20waters%20Dublin%20Core%20Metadata_v2-0.pdf

20 Collaborative Digitization Program Dublin Core Metadata Best Practices http://www.cdpheritage.org/cdp/documents/CDPDCMBP.pdf

21 Dublin Core Collections Application Profile http://dublincore.org/groups/collections/collection-application-profile/

22 UKOLN Research Support Libraries Programme (RSLP) Collection Description Schema http://www.ukoln.ac.uk/metadata/rslp/schema/

23 IMLS DCC Collection Description Metadata Schema http://imlsdcc.grainger.uiuc.edu/CDschema_elements.asp

24 PREMIS Preservation Metadata: Implementation Strategies http://www.oclc.org/research/projects/pmwg/premis-final.pdf

25 MetaArchive Collection-Level Conspectus Metadata Specification http://metaarchive.org/pdfs/conspectus_md_2005.html

26 Subject [dc:subject] LOUISVILLE

27 Describing Content 2. Conspectus database a. Collection-management tool b. How it’s constructed c. MetaArchive’s framework

28

29 Describing Content

30 3. MetaArchive Case Study

31

32

33

34

35

36

37 Inventorying Collections Gather information based on descriptive schema Describing your collection 1. Inventory a. What to include b. Timeframe c. Preparation

38 Inventorying Collections a. What to include – network and local decision b. Timeframe – agree on one c. Preparation – get all the information together first!

39 Harvesting What to save/harvest first, later, eventually and how Preparing for harvest 1. Prioritization a. Institution level b. Network level

40 Harvesting 2. Preparation a. Inventory b. “Data wrangling” c. Plug-ins and manifest pages

41 HARVEST!!

42 Discussion jannik@gatech.edu Thanks!


Download ppt "1.2 Content Management Catherine M. Jannik Georgia Institute of Technology MetaArchive Distributed Digital Preservation Workshop Emory University – Atlanta,"

Similar presentations


Ads by Google