Presentation is loading. Please wait.

Presentation is loading. Please wait.

The Open Content Alliance Project Liz Bell & Charley Pennell.

Similar presentations


Presentation on theme: "The Open Content Alliance Project Liz Bell & Charley Pennell."— Presentation transcript:

1 The Open Content Alliance Project Liz Bell & Charley Pennell

2 What is OCA Background Relationship to Internet Archive Nature of NCSU Libraries’ relationship Selection process Scanning process Bibliographic description

3 The Open Content Alliance Announced in October 2005 as a reaction to Google Books project Alliance of Yahoo with universities & non-profits, hosted by Internet Archive U of California, Toronto, British Library Scans material outside of U.S. Copyright protection (<1923), or later with permission of copyright holder “Opt in”: seek permission of copyright holder vs. “Opt out”: scan first, ask later Funded by Microsoft until 2008 as part of Live Book Search

4 OCA members today British Library Columbia University Emory University European Archive Getty Research Institute Indiana University Internet Archive Johns Hopkins McMaster University Memorial University of Newfoundland Missouri Botanical Gardens National Archives National Library of Australia Natural History Museum, London New York Botanical Garden Research Libraries Group (RLG) Rice University San Francisco Public Library Simon Fraser University Smithsonian Institution University of Alberta University of British Columbia University of California University of Chicago University of Georgia University of Illinois, Urbana University of North Carolina University of Ottawa University of Pittsburgh University of Texas University of Toronto University of Virginia Washington University Xerox Corporation Yahoo! York University

5 The Internet Archive Founded by Brewster Kahle, inventor of WAIS and Alexa Internet, in 1996 Credo: “Universal access to all knowledge” Operates the Wayback Machine, archive of past Web content Launched Open Library as wiki alternative to OCLC Legally recognized in California as a library

6

7 OCA at NCSU Libraries Two-year contract expires 31 March 2010 Focus on botanical, biological sciences Pre-1923 (though a few later titles have sneaked in accidently) The Agromeck One scanner machine, two daily shifts

8 Selection criteria Books selected by Collection Managers, SCRC Pull list generated by Patrick in Preservation Book trucks delivered to scanners

9 Scanning operations Scanning operation takes place in Collection Management office Scanning apparatus consists of one booth with two SLR cameras, a computer and lighting Two part-time scanners work 8AM-10PM daily Quota of 3000 pages/day

10

11

12

13

14

15

16

17 Output NCSU project has scanned over 3100 volumes Most downloaded title: Standardized plant names; a catalogue of approved scientific and common names of plants in American commerce, 3247 downloads Formats supported: JPEG2000,.pdf,.djvu, OCR txt, Kindle, Daisy Metadata: MARC, MARCXML, technical

18

19 NCSU metadata generation Start with list of archived titles from Preservation Download xml files from OCA site Using MarcEdit (Terry Reese): 1.Apply XSLT style sheet (Erin Stalberg) to convert OCA xml file to MARC xml 2.Convert MARC xml to MARC 3.Join MARC files for import into OCLC

20

21

22

23

24 XML file form OCA After conversion to MARCxml

25


Download ppt "The Open Content Alliance Project Liz Bell & Charley Pennell."

Similar presentations


Ads by Google