Presentation is loading. Please wait.

Presentation is loading. Please wait.

Web Archiving Service (WAS) Rosalie Lack Data Curation for Practitioners 2012 Workshop.

Similar presentations


Presentation on theme: "Web Archiving Service (WAS) Rosalie Lack Data Curation for Practitioners 2012 Workshop."— Presentation transcript:

1 Web Archiving Service (WAS) Rosalie Lack rosalie.lack@ucop.edu Data Curation for Practitioners 2012 Workshop

2 Imagine a world …

3 This is our world …

4 WAS … is A service of the UC Curation Center to collect, manage, preserve and publish websites and documents.

5 WAS Snapshot 53 public archives 120+ archives total 7,500+ sites 50+ TB 23 institutions

6 WAS Institutions Institute of Governmental Studies Library, UCB UC Berkeley Office of Public Affairs UC Berkeley Libraries UC Davis Libraries UC Irvine Libraries UC Los Angeles Libraries UC Riverside Libraries UC San Diego Libraries UC San Francisco Libraries UC Santa Barbara UC Santa Cruz McHenry Library Emory University Library Institute for Research on Labor and Employment New York University Northwestern University Library Purdue University Stanford University Libraries Temple University University of Arkansas Libraries University of Illinois at Urbana Champaign Libraries University of Michigan, Bentley Historical Library USDA Economic Research Service Water Resources Collections and Archives

7 WAS Overview A) Curator Tools

8 Curator Workflow

9 1. Create Site Enter site name, URL and description Scope Capture frequency Robots.txt

10 2. Capture Sites

11 3. View Captures View captures QA Compare

12 4. Public Access Customize the archive Write description Create custom banner and icon

13 WAS Overview B) Public Archives

14 Web Archive ‘home page’

15 Browse: Site List + Tags

16 Search: All Sites in an Archive

17 Integration with your Systems

18 How are people using WAS?

19 Institution’s website Preserve intuitional history Capture university news and events

20 Geographically focused

21 Topical Support special research collections

22 Event Sudden action required May need many selectors Start date / end date

23 Researcher’s Perspective Building collections for research – Study the topic / event – Study site change or web-based communication – Websites are datasets for analysis and data mining Preservation of research – Archive grant-funded websites – Selected sites Create stable citations for publications

24 Get started! Each library has WAS administrator(s) Unlimited number of curators per account What’s the cost? – UC does not pay a service fee – Storage only: $1040/per TB (average site is $1.46/annually); storage costs to go down

25 Challenges Shared collection development Metadata issues Workflow and cost models for faculty projects Time! Limitations of web crawlers Websites are messy

26 Contact me! Rosalie Lack WAS Service Manager rosalie.lack@ucop.edu

27 www.votearriana.comwww.votearriana.com (2003)* *WAS 2013 California Recall Election Web Archive California http://webarchives.cdlib.org/a/carecall2003http://webarchives.cdlib.org/a/carecall2003

28 www.votearriana.comwww.votearriana.com (2012)


Download ppt "Web Archiving Service (WAS) Rosalie Lack Data Curation for Practitioners 2012 Workshop."

Similar presentations


Ads by Google