Web Archiving Service (WAS) Rosalie Lack Data Curation for Practitioners 2012 Workshop.

Slides:



Advertisements
Similar presentations
This Library Never Forgets Preservation, Cooperation, and the Making of HathiTrust Digital Library Jeremy York Project Librarian HathiTrust Digital Library.
Advertisements

Building the Universal Library: The Promise and Challenges of HathiTrust John Wilkin 2 April 2009.
HathiTrust and the Ecology of Shared Collections Paul N. Courant 21 May 2009.
HATHITRUST A Shared Digital Repository HathiTrust current work, challenges, and opportunities for public libraries Creating a Blueprint for a National.
1 What is the Internet Archive We are a Digital Library Mission Statement: Universal access to human knowledge Founded in 1996 by Brewster Kahle in San.
Digital & Preservation Resources Managing the digital collection life cycle.
Digital Preservation and the Open Web: A Curatorial Perspective Terence K. Huwe Institute of Industrial Relations University of California, Berkeley Computers.
UC Shared Images Delivering essential image collections UC-wide Berkeley Davis Irvine Los Angeles Merced Riverside San Diego Santa Barbara Santa Cruz CDL.
HATHITRUST A Shared Digital Repository HathiTrust: A Second Life for Library Collections Jeremy York Exploring Humanities Cyberinfrastructure April 30,
Latin American and Human Rights Web Archiving as part of Research Library Special Collections Kent Norsworthy LLILAS Benson Digital Curation Coordinator,
HATHITRUST A Shared Digital Repository A Preservation Infrastructure Built to Last: Preservation, Community, and HathiTrust UNESCO Memory of the World.
The California Audiovisual Preservation Project Discovering the State’s Rich Audiovisual Heritage.
University Archives University Archives & Archive-It WebCom
Building a Smarter Campus Home Site: Learning From Users’ Activities Derk Adams Director of Internet Development University of California, Riverside University.
What are the key improvements in web content management?
University of California Applications and Thought-Starters C L A S S R O O M IN-BETWEEN SPACES HOUSING & DINING LIBRARY OFFICES MEDICAL CENTERS CLICK TO.
Joanne Archer University of Maryland Kate Odell Archive-It Abbie Grotke Library of Congress Tessa Fallon Columbia University Creating and Maintaining Web.
HATHITRUST A Shared Digital Repository HathiTrust: Putting Research in Context HTRC UnCamp September 10, 2012 John Wilkin, Executive Director, HathiTrust.
City of Seattle Office of the City Clerk Open Government = Access Challenges and Opportunities with Digital Records.
Web Capture team Office of strategic initiatives February 27, 2006 Selecting Content from the Web: Challenges and Experiences of the Library of Congress.
The Web Archiving Service Tracy Seneca California Digital Library California Digital LibraryNew York UniversityUniversity of North Texas National Digital.
IIPC GA Curator Tools Fair May 2014 WEB CURATOR TOOL Nicola Bingham Web Archivist.
The web has revolutionized our access to information. Documents and publications that were once difficult to fin are now readily available to anyone. Government.
EPortfolio California Webinar John Whitmer, California Virtual Campus.
Web Archiving Challenges: Collaborative Collection Building.
1 Archive-It: Archiving and Preserving Born Digital Content NDIIPP June 2009 Molly Bragg Partner Specialist Internet Archive.
Was.cdlib.org California Digital Library University of California Rosalie Lack
Phases of Policy Development Joshua Adams, Cornell University Nancy Capell, University of California Patrice DeCoster, SUNY Empire State College ACUPA.
Digital Special Collections Users Council Annual Meeting May 9, 2008.
Preserving Digital Culture: Tools & Strategies for Building Web Archives : Tools and Strategies for Building Web Archives Internet Librarian 2009 Tracy.
HATHITRUST A Shared Digital Repository HathiTrust and TRAC DigitalPreservation 2012 July 25, 2012 Jeremy York, Project Librarian, HathiTrust.
The Library of Congress Martha Anderson Program Officer, NDIIPP Office of Strategic Initiatives Library of Congress April 2005 LC Perspective : Preservation.
Web Archiving Service (WAS) Rosalie Lack Data Curation for Practitioners 2012 Workshop.
HATHITRUST A Shared Digital Repository HathiTrust and the Future of Research Libraries American Antiquarian Society March 31, 2012 Jeremy York, Project.
Suggested Placement of WCL Boxes/Links The following screens are meant to illustrate where WCL search boxes currently reside on operational library pages,
The Web-at-Risk NDIIPP Sponsored Project Partners include: California Digital Library – project lead University of North Texas New York University California.
Metadata Extraction & Web Archives: Automating the Record Creation Process Abbie Grotke / Gina Jones /
Web Archiving Service Public Access Release Date: July
Preservation Program Digital Preservation Program Digital Preservation Services: Extending tools to meet campus needs Patricia Cruse, Director, Digital.
HATHITRUST A Shared Digital Repository Institution Uses of HathiTrust Jeremy York University of Maine May 24, 2013.
The Web Archiving Service Spring 2009 Update User’s Council Annual Meeting Tracy Seneca California Digital Library Capture Today’s Web;
HathiTrust: Collaboration in Building the Universal Collection John Wilkin 1 October 2009.
HATHITRUST A Shared Digital Repository HathiTrust Large Digital Libraries: Beyond Google Books Modern Language Association January 5, 2012 Jeremy York,
A Project of the University Libraries Ball State University Libraries A destination for research, learning, and friends.
An Introduction to EZID University of California Curation Center Team California Digital Library August, 2011 UC3 Summer Webinar Series.
HathiTrust: A valuable and visionary Partnership.
Libraries Digital Program Updates, 12/8/2006 Stephen P. Davis Director, Libraries Digital Program Division.
Onboarding Workgroup Janna LeBlanc, UC Riverside Michael Luttrell, UC Santa Cruz Rejeana Mathis (chair), UC Los Angeles Rochelle Niccolls, UC Berkeley.
OPEN ACCESS INITIATIVE at U.S. Department of Transportation LIBRARY PERSPECTIVE 2014 TRB Annual Meeting January 16, 2014 Lisa D. Zilinski Data Services.
SAMPLE MARCH WORKGROUP PRESENTATION
Welcome to your Senior Year! The following is a powerpoint presentation that you can use to get started on your college applications.
DR. JACOB NG VICE CHANCELLOR International Affairs,
Archiving & Preserving Digital Content
Introducing the University of California
SCATTERGRAMS COLLEGE ENTRANCE DATA
CUCSA Workgroup Chair Orientation
Joanne Archer University of Maryland Libraries
SAMPLE MARCH WORKGROUP PRESENTATION
Digital Collections Update
Creating Web Collections with Archive-It
Accounts Payable Invoicing Service
Archiving and Delivery of Student Portfolios
University of California:
Wisconsin County and Municipal Government Collections in Archive-It
4 Systems of Higher Education: UC system
New Chapter Websites Overview.
MSC photo:  It was taken some time in the late 1930s, but we don’t have an exact date.  The college was known as MSC from 1925 until 1955 when we became.
University of California
The community college pathway
Presentation transcript:

Web Archiving Service (WAS) Rosalie Lack Data Curation for Practitioners 2012 Workshop

Imagine a world …

This is our world …

WAS … is A service of the UC Curation Center to collect, manage, preserve and publish websites and documents.

WAS Snapshot 53 public archives 120+ archives total 7,500+ sites 50+ TB 23 institutions

WAS Institutions Institute of Governmental Studies Library, UCB UC Berkeley Office of Public Affairs UC Berkeley Libraries UC Davis Libraries UC Irvine Libraries UC Los Angeles Libraries UC Riverside Libraries UC San Diego Libraries UC San Francisco Libraries UC Santa Barbara UC Santa Cruz McHenry Library Emory University Library Institute for Research on Labor and Employment New York University Northwestern University Library Purdue University Stanford University Libraries Temple University University of Arkansas Libraries University of Illinois at Urbana Champaign Libraries University of Michigan, Bentley Historical Library USDA Economic Research Service Water Resources Collections and Archives

WAS Overview A) Curator Tools

Curator Workflow

1. Create Site Enter site name, URL and description Scope Capture frequency Robots.txt

2. Capture Sites

3. View Captures View captures QA Compare

4. Public Access Customize the archive Write description Create custom banner and icon

WAS Overview B) Public Archives

Web Archive ‘home page’

Browse: Site List + Tags

Search: All Sites in an Archive

Integration with your Systems

How are people using WAS?

Institution’s website Preserve intuitional history Capture university news and events

Geographically focused

Topical Support special research collections

Event Sudden action required May need many selectors Start date / end date

Researcher’s Perspective Building collections for research – Study the topic / event – Study site change or web-based communication – Websites are datasets for analysis and data mining Preservation of research – Archive grant-funded websites – Selected sites Create stable citations for publications

Get started! Each library has WAS administrator(s) Unlimited number of curators per account What’s the cost? – UC does not pay a service fee – Storage only: $1040/per TB (average site is $1.46/annually); storage costs to go down

Challenges Shared collection development Metadata issues Workflow and cost models for faculty projects Time! Limitations of web crawlers Websites are messy

Contact me! Rosalie Lack WAS Service Manager

(2003)* *WAS 2013 California Recall Election Web Archive California

(2012)