Creating Web Collections with Archive-It

Slides:



Advertisements
Similar presentations
1 Advanced Archive-It Application Training: Quality Assurance October 17, 2013.
Advertisements

1 What is the Internet Archive We are a Digital Library Mission Statement: Universal access to human knowledge Founded in 1996 by Brewster Kahle in San.
Archive What I See Now Mat Kelly, Michael L. Nelson, Michele C. Weigle Old Dominion University Web Science and Digital.
Looking Ahead Archive-It Partner Meeting November 18, 2014.
Looking Ahead Archive-It Partner Meeting November 12, 2013.
Facebook Presented by: Keystone Computer Concepts.
1 Archiving and Preserving the Web Kristine Hanna Internet Archive April 2006.
1 WEB ARCHIVING IN THE BRITISH LIBRARY John Tuck Head of British Collections February 2004.
1 Archive-It Training University of Maryland July 12, 2007.
1 Advanced Archive-It Application Training: Archiving Social Networking and Social Media Sites.
Web Archiving at the Innsbruck Newspaper Archive Innsbrucker Zeitungsarchiv / IZA Presentation by Renate Giacomuzzi, Elisabeth Sporer, Armin Schleicher.
Archive-It collection on “Occupy Movement 2011/2012” Archiving Web Content.
Joanne Archer University of Maryland Kate Odell Archive-It Abbie Grotke Library of Congress Tessa Fallon Columbia University Creating and Maintaining Web.
DuraCloud A service provided by Sandy Payette and Michele Kimpton.
Web The Internet Archive. Agenda Brief Introduction to IA Web Archiving Collection Policies and Strategies Key Challenges (opportunities for.
The Web is a Mess: or How I Learned to Stop Worrying and Love Web Archiving Lori Donovan, Internet Archive.
Web Capture team Office of strategic initiatives February 27, 2006 Selecting Content from the Web: Challenges and Experiences of the Library of Congress.
Finding and Using Information. Curating Content curation is the process of sorting through the vast amounts of content on the web and presenting it in.
The Invisible Web Cynthia Rooley Computer Research.
Medical Heritage Library. Mission Content-centered digital community Supporting research, education, dialog History of medicine contributing to understanding.
Office of Strategic Initiatives All Hands Meeting-March 2010 Challenges in Web Archiving: Library of Congress Edition Abbie Grotke, Web Archiving Team.
1 Archive-It: Archiving and Preserving Born Digital Content NDIIPP June 2009 Molly Bragg Partner Specialist Internet Archive.
Resource Curation and Automated Resource Discovery.
Preserving Digital Culture: Tools & Strategies for Building Web Archives : Tools and Strategies for Building Web Archives Internet Librarian 2009 Tracy.
Curator wishes for the roadmap november 2011 updates.
Archive What I See Now Mat Kelly, Michael L. Nelson, Michele C. Weigle Old Dominion University Web Science and Digital.
Web Archiving Service (WAS) Rosalie Lack Data Curation for Practitioners 2012 Workshop.
CyberCemetery Preserving At-Risk Government Web Content.
University of Texas Libraries Integrating Library Resources with Blackboard TBUG Conference, Fall 2006.
Metadata Extraction & Web Archives: Automating the Record Creation Process Abbie Grotke / Gina Jones /
1 Advanced Archive-It Application Training: Crawl Scoping.
The Boston TV News Digital Library: Partners WGBH Media Library and Archives (WGBH) Northeast Historic Film (NHF) Boston Public Library (BPL)
Building Collections on the Web BCWeb. What’s BCWeb ? BCWeb was developped entirely by the BnF for the content curators to replace its old selection tools.
WordPress for Beginners February 2, 2014 Facebook.
1 Advanced Archive-It Application Training: Reviewing Reports and Crawl Scoping.
What part of the URL tell the computer to find the server?
We will begin at 9 PM This is an Audio Seminar. Please be sure to adjust your audio. When reviewing the archived seminar this document will provide the.
HOW TO SET UP A WEBSITE. Why use WordPress? Nearly half of the websites on the Internet are running on the WordPress website platform It’s totally free.
Web Archiving Service (WAS) Rosalie Lack Data Curation for Practitioners 2012 Workshop.
Company Meeting Title Presenter.
Archiving & Preserving Digital Content
Workshop on Web Archiving
Joanne Archer University of Maryland Libraries
What's It Like To Work in the WS-DL Lab?
Michele C. Weigle and Michael L. Nelson
Challenges and Opportunities of Archiving the UK Web
Guided by: wpglobalsupportwpglobalsupport Embed a Facebook Video in WordPress Website.
Adding Post Type Archive in WordPress Navigation Menus Guided By: wpglobalsupportwpglobalsupport.
Build an Auction Site like eBay using WordPress Build an Auction Site like eBay using WordPress Guided By: wpglobalsupportwpglobalsupport.
Web 2.0 tools for your teaching and learning programme
Simple ways to create custom Facebook feeds in WordPress
Why Does Your Website Need a Sitemap?
OverDrive Digital Library Basics
Critical evaluation of websites
What Are Institutional Repositories?
We go Way Back: Libraries & Community Web Archiving
MSC photo:  It was taken some time in the late 1930s, but we don’t have an exact date.  The college was known as MSC from 1925 until 1955 when we became.
Latin American Government Documents Archive, LAGDA
Wisconsin County and Municipal Government Collections in Archive-It
Fast, free, fun Weebly web sites.
Web archive data and researchers’ needs: how might we meet them?
Is it a Good Web Site to use?
MSC photo:  It was taken some time in the late 1930s, but we don’t have an exact date.  The college was known as MSC from 1925 until 1955 when we became.
Márton Németh – László Drótos How to catalogue a web archive?
Internet Vocabulary Terms
Digital Resources.
Develop Your Web Presence Using WEEBLY
Metadata supported full-text search in a web archive
Brilliant. Sharp. Inspiring.
Presentation transcript:

Creating Web Collections with Archive-It Michele C. Weigle and Michael L. Nelson CS 891 – Web Archiving Seminar Fall 2017 @WebSciDL https://phonedude.github.io/cs891-f17/

Archive-It – archive-it.org Subscription service of the Internet Archive 400 partners 48 US states 16 countries Partners include College and University Libraries State Archives, Libraries, and Historical Societies Federal Institutions and NGOs Museums and Art Libraries Public Libraries, Cities and Counties Fall 2017 CS 891 - Web Archiving Seminar

CS 891 - Web Archiving Seminar Collection Basics Collections are a group of URLs curated around a common theme, topic, or domain. Scope determines what the crawler will capture and what it won’t.  Scoping is the process of, and use of tools, to tell the crawler how to adjust the scope. This includes general scoping, as well as scoping for specific web platforms.   Crawling is the use of software, called crawlers, to visit websites and index the information included therein. Reviewing is the activity of evaluating completed captures.   Quality Assurance includes the use of tools and articles related to improving the quality of captures. Access is the step of sharing content, by either making is publicly available, or sharing the private collection link, if applicable. https://support.archive-it.org/hc/en-us/articles/115002187023-Archive-It-workflow Fall 2017 CS 891 - Web Archiving Seminar

CS 891 - Web Archiving Seminar https://archive-it.org/collections/7760 Fall 2017 CS 891 - Web Archiving Seminar

Why Did I Want To Create a Collection? Fall 2017 CS 891 - Web Archiving Seminar

Baton Rouge Advocate / Aug 14, 2016 Fall 2017 CS 891 - Web Archiving Seminar

CS 891 - Web Archiving Seminar Reuters.com / Aug 14, 2016 Fall 2017 CS 891 - Web Archiving Seminar

CS 891 - Web Archiving Seminar New York Times / Aug 14, 2016 Much farther down the page… Fall 2017 CS 891 - Web Archiving Seminar

Capture Listing (old-style Wayback) Fall 2017 CS 891 - Web Archiving Seminar

Walkthrough of the Collection Pretty well-preserved - http://wayback.archive-it.org/7760/20160825224228/http://www.bbc.com/news/world-us-canada-37121404 Lots of damage, but text still available - http://wayback.archive-it.org/7760/20160818180320/http://www.actionnews17.com/headlines/sheriff-says-more-than-75-of-livingston-parish-is-total-loss-1673658 Captured more than just page-only - http://wayback.archive-it.org/7760/20160929184449/http://volunteerlouisiana.gov/ NY Times, list of recommended articles also preserved (provides context) - http://wayback.archive-it.org/7760/20160818180300/http://www.nytimes.com/2016/08/18/us/louisiana-flooding.html?_r=0 Flipagram (audio, pictures) not preserved - http://wayback.archive-it.org/7760/20160929184436/https://flipagram.com/f/mKQNY72zhI Storify - captured ok, but not all images/tweets replayed - http://wayback.archive-it.org/7760/20160831174031/https://storify.com/NOLANews/when-floodwaters-rise/ Facebook, but no video - http://wayback.archive-it.org/7760/20160929185158/https://www.facebook.com/Walker.Police/videos/10155108364813368/# Facebook – text-only post of first-hand account of what was going on during the flood - http://wayback.archive-it.org/7760/20160826124539/https://www.facebook.com/itsreininghorses/posts/10106070923387895/ Fall 2017 CS 891 - Web Archiving Seminar

My Collecting Experience http://ws-dl.blogspot.com/2016/12/2016-12-20-archiving-pages-with.html Fall 2017 CS 891 - Web Archiving Seminar

Backend – partner.archive-it.org Fall 2017 CS 891 - Web Archiving Seminar

CS 891 - Web Archiving Seminar Seed List Group Status Frequency Type Access Last Crawl Captures Link to Wayback Fall 2017 CS 891 - Web Archiving Seminar

CS 891 - Web Archiving Seminar Add a Seed Seed Types: Default scope (Standard): Embedded content captured Linked content to internal pages captured Linked content to external sites not captured https://support.archive-it.org/hc/en-us/articles/208332843-Assign-and-edit-a-seed-type- Fall 2017 CS 891 - Web Archiving Seminar

CS 891 - Web Archiving Seminar Fall 2017 CS 891 - Web Archiving Seminar

CS 891 - Web Archiving Seminar Fall 2017 CS 891 - Web Archiving Seminar

CS 891 - Web Archiving Seminar Help Center https://support.archive-it.org/hc/en-us Fall 2017 CS 891 - Web Archiving Seminar

CS 891 - Web Archiving Seminar Lots of Good Resources User Guide - https://support.archive-it.org/hc/en-us/categories/201179946-Archive-It-User-Guide Resources - https://support.archive-it.org/hc/en-us/sections/202143863-Resources Archive-It Basics - https://support.archive-it.org/hc/en-us/articles/208111766-Archive-It-Trial-Basics Archive-It Crawling Technology - https://support.archive-it.org/hc/en-us/articles/115001081186-Archive-It-Crawling-Technology 5 Challenges of Web Archiving - https://support.archive-it.org/hc/en-us/articles/209637043-5-Challenges-of-Web-Archiving Archiving Facebook https://support.archive-it.org/hc/en-us/community/posts/115010479146-Important-Update- https://support.archive-it.org/hc/en-us/articles/208333113-Archiving-Facebook Fall 2017 CS 891 - Web Archiving Seminar