Harvesting and showing complicated sites using archive-it – status for some of our tests from October 2014 – January 2015 January 2015 By Tue Hejlskov.

Slides:



Advertisements
Similar presentations
1 After completing this lesson, you will be able to: Search for information on the Web. Create a favorites list. Use and modify your History folder.
Advertisements

the Internet browser of choiceVidor ISD What is Internet Explorer? a Web browser* produced by the Microsoft Corporation *a software application used.
1 Advanced Archive-It Application Training: Quality Assurance October 17, 2013.
Looking Ahead Archive-It Partner Meeting November 18, 2014.
Search content in the Admin user library to share in the meeting Search web content Upload files from local PC Open audio chat only Open IM chat window.
Looking Ahead Archive-It Partner Meeting November 12, 2013.
Better information. Better decisions. RSS Really Simple Syndication Tutorial.
Starter for 10 Unit 6: Searching for your hobbies Transform IT SFT06_searching_hobbies.
University Archives University Archives & Archive-It WebCom
CMP 101 INTRODUCTION TO THE INTERNET L02. Internet Unit A.
Archive-It Architecture Introduction April 18, 2006 Dan Avery Internet Archive 1.
What Is A Web Page? An Introduction to the Internet.
WEB BROWSERS. W EB B ROWSER B ASICS Define: a software application for retrieving, presenting, and traversing information resources on the World Wide.
Recent approaches to capture web content, which Heritrix can’t harvest  Capturing Social Media  Screen filming of Rich Media  Project: Event crawl of.
1.Learning the Terms Learning the TermsLearning the Terms 2.Accessing the Internet from a PC Accessing the Internet from a PCAccessing the Internet from.
1 Archive-It Training University of Maryland July 12, 2007.
1 Advanced Archive-It Application Training: Archiving Social Networking and Social Media Sites.
A step-by-step tutorial by Henry Liu Auckland City Libraries Make a start Chinese Digital Community.
Internet. Internet is Is a Global network Computers connected together all over that world. Grew out of American military.
Internet Standard Grade Computing. Internet a wide area network spanning the globe. consists of many smaller networks linked together. Service a way of.
Archive-it WARC usage - compared with NAS – and 3 Questions. By Tue Hejlskov Larsen, netarchive.dk January 2015.
Creating an Online Professional Presence Using Social Media.
Creating Your PE Webshop Using SmartStore.biz This Tutorial assumes you have downloaded the software from This tutorial is based.
Review: How do you change the border color of an image?
Copyright ©: SAMSUNG & Samsung Hope for Youth. All rights reserved Tutorials The internet: Getting online Suitable for: Beginner.
Saving and printing Section 4. Objectives Student will learn about print a web site, download files from the internet.
© Paradigm Publishing, Inc Browsing the Internet Using Internet Explorer 8.0.
1 Archive-It: Archiving and Preserving Born Digital Content NDIIPP June 2009 Molly Bragg Partner Specialist Internet Archive.
Primary Sources 2.0 Using today’s technology to promote historical thinking.
Objective Understand concepts used to web-based digital media. Course Weight : 5%
KLUWER JOURNALS
Harvesting e-publications in DK – a short status January 2015 By Tue Hejlskov Larsen, netarchive.dk.
Curator wishes for the roadmap november 2011 updates.
JUX 1 12/26/2012 JUX Creating in Jux allows you to present your images, captions, text, blogs, etc. in a creative format. Create an Account Login at
PDF Dissertation Full Text Book Promotion & Service Co., Ltd. ByByByBy Jirawat Promporn Jirawat Promporn k.co.th
Intro to Google Docs Table of Contents Video What is Google Docs? What can you do with it? Creating a new document Uploading an existing document.
Introducing the Internet and The Web Computer Concepts Unit A What Is Internet.
Week Nine Week Nine focuses on Collecting Images and Web Page URLs to use for your final Web Page Project. Discussions on using Netscape Communicator Composer.
Teacher In-service January 25, 2013 Presented by Kevin Pedersen of.
OESO knowledge Interactive Information System: Exclusive documents now just a mouse click away.
Powerpoint as a Multimedia Platform Matt Monjan Discovery Educator Network.
My Bitcoin app is a simple application for Windows Phone. It aggregates and displays Bitcoin market data from internet sources. You can pin the app to.
HTML Basic. What is HTML HTML is a language for describing web pages. HTML stands for Hyper Text Markup Language HTML is not a programming language, it.
Mrs. Walls September/October Learning the Web Vocabulary Web Sites Web Pages Web Browser To Bibliography Bibliography.
Mrs. Walls September/October Learning the Web Vocabulary Web Sites Web Pages Web Browser.
LatchKey Realty Understanding Our New Web Site. Introduction LatchKey Realty is finally on-line If you’ve surfed the Web before, you’ll appreciate our.
Copyright 2007, EMC Paradigm Publishing Inc. INTERNET EXPLORER 7 BACKNEXTEND 1-1 LINKS TO OBJECTIVES Launching Internet Explorer Launching Internet Explorer.
Managing Your Specialty Area Website: A What’s What and How-To Guide.
Vocabulary 3 Internet Vocabulary. internet A system that connects billions of computers around the world.
Note:- You can unpin it any time you wish to…..
INGENTA GATEWAY PORTAL
Vocabulary 2 Internet Vocabulary. online On the internet.
My Blogging History with WordPress Why I Like WordPress How to setup a WordPress Blog Account How to configure your WordPress Blog.
Using a Wiki as a Learning Environment Portia Pusey Towson University College of Education Technology Day June 4, :40 – 11:40 PY 300.
Template for a Dynamic Display PowerPoint Presentation Elliott McCrory October, 2014.
Internet Explorer. 2 Menu bar Command bar Back & forward buttons Search bar (type word or phrases )
Vocabulary 2 Internet Vocabulary. online On the internet.
Your Page Name – Jurassic File Edit View History Bookmark Window Help Search Jurassic Browser Template Your Name Insert.
Windows Vista Configuration MCTS : Internet Explorer 7.0.
Part One: Introduction  How to Log on  Which Browser to use  The URL for the site  The Home Page
One way to use the Lili database Go to lili.org. Select your grade level.
GALILEO Support Services
What this activity will show you
PubMed/History, Advanced Search and Review (module 4.3)
Backpage Paris | Back page Paris
Claire Meiring Application For: General Manager
Use an Internet Browser
Internet Vocabulary Beth Felton McKelvey.
EPA website.
Presentation transcript:

Harvesting and showing complicated sites using archive-it – status for some of our tests from October 2014 – January 2015 January 2015 By Tue Hejlskov Larsen, netarchive.dk

Archive-it (AIT) Setup january 2015  Heritrix snapshot  Umbra - all seed URLs in AIT are crawled using Umbra and Heritrix. >  Harvesting using”Only one page” from october 2014 to january  Following help instructions here (and sorry, if i’m missing some of the instructions – AIT updates the instructions from time to time !):  Used Wayback browser in proxy mode : Internet Explore 9

dumper-internationalt

dumper-internationalt  They can harvest jsincludes with articles

AIT Videoplayer No comments Missing some images

With Video playback in place - only with Firefox in proxy mode

 With tweets, images, video links

 No Mouse down Paging

Tiny url’s ok e.g.

Using AIT free text search found posts/comments older than showed – have some locale problems…

 With linked videos - not inplace

 Images, Posts and some comments  Posts to page in full view  History (mouse down)  No view comments  No view of previous comments  Using freetext search I found comments which could not be showed on the page

it.org/4897/ /

Images 2 times mouse down paging No proveniens topbar No full image No show more button

Posts and images With big images No notes

 With video - not in place

ens-museum-for-kunst?projectId=art-project Images not inplace No zoom No streetview

Comparison of display capabilities between Archive-it Wayback and NAS Wayback in proxy mode (AIT/NAS)

Complicated sites – Some Test Examples  iframes/js with articles  video, comments, images, paging  tweets, images, paging, video, short links  post/comments, images, paging  images, paging  post/comments, images, videos, paging  video  for-kunst?projectId=art-project street view, image list and zoom for-kunst?projectId=art-project