Curator wishes for the roadmap november 2011 updates.

Slides:



Advertisements
Similar presentations
we present SLIDEPLAYER.US
Advertisements

we present SLIDEPLAYER.US
Polk County School Board Geographic Information Systems.
1 What is the Internet Archive We are a Digital Library Mission Statement: Universal access to human knowledge Founded in 1996 by Brewster Kahle in San.
Status and plans for the H3 release NetarchiveSuite 5.0.
SharePoint User Group Chicago: 1/24/2013 SharePoint 2013 Search Overview.
BnF projects and priorities On the collection side – Perform broad and focused crawls with a maximum of 100TB – Set up the legal deposit of ebooks.
How to Use LucidWorks Search
Looking Ahead Archive-It Partner Meeting November 12, 2013.
SEO Yearly Plan For 6 Keywords Basic SEO :10,000 per month Advanced: 15, 000 per month Super SEO: 20, 000 per month Complete SEO: 25, 000 per month *Prices.
CSC 101 Andrew Eng 03/28/06. Assingments Slide 1 - Slide show title, your name, class, and data Slide 2 - Podcast - Title, Very short definition, link.
1 ETT 429 Spring 2007 Microsoft Publisher II. 2 World Wide Web Terminology Internet Web pages Browsers Search Engines.
A field is a unit of information. Limit search by the title field.
Comparing Podcast, Blogs, Wiki Johana vallejo CSC101 3/28/2006.
Archive-It Architecture Introduction April 18, 2006 Dan Avery Internet Archive 1.
WELCOME TO THE AHIA CONNECTED COMMUNITY! HEALTHCARE INTERNAL AUDIT'S PROFESSIONAL THOUGHT LEADERSHIP COMMUNITY.
Recent approaches to capture web content, which Heritrix can’t harvest  Capturing Social Media  Screen filming of Rich Media  Project: Event crawl of.
The capture and preservation of websites at the National Library of New Zealand Gillian Lee Alexander Turnbull Library.
 SlideShare is the world's largest community for sharing presentations.  Besides presentations, SlideShare also supports documents, PDFs, videos & webinars.
1 Archive-It Training University of Maryland July 12, 2007.
1 © 2003 Cisco Systems, Inc. All rights reserved. Session Number Presentation_ID Cisco Technical Assistance Center (TAC) TAC Service Request Tool Overview.
By Jeerarat Boonyanit. As you can see I have chosen Cpanel for my server management tool. cPanel is a Linux based web hosting control panel that provides.
Annick Le Follic Bibliothèque nationale de France Tallinn,
Web Archiving at the Innsbruck Newspaper Archive Innsbrucker Zeitungsarchiv / IZA Presentation by Renate Giacomuzzi, Elisabeth Sporer, Armin Schleicher.
Meshups- embedding content from other websites, mostly maps: In netarchive: no map – just a ”black hole” – no solution netarkivet.
What is LinkedIn?  Launched in 2003  200 Million Users  Publically held company (LNKD)  December 2012 Q4 earnings $300 million  Most popular B2B Network.
Creating an Online Professional Presence Using Social Media.
XHTML Introductory1 Linking and Publishing Basic Web Pages Chapter 3.
Annick Le Follic Bibliothèque nationale de France Tallinn,
In addition to Word, Excel, PowerPoint, and Access, Microsoft Office® 2013 includes additional applications, including Outlook, OneNote, and Office Web.
INFO 344 Web Tools And Development CK Wang University of Washington Spring 2014.
Crawling Slides adapted from
NetarchiveSuite Sabine Schostag The Netarchive
WORDPRESS TECHNOLOGY BY AMEER. WELCOME INTRODUCTION WordPress is an Open Source software system used by millions of people around the world to create.
Recap for 2013 Virtual Fall GaIN Meeting Carolann Curry, MLIS, AHIP Reference & Document Delivery Librarian Mercer University Medical Library - Macon Anna.
New features Improved search capabilities If your search term matches an existing articles it will bring you directly.
Preserving Digital Culture: Tools & Strategies for Building Web Archives : Tools and Strategies for Building Web Archives Internet Librarian 2009 Tracy.
Iowa Head and Neck Protocols Managing Your Site: The Wiki.
What makes a good interactive resume? Click for detailed information Multimedia Navigation Communication.
Web Archiving: Avery Fisher Center for Music & Media Rhiannon Bettivia, Zack Lischer-Katz, Samantha Losben & Erica Wilson November 29, 2010 Digital Preservation.
McLean HIGHER COMPUTER NETWORKING Lesson 7 Search engines Description of search engine methods.
Harvesting and showing complicated sites using archive-it – status for some of our tests from October 2014 – January 2015 January 2015 By Tue Hejlskov.
NetarchiveSuite Meeting, BnF, Austria Updates and Plans for 2012 Michaela Mayr, Andreas P. Austrian National Library
DTCC Confidential DTCC Social Networking Policy Task Force January, 2008.
1 Video and flash harvesting. 2 Dailymotion, a special crawl Twice a year we crawl Dailymotion. But the model changes all the time… –The seed list contains.
January 2007Georgia Career Information CenterAdministrator’s Workbook Georgia Career Information System Administrative Training GCIS Portfolio Administration.
HTML5 Audio and Video. Slide 2 History Playing audio and video used to be something of a novelty You would embed a control (with the element) into your.
NetarchiveSuite Workshop, November 24, 2011, Paris 1 Austria Using Wayback for Access and QA Andreas P. Austrian National Library
A blog is a web log, a frequently updated website. Authors: Usually only one person - each post is one author's voice. Others can only leave comments.
1 Advanced Archive-It Application Training: Crawl Scoping.
Customer Hub Protect Your Content. What We’ll Be Talking About Customer Hub is a powerful content management system that is fully integrated with Infusionsoft.
© 2011 Delmar, Cengage Learning Chapter 10 Using ActionScript to Enhance User Experience.
Domain Names and Websites You Will Acquire You will be provided with these domain names and their websites: coolitv.comcoolitv.com, greatitv.com, internettvinstitute.com,
Building Collections on the Web BCWeb. What’s BCWeb ? BCWeb was developped entirely by the BnF for the content curators to replace its old selection tools.
2015 NetarchiveSuite Workshop Eesti Rahvusraamatukogu Tallinn, Estonia January
1 Advanced Archive-It Application Training: Reviewing Reports and Crawl Scoping.
Web Crawling and Automatic Discovery Donna Bergmark March 14, 2002.
Chapter 8: Web Analytics, Web Mining, and Social Analytics
June 30, 2005 Public Web Site Search Project Update: 6/30/2005 Linda Busdiecker & Andy Nguyen Department of Information Technology.
+ Responsive Technology Performance, efficiency and elegance are the three key elements that make our platform unique. Each of the features in this presentation.
Best 20 jobs jobs sites.
Use cases for BnF broad crawls Annick Lorthios. 2 Step by step, the first in-house broad crawl The 2010 broad crawl has been performed in-house at the.
Types and purposes of online communities. Types of websites within online communities blogs chat rooms forums social networking wikis.
Google webmaster tools.  Webmaster is one or more person who is responsible to create one or more sites.  Google webmaster is now changed and called.
Pinterest Clone Features
AppDB current status and proposed extensions
Creating Web Collections with Archive-It
MSC photo:  It was taken some time in the late 1930s, but we don’t have an exact date.  The college was known as MSC from 1925 until 1955 when we became.
Cooperative & Experiential Education
Webarchive Austria NetarchiveSuite Meeting Madrid 2019
Presentation transcript:

Curator wishes for the roadmap november 2011 updates

Top Priorities regarding NAS Display seed list changes to replace SB wiki domain history pages Hide unused harvest definitions and seed lists by adapting the domain page Give the ability to search more than only domains names (comments, number of objects/bytes, crawler traps…)

Top priorities related complex harvesting Use the wiki and share solutions to better crawl : –Audio, videos, flash –Facebook, Twitter –Password protected sites

Priorities 2 Give the ability to easily exclude domains out the snapshot harvest, make it visible on the domain level Give the ability to associate crawler traps either to all or to one or several harvest definitions Further functionalities of the ViewerProxy –make it more informative, display the running processes on the UI –add a flag on the jobs which have been indexed

Priorities 2 Give acces (from a harvest definition) to a history page including a recap on harvested URL and bytes on all domains + alarms/colors to easily detect changes

Tests BCWeb nomination tool to see potential usage NetarchiveSuite as a single and integrated working tool => use of the extended fileds