Download presentation
Presentation is loading. Please wait.
Published byElmer Crowl Modified over 9 years ago
1
BnF projects and priorities
2
2014 On the collection side – Perform broad and focused crawls with a maximum of 100TB – Set up the legal deposit of ebooks (not the same IT team) – Design an organization for harvesting of news websites (PDFs) On the preservation / back office side – Move to WARC format – Replace othe Petabox architecture – Improve the performance of ingest in SPAR (BnF digital repository) On the access side – Give access in regional libraries (start with 3) – Launch a data mining project around WWI On the international side – Contribute to NAS and Wayback developement – Open source BCWeb
3
2015 On the collection side – Perform broad and focused crawls with a maximum of ? Tb – Legal deposit of ebooks (not the same IT team) – Extend the number of news websites (PDFs) / experiment with digital newspapers deposit? – Crawl YouTube and Vimeo? On the preservation – Ingest « historical » web archive collections in SPAR On the access side – Extend access to web archives in regional libraries – Redesign indexing processes : better search, FT search (???)
Similar presentations
© 2024 SlidePlayer.com Inc.
All rights reserved.