DIY: Copyright – Show your work Document copyright research to justify your usage, and to show that you acted professionally in trying to locate rightsholders.
PERMISSION TO DIGITIZE
DIY: Copyright - Guidelines
Securing permission: consent forms Organizational leaders: may think they have to sign over copyright may be afraid to sign something will likely seek broader approval
Securing permission: consent forms Consent agreement should be clear on copyright Be clear how content will be used If you already have a consent form, make sure it applies to new projects For consent agreement questions, consult an attorney.
EBOOKS DISSECTED & DIGITIZED
ePub as zip file
ebook markup HTML & CSS
Everything has been digitized, right? Bad OCR: hours, fractions Scanned Digitized Corrected WPPL Epub page
Homer ebook project http://bookscanner.pbworks.com/w/page/40965440/FrontPage
Homer The following tools are installed as part of the Homer Project: ImageMagick (for manipulation images) Jpegtran (loseless jpeg transformation) JBIG2 encoder (compression tool for bi-level images) Tesseract-OCR (optical character recognition) RubyInstaller (installs the Ruby programming language) Hpricot (HTML parser) RMagick (interface between the Ruby programming language and ImageMagick) Pdfbeads (to create searchable PDF) Cmdow.exe (command-line utility used in Homer) ScanTailor (post-processing tool) Homer (command-line bash script)
Ebook Production Workflow
Homer: ScanTailor Preprocess tiff-format images of book pages Deskewing De-speckling Correcting warp Right-to-left language support Outputs images for Homer
HOMER BASH SCRIPT It looks like command-line…
HOMER BASH SCRIPT but its drag-and- drop!!!
Homer: tesseract-ocr Optical Character Recognition Multilingual support - From Afrikaans to Vietnamese
Homer: pdfbeads Outputs a searchable PDF
Homer & pdfbeads Outputs a searchable PDF
Epub Validator http://validator.idpf.org/
Drupal Open source content management system Widely used in libraries Drupal 7 Responsive layout drupal.org
Drupal Ability to create custom fields for metadata – can be hidden from users
Original content: We can help organizations produce their own cookbooks Work with organizations to produce ebook versions…but
The Community Cookbook – whats next? …with one more open-source tool, we can even help them design print versions: We can do everything but the printing.
Its an exciting possibility… for the future of libraries that there is value to be mined from content already in our communities.
Even more exciting is the thought that the most valuable content to libraries is content from our communities that hasnt been created yet.
Further Reading Jarret Buse - A Hands-on Guide to EPUB2 and EPUB3 Excellent guide to the guts of ebooks Features many of the open-source programs I have discussed
Further Reading Stanford University: Copyright & Fair Use – Charts and Tools http://fairuse.stanford.edu/charts-and-tools/
Image credits Open Source Sign Timothy Appnel - https://www.flickr.com/photos/tappnel/5798812875/ https://www.flickr.com/photos/tappnel/5798812875/ Librarian from Turn of the Century - http://www.moyak.com/researcher/Clients/male_librarians/ind ex.html?id=34 http://www.moyak.com/researcher/Clients/male_librarians/ind ex.html?id=34 Ereaders - Michael Porter https://www.flickr.com/photos/libraryman/5052936803/ https://www.flickr.com/photos/libraryman/5052936803/ Apples & oranges http://mrg.bz/n1xLHghttp://mrg.bz/n1xLHg
Image credits Techno_background2.jpg (ones and zeroes) http://www.morguefile.com/creative/Grafixar http://www.morguefile.com/creative/Grafixar Pile of books with lock: Librarian in Black - http://librarianinblack.net/librarianinblack/2011/12/overdrive.ht ml http://librarianinblack.net/librarianinblack/2011/12/overdrive.ht ml Ricoh Copier: http://www.itinstock.com/ekmps/shops/itinstock/images/ricoh- aficio-mp-4001-fast-photocopier-copier-printer-scan-fax-5598- p.jpg http://www.itinstock.com/ekmps/shops/itinstock/images/ricoh- aficio-mp-4001-fast-photocopier-copier-printer-scan-fax-5598- p.jpg