Translation of PDF and ODF documents to Braille Ian Ball 12413959 Supervisor: Iain Murray.

Slides:



Advertisements
Similar presentations
ECMA Open XML File Formats and the Evolution of Open File Formats Mark Lange Senior Policy Counsel Microsoft EMEA.
Advertisements

DOCUMENT TYPES. Digital Documents Converting documents to an electronic format will preserve those documents, but how would such a process be organized?
.txt What about the Apple Apps? Save As... OR Export Pages Numbers Keynote.
CHS GRAPHICS GDP UNIT 01 FILE FORMATS Understanding File Formats.
Alternative FILE formats
HTML5 ETDs Edward A. Fox, Sung Hee Park, Nicholas Lynberg, Jesse Racer, Phil McElmurray Digital Library Research Laboratory Virginia Tech ETD 2010, June.
Part A Multimedia Production Rico Yu. Part A Multimedia Production Ch.1 Text Ch.2 Graphics Ch.3 Sound Ch.4 Animations Ch.5 Video.
Adaptability of learning objects by appropriate knowledge representation Anastas Misev Institute of Informatics Faculty of Natural Science and Mathematics.
ExpressReader Pro adopted to retrodigitization of mathematical documents Kazuaki Yokota.
Computer Skills Preparatory Year Presented by: L.Obead Alhadreti.
Portable Document Format PDF. What is PDF? Universal file format developed by Adobe Systems Incorporates fine detail and quality of print publications.
UNIVERSITY OF MACEDONIA ECONOMIC AND SOCIAL SCIENCES Support and Inclusion of students with disabilities at higher education institutions in Montenegroz.
George Irwin Syracuse University.  Definitions  Creating PDF  Retrofitting PDF documents  Assistive technology and PDF  Resources.
Adobe Reader and Acrobat Professional Adobe LiveCycle Designer Microsoft Office Word PowerPoint.
P OTENTIAL OCR S OFTWARE FOR N UTRITION F ACTS L ABELS Dennis Given.
Android 4: Creating Contents Kirk Scott 1. Outline 4.1 Planning Contents 4.2 GIMP and Free Sound Recorder 4.3 Using FlashCardMaker to Create an XML File.
Portable Document Format PDF. What is PDF? Universal file format developed by Adobe Systems Incorporates fine detail and quality of print publications.
Microsoft ® Word Templates and Accessibility. 1 What is a Word template? File with a.dot (document template) extension Can define the following:  Paragraph.
It is helpful to break up the word ‘multimedia’ in order to gain a better understanding of its meaning. “Multi” means more than one e.g. a multi storey.
XP Practical PC, 3e Chapter 10 1 Writing and Printing Documents.
2.1 Different Text Attributes Font A set of printable or displayable text characters with its style and size specified Arial 16 point bold Arial 32 point.
Chapter 2 TEXT.
1Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall. Exploring Microsoft Office Access 2010 by Robert Grauer, Keith Mast, and Mary Anne.
Introduction State of the art & related work odt2braille approach odt2braille architecture Accessibility checker Future work OVERVIEW Jan Engelen odt2braille.
Designing accessible multimedia educational materials Piotr Brzoza, MSc Silesian University of Technology.
Software Essentials ICT 1 & 2. What is software?  software is the set of instructions stored inside a computer  These instructions tell the computer.
Peoplesoft XML Publisher Integration with PeopleTools -Jayalakshmi S.
K. Harrison CERN, 20th April 2004 AJDL interface and LCG submission - Overview of AJDL - Using AJDL from Python - LCG submission.
Adobe® Portable Document Format (PDF) is the open de facto standard for electronic document distribution worldwide. Adobe PDF is a universal file format.
Chapter 4 System Software. Software Programs that tell a computer what to do and how to do it. Sets of instructions telling computers to perform actions.
APPX Open Document Support Supporting the way documents are used.
1 Reference Linking in Project Euclid …with some thoughts on the preservation of digital collections. A presentation at the Workshop on Linking and searching.
Vector Application : A UML Example © Dr. David A. Workman School of EE and CS University of Central Florida Feb. 8, 2001.
Output Design. Output design  Output can be: Displayed on a screen/VDU/monitor. Printed on paper as hard copy. Sound.
Presentation Software IT DOES NOT HAVE TO BE POWERPOINT.
“This presentation is for informational purposes only and may not be incorporated into a contract or agreement.”
Editors And Debugging Systems Other System Software Text Editors Interactive Debugging Systems UNIT 5 S.Sharmili Priyadarsini.
Melanie Peasnall. MS Word Microsoft Word is a word processing program that allows you to make text files. This can be anything from a note to a novel.
Implementation of Least Significant Bit Image Steganography and its Steganalaysis By: Deniz Oran Fourth Quarter.
General Purpose Packages DATA TYPES. Data Types Computer store information in the form of data. Information has meaning. Eg 23 May 2005 Data has no meaning.
COMPUTER SOFTWARE FORM 1. Learning Area Introduction to computer software Operating System (OS) Application Software Word Processing Software Presentation.
Structural Design Software TYLER HUTCHISON. Requirements  Develop an interface to generate a text file.  The text file is then passed to one of five.
Software Essentials ICT 1 & 2. What is software?  software is the set of instructions stored inside a computer  These instructions tell the computer.
Implementation of Least Significant Bit Image Steganography and its Steganalaysis By: Deniz Oran Third Quarter.
Do Now You have 10 minutes to finish your About Me essay. When you are done, print out both your new About Me Ad and your typed essay.
Joel Håkansson Henry Larsson Björn Westling. Contact information  TPB – The Swedish Library of Talking Books and Braille, Stockholm  Joel Håkansson.
USING PDF AS SOURCE Liz 1 #stc16 Mike
1 Programming and problem solving in C, Maxima, and Excel.
Creating your course on MOODLE Learning Management System.
1 Workshop - ODFDOM Lars Behrmann Frank Meies Svante Schubert Sun Microsystems, Hamburg 1.
Microsoft Office 2007 Word vs OpenOffice 3 Writer by Vlatko Lukarov.
A Workshop on LibreOffice Er. Arvind Kumar Assistant Professor, Department of Computer Science & Engineering
Welcome to.. Miss Heaton.
Computer System Structures
Lesson 2 – Video & Audio Editing
ODF API - ODFDOM Svante Schubert Software Engineer
Ashima Wadhwa Assistant Professor(giBS)
Non-contact measurement of Braille Dot characteristics and documentation of your quality according to DIN EN Dipl. Ing. Lukas Pescoller.
Improving Braille accessibility and personalization on Internet
Contents Todays Topic: Introduction to Computer Software We will learn
COMPSCI 111 / 111G An introduction to practical computing
Portable Document Format
Ensuring our research is preserved for years to come:
Michele Joy Bromley Adaptive Technology Specialist
ICT Programming Lesson 3:
Securing and Sharing a Presentation
A function that is initiated using an icon, drop-down menu, or mouse
Microsoft Word.
WJEC GCSE Computer Science
Securing and Sharing a Presentation
Presentation transcript:

Translation of PDF and ODF documents to Braille Ian Ball Supervisor: Iain Murray

What is Braille? Braille is a system of communicating, primarily for use by blind people. Each Braille character consists of dots in a 2x3 matrix. For example, the character “h” would be as shown. There is also “Grade 2” Braille, which is similar to shorthand.

What is PDF? Portable Document Format is a document format developed by Adobe, that is designed to represent documents in a platform independent manner. A de-facto standard for distributing documents. Included with Mac OS X 10.4, Apple has provided a set of libraries called “PDF Kit” that allows PDF documents to be manipulated, and for the contents to be extracted.

What is ODF? OASIS Open Document Format is a document format for representing text documents, spreadsheets, presentations, graphics and mathematical formulae. XML file format, with several XML files inside a JAR archive. ISO Standardised

Computerised Braille Translation While there are many products that currently exist for computerised Braille translation, they are very expensive. There is a lack of software that translates PDF or ODF documents to Braille. There is no other software that allows Braille conversion on the Mac OS X platform.

Computerised Braille Translation There is an online service called RoboBraille, that can convert text documents and Microsoft Word documents to Braille free of charge. However, it is not possible to edit the document after it has been converted. Most commercial packages do not allow “on-the- fly” conversion, where the text can be edited and the conversion performed immediately

Requirements Translate the text contents of PDF and ODF documents to Braille Need the ability to edit the text imported from the document, and to save it as a text file. Need the ability to load the saved file. It is also required that the Braille be able to be output on a Braille Embosser.

Translation Program The translation program used was developed by Daniel Lee, (Braille Translation Program, 2006). The Braille Translation Program uses a translation algorithm developed by Shannon Thrower (Grade 2 Braille-Print Translation on the OMAP Microprocessor, 2005)

Screen shot

Design The user selects the file to be opened from the “Open File” dialogue, which is then translated word by word into Braille. As each word is translated, it is inserted into the user interface. An object-oriented approach was taken, with each different document type represented as a class.

Use-Case Diagram

Opening and Saving When a file is opened, the text contents are extracted from the file. Each word is then translated one at a time to Braille, and then inserted into the user interface. When the text is saved, two files are created. One contains the plain text representation, and the other contains the ASCII Braille representation.

Supported Formats The translation program will convert files of the following types .pdf (Portable Document Format) .odt (Open Document Text) .odp (Open Document Presentation) .txt (Plain Text) In addition, plain text files such as programming code (.c etc) can be translated.

Document Formating The translation works best on documents with a simple structure. Formatting of the document is not able to be preserved (for example: bold, underline, centring, etc). Headers and Footers should be avoided, since the text from these will not be distinguished from the body text. Graphics are obviously not translated :)

Implementation Implemented using Objective C using Xcode on Mac OS X PDF text extraction was done using the PDF Kit, supplied as part of the Cocoa libraries XML Parsing done using the XML Parsing classes supplied as a part of Cocoa

Future Developments Add additional document formats, such as Microsoft's OpenXML. Improve the support for Open Document Format Documents, by increasing the number of XML elements recognised.

Questions ?