Requirements of End User Defined Characters & Some Frequently Used Solutions Chinese Foundation for Digitization Technology (CMEX) Phobos Chang.

Slides:



Advertisements
Similar presentations
Authoring Languages and Web Authoring Software 4.01 Examine web page development and design.
Advertisements

1 Pengantar Teknologi Internet W03: CSS Cascading Style Sheets.
July 2007 Health-e Web Entry. © ENS Inc, an INGENIX company. 2 Introduction  Before your installation appointment, complete the following: (Call your.
CHAPTER 30 THE HTML 5 FORMS PROCESSING. LEARNING OBJECTIVES What the three form elements are How to use the HTML 5 tag to specify a list of words’ form.
Tafseer Ahmed Department of Computer Science University of Karachi Urdu on Linux International Support.
Copyright 2004 Monash University IMS5401 Web-based Systems Development Topic 2: Elements of the Web (g) Interactivity.
CIS101 Introduction to Computing Week 05. Agenda Your questions Exam next week - Excel Introduction to the Internet & HTML Online HTML Resources Using.
Working with Cascading Style Sheets. 2 Objectives Introducing Cascading Style Sheets Using Inline Styles Using Embedded Styles Using an External Style.
October 16, 2007HighEdWebDev2007 Single Source Website for Full Spectrum Access Rick Ells University of Washington
Review1 What is multilingual computing? Bilingual, trilingual, vs. Multilingual What are the fundamental issues in multi-lingual computing? –Representation.
Dreamweaver 8 Concepts and Techniques Introduction Web Site Development and Macromedia Dreamweaver 8.
4.01B Authoring Languages and Web Authoring Software 4.01 Examine webpage development and design.
EBooks and Devices Educating users about options Demonstration and Petting Zoo Inland Library Tech Day May 01, 2013.
Fonts and Acrobat, Team Meeting, Frascati, 2005, J. Poole 1JACoWJACoW Problems Editing PDF text is often impossible because one needs the fonts which are.
Document Delivery Formats for the Web and Legal Digital Collections Kevin Reiss June 18 th, 2004 Law Library Rutgers-Newark School of Law.
Chapter 14-Designing for the World Wide Web. Overview Introducing multimedia on the Web. Designing text for the Web. Creating images for the Web. Adding.
Cs413_design01.ppt Web Design vs. GUI Design Designing for the Web: the designer has to give up full control of the design In traditional design, you control.
Unicode & W3C Jataayu Software C. Kumar January 2007.
Creating a Simple Page: HTML Overview
INTRODUCTION TO DHTML. TOPICS TO BE DISCUSSED……….  Introduction Introduction  UsesUses  ComponentsComponents  Difference between HTML and DHTMLDifference.
4.1 JavaScript Introduction
XP Tutorial 7New Perspectives on Creating Web Pages with HTML, XHTML, and XML 1 Working with Cascading Style Sheets Creating a Style for Online Scrapbooks.
_______________________________________________________________________________________________________________ E-Commerce: Fundamentals and Applications1.
1 Shifting Forward in Digital Publishing: EPUB 3 Nettie Lagace Associate Director for Programs, NISO NEASIS&T.
2.1 Different Text Attributes Font A set of printable or displayable text characters with its style and size specified Arial 16 point bold Arial 32 point.
Introduction to Interactive Media 06: Text: Static Interactive Media Component.
XForms: A case study Rajiv Shivane & Pavitar Singh.
Encoding and fonts Edward Garrett Software Developer, ELAR.
Chapter 16 The World Wide Web Chapter Goals Compare and contrast the Internet and the World Wide Web Describe general Web processing Describe several.
B.Sc. Multimedia ComputingMedia Technologies Character Representation & Font Technology.
Introduction technology XSL. 04/11/2005 Script of the presentation Introduction the XSL The XSL standard Tools for edition of codes XSL Necessary resources.
CHAPTER FIVE TEXT.
Web Programming: Client/Server Applications Server sends the web pages to the client. –built into Visual Studio for development purposes Client displays.
Learning Web Design: Chapter 4. HTML  Hypertext Markup Language (HTML)  Uses tags to tell the browser the start and end of a certain kind of formatting.
HTML. Principle of Programming  Interface with PC 2 English Japanese Chinese Machine Code Compiler / Interpreter C++ Perl Assembler Machine Code.
Silverlight Technology. Table of Contents 1.What is Silverlight Technology? 2.Silverlight Overview. 2.1 How it works 2.2 Silverlight development tools.
Introduction to HTML Tutorial 1 eXtensible Markup Language (XML)
Chapter 8 Introduction to HTML and Applets Fundamentals of Java.
Character Encoding, F onts. Overview Why do character encoding and fonts matter to linguists? How can you identify problems? Why do these problems arise?
XML and Digital Libraries M. Zubair Department of Computer Science Old Dominion University.
Tutorial 7 Working with Multimedia
Tutorial 7 Designing a Multimedia Web Site
1.Obtaining software 2.Sample pdf for this presentation 3.Checking accessibility of the pdf 4.Tackling inaccessibility 5.Tips and helpful links How to.
What Agencies Should Know About PDF/A-1 April 6, 2006 Mark Giguere
Introduction to Interactive Media Interactive Media Components: Text.
1 First BlueJ Day, Houston, Texas, 1st March 2006 The Submitter Extension Ian Utting University of Kent.
File Formats and Vector Graphics. File Types Images and data are stored in files. Each software application uses different native file types and file.
Web Foundations TUESDAY, OCTOBER 29, 2013 LECTURE 20: EMBEDDING WEB FONTS.
UPLOAD / DOWNLOAD april  HTML5 is just the next iteration of HTML  Previous version was technically HTML 4.01, which incorporated XHTML 1.0.
XML stands for Extensible Mark-up Language XML is a mark-up language much like HTML XML was designed to carry data, not to display data XML tags are not.
Web Application Programming Presented by: Mehwish Shafiq.
Lis508 lecture 2: characters to textual documents Thomas Krichel
Marwan Al-Namari 1 Digital Representations. Bits and Bytes Devices can only be in one of two states 0 or 1, yes or no, on or off, … Bit: a unit of data.
Web Design and Development. World Wide Web  World Wide Web (WWW or W3), collection of globally distributed text and multimedia documents and files 
CNIT 132 – Week 4 Cascading Style Sheets. Introducing Cascading Style Sheets Style sheets are files or forms that describe the layout and appearance of.
Basics of Web Based Computing. The Architecture The user’s system A Web Server What’s inside? Server software Apache or other Resources to be accessible.
Introduction  “M” “V” “C” stands for “MODEL” “VIEW” “CONTROLLER”. ASP.NET MVC is an architecture to develop ASP.NET web applications in a different manner.
XP Tutorial 7New Perspectives on HTML and XHTML, Comprehensive 1 Working with Cascading Style Sheets Creating a Style for Online Scrapbooks Tutorial 7.
Blended HTML and CSS Fundamentals 3 rd EDITION Tutorial 1 Using HTML to Create Web Pages.
TEI 工作坊 TEI and Images October The Concept.
Working with Cascading Style Sheets
Introduction to Scanners
OCR Computing for GCSE © Hodder Education 2011
XML QUESTIONS AND ANSWERS
Basic XHTML Tables XHTML tables—a frequently used feature that organizes data into rows and columns. Tables are defined with the table element. Table.
Silverlight Technology
How the Web Works 2/19/2019.
Introduction to Web Application Design
OCR GCSE Computing © Hodder Education 2013 Slide 1
Presentation transcript:

Requirements of End User Defined Characters & Some Frequently Used Solutions Chinese Foundation for Digitization Technology (CMEX) Phobos Chang

Agenda The Need For End User Defined Characters When to Use EUDC Embedded Resources Multi-Typefaces Support Mapping Information for Further Use Frequently Used Solutions

The Need For End User Defined Characters The first well-accepted Traditional Chinese encoding format only defines 13,053 hanzi characters. –Some characters used in Address do not include. –Characters used in People’s name do not fully include. Even Prime Minister’s name (only three characters) lacks one. Tax collection Fortune Teller’s issue –Characters used in ancient books, used in the names of historical people. –Needs EUDC support from day one. Starts from MS-DOS era.

When to Use EUDC When EPUB devices cannot display characters bundled when purchased. GNU Unifont 5.1 –A project started in 1998 by Roman Czyborra –Covers Basic Multilingual Plane (BMP) of Unicode 5.1 standard –Bitmapped (8x8 or 16x16) font at the beginning, then transfer to TrueType –Has a character for BMP code point at first, beautiful is next –Sample

Hardware Resource Limitation Why does CMEX suggest using BMP as a minimum? –BMP includes 27,484 normalized hanzi characters –Supplementary Characters are too many for low-end devices Only CJK Unified Ideographys Extension B contains 42,711 characters Surrogate support is not public for now –Not every book uses code points beside BMP, those books that needs EUDC support are few

Requirement 1X Define EUDC as – End User Defined Character are those characters whose interpretation are not specified by current Unicode standard, plus characters whose interpretation are specified by Unicode standard but assigned code points are not inside BMP. Requirement One –For any character which is not defined in current Unicode standard, or which is defined in current Unicode standard but its code point is outside of Unicode BMP can be used in the context of any one EPUB document via EUDC support.

Embedded Resources & Requirement 2X Not every EPUB hardware has wireless connection support. Those devices that have wireless connection, may be carried to a location without connection, like basement. We wish EUDC support works in such a circumstances. Requirement Two –For any EPUB documents which contains EUDC, all resource files to support the display of EUDC can be embedded inside the EPUB zip compression file.

Multi-Typefaces Support Some EPUB hardware can let user to choose which typeface they want to use for display For example, Song ( 細明體 ) and Kai ( 楷體 ) are two mostly used fonts in Traditional Chinese. –To display EUDC using either font, it will need two resources, respectively. Requirement Three –It would be better to provide a mechanism to assign a corresponding resource to support EUDC display for each font using in an EPUB document.

Mapping Information for Further Use What if EPUB hardware does not support EUDC? Provide useful information for later process. Requirement Four –It would be better to embed mapping information for all the EUDC using inside an EPUB document. –When embeds mapping information inside an EPUB documents, for EUDC that are interpreted by Unicode standard but beyond BMP, mapping information should contain corresponding code point such as U for each character; –for EUDC that are not interpreted by Unicode standard, mapping information should contain useful reference coding scheme, such as TF-2121 used in Taiwan’s CNS11643 standard.

Use of Private Use Area Most of the solutions for EUDC is PUA-centric in Taiwan. –Input within Input Method Environment –Display for every application –Printing Pros –Easy to use when authoring –Much more straightforward Cons –Will need to check code point range when rendering –Unicode normalization

Frequently Used Solutions In-line Images Java Applet for EUDC Display and Input EUDC Display using Ajax Embedded OpenType Font (EOT) sIRF Web Open Font Format

Embedded OpenType Font Designed by Microsoft Submitted to W3C in 2007 as part of CSS3 and get rejected. Re-submitted to W3C in 2008 as a standalone submission IE only Not widely accepted even in Taiwan

sIFR Scalable Inman Flash Replacement Open Source Javascript and Adobe Flash

Web Open Font Format Developing in 2009 A strong favorite for standardization by the W3C Web Fonts Working Group Vendor Support –FireFox since 3.6 –Microsoft IE 9 –Webkit