Creating User Interfaces [Catch up presentations]. Language. Localization. Homework: Work on teaching projects. Post comments on source for localization,

Slides:



Advertisements
Similar presentations
Unicode Mark Davis Unicode Consortium President IBM Chief SW Globalization Architect
Advertisements

Unicode Mark Davis Unicode Consortium President IBM Chief SW Globalization Architect.
LIS650lecture 1 XHTML 1.0 strict Thomas Krichel
Worldwide typography (and how to apply JIS-X to Unicode) Michel Suignard Microsoft Corporation.
The creation of "Yaolan.com" A Site for Pre-natal and Parenting Education in Chinese by James Caldwell DAE Interactive Marketing a Web Connection Company.
HTML I. HTML Hypertext mark-up language. Uses tags to identify elements of a page so that a browser such as Internet explorer can render the page on a.
4. Internet Programming ENG224 INFORMATION TECHNOLOGY – Part I
QIF Hilton Head, SC. Larry Maggiano Mitutoyo America Corporation June 13, 2012 Unicode for GD&T Symbols?
Tafseer Ahmed Department of Computer Science University of Karachi Urdu on Linux International Support.
Solutions for Multilingual Literature by XSL Formatter 6,800 known languages.
1 Lab Session-IV CSIT-120 Spring 2001 Lab 3 Revision and Exercises Rev: Precedence Rules Lab Exercise 4-A Machine Language Programming The “Micro” Machine.
Media: Text “Words and symbols in any form, spoken or written, are the most common system of communication.” ~ unknown.
1/25 Writing Character sets Unicode Input methods.
1 HTML’s Transition to XHTML. 2 XHTML is the next evolution of HTML Extensible HTML eXtensible based on XML (extensible markup language) XML like HTML.
Developing a Basic Web Page with HTML
Overview of Search Engines
Planning and Designing a Website Session 8. Designing a Website Like all technical artefacts a website needs to be carefully planned and designed to be.
* The basic components of a web site are: * Content – information displayed or accepted from users * Static – content that doesn’t change for different.
Expression Web 2 Concepts and Techniques Expression Web Design Feature Web Design Basics.
Introduction to Human Language Technologies Tomaž Erjavec Karl-Franzens-Universität Graz Tomaž Erjavec Lecture: Character sets
Problemsolving 2 Problem Solving: Designing a website solution Identifying how a solution will function Taking into account the technical constraints a.
Sophia Antipolis, September 2006 Multilinguality, localization and internationalization Miruna Bădescu Finsiel Romania.
Unicode & W3C Jataayu Software C. Kumar January 2007.
Creating Interfaces: Localization Language & other issues character codes Homework: preparation for future topics.
Globalisation & Computer Systems week 5 1. Localisation presentations 2.Character representation and UNICODE UNICODE design principles UNICODE character.
Encoding and fonts Edward Garrett Software Developer, ELAR.
1 herbert van de sompel CS 502 Computing Methods for Digital Libraries Cornell University – Computer Science Herbert Van de Sompel
Week 4 Number Systems.
Spring /6.831 User Interface Design and Implementation1 Lecture 22: Internationalization.
Chapter 16 The World Wide Web Chapter Goals Compare and contrast the Internet and the World Wide Web Describe general Web processing Describe several.
1 CS 502: Computing Methods for Digital Libraries Lecture 4 Text.
Object and component “wiring” standards This presentation reviews the features of software component wiring and the emerging world of XML-based standards.
1 Regular Expressions CIS*2450 Advanced Programming Techniques Material for this lectures has been taken from the excellent book, Mastering Regular Expressions,
FishBase Summary Page about Salmo salar in the standard Language of FishBase (English) ENBI-WP-11: Multilingual Access to European Biodiversity Sites through.
Programming in HTML.  Programming Language  Used to design/create web pages  Hyper Text Markup Language  Markup Language  Series of Markup tags 
IBM Globalization Center of Competency © 2006 IBM Corporation IUC 29, Burlingame, CAMarch 2006 Automatic Character Set Recognition Eric Mader, IBM Andy.
Building digital libraries in Indian languages: case studies with Hindi and Kannada B.S. Shivaram Trainee ( ) National Center for Science Information.
XP New Perspectives on The Internet, Sixth Edition— Comprehensive Tutorial 8 1 Creating Effective Web Pages Creating HTML Documents Tutorial 8.
XML About XML Things to be known Related Technologies XML DOC Structure Exploring XML.
Challenges in Web Document Summarization: Some Myths and Reality A. Rahman H. Alam Document Analysis and Recognition Team (DART) BCL Computers Inc. Santa.
Character Encoding, F onts. Overview Why do character encoding and fonts matter to linguists? How can you identify problems? Why do these problems arise?
Internationalization in PHP: PmWiki’s approach Dr. Patrick R. Michaud September 13, 2005.
Introduction to Interactive Media Interactive Media Components: Text.
D. M. Akbar Hussain: Department of Software & Media Technology 1 Compiler is tool: which translate notations from one system to another, usually from source.
Globalisation & Computer systems Week 5/6 Character representation ACII and code pages UNICODE.
Oracle9i Database Administrator: Implementation and Administration 1 Chapter 14 Globalization Support in the Database.
Copenhagen, 6 June 2006 EC CHM Multilinguality Anton Cupcea Finsiel Romania.
Data Files on Computers Text Files (ASCII) Files that can be created by typing on the keyboard while using a text editor such as notepad or TextEdit.
Week 7 Lecture 2 Globalization Support in the Database.
XML stands for Extensible Mark-up Language XML is a mark-up language much like HTML XML was designed to carry data, not to display data XML tags are not.
 LAN ◦ A LAN (Local Area Network) is a system whereby individual PCs are connected together within a company or organization.  WAN ◦ A WAN (Wide Area.
Formatting Tags. HTML Page Structure Demo Page Aloha, this is a demo page.
Creating User Interfaces Ideas & Trends Homework: Post constructive comments. Work on project.
Microsoft Expression Web 3 Expression Web Design Feature Web Design Basics.
© 2001, Penn State University Encoding on the Internet Elizabeth J. Pyatt CETS.
DATA REPRESENTATION 4 Y. Colette Lemard February 2009.
Formatting Tags. HTML Page Structure Demo Page Aloha, this is a demo page.
Your Interactive Guide to the Digital World Discovering Computers 2012 Chapter 13 Computer Programs and Programming Languages.
Assistive Technology for Information Access (Visual Impairments) UNDERSTANDING ACCESSIBLE FORMATS.
1 Non-Numeric Data Representation V1.0 (22/10/2005)
INTRO. To I.T Razan N. AlShihabi
INTERNATIONALIZATION
Characters & Fonts Digital Multimedia, 2nd edition
Web Programming– UFCFB Lecture 9
Creating a Successful Web Presence
Text.
Trust and Culture on the Web
Digital Encodings.
Web Programming– UFCFB Lecture 9
ASCII and Unicode.
Presentation transcript:

Creating User Interfaces [Catch up presentations]. Language. Localization. Homework: Work on teaching projects. Post comments on source for localization, language

Localization Create web site for use in several specific 'local' places. Create multiple web sites, each for use in specific place. –in an efficient, effective manner so any underlying common content does not need to be duplicated (and commonality diluted). Develop tools (networking s/w, standards, etc.) that promote Web as "global, interoperable tool of communication" –

International / Global / ? So-called localization more than just language –Alphabets –Idiom Other issues –Dates –Layout (flow on page) –money –Cultural issues ?? –More?

Class exercise What is the word in that language for –Spanish Latin America, Spain, USA, … –Portuguese –Chinese (Mandarin? Hainese?) –Korean –Japanese –Hebrew –Russian –French –Arabic (Classical, today) What is the direction of text? What is the format for dates? Time? Money? Relevant cultural issues?

Example There can be differences: Aymara example concerning time Spanish double negative –Double positive?

Globalization techniques (globalisation) Separate content from formatting and programming (behavior) –Content: separate text checklists

Bits and bytes How many different patterns can be held in a bit string –4 bits long –7 –8 –16

Character code Note: European languages plus several other 'small' alphabets easily handled. We/I (typical monolingual American) can't hardly appreciate the challenge: –two Chinese (kanji) character sets: modern (China) and traditional (Taiwan + most of the Chinese diaspora) –'ruby': symbols 'over' ideographs

character repertoire: A set of distinct characters. character code: A mapping, often presented in tabular form, which defines a one-to-one correspondence between characters in a character repertoire and a set of nonnegative integers.

Characters, more character encoding: A method (algorithm) for presenting characters in digital form by mapping sequences of code numbers of characters into sequences of octets. In the simplest case, each character is mapped to an integer in the range according to a character code and these are used as such as octets. Naturally, this only works for character repertoires with at most 256 characters. For larger sets, more complicated encodings are needed. Encodings have names, which can be registered.

charset Using the terms just defined, the charset attribute in an HTML meta tag means encoding

Language Attribute of html tag MAY be used by browsers (spell-check, hyphenation, speech synthesizers), search engines, other tools. See two-letter codes: m

… more A glyph is a presentation of a particular shape which a character may have when rendered or displayed. –speak of same glyph in italic, bold, etc. A repertoire of glyphs comprises a font. In a more technical sense, as the implementation of a font, a font is a numbered set of glyphs. The numbers correspond to code positions of the characters (presented by the glyphs). Thus, a font in that sense is character code dependent. An expression like "Unicode font" refers to such issues and does not imply that the font contains glyphs for all Unicode characters.

Examples ASCII is a character repertoire, code and encoding. Note: confusion about 7 vs 8 bit ASCII ISO Latin 1 alias ISO standard defines a repertoire, code and encoding of which ASCII is a subset. ISO 8859 is a family of many encodings, indicated by the –n. ISO handles Cyrillic.

Unicode … provides a unique number for every character, no matter what the platform, no matter what the program, no matter what the language. This is the goal.

Unicode, continued The Unicode Standard has been adopted by such industry leaders as Apple, HP, IBM, JustSystem, Microsoft, Oracle, SAP, Sun, Sybase, Unisys and many others. Unicode is required by modern standards such as XML, Java, ECMAScript (JavaScript), LDAP, CORBA 3.0, WML, etc., and is the official way to implement ISO/IEC It is supported in many operating systems, all modern browsers, and many other products. The emergence of the Unicode Standard, and the availability of tools supporting it, are among the most significant recent global software technology trends.

Note Unicode goal is universal coverage… Unicode is product of a consortium of 'mostly US companies'. Some controversy in its treatment of things –Combining certain kanji characters

Unicode consortium Go to Unicode.html Examine the Translations on the left. See what language characters do not appear on your computer. –Select one and –Go to Display Problems and see if you can fix it.

Techniques One web site / screen provide options to go to different pages –use symbols/icons that are meaningful to audience tricky. Flags may not be appropriate. –use images containing text in the specific language –risky choice: hope that computer/platform/browser has character encoding and font to display language –poor choice: use English word for other language and/or require English –Other?

Class Exercise Go to these sites to see what they do? – – – – –Pick one (a site that would claim to have a global appeal)

Refrain: Accessibility Many organizations (including SUNY) require websites, other applications to be accessible to people with disabilities. There also can be situational blindness, other situations, in which attention/awareness helps a wider set of people. Note: complex situation example: ramps for wheelchairs versus curbs for the blind. Can you come up with a computer interface example? How to make websites be accessible for differently-abled in different places/languages?

Special requirements How to help visually impaired people understand mathematics? One approach (Murphy, et al, at Dublin City University): use a mixture of non- speech (e.g., rising and falling tones), modified speech, and binaural spatialization:

Discussion What would be other approachs? How would these types of techniques apply to something else?

Something to explore Speech synthesis (Text to speech) and Speech recognition I have not studied this: elements/?utm_source=html5weekly&utm_ medium=

Segue to privacy European countries (may be European Union) have stricter rules about privacy, such as maintaining data on people

Business model … for many products involves using information for marketing For Facebook, Google, etc. "we are not the consumer; we are the product." –My source: Jeffrey Rosen, /who-do-online-advertisers-think-you- are.html?pagewanted=all /who-do-online-advertisers-think-you- are.html?pagewanted=all –Alternate: if you aren't paying for it, you are the product. –What is your reaction?

Homework Find unique source on (one of) –techniques & checklists for localization/globalization –Unicode, –interactions with geolocation, –exact purpose for lang attribute, other html, similar things in meta tags –privacy issue (especially differences in different places) –Speech reco / speech synthesis package Write summary of what the site is/does; your assessment, and include working link. Keep working on teaching project