Characters & Fonts Digital Multimedia, 2nd edition

Slides:



Advertisements
Similar presentations
1 Professional Communications Working with Type Copyright © Texas Education Agency, All rights reserved. Images and other multimedia content used.
Advertisements

Graphics 2D 1 Subject:T0934 / Multimedia Programming Foundation Session:6 Tahun:2009 Versi:1/0.
Typography the art of using (text) fonts and font styles to produce professional looking publications the art of using (text) fonts and font styles to.
Tafseer Ahmed Department of Computer Science University of Karachi Urdu on Linux International Support.
Chapter Concepts Discuss Fonts Understand Fonts
Media: Text “Words and symbols in any form, spoken or written, are the most common system of communication.” ~ unknown.
Review1 What is multilingual computing? Bilingual, trilingual, vs. Multilingual What are the fundamental issues in multi-lingual computing? –Representation.
1.01 Investigate typefaces and fonts.
Unicode, character sets, and a a little history. Historical Perspective First came EBCIDIC (6 Bits?) Then in the early 1960s came ASCII – Most computers.
CHARACTERS Data Representation. Using binary to represent characters Computers can only process binary numbers (1’s and 0’s) so a system was developed.
Introduction to Human Language Technologies Tomaž Erjavec Karl-Franzens-Universität Graz Tomaž Erjavec Lecture: Character sets
Web Design, 5 th Edition 5 Typography and Images.
Typography 2.01 Investigate typefaces and fonts..
Chapter 9 Text. So, chapter 10 is about Photography, now Chapter 9 is about text What should it look like, how big should it be, how do colors contrast,
Text Text. Multimedia Elements u Text u Graphics u Animation u Sound u Video.
Text.
Chapter 7 Chapter 7 Characters and Fonts Multimedia Systems.
Text. Table of Content 1.Introduction of text. 2.Text elements. 3.Types of text. 4.Fonts and typefaces. 5.Font Terminology. 6.Classification of fonts.
Typography Terms. o.php?viewkey=d26eb03e91d5741a4a 3b.
Digital Interactive Media
Encoding and fonts Edward Garrett Software Developer, ELAR.
1 herbert van de sompel CS 502 Computing Methods for Digital Libraries Cornell University – Computer Science Herbert Van de Sompel
Digital Multimedia, 2nd edition Nigel Chapman & Jenny Chapman Chapter 10 This presentation © 2004, MacAvon Media Productions Characters & Fonts.
Week 4 Number Systems.
Dale Roberts Department of Computer and Information Science, School of Science, IUPUI CSCI N305 Information Representation: Characters and Images.
Chapter 1 Data Storage(2) Yonsei University 1 st Semester, 2014 Sanghyun Park.
B.Sc. Multimedia ComputingMedia Technologies Character Representation & Font Technology.
CHAPTER FIVE TEXT.
Character Encoding, F onts. Overview Why do character encoding and fonts matter to linguists? How can you identify problems? Why do these problems arise?
Introduction to Interactive Media Interactive Media Components: Text.
Typography (Fonts). Type type = any set of printed characters font = a set of printable or displayable characters with a specific style and size typeface.
Globalisation & Computer systems Week 5/6 Character representation ACII and code pages UNICODE.
Anlab ( ) Kim, Yangjung Characters & Fonts.
Lecture 8: Introduction to Text and Font COMP Digital Media Production: Lecture 8.
Typography Graphic Design Fundamentals
Chapter 3 - Text Revamped by Hamzah Asyrani Sulaiman.
Graphic Design  Graphic Design is a form of visual communication.  Communicate a message  Using typography and graphic elements.
Business and Computing Deanery Multimedia Week 7 Text.
© 2001, Penn State University Encoding on the Internet Elizabeth J. Pyatt CETS.
Representing Characters in a Computer System Representation of Data in Computer Systems.
UNIT 4 - TYPOGRAPHY Digital Communications I. 3-D  an effect that gives the illusion of depth.
Digital Interactive Media
Digital Interactive Media
Digital Interactive Media
Multimedia & Webpage Design 1.01 Investigate typefaces and fonts.
Digital Interactive Media
Understanding and Applying Typography in CSS
The art of using text to produce professional looking publications.
Chapter 3 Data Storage.
DFP 4113 MULTIMEDIA TECHNOLOGY
Basic Desktop Terminology
Multimedia Systems and Applications
Unit 2.1: Identifying design elements when preparing graphics
2.01 Investigate typefaces and fonts.
Text.
The Art of Type 9/17/2018.
Multimedia & Webpage Design 1.01 Investigate typefaces and fonts.
2.01 Investigate typefaces and fonts.
Ch2: Data Representation
Source: Digital Desktop Publishing--Font Basics
Desktop Publishing Terminology
Digital Interactive Media
Characters & Fonts Digital Multimedia, 2nd edition
Assist. Lecturer Safeen H. Rasool Collage of Science Department of IT
Multimedia & Webpage Design 1.01 Investigate typefaces and fonts.
Layout Terms Visual Hierarchy
Professional Communications
UNIT-2 Text.
Multimedia & Webpage Design 1.01 Investigate typefaces and fonts.
ASCII and Unicode.
Presentation transcript:

Characters & Fonts Digital Multimedia, 2nd edition Nigel Chapman & Jenny Chapman Chapter 10 This presentation © 2004, MacAvon Media Productions

Text Dual nature Visual representation of language (content) 314–315 Text Dual nature Visual representation of language (content) Need to relate bit patterns to symbols of a written language Graphic element (appearance) Precise shapes of characters, spacing and layout (typography) Each abstract character may have many different graphic representations

Character Sets Abstract characters are grouped into alphabets 315–316 Character Sets Abstract characters are grouped into alphabets Any set of distinct symbols, usually forming the basis of some written language A character set is a mapping between the characters of some alphabet (its character repertoire) and bit patterns For each character in its repertoire, a character set defines a code value, belonging to its set of code points

ASCII American Standard Code for Information Interchange 317 ASCII American Standard Code for Information Interchange 7 bits for each code value, hence 128 code points Character repertoire only comprises 95 characters (other 33 values are used for control characters) Character repertoire is only really adequate for American English ISO standard ISO 646 is ASCII with national variants (accented letters, currency symbols)

318 8-bit Character Sets Easy to double the number of code points by using the eighth bit of a byte Maintain backward compatibility by keeping lower half (0–127) identical with US-ASCII Use code points 128–255 for accented letters, math symbols, extra punctuation 256 code points still insufficient for all languages, so must still use different variants ('code pages')

319 ISO 8859 Incompatible 8-bit extensions to ASCII were originally developed by manufacturers Standardization required ISO 8859 is a multi-part standard which defines a collection of character sets, each designed to accommodate the needs of a group of related languages ISO 8859-1, known as ISO-Latin1, covers most Western European languages

Multi-byte Character Sets 320 Multi-byte Character Sets 256 code points is not sufficient for ideographically based alphabets or for using more than one language at a time 16-bit (2-byte) character set with 65,536 code points can accommodate 256 8-bit character sets simultaneously, and so on with 24- and 32-bit sets ISO 10646 is structured as a hypercube comprising 256 groups (cubes) each of 256 planes of 256 rows, each with 256 characters

320 Structure of ISO 10646 Each code point can be written as a quadruple (g, p, r, c) – group, plane, row, character Using * to mean all values from 0–255, can also use quadruples to identify subsets (0, 0, 0, *) is the subset with all but lowest- order byte set to 0 In ISO 10646, (0, 0, 0, *) is identical with ISO Latin1

Unicode 16-bit character set developed by an industry consortium 320–321 Unicode 16-bit character set developed by an industry consortium Unicode uses CJK consolidation to fit all characters required for Chinese, Korean and Japanese into 16 bits Characters that look the same have the same position, even if they are in fact different ISO 10646 Basic Multilingual Plane (0, 0, *, *) is identical to Unicode (even though ISO 10646 doesn't really need to use CJK consolidation)

Encodings Mapping from code values to a sequence of bytes 322–323 Encodings Mapping from code values to a sequence of bytes charset specification in a MIME type identifies encoding and character set e.g. text/html; charset = ISO-8859-1 Obvious encoding of ISO 10646 uses four bytes for each 32-bit value – UCS 4 For values on BMP drop zero bytes – UCS 2 UCS 2 is therefore identical to Unicode

323 UTFs UCS Transformation Formats can be applied to Unicode (UCS 2) values UTF-8: ASCII characters encoded as themselves, values > 127 encoded as a string of up to six bytes with highest bit set to 1 UTF-7 further encodes UTF-8 as 7-bit values to avoid problems with older protocols UTF-16 allows pairs of 16-bit values to be combined into a single 32-bit value, extending Unicode beyond BMP (additional 15 planes)

Fonts Visual representation of a character is called a glyph 324–326 Fonts Visual representation of a character is called a glyph Must replace characters with glyphs for display Glyphs are arranged into collections called fonts Fonts are stored in specified locations on a computer system, may be embedded in documents If font is not embedded, document may not display properly on systems where that font is not installed

Classification of Fonts 327–330 Classification of Fonts Spacing: monospaced (fixed width)/proportional Serifs: serifed/sans serif Serifs are the small strokes added to the ends of character shapes in conventional book fonts Shape: upright/italic/slanted Slant is a vertical shear effect, italic uses different glyph shapes with a slant Weight: bold/normal/light

331–332 Choice of Fonts Text fonts – suitable for continuous text (e.g. body of a book or article) Must be unobtrusive, easy to read Display fonts – suitable for isolated pieces of short text (e.g. headings, signs or slogans) Intention is to get a short message across, so eye-catching design that would be inappropriate for continuous text is OK

Fonts for Multimedia Text fonts may be problematical 332–333 Fonts for Multimedia Text fonts may be problematical Low resolution of computer displays leads to loss of details (e.g. fine serifs) and distortion of letter shapes Use larger sizes than in print, prefer sans serif, use fonts such as Arial and Verdana designed to be readable at low resolution Display fonts work better and may be suitable for small pieces of continuous text

Font Measurement Units Points: 1pt = 1/72" = 0.3528mm 334–335 Font Measurement Units Points: 1pt = 1/72" = 0.3528mm Exact size is not standard; 1/72" is invariably used by computer systems Picas: 1pc = 12pt Font's body size is not necessarily the size of any particular character e.g. 10pt Times Roman

337 Font Terminology Baseline – the line on which the bases of characters are arranged Leading – the distance between successive baselines x-height – the distance between the baseline and the top of a lower-case letter x Ascenders/descenders – strokes that rise above the x-height/drop below the baseline

336

Relative Units Used to express measurements relative to font size 336–337 Relative Units Used to express measurements relative to font size 1 ex = font's x-height 1 em = body size Traditionally the width of an upper-case M Long dashes — known as em-dashes (1em long) 1 en = 0.5em Shorter dashes – known as en-dashes (1en long)

337–338 Spacing Kerning – adjustment of space between certain pairs of letters (e.g. AV) to make them look more uniform Kerning pairs for a font are defined by its designer, stored with the font metrics Ligatures – single composite characters used to replace pairs of letters that don't look right next to each other (e.g. fi) Ligatures are stored as extra characters in the font

339 Digital Fonts Glyphs are just images, so we can have bitmapped or vector (outline) fonts Bitmapped fonts don't scale well or reproduce at different resolutions Outline font formats: PostScript Type 1 TrueType OpenType

Outline Fonts Type 1 Character shapes are based on Bézier curves 340–341 Outline Fonts Type 1 Character shapes are based on Bézier curves Fonts can contain hints used by rendering programs to improve appearance at low res TrueType Character shapes are based on quadratic curves Instructions specify how features of a character are rendered at different resolutions

340 OpenType Fonts New cross-platform format that unifies Type 1 and TrueType More than 256 characters in each font Type 1 and TrueType both limited to 256 Encoding based on Unicode Support for extended range of ligatures, old- style numerals, swash capitals, fractions