ICT Foundation 1 Copyright © 2010, IT Gatekeeper Project – Ohiwa Lab. All rights reserved. Character representation.

Slides:



Advertisements
Similar presentations
ICS312 Set 2 Representation of Numbers and Characters.
Advertisements

Text Representation within Computers CS208. The Binary Digit (Bit) One bit can encode a value set that contains two elements e.g. {black, white}, {up,
Overview Digital Systems and Computer Systems Number Systems [binary, octal and hexadecimal] Arithmetic Operations Base Conversion Decimal Codes [BCD (binary.
S. Barua – CPSC 240 CHAPTER 2 BITS, DATA TYPES, & OPERATIONS Topics to be covered are Number systems.
Data Representation 3 This week – Recap on Floating point number – ASCII – unicode.
Introduction to Computer Engineering ECE/CS 252, Fall 2010 Prof. Mikko Lipasti Department of Electrical and Computer Engineering University of Wisconsin.
CCE-EDUSAT SESSION FOR COMPUTER FUNDAMENTALS Date: Session III Topic: Number Systems Faculty: Anita Kanavalli Department of CSE M S Ramaiah.
Chapter 2 Bits, Data Types, and Operations. Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. 2-2 How do we.
CIS 234: Character Codes Dr. Ralph D. Westfall April, 2011.
COMPUTER FUNDAMENTALS David Samuel Bhatti
2.1.4 BINARY ASCII CHARACTER SETS A451: COMPUTER SYSTEMS AND PROGRAMMING.
Computer Systems Nat 4/5 Computing Science Data Representation Lesson 3: Storing Text.
CHARACTERS Data Representation. Using binary to represent characters Computers can only process binary numbers (1’s and 0’s) so a system was developed.
Localizing OpenClinica Hiroaki Honshuku: SQA 1. © What is Character Encoding?  Morse Code (1840) → Latin Alphabet  ASCII (1963)  The American Standard.
ACOE1611 Data Representation and Numbering Systems Dr. Costas Kyriacou and Dr. Konstantinos Tatas.
UNICODE Character Sets and Coding Standards Han Unification and ISO10646 Encoding Evolution and Unicode Programming Unicode.
Chapter 3 Representing Numbers and Text in Binary Information Technology in Theory By Pelin Aksoy and Laura DeNardis.
Agenda Data Representation – Characters Encoding Schemes ASCII
Numbers and Number Systems
Computer System Basics 1 Number Systems & Text Representation Computer Forensics BACS 371.
Aloha Aloha What you see: What the computer sees: binary number columns binary number columns
Logic and data representation Revision. AND gate A B A B All inputs have to be true ( i.e 1) for the output of the gate to be high, in all other cases.
Information and Programs. Foundations of Computing Information –Binary numbers –Integers and Floating Point –Booleans (True, False) –Characters –Variables.
Chapter 2 - Data Types. BYU CS/ECEn 124Chapter 2 - Data Types2 Today… Unit 1 slides and homework in JFSB B115 Problems with CCE on Talmage Window’s machines…
Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. Idea 1: Bits Two states in a digital machine: 1.presence of.
Lec 3: Data Representation Computer Organization & Assembly Language Programming.
Binary, Decimal and Hexadecimal Numbers Svetlin Nakov Telerik Corporation
ICS312 Set 1 Representation of Numbers and Characters.
PART 2 Data Types 1. Integers 2. 2’s Complement 3. Conversion
Character Encoding, F onts. Overview Why do character encoding and fonts matter to linguists? How can you identify problems? Why do these problems arise?
Chapter 2 Bits, Data Types, and Operations. 2-2 Hexadecimal Notation It is often convenient to write binary (base-2) numbers as hexadecimal (base-16)
Bits, Data Types, and Operations Slides based on set prepared by Gregory T. Byrd, North Carolina State University.
Coding of Information Assign a unique string of binary digits to each piece of information There are many standard coding groups –Decimal Codes (BCD –
Computer System Basics 1 Number Systems & Text Representation Computer Forensics BACS 371.
CISC1100: Binary Numbers Fall 2014, Dr. Zhang 1. Numeral System 2  A way for expressing numbers, using symbols in a consistent manner.  " 11 " can be.
1 Data Representation Characters, Integers and Real Numbers Binary Number System Octal Number System Hexadecimal Number System Powered by DeSiaMore.
Number Systems and Digital Codes
Strings in MIPS. Chapter 2 — Instructions: Language of the Computer — 2 Character Data Byte-encoded character sets – ASCII: 128 characters 95 graphic,
Instructor:Po-Yu Kuo 教師:郭柏佑
Irvine, Kip R. Assembly Language for Intel-Based Computers 6/e, Signed Integers The highest bit indicates the sign. 1 = negative, 0 = positive.
ICT Foundation 1 Copyright © 2010, IT Gatekeeper Project – Ohiwa Lab. All rights reserved. Binary and Hexadecimal numbers.
Text Copyright © Software Carpentry 2010 This work is licensed under the Creative Commons Attribution License See
Characters CS240.
Representing Characters in a Computer System Representation of Data in Computer Systems.
Binary Arithmetic, ASCII, & Boolean Algebra
Binary a. express numbers in binary, binary-coded decimal (BCD), octal and hexadecimal;
1.4 Representation of data in computer systems Character.
1 Non-Numeric Data Representation V1.0 (22/10/2005)
Nat 4/5 Computing Science Data Representation Lesson 3: Storing Text
Data Representation COE 308 Computer Architecture
Unit 2.6 Data Representation Lesson 2 ‒ Characters
Machine level representation of data Character representation
Data Representation ICS 233
Lec 3: Data Representation
Lesson Objectives Aims You should be able to:
Data Representation.
INTERNATIONALIZATION
Binary, Decimal and Hexadecimal Numbers
Data Encoding Characters.
TOPICS Information Representation Characters and Images
Data Representation COE 301 Computer Organization
Lecture 9: Radix-64 Tutorial
Chapter 3 DataStorage Foundations of Computer Science ã Cengage Learning.
Text Encoding.
Data Representation ICS 233
Learning Intention I will learn how computers store text.
WJEC GCSE Computer Science
Lecture 36 – Unit 6 – Under the Hood Binary Encoding – Part 2
ASCII and Unicode.
Data Representation COE 308 Computer Architecture
Presentation transcript:

ICT Foundation 1 Copyright © 2010, IT Gatekeeper Project – Ohiwa Lab. All rights reserved. Character representation

2 Copyright © 2010, IT Gatekeeper Project – Ohiwa Lab. All rights reserved. Using binary numbers to represent characters Computer can handle character data ▪For example, mapping each binary number to each character and then handling the binary numbers instead of characters. ▪The process of converting characters to binary strings (bit sequences) which computer can handle is called encoding. ▪The table that consists characters and binary strings is called character code table. CharacterABCD Binary number ※ These mapping rules are not the real ones in computer.

3 Copyright © 2010, IT Gatekeeper Project – Ohiwa Lab. All rights reserved. Character Encoding ASCII code ASCII code ▪Characters are encoded in 7-digit binary numbers. The 4 left-most bits represent hexadecimal numbers from 0~F, the 3 right-most bits represent hexadecimal numbers from 0~7. ( E.g. : A = 41 ( 16 ) = ( 2 ) ) ▪Specially defined control characters such as CR, DEL, etc. do specific functions. BS ( Back Space ) : turn back by one character CR ( Carriage Return ) : start new line of text ▪Japanese uses many characters so that 7 bits can not represent sufficiently.

4 Copyright © 2010, IT Gatekeeper Project – Ohiwa Lab. All rights reserved. ASCII code table NullDLE Space 1 SOHDC1!1AQaq 2 STXDC2"2BRbr 3 ETXDC3#3CScs 4 EOTDC4$4DTdt 5 ENQNAK%5EUeu 6 ACKSYN&6FVfv 7 BElETB'7GWgw 8 BSCAN(8HXhx 9 HTEM)9IYiy A LFSUB*:JZjz B VTESC+;K[k{ C FFFS,<L\l¦ D CRGS-=M]m} E SORS.>N^n~ F SIUS/?O_oDEL

5 Copyright © 2010, IT Gatekeeper Project – Ohiwa Lab. All rights reserved. Japanese Encoding Multibyte Encoding (variable-width encoding) Japanese including Kanji with characters can be represented by 16-bit (2 bytes) binary numbers. ▪JIS X 0208 standard prescribes 6879 characters of Hiragana, Katakana, Kanji, … There are three types of encoding based on JIS X 0208 ▪ISO-2022-JP ( JIS )・・・ Mainly used in ▪Shift_JIS ・・・ First used in Windows and widely used in personal computer. ▪EUC-JP ・・・ Mainly used in Unix

6 Copyright © 2010, IT Gatekeeper Project – Ohiwa Lab. All rights reserved. Unicode Representation and handling of text expressed in most of the world’s writing system. ▪Shift-JIS or EUC-JP that is based on JIS X 0208 is only used in Japan ▪Started to express the characters in the whole world by 16-bit binary number. Two encoding types ▪UCS-2 , UCS-4 ▪UTF-7 , UTF-8 , UTF-16 , UTF-32 Official website : There’s also a problem of integrated working (be assumed as the same) in such languages using similar Kanji as Japanese, Chinese or Korean