Presentation is loading. Please wait.

Presentation is loading. Please wait.

ITEC 1011 Introduction to Information Technologies 2. Data Formats Chapt. 3.

Similar presentations


Presentation on theme: "ITEC 1011 Introduction to Information Technologies 2. Data Formats Chapt. 3."— Presentation transcript:

1 ITEC 1011 Introduction to Information Technologies 2. Data Formats Chapt. 3

2 ITEC 1011 Introduction to Information Technologies Introduction Examples pp. 59.-61 Real World Data Computer Data Input device Dear Mom: Keyboard 10110010… Digital camera 10110010…

3 ITEC 1011 Introduction to Information Technologies Format must be appropriate The internal representation must be appropriate for the type of processing to take place (e.g., text, images, sound)

4 ITEC 1011 Introduction to Information Technologies Rules/Conventions Proprietary formats –Unique to a product or company –E.g., Microsoft Word, Corel Word Perfect, IBM Lotus Notes Standards –Evolve two ways: Proprietary formats become de facto standards (e.g., Adobe PostScript, Apple Quick Time) Committee is struck to solve a problem (Motion Pictures Experts Group, MPEG) pp. 61-62

5 ITEC 1011 Introduction to Information Technologies Standards Organizations ISO – International Standards Organization CSA – Canadian Standards Association ANSI – American National Standards Institute IEEE – Institute for Electrical and Electronics Engineers Etc.

6 ITEC 1011 Introduction to Information Technologies Examples of Standards Type of DataStandards AlphanumericASCII, EBCDIC, Unicode ImageJPEG, GIF, PCX, TIFF Motion pictureMPEG-2, Quick Time SoundSound Blaster, WAV, AU Outline graphics/fontsPostScript, TrueType, PDF

7 ITEC 1011 Introduction to Information Technologies Why Standards? Standard are “arbitrary” They exist because they are –Convenient –Efficient –Flexible –Appropriate –Etc.

8 ITEC 1011 Introduction to Information Technologies Alphanumeric Data Problem: Distinguishing between the number 123 (one hundred and twenty-three) and the characters “123” (one, two, three) Four standards for representing letters (alpha) and numbers –BCD – Binary-coded decimal –ASCII – American standard code for information interchange –EBCDIC – Extended binary-coded decimal interchange code –Unicode pp. 63-69

9 ITEC 1011 Introduction to Information Technologies Next 2 slides Standard Alphanumeric Formats BCD ASCII EBCDIC Unicode

10 ITEC 1011 Introduction to Information Technologies Binary-Coded Decimal (BCD) Four bits per digit DigitBit pattern 00000 10001 20010 30011 40100 50101 60110 70111 81000 91001 Note: the following bit patterns are not used: 1010 1011 1100 1101 1110 1111

11 ITEC 1011 Introduction to Information Technologies Example 7093 10 = ? (in BCD) 7 0 9 3 0111 0000 1001 0011

12 ITEC 1011 Introduction to Information Technologies Next 22 slides Standard Alphanumeric Formats BCD ASCII EBCDIC Unicode

13 ITEC 1011 Introduction to Information Technologies The Problem Representing text strings, such as “ Hello, world”, in a computer

14 ITEC 1011 Introduction to Information Technologies Codes and Characters Each character is coded as a byte Most common coding system is ASCII (Pronounced ass-key) ASCII = American National Standard Code for Information Interchange Defined in ANSI document X3.4-1977

15 ITEC 1011 Introduction to Information Technologies ASCII Features 7-bit code 8 th bit is unused (or used for a parity bit) 2 7 = 128 codes Two general types of codes: –95 are “Graphic” codes (displayable on a console) –33 are “Control” codes (control features of the console or communications channel)

16 ITEC 1011 Introduction to Information Technologies ASCII Chart

17 ITEC 1011 Introduction to Information Technologies

18 ITEC 1011 Introduction to Information Technologies Most significant bit Least significant bit

19 ITEC 1011 Introduction to Information Technologies e.g., ‘a’ = 1100001

20 ITEC 1011 Introduction to Information Technologies 95 Graphic codes

21 ITEC 1011 Introduction to Information Technologies 33 Control codes

22 ITEC 1011 Introduction to Information Technologies Alphabetic codes

23 ITEC 1011 Introduction to Information Technologies Numeric codes

24 ITEC 1011 Introduction to Information Technologies Punctuation, etc.

25 ITEC 1011 Introduction to Information Technologies “Hello, world” Example ======================== Binary 01001000 01100101 01101100 01101111 00101100 00100000 01110111 01100111 01110010 01101100 01100100 Hexadecimal 48 65 6C 6F 2C 20 77 67 72 6C 64 Decimal 72 101 108 111 44 32 119 103 114 108 100 Hello, worldHello, world ======================== ========================

26 ITEC 1011 Introduction to Information Technologies Common Control Codes CR0Dcarriage return LF0Aline feed HT09horizontal tab DEL7Fdelete NULL00null Hexadecimal code

27 ITEC 1011 Introduction to Information Technologies

28 ITEC 1011 Introduction to Information Technologies Terminology Learn the names of the special symbols –[ ] brackets –{ }braces –( )parentheses –@commercial ‘at’ sign –& ampersand –~tilde

29 ITEC 1011 Introduction to Information Technologies

30 ITEC 1011 Introduction to Information Technologies Escape Sequences Extend the capability of the ASCII code set For controlling terminals and formatting output Defined by ANSI in documents X3.41-1974 and X3.64-1977 The escape code is ESC = 1B 16 An escape sequence begins with two codes: ESC [ 1B 16 5B 16

31 ITEC 1011 Introduction to Information Technologies Examples Erase display:ESC [ 2 J Erase line:ESC [ K

32 ITEC 1011 Introduction to Information Technologies Next 1 slides Standard Alphanumeric Formats BCD ASCII EBCDIC Unicode

33 ITEC 1011 Introduction to Information Technologies EBCDIC Extended BCD Interchange Code (pronounced ebb’-se-dick) 8-bit code Developed by IBM Rarely used today IBM mainframes only

34 ITEC 1011 Introduction to Information Technologies Next 2 slides Standard Alphanumeric Formats BCD ASCII EBCDIC Unicode

35 ITEC 1011 Introduction to Information Technologies Unicode 16-bit standard Developed by a consortia Intended to supercede older 7- and 8-bit codes

36 ITEC 1011 Introduction to Information Technologies Unicode Version 2.1 1998 Improves on version 2.0 Includes the Euro sign (20AC 16 = ) From the standard: …contains 38,887 distinct coded characters derived from the supported scripts. These characters cover the principal written languages of the Americas, Europe, the Middle East, Africa, India, Asia, and Pacifica. http://www.unicode.org

37 ITEC 1011 Introduction to Information Technologies Keyboard Input Key (“scan”) codes are converted to ASCII ASCII code sent to host computer Received by the host as a “stream” of data Stored in buffer Processed Etc. pp. 69

38 ITEC 1011 Introduction to Information Technologies Shift Key inhibits bit 5 in the ASCII code Key(s) ASCII code 6 5 4 3 2 1 0 Character 1 1 0 0 0 0 1 1 0 0 0 0 0 1 aAaA a aShift

39 ITEC 1011 Introduction to Information Technologies Control Key inhibits bits 5 & 6 in the ASCII code Key(s) ASCII code 6 5 4 3 2 1 0 Character 1 1 0 0 0 1 1 0 0 0 0 0 1 1 c ETX c cCtrl Control code

40 ITEC 1011 Introduction to Information Technologies Other Input OCR – optical character recognition Bar code readers Voice/audio input Punched cards Images / objects Pointing devices pp. 69-86

41 ITEC 1011 Introduction to Information Technologies OCR Hello, world Page of text Optical scan 10110110… Computer file

42 ITEC 1011 Introduction to Information Technologies Other Input OCR – optical character recognition Bar code readers Voice/audio input Punched cards Images / objects Pointing devices pp. 69-86

43 ITEC 1011 Introduction to Information Technologies Bar Codes An automatic identification (Auto ID) technology that streamlines identification and data collection See http://www.digital.net/barcoder/barcode.html

44 ITEC 1011 Introduction to Information Technologies Other Input OCR – optical character recognition Bar code readers Voice/audio input Punched cards Images / objects Pointing devices pp. 69-86

45 ITEC 1011 Introduction to Information Technologies Voice/audio Input Input device: microphone Audio input is “digitized” and stored Processed in two ways –As is (no recognition) –Recognized and converted to alphanumeric data (ASCII) Digitize 10110010…

46 ITEC 1011 Introduction to Information Technologies Other Input OCR – optical character recognition Bar code readers Voice/audio input Punched cards Images / objects Pointing devices pp. 69-86

47 ITEC 1011 Introduction to Information Technologies Punched Cards Invented by Herman Hollerith (founder of IBM) Each card holds 80 characters

48 ITEC 1011 Introduction to Information Technologies Other Input OCR – optical character recognition Bar code readers Voice/audio input Punched cards Images / objects Pointing devices pp. 69-86

49 ITEC 1011 Introduction to Information Technologies Images Typically images are pictures that are optically scanned and saved as a “bit map” or in some other format Many formats –gif, jpeg, …

50 ITEC 1011 Introduction to Information Technologies Typical “Save As” Dialog

51 ITEC 1011 Introduction to Information Technologies Objects Images made of geometrically definable shapes Offer efficiency, flexibility, small size, etc.

52 ITEC 1011 Introduction to Information Technologies Other Input OCR – optical character recognition Bar code readers Voice/audio input Punched cards Images / objects Pointing devices pp. 69-86

53 ITEC 1011 Introduction to Information Technologies Pointing Devices Originally used for specifying coordinates (x, y) for graphical input Today used as general purpose device for “graphical user interfaces” (GUIs)

54 ITEC 1011 Introduction to Information Technologies Thank you


Download ppt "ITEC 1011 Introduction to Information Technologies 2. Data Formats Chapt. 3."

Similar presentations


Ads by Google