Www.fakengineer.com Data compression. www.fakengineer.com INTRODUCTION If you download many programs and files off the Internet, we have probably encountered.

Slides:



Advertisements
Similar presentations
Compression techniques. Why we need compression. Types of compression –Lossy and lossless Concentrate on lossless techniques. Run Length coding. Entropy.
Advertisements

15 Data Compression Foundations of Computer Science ã Cengage Learning.
Data Compression CS 147 Minh Nguyen.
SIMS-201 Compressing Information. 2  Overview Chapter 7: Compression Introduction Entropy Huffman coding Universal coding.
Data Compression Michael J. Watts
Compression & Huffman Codes
CPSC 231 Organizing Files for Performance (D.H.) 1 LEARNING OBJECTIVES Data compression. Reclaiming space in files. Compaction. Searching. Sorting, Keysorting.
Compression Techniques. Digital Compression Concepts ● Compression techniques are used to replace a file with another that is smaller ● Decompression.
Compression & Huffman Codes Fawzi Emad Chau-Wen Tseng Department of Computer Science University of Maryland, College Park.
Lossless Data Compression Using run-length and Huffman Compression pages
Last time 3 main components to a computer system Types of computers Talked about software – task oriented What are some kinds of data that a computer works.
 Wisegeek.com defines Data Compression as:  “Data compression is a general term for a group of technologies that encode large files in order to shrink.
©Brooks/Cole, 2003 Chapter 15 Data Compression. ©Brooks/Cole, 2003 Realize the need for data compression. Differentiate between lossless and lossy compression.
Lossless Compression Multimedia Systems (Module 2 Lesson 3)
Spring 2015 Mathematics in Management Science Binary Linear Codes Two Examples.
Management Information Systems Lection 06 Archiving information CLARK UNIVERSITY College of Professional and Continuing Education (COPACE)
Media File Formats Jon Ivins, DMU. Text Files n Two types n 1. Plain text (unformatted) u ASCII Character set is most common u 7 bits are used u This.
Lecture 10 Data Compression.
Chapter 2 Source Coding (part 2)
CSC 335 Data Communications and Networking Lecture 4c: Communication and Multiplexing Dr. Cheer-Sun Yang.
MULTIMEDIA TECHNOLOGY SMM 3001 DATA COMPRESSION. In this chapter The basic principles for compressing data The basic principles for compressing data Data.
1 Perception, Illusion and VR HNRS 299, Spring 2008 Lecture 14 Introduction to Computer Graphics.
Data Compression. How File Compression Works If you download many programs and files off the Internet, you've probably encountered ZIP files before. This.
 The amount of data we deal with is getting larger  Not only do larger files require more disk space, they take longer to transmit  Many times files.
1 Lecture 17 – March 21, 2002 Content-delivery services. Multimedia services Reminder  next week individual meetings and project status report are due.
Survey on Improving Dynamic Web Performance Guide:- Dr. G. ShanmungaSundaram (M.Tech, Ph.D), Assistant Professor, Dept of IT, SMVEC. Aswini. S M.Tech CSE.
Multimedia Specification Design and Production 2012 / Semester 1 / L3 Lecturer: Dr. Nikos Gazepidis
Chapter 7 – End-to-End Data Two main topics Presentation formatting Compression We will go over the main issues in presentation formatting, but not much.
Image Compression (Chapter 8) CSC 446 Lecturer: Nada ALZaben.
Compression.  Compression ratio: how much is the size reduced?  Symmetric/asymmetric: time difference to compress, decompress?  Lossless; lossy: any.
The LZ family LZ77 LZ78 LZR LZSS LZB LZH – used by zip and unzip
How Analog and Digital Recording Works Analog converted to digital via an ADV (Analog to Digital Converter = stream of numbers) On playback: digital converted.
Addressing Image Compression Techniques on current Internet Technologies By: Eduardo J. Moreira & Onyeka Ezenwoye CIS-6931 Term Paper.
Lossless Compression CIS 465 Multimedia. Compression Compression: the process of coding that will effectively reduce the total number of bits needed to.
Comp 335 File Structures Data Compression. Why Study Data Compression? Conserves storage space Files can be transmitted faster because there are less.
Lempel-Ziv-Welch Compression
Multi-media Data compression
Lecture 12 Huffman Algorithm. In computer science and information theory, a Huffman code is a particular type of optimal prefix code that is commonly.
LZW (Lempel-Ziv-welch) compression method The LZW method to compress data is an evolution of the method originally created by Abraham Lempel and Jacob.
Computer Sciences Department1. 2 Data Compression and techniques.
Submitted To-: Submitted By-: Mrs.Sushma Rani (HOD) Aashish Kr. Goyal (IT-7th) Deepak Soni (IT-8 th )
Data Compression: Huffman Coding in Weiss (p.389)
Data Compression Michael J. Watts
File Formats Different applications (programs) store data in different formats. Applications support some file formats and not others. Open…, Save…, Save.
3.3 Fundamentals of data representation
Textbook does not really deal with compression.
Data Coding Run Length Coding
Compression & Huffman Codes
Data Compression.
4k… 4K format was named because it has 4000 pixels horizontal resolution approximately. Meanwhile, standard 1080p and 720p resolutions were named because.
CPSC 231 Organizing Files for Performance (D.H.)
Data Compression.
Multimedia Outline Compression RTP Scheduling Spring 2000 CS 461.
Don’t squeeze the Charnin (OK to squeeze data!)
Introduction to Computer Science - Lecture 4
Data Compression.
File Compression-overview
Data Compression CS 147 Minh Nguyen.
Why Compress? To reduce the volume of data to be transmitted (text, fax, images) To reduce the bandwidth required for transmission and to reduce storage.
Topic 3: Data Compression.
UNIT IV.
CSE 589 Applied Algorithms Spring 1999
COMS 161 Introduction to Computing
15 Data Compression Foundations of Computer Science ã Cengage Learning.
Data Compression.
Chapter 8 – Compression Aims: Outline the objectives of compression.
Compression.
Thesis Presented By Mohammad Abul Kalam Azad C Shabbir Ahmad C Francis Palma Tony C Supervised by S. M. Kamruzzaman Assistant.
WJEC GCSE Computer Science
15 Data Compression Foundations of Computer Science ã Cengage Learning.
Presentation transcript:

Data compression

INTRODUCTION If you download many programs and files off the Internet, we have probably encountered ZIP files before. This compression system is a very handy invention, especially for Web users, because it reduces the overall number of bits and bytes in a file so it can be transmitted faster over slower Internet connections, or take up less space on a disk. The technique behind these ZIP files is known asData compression".

How data compression works? It is based on the following processes: Finding Redundancy Let us take an example: In John F. Kennedy's 1961 inaugural address, he delivered this famous line: " Ask not what your country can do for you --ask what you can do for your country. " The quote has 17 words, made up of 61 letters, 16 spaces, one dash and one period. If each letter, space or punctuation mark takes up one unit of memory, we get a total file size of 79 units.

How data compression works? To get the file size down, we need to look for redundancies. In the above quote the words ask, what, your, country, can, do, for, you appear two times. That means nine words -- ask, not, what, your, country, can, do, for, you -- give us almost everything we need for the entire quote. To construct the second half of the phrase, we just point to the words in the first half and fill in the spaces and punctuation.

Looking it Up In this step we pick out the words that are repeated and put them into the numbered index. Our sentence now reads: "1 not " Words Numbered Index ask1 what2 your3 country4 can5 do6 for7 you8

Searching for Patterns The phrase "can do for" is repeated, one time followed by "your" and one time followed by "you," giving us a repeated pattern of "can do for you." This lets us write 15 characters (including spaces), while "your country" only lets us write 13 characters (with spaces), so the program would overwrite the "your country" entry as just r country," and then write a separate entry for "can do for you."

Searching for Patterns Using the patterns we picked out above, and adding -" for spaces, we come up with this larger dictionary: And the quote converted to this smaller sentence: Words with space Index Number ask___1 what___2 you3 r_country4 _can_do_for_ you 5 "1 not ___ 2345 ___ -- __12354

Data compression methods Data compression Lossless Methods Lossy Methods

Lossless compression The following are some of the techniques used in lossless compression. Run- Length Encoding: When data containing strings of repeated symbols (such as bits or characters), the strings can be replaced by a special marker, followed by the repeated symbol, followed by the number of occurrences.

Run- Length Encoding Figure2(a): Original data 5726#409321# #015 The symbol 4 is repeated 09 times. The symbol 3 is repeated 19 times. The symbol 0 is repeated 15 times. Figure2(b): Compressed data The symbol # is the marker

Statistical Compression The three common encoding system using this principle are Morse code, Huffman encoding and Lempel-Ziv-Welch encoding. Morse code: It uses variable length combination of mark (dash) space (dot) to encode data. One-symbol code represents the most frequent characters and five- symbol codes represent the least frequent characters. Example dot (.) represents the character E and four dashes and a dot ( --.--) represent the character Q.

Huffman Encoding E:0T:1 A:00I:01M:10 N:11 C:000D:001G:010 K:011 O:100 R:101 S:110 U:111 Figure3: Bit assignment based on frequency of characters Code sent E I G O T U E First interpretation A M R E I T S D G O U M Second interpretation Third interpretation Figure4: Multiple interpretations of transmitted data

Lempel-Ziv-Welch Encoding The LZW method of compressing data is an evolution of the method originally created by Abraham Lempel and Jacob Ziv The compression which takes place at the sender site, has the following components: a dictionary, a buffer, and an algorithm.

Lempel-Ziv-Welch 123 ABC Figure 5: Original dictionary for a three symbol text. Buffer Dictionary Buffer Strings to dictionary Symbols from the text Codes sent Figure 6: Buffer at the compression site

Compression algorithm Figure 7 shows the flow chart for the compression algorithm.

Decompression algorithm The decompression process, which takes place at the receiver site, uses the same components at the compression process. Dictionary A very interesting point is that the sender does not send dictionary created by the compression process; instead, the dictionary will be created at receiver site and, surprisingly, it is the exact replica of the dictionary created at the sender site. Buffers BufferTemporary buffer String to dictionary Symbols to be printed Figure8: Buffers at the decompression site Codes received

Decompression algorithm Figure9 shows the flowchart for the decompression algorithm.

Lossy compression If the decompressed data is not an exact replica of original information but something very close, we can use a lossy data compression method. Several methods have been developed using lossy compression techniques. Joint photographic experts group (JPEG) is used to compress pictures and graphics. Motion picture experts group (MPEG) is used to compress video.

Conclusion With technologies developing at a rapid rate new data compression methods are arising. One of them is JBIG (Joint Hi- Level Image Experts Group). It is made for image compressions and is a lossless method. Using artificial neural network the data compression techniques are also developing.

THANK YOU !!