CS 376b Introduction to Computer Vision 02 / 08 / 2008 Instructor: Michael Eckmann.

Slides:



Advertisements
Similar presentations
Information Representation
Advertisements

Computer Science 101 RGB Color System. Simplified Introduction to Color Vision Go to How We See: The First Steps of Human Vision or Color Vision for more.
CS Spring 2009 CS 414 – Multimedia Systems Design Lecture 4 – Digital Image Representation Klara Nahrstedt Spring 2009.
Digital Image Fundamentals Selim Aksoy Department of Computer Engineering Bilkent University
Digital Image Fundamentals Selim Aksoy Department of Computer Engineering Bilkent University
Digital Image Fundamentals Selim Aksoy Department of Computer Engineering Bilkent University
Bit Depth and Spatial Resolution SIMG-201 Survey of Imaging Science © 2002 CIS/RIT.
CS 102 Computers In Context (Multimedia)‏ 04 / 20 / 2009 Instructor: Michael Eckmann.
CS 376b Introduction to Computer Vision 04 / 08 / 2008 Instructor: Michael Eckmann.
CS 376 Introduction to Computer Graphics 04 / 09 / 2007 Instructor: Michael Eckmann.
CS 376b Introduction to Computer Vision 03 / 26 / 2008 Instructor: Michael Eckmann.
1 Imaging and Image Representation  Sensing Process  Typical Sensing Devices  Problems with Digital Images  Image Formats  Relationship of 3D Scenes.
CS 376b Introduction to Computer Vision 04 / 01 / 2008 Instructor: Michael Eckmann.
CS 376b Introduction to Computer Vision 04 / 14 / 2008 Instructor: Michael Eckmann.
Digital Audio, Image and Video Hao Jiang Computer Science Department Sept. 6, 2007.
CS 376b Introduction to Computer Vision 02 / 05 / 2008 Instructor: Michael Eckmann.
Image Representation CS105. Data Representation Text representation – ASCII character set – Unicode – Data compression Images!
1/22/04© University of Wisconsin, CS559 Spring 2004 Last Time Course introduction Image basics.
CS559-Computer Graphics Copyright Stephen Chenney Image File Formats How big is the image? –All files in some way store width and height How is the image.
CS 376 Introduction to Computer Graphics 01 / 26 / 2007 Instructor: Michael Eckmann.
Digital Images The digital representation of visual information.
CS 376b Introduction to Computer Vision 02 / 26 / 2008 Instructor: Michael Eckmann.
Chpater 3 Resolution, File Formats and Storage. Introduction There are two factors that determine the quality of the picture you take; The resolution.
CS 376b Introduction to Computer Vision 04 / 02 / 2008 Instructor: Michael Eckmann.
Department of Physics and Astronomy DIGITAL IMAGE PROCESSING
COMP Bitmapped and Vector Graphics Pages Using Qwizdom.
Image processing Second lecture. Image Image Representation We have seen that the human visual system (HVS) receives an input image as a collection of.
CS 102 Computers In Context (Multimedia)‏ 01 / 28 / 2009 Instructor: Michael Eckmann.
Image Formation. Input - Digital Images Intensity Images – encoding of light intensity Range Images – encoding of shape and distance They are both a 2-D.
Lab #5-6 Follow-Up: More Python; Images Images ● A signal (e.g. sound, temperature infrared sensor reading) is a single (one- dimensional) quantity that.
TOPIC 4 INTRODUCTION TO MEDIA COMPUTATION: DIGITAL PICTURES Notes adapted from Introduction to Computing and Programming with Java: A Multimedia Approach.
Lawrence Snyder University of Washington, Seattle © Lawrence Snyder 2004 Adding some light to computing ….
How A Camera Works Image Sensor Shutter Mirror Lens.
Too much information running through my brain.. We live in the information age. Knowledge comes from careful investigation of information. Information.
Chapter 11 Fluency with Information Technology 4 th edition by Lawrence Snyder (slides by Deborah Woodall : 1.
1 Perception, Illusion and VR HNRS 299, Spring 2008 Lecture 14 Introduction to Computer Graphics.
Data Representation CS280 – 09/13/05. Binary (from a Hacker’s dictionary) A base-2 numbering system with only two digits, 0 and 1, which is perfectly.
CS 376 Introduction to Computer Graphics 04 / 11 / 2007 Instructor: Michael Eckmann.
CS 325 Introduction to Computer Graphics 02 / 01 / 2010 Instructor: Michael Eckmann.
Digital Image Fundamentals Selim Aksoy Department of Computer Engineering Bilkent University
FYS 100 Creative Discovery in Digital Art Forms Fall 2008 Burg Digital Photography Assignment.
© 1999 Rochester Institute of Technology Introduction to Digital Imaging.
CS 376b Introduction to Computer Vision 02 / 22 / 2008 Instructor: Michael Eckmann.
September 21, COMPUTER VISION WEB PAGE IS UP !! OR Simply go to computer science homepage.
Digital Cameras And Digital Information. How a Camera works Light passes through the lens Shutter opens for an instant Film is exposed to light Film is.
Introduction to Computer Vision Sebastian van Delden USC Upstate
Chapter 2 : Imaging and Image Representation Computer Vision Lab. Chonbuk National University.
Intelligent Vision Systems Image Geometry and Acquisition ENT 496 Ms. HEMA C.R. Lecture 2.
CS 376b Introduction to Computer Vision 03 / 21 / 2008 Instructor: Michael Eckmann.
Multimedia Basics (1) Hongli Luo CEIT, IPFW. Topics r Image data type r Color Model : m RGB, CMY, CMYK, YUV, YIQ, YCbCr r Analog Video – NTSC, PAL r Digital.
Digital Photography File Formats (….and why use one or the other…) Instructor: David King.
Computer Vision Introduction to Digital Images.
Data Representation The storage of Text Numbers Graphics.
CS 376b Introduction to Computer Vision 02 / 11 / 2008 Instructor: Michael Eckmann.
CS 376b Introduction to Computer Vision 03 / 18 / 2008 Instructor: Michael Eckmann.
1 Computer Science 631 Multimedia Systems Prof. Ramin Zabih Computer Science Department CORNELL UNIVERSITY.
CS 325 Introduction to Computer Graphics 04 / 12 / 2010 Instructor: Michael Eckmann.
CS 101 – Sept. 14 Review Huffman code Image representation –B/W and color schemes –File size issues.
Intelligent Vision Systems Image Geometry and Acquisition ENT 496 Ms. HEMA C.R. Lecture 2.
Image File Formats By Dr. Rajeev Srivastava 1. Image File Formats Header and Image data. A typical image file format contains two fields namely Dr. Rajeev.
CS 376b Introduction to Computer Vision 02 / 12 / 2008 Instructor: Michael Eckmann.
CS 376b Introduction to Computer Vision 03 / 31 / 2008 Instructor: Michael Eckmann.
CS 376b Introduction to Computer Vision 03 / 17 / 2008 Instructor: Michael Eckmann.
More Digital Representation Discrete information is represented in binary (PandA), and “continuous” information is made discrete.
COMP 9517 Computer Vision Digital Images 1/28/2018 COMP 9517 S2, 2009.
Graphics and image data representation
Binary Representation in Audio and Images
Digital Image Fundamentals
Representing Images 2.6 – Data Representation.
Basic Concepts of Digital Imaging
Presentation transcript:

CS 376b Introduction to Computer Vision 02 / 08 / 2008 Instructor: Michael Eckmann

Michael Eckmann - Skidmore College - CS 376b - Spring 2008 Today’s Topics PGM, PPM file formats (see webpages and examples)‏ RGB color model HW (required)‏ Image acquisition/formation and representation –problems with digital images –some definitions quantization problems compression comment frames of reference hole counting algorithm from ch. 1

Michael Eckmann - Skidmore College - CS Spring 2008 RGB Color Model Color = rR + gG + bB r, g and b range from 0.0 to 1.0 notice: white's (r,g,b) = (1,1,1)‏ black's (r,g,b) = (0,0,0)‏ complementary colors are those that add up to (1,1,1)‏ can anyone tell me 2? image from “Computer Graphics” by Hearn and Baker

Michael Eckmann - Skidmore College - CS 376b - Spring 2008 individual Homework write a C++ program to generate a semi random PPM image (named verticalStripes.ppm) with 400 columns and 300 rows –each stripe should be 10 pixels high and 400 pixels wide start with a random 3 byte RGB color (each R G and B should be a number )‏ –select the largest of the random R G B generated and gradually and regularly alter that channel from that number down to 0 along the row of 300 pixels –so, 30*3 random numbers are to be generated (3 at a time)‏ use one set of 3 as the R G and B value of of the leftmost pixel for 10 rows

Michael Eckmann - Skidmore College - CS 376b - Spring 2008 team Homework you should group into 2 teams of 2 and 1 team of 3 students –each team should have at least one student who took CG to help with 2.2 part c)‏ exercises from the text –2.2 (p. 25)‏ –2.4 (p. 34)‏ –and...

Michael Eckmann - Skidmore College - CS 376b - Spring 2008 team Homework 2.8 (p. 39)‏ find two images (a face and a landscape) on the web with a license that allows you to use them legally (e.g. creative commons) provide attribution regardless whether the license requires it (and the exact url where you found it)‏ –use gimp (or another program) to save it in PPM ascii, PPM raw, GIF, TIFF and 5 different quality levels (10, 25, 40, 55, 70 and original) of JPEG – make sure to change name of JPEG files when saving to diff. quality levels always start from the original to produce the 5 other quality levels –make a chart containing file size in bytes for each format and examine by eye the various images and make quality judgements (consider the overall picture and fine details) specify the #rows&cols

Michael Eckmann - Skidmore College - CS Spring 2008 CCD cameras image figure 2.2 in “Computer Vision” by Shapiro and Stockman discrete cells convert light energy into electrical charges and they integrate the light energy falling on them during the time the shutter is open

Michael Eckmann - Skidmore College - CS Spring 2008 Video 30 frames (images) per second for NTSC video –can be a different rate of images e.g. 15, 60 almost always compressed due to the large size –example: a 640 x 480 image with 3 bytes per pixel uncompressed is how big? a 90 minute video w/ 30 frames per second uncompressed with each frame as above is how big? COTS (commercial off the shelf) CCD video cameras are made for other purposes than for computer vision –e.g. pixels may not be square (4:3 ratio)‏ computer vision most likely prefers/assumes square pixels

Michael Eckmann - Skidmore College - CS Spring 2008 Problems in digital images geometric distortion due to an imperfect lens the light beams don't travel on the correct path e.g. barrel distortion for small focal lengths scattering medium that the light travels through is rarely a vacuum, hence the light bends (e.g. air, water, other liquids...)‏ blooming each CCD cell is close to their neighbors so, light can leak between neighboring cells CCD variations some cells might not be all equally sensitive due to variations in the manufacturing process

Michael Eckmann - Skidmore College - CS Spring 2008 Problems in digital images clipping or wrap-around when an analog value is converted to digital, high values may be “clipped” to the max digital value, wrap around is like a modulus operator quantization effects converting a continuous range to a discrete range (we'll see one type of problem with this shortly)‏ chromatic distortion lens has the index of refraction (bending) different for different wavelengths (colors) let's look back at the lightbulb picture, what's the effect?

Michael Eckmann - Skidmore College - CS Spring 2008 Problems in digital images images from “Computer Vision” by Shapiro and Stockman Blooming(left) & Barrel distortion (right)‏

Michael Eckmann - Skidmore College - CS Spring 2008 some definitions analog image –“infinite” precision in space and intensity value digital image –discrete 2D array of limited precision intensity values grey-scale image –one intensity per pixel (e.g. if one byte intensity range is 0-255)‏ multispectral image –a vector of values at each pixel (for color, usually 3 values representing values for red, green and blue)‏ binary image –each pixel has value 0 or 1

Michael Eckmann - Skidmore College - CS Spring 2008 some definitions labelled image –all pixel values are from some finite alphabet –usually generated from a digital image based on some way to decide which label a pixel gets –example on board picture function –f(x,y) where x and y are spatial variables and f(x,y) is the intensity at x,y

from Shapiro & Stockman figure 2.9 in “Computer Vision” quantization problems example

Michael Eckmann - Skidmore College - CS Spring 2008 compression comment run coding is used as part of some compression algorithms to give you a sense of how an image can be losslessly compressed using run length coding –count runs of 0's and runs of 1's example using 1 byte (8 bits) greyscale image –divide up your image into 8 “bit planes” each plane is a binary image run code each binary image note 127 -> 128 problem –solution --- first convert all to a “grey code” which has the property that successive greyscale differences differ in only 1 bit

Michael Eckmann - Skidmore College - CS Spring 2008 frames of reference world (W)‏ object (O)‏ camera (C)‏ real image (F)‏ pixel (I)‏

Michael Eckmann - Skidmore College - CS Spring 2008 frames of reference

Michael Eckmann - Skidmore College - CS Spring 2008 hole counting external corners internal corners these are sufficient as long as the holes are “4-connected” and simple (e.g. no donuts)‏

Michael Eckmann - Skidmore College - CS Spring 2008 hole counting algorithm

Michael Eckmann - Skidmore College - CS Spring 2008 hole counting algorithm