Ramakrishna Lanka Ramakrishna.lanka@mavs.uta.edu 1001126160 IMAGE ANALYSIS AND OBJECT DETECTION-RECOGNITION USING COMPUTER VISION ALGORITHMS Ramakrishna.

Slides:



Advertisements
Similar presentations
Presented by Xinyu Chang
Advertisements

Caroline Rougier, Jean Meunier, Alain St-Arnaud, and Jacqueline Rousseau IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 21, NO. 5,
EDGE DETECTION ARCHANA IYER AADHAR AUTHENTICATION.
APECE-505 Intelligent System Engineering Low-level Processing Md. Atiqur Rahman Ahad Reference books: - Computer Vision and Action Recognition, Md. Atiqur.
Image (and Video) Coding and Processing Lecture 5: Point Operations Wade Trappe.
Computer Vision Detecting the existence, pose and position of known objects within an image Michael Horne, Philip Sterne (Supervisor)
Uncertainty Representation. Gaussian Distribution variance Standard deviation.
Real-time Embedded Face Recognition for Smart Home Fei Zuo, Student Member, IEEE, Peter H. N. de With, Senior Member, IEEE.
EE663 Image Processing Edge Detection 2 Dr. Samir H. Abdul-Jauwad Electrical Engineering Department King Fahd University of Petroleum & Minerals.
Traffic Sign Recognition Jacob Carlson Sean St. Onge Advisor: Dr. Thomas L. Stewart.
LYU 0102 : XML for Interoperable Digital Video Library Recent years, rapid increase in the usage of multimedia information, Recent years, rapid increase.
Highlights Lecture on the image part (10) Automatic Perception 16
Introduction to Computer Vision CS / ECE 181B Thursday, April 22, 2004  Edge detection (HO #5)  HW#3 due, next week  No office hours today.
The Segmentation Problem
Elements of Biomedical Image Processing BMI 731 Winter 2005 Kun Huang Department of Biomedical Informatics Ohio State University.
Smart Traveller with Visual Translator. What is Smart Traveller? Mobile Device which is convenience for a traveller to carry Mobile Device which is convenience.
California Car License Plate Recognition System ZhengHui Hu Advisor: Dr. Kang.
Automatic Image Alignment (feature-based) : Computational Photography Alexei Efros, CMU, Fall 2006 with a lot of slides stolen from Steve Seitz and.
Digital Image Processing & Pattern Analysis (CSCE 563) Course Outline & Introduction Prof. Amr Goneid Department of Computer Science & Engineering The.
Overview Introduction to local features
Introduction to Image Processing Grass Sky Tree ? ? Review.
Neighborhood Operations
Overview Harris interest points Comparing interest points (SSD, ZNCC, SIFT) Scale & affine invariant interest points Evaluation and comparison of different.
CMPD273 Multimedia System Prepared by Nazrita Ibrahim © UNITEN2002 Multimedia System Characteristic Reference: F. Fluckiger: “Understanding networked multimedia,
Vehicle License Plate Detection Algorithm Based on Statistical Characteristics in HSI Color Model Instructor : Dr. K. R. Rao Presented by: Prasanna Venkatesh.
September 23, 2014Computer Vision Lecture 5: Binary Image Processing 1 Binary Images Binary images are grayscale images with only two possible levels of.
Digital Image Processing CCS331 Relationships of Pixel 1.
0 - 1 © 2007 Texas Instruments Inc, Content developed in partnership with Tel-Aviv University From MATLAB ® and Simulink ® to Real Time with TI DSPs Edge.
Chapter 10 Image Segmentation.
Lecture 7: Features Part 2 CS4670/5670: Computer Vision Noah Snavely.
Digital Image Processing Lecture 1: Introduction February 21, 2005 Prof. Charlene Tsai Prof. Charlene Tsai
DIGITAL IMAGE. Basic Image Concepts An image is a spatial representation of an object An image can be thought of as a function with resulting values of.
Edge Detection and Geometric Primitive Extraction Jinxiang Chai.
Kylie Gorman WEEK 1-2 REVIEW. CONVERTING AN IMAGE FROM RGB TO HSV AND DISPLAY CHANNELS.
Autonomous Robots Vision © Manfred Huber 2014.
Jack Pinches INFO410 & INFO350 S INFORMATION SCIENCE Computer Vision I.
Overview Introduction to local features Harris interest points + SSD, ZNCC, SIFT Scale & affine invariant interest point detectors Evaluation and comparison.
Sejong Univ. Edge Detection Introduction Simple Edge Detectors First Order Derivative based Edge Detectors Compass Gradient based Edge Detectors Second.
1 Machine Vision. 2 VISION the most powerful sense.
CSE 185 Introduction to Computer Vision Feature Matching.
Digital Image Processing
October 1, 2013Computer Vision Lecture 9: From Edges to Contours 1 Canny Edge Detector However, usually there will still be noise in the array E[i, j],
Augmented Reality and 3D modelling Done by Stafford Joemat Supervised by Mr James Connan.
Graphics II Image Processing I. Acknowledgement Most of this lecture note has been taken from the lecture note on Multimedia Technology course of University.
Lecture 04 Edge Detection Lecture 04 Edge Detection Mata kuliah: T Computer Vision Tahun: 2010.
Canny Edge Detection Using an NVIDIA GPU and CUDA Alex Wade CAP6938 Final Project.
Machine Vision Edge Detection Techniques ENT 273 Lecture 6 Hema C.R.
Edge Segmentation in Computer Images CSE350/ Sep 03.
Digital Image Processing
Digital Image Processing CSC331
1 Edge Operators a kind of filtering that leads to useful features.
Digital Image Processing CCS331 Relationships of Pixel 1.
CSCI 631 – Foundations of Computer Vision March 15, 2016 Ashwini Imran Image Stitching.
Computer Photography -Scene Fixed 陳立奇.
CSCI 631 – Foundations of Computer Vision March 15, 2016 Ashwini Imran Image Stitching Link: singhashwini.mesinghashwini.me.
Visual Information Processing. Human Perception V.S. Machine Perception  Human perception: pictorial information improvement for human interpretation.
IMAGE PROCESSING INTRODUCTION TO DIGITAL IMAGE PROCESSING
IMAGE PROCESSING INTENSITY TRANSFORMATION AND SPATIAL FILTERING
Introduction Computer vision is the analysis of digital images
Image and Video Processing – An Introduction
Levi Smith REU Week 1.
Introduction Computer vision is the analysis of digital images
ECE 692 – Advanced Topics in Computer Vision
Digital Image Processing
a kind of filtering that leads to useful features
Brief Review of Recognition + Context
a kind of filtering that leads to useful features
CSE 185 Introduction to Computer Vision
Introduction Computer vision is the analysis of digital images
Lecture 5: Feature invariance
Presentation transcript:

Ramakrishna Lanka Ramakrishna.lanka@mavs.uta.edu 1001126160 IMAGE ANALYSIS AND OBJECT DETECTION-RECOGNITION USING COMPUTER VISION ALGORITHMS Ramakrishna Lanka Ramakrishna.lanka@mavs.uta.edu 1001126160

Contents Introduction to images and image processing Overview of computer vision Overview of image and video processing workflow Image enhancement Computer vision search algorithms and techniques Tools to be used References

PROPOSAL The objective of this project is to implement computer vision algorithms to analyze an image and detect objects in the image based on object reference images.

Acronyms 3D – Three dimensional CVIU – Computer vision and image understanding FAST – Features from accelerated segment test MSER – Maximally stable extremal regions PCB – Printed circuit board RANSAC – Random sample consensus RGB – Red, green and blue SURF – Speeded up robust features VTP – Virtual terrain Project

What is an Image ? Pictorial reproduction of an object in physical or digital space. A digital Image is made up of Pixels. More the pixels, much detailed is the Image as shown in figure(1). Resolution of an Image is measured by the number of pixels it consists of. Ex: 1366x768 Fig(1). Illustration of image pixels. [19]

Need for Image and video processing. Computer Vision: Object/Activity detection as shown in figure (15) and figure (16). Medical and biological image processing as shown in figure (10) and figure (11). After effects: Adding effects and image correction as shown in figure (2). Efficient use of media in Consumer multimedia products: Storage, display and transfer. Fig(2). Illustration of image after effects.

OVERVIEW OF COMPUTER VISION Vision has given humans the natural ability to picture the world around in a three-dimensional (3D) space. Computer vision gives the man-made device the ability to perceive the world. The goal of computer vision is to extract meaningful information from the images and videos that the image sensors capture.

OVERVIEW OF THE IMAGE AND VIDEO PROCESSING WORKFLOW Fig(2). Image and video processing workflow. [2]

OVERVIEW OF THE IMAGE AND VIDEO PROCESSING WORKFLOW The raw images from the source may contain unwanted image source characteristics or noise. Preprocessing to enhance the images will prepare it for further processing based on the requirement. Image enhancement can be achieved using various filtering techniques and transforms.

Fig(3). Original image of a PCB [21] Negative of an image Fig(3). Original image of a PCB [21] Fig(4). Histogram of the original PCB image N(r, c) = 255 – I(r, c) where 0 <= r <= R and 0 <= c <= C (1.1)   Where, N is the negative of image I, both with dimension R x C.

Fig(7). Negative image of the PCB Negative of an image Fig(7). Negative image of the PCB Fig(8). Histogram of the negative PCB image

Negative of an image Fig(10). Original image of a knee scan [22] Fig(11). Negative image of the knee scan figure (10) Fig(9). Histogram of the knee scan Fig(12). Histogram of the negative knee scan

Thresholding Transformation (1.2) Where s is the value of the resultant pixel intensity and r is the actual pixel intensity. Fig(13). Image of some coins [28] Fig(14). Threshold image of figure(13)

Thresholding Transformation Multilevel thresholding Fig(15). Image of obstacles [24] Fig(16). Multilevel threshold image of figure(15)

Thresholding Transformation Thresholding – color labelling Fig(17). Image of my cluttered desk Fig(18). Threshold segmentation of figure(17)

Powers-Law Transformations / Gamma correction s = crγ (1.3) Where, s is the corrected pixel intensity, r is the actual pixel intensity, c and γ are gamma correction constants. Fig(19). Image of the gamma palette [27] Fig(20). Gamma corrected image of figure(19) with γ=0.7 Fig(21). Gamma corrected image of figure(19) with γ=0.5

Histogram Equalization h(rk) = nk (1.4) Where, h(rk) is the histogram function, rk is the intensity value and nk is the number of pixels with the intensity rk.[4]

Computer Vision search algorithms and techniques Edge feature detection The Sobel operator The Prewitt operators The Robert operator Laplacian/Laplacian of Gaussian operator Canny edge detection Fig(24). Reference object image [25] Fig(25). Object in scene image [25]

Edge feature detection The Sobel operator (1.5) Where Gx is the vertical filter and Gy is the horizontal filter.[14][15] Figure(26) and figure(27) show the edge detections of figure(24) and figure(25). Fig(26). Sobel edge detection of figure(24) Fig(27). Sobel edge detection of figure(25)

Edge feature detection The Prewitt operators (1.6) Where Gx is the vertical filter and Gy is the horizontal filter.[14][15] Figure(28) and figure(29) show the edge detections of figure(24) and figure(25). Fig(28). Prewitt edge detection of figure(24) Fig(29). Prewitt edge detection of figure(25)

Edge feature detection The Robert operator (1.7) Where Gx is the vertical filter and Gy is the horizontal filter.[14][15] Figure(30) and figure(31) show the edge detections of figure(24) and figure (25).

Edge feature detection The Laplacian/Laplacian of Gaussian operator The Laplacian of an image highlights regions of rapid intensity change. The Laplacian is often applied to an image that has first been smoothed with something approximating a Gaussian smoothing filter in order to reduce its sensitivity to noise. [14][15]

Edge feature detection Canny edge detection Low error rate. Edge points should be well localized. Single edge point response.

Computer Vision search algorithms and techniques Local point/region detection Corner point detection – FAST (Features from accelerated segment test ) MSER(Maximally stable extremal regions) feature detection The Robert operator SURF(Speeded up robust features) feature detection. Fig(24). Reference object image [25] Fig(25). Object in scene image [25]

Local point/region feature detection Corner point feature detection - FAST corner detector uses a circle of 16 pixels (a Bresenham circle of radius 3) to classify whether a candidate point p is actually a corner. [31]

Local point/region feature detection Maximally stable extremal regions (MSER) MSER is a method for blob detection in images. The MSER algorithm extracts from an image a number of co-variant regions, called MSERs: an MSER is a stable connected component of some gray-level sets of the image. [29]

Local point/region feature detection Speeded up robust features (SURF) SURF is a fast and robust algorithm for local, similarity invariant representation and comparison. SURF selects interest points of an image from the salient features

Object recognition Recognition [9][10] Recognition is a process that follows detection and/or segmentation. Recognition approaches in this project, Object feature recognition [10] Recognition with point feature matching Recognition with edge feature matching Object Segmentation recognition [1] Recognition with segmentation

Object feature recognition Recognition with point feature matching [1][10] Match the extracted feature points of reference and scene images. The RANSAC (Random sample consensus ) algorithm works by estimating the best set of point matches between the images. This is shown in figure(43). Fig(43). Feature matching between figure(25) and figure(26)

Object feature recognition Recognition with point feature matching [1][10] Fig(44). Object in figure(25) recognized in figure(26)

Project status The aim of this project is to analyze and enhance images; and design a system to identify objects in an image. The identity of the objects will be achieved by using feature detection and recognition based on the features of the reference objects that the system would have learnt before recognition. Image analysis : Done Image Enhancement : Done Feature detection : Done Feature Extraction : Done Segmentation : Done Performance comparison : Ongoing Recognition with point feature matching : Done Recognition with edge feature matching : Done (yet to document) Recognition with segmentation : Done (yet to document) Multiple object recognition : Done (yet to document)

Tools being used Matlab – Image processing tool box [13] Matlab – Computer vision tool box [9] Matlab – R2013a OpenCV v2.4.11 [10] Microsoft Visual Studio v2010 Windows 8.1 64bit Central processor – Intel® Core™ i3-3120 2.50Ghz Graphics processor – NVidia GT 720M 2GB Memory – 8.00 GB

References [1] R. Szeliski, “Computer Vision: Algorithms and Applications”, 2010 Springer, Draft: 3, September 2010. [2] A. Thé, “Image Processing Made Easy”, MathWorks Videos and Webinars, Recorded: 15 Oct 2005. http://www.mathworks.com/videos/image-processing-made-easy-81718.html?form_seq=conf1050 [3] VTP software tutorials webpage (Source for hazy image): http://vterrain.org/Doc/Tut3/ [4] R. Maini and H. Aggarwal, “A Comprehensive Review of Image Enhancement Techniques”, Journal of Computing, Volume 2, Issue 3, pp 8-12 March 2010. : http://arxiv.org/ftp/arxiv/papers/1003/1003.4053.pdf [5] T. Gupta, “Seeing through Chaos: Noise Filtering”, ahumaninmachinesworld.blogspot, January 2014 (Internet Blog on Noise Filtering): http://ahumaninmachinesworld.blogspot.com/2014/01/seeing-through-chaos-noise-filtering.html [6] Computer Vision Algorithms, “COMPUTER VISION TUTORIAL”. Link: http://www.societyofrobots.com/programming_computer_vision_tutorial_pt3.shtml#middle_mass [7] H. Spont´on and J. Cardelino, “A Review of Classic Edge Detectors”, Image Processing On Line, Preprint Issue, pp 1-8, June 2012. Link: http://www.ipol.im/pub/pre/35/preprint.pdf

References [8] ‘Performance criteria’, Document in the Course website of EE5356 Digital Image processing (rate the performance of the filtered/enhanced image). Link: http://www.uta.edu/faculty/krrao/dip/Courses/EE5356/performance.pdf [9] Matlab computer vision toolbox: http://www.mathworks.com/help/vision/ [10] Open source Computer vision software: OpenCV v3.0 – download: http://opencv.org/downloads.html [11] Open source Computer vision software: OpenCV v3.0 – Reference Manual: http://docs.opencv.org/3.0-beta/modules/refman.html [12] R.C. Gonzalez, R.E.Woods and S.L. Eddins, “Digital Image Processing Using MATLAB”, Pearson Education, 2004. [13] Matlab image processing toolbox: http://www.mathworks.com/products/image/index-b.html [14] A. K. Jain, “Fundamentals of Digital Image Processing”. Englewood Cliffs, NJ: Prentice Hall, 1989.

References [15] R.C. Gonzalez and R.E. Woods, “Digital Image Processing”. 3nd edition, Prentice Hall, 2007. [16] Line detection image: http://www.camera-sdk.com/p_253-how-to-accomplish-line-detection-in-c-onvif.html [17] Open source Computer vision software: OpenCV v3.0 – User Guide: http://docs.opencv.org/3.0-beta/doc/user_guide/user_guide.html [18] Image segmentation example link: http://www.bioclinica.com/sites/default/files/u1/atlas-based-brain-segmentation-lg.jpg [19] Test images: http://www.imageprocessingplace.com/root_files_V3/image_databases.htm [20] Computer vision test image: http://www.cs.cmu.edu/afs/cs/project/cil/www/v-images.html [21] PCB image: http://images.fineartamerica.com/images-medium-large-5/printed-circuit-board-russell-shively.jpg

References [22] Medical image – knee: http://www.leutron.com/english/product/vip_d.htm [23] Geometric shapes image: http://www.maxton.com/gallery/con3-all2a.jpg [24] Need for image processing, computer vision path planning image: http://www.juliangough.com/photos/assorted-stolen-images-i-like/1107153 [25] Stop reference image: http://safety.fhwa.dot.gov/intersection/resources/casestudies/fhwasa09010/images/f1.jpg [26] Stop scene image: http://www.driversedguru.com/wp-content/gallery/jamie-gallery/Stop%20Sign.jpg [27] Gamma correction image: http://www.maximumpc.com/files/u16580/HowMonTest1.jpg [28] Coins image: http://upload.wikimedia.org/wikipedia/en/e/ec/Eagle_coins.jpg

References [29] Ppt on MSER : http://www.micc.unifi.it/delbimbo/wp-content/uploads/2011/03/slide_corso/A34%20MSER.pdf [30] H. Bay, A. Ess, T.Tuytelaars and L.V. Gool, "SURF: Speeded Up Robust Features", Computer Vision and Image Understanding (CVIU), Vol. 110, No. 3, pp. 346--359, June 2008. [31] PowerPoint presentation on FAST: http://web.eecs.umich.edu/~silvio/teaching/EECS598_2010/slides/11_16_Hao.pdf