Knowledge Systems Lab JN 9/10/2002 Computer Vision: Gesture Recognition from Images Joshua R. New Knowledge Systems Laboratory Jacksonville State University.

Slides:



Advertisements
Similar presentations
Road-Sign Detection and Recognition Based on Support Vector Machines Saturnino, Sergio et al. Yunjia Man ECG 782 Dr. Brendan.
Advertisements

Computational Biology, Part 23 Biological Imaging II Robert F. Murphy Copyright  1996, 1999, All rights reserved.
Detection, Segmentation, and Pose Recognition of Hands in Images by Christopher Schwarz Thesis Chair: Dr. Niels da Vitoria Lobo.
M. Emre Sargın, Ferda Ofli, Yelena Yasinnik, Oya Aran, Alexey Karpov, Stephen Wilson,Engin Erzin, Yücel Yemez, A. Murat Tekalp Combined Gesture- Speech.
Lecture 07 Segmentation Lecture 07 Segmentation Mata kuliah: T Computer Vision Tahun: 2010.
COLORCOLOR A SET OF CODES GENERATED BY THE BRAİN How do you quantify? How do you use?
Each pixel is 0 or 1, background or foreground Image processing to
Move With Me S.W Graduation Project An Najah National University Engineering Faculty Computer Engineering Department Supervisor : Dr. Raed Al-Qadi Ghada.
COMP322/S2000/L181 Pre-processing: Smooth a Binary Image After binarization of a grey level image, the resulting binary image may have zero’s (white) and.
Handwritten Character Recognition Using Artificial Neural Networks Shimie Atkins & Daniel Marco Supervisor: Johanan Erez Technion - Israel Institute of.
A new face detection method based on shape information Pattern Recognition Letters, 21 (2000) Speaker: M.Q. Jing.
Binary Image Analysis: Part 2 Readings: Chapter 3: mathematical morphology region properties region adjacency 1.
Cindy Song Sharena Paripatyadar. Use vision for HCI Determine steps necessary to incorporate vision in HCI applications Examine concerns & implications.
Highlights Lecture on the image part (10) Automatic Perception 16
Objective of Computer Vision
Computer Vision Basics Image Terminology Binary Operations Filtering Edge Operators.
Hand Movement Recognition By: Tokman Niv Levenbroun Guy Instructor: Todtfeld Ari.
Smart Traveller with Visual Translator. What is Smart Traveller? Mobile Device which is convenience for a traveller to carry Mobile Device which is convenience.
California Car License Plate Recognition System ZhengHui Hu Advisor: Dr. Kang.
Real-Time Face Detection and Tracking Using Multiple Cameras RIT Computer Engineering Senior Design Project John RuppertJustin HnatowJared Holsopple This.
I mage and M edia U nderstanding L aboratory for Performance Evaluation of Vision-based Real-time Motion Capture Naoto Date, Hiromasa Yoshimoto, Daisaku.
Knowledge Systems Lab JN 8/24/2015 A Method for Temporal Hand Gesture Recognition Joshua R. New Knowledge Systems Laboratory Jacksonville State University.
West Virginia University
1. Binary Image B(r,c) 2 0 represents the background 1 represents the foreground
Chapter 3 Binary Image Analysis. Types of images ► Digital image = I[r][c] is discrete for I, r, and c.  B[r][c] = binary image - range of I is in {0,1}
Knowledge Systems Lab JN 9/13/2015 An Advanced User Interface for Pattern Recognition in Medical Imagery: Interactive Learning, Contextual Zooming, and.
3D Motion Capture Assisted Video human motion recognition based on the Layered HMM Myunghoon Suk & Ashok Ramadass Advisor : Dr. B. Prabhakaran Multimedia.
Presentation by: K.G.P.Srikanth. CONTENTS  Introduction  Components  Working  Applications.
A Method for Hand Gesture Recognition Jaya Shukla Department of Computer Science Shiv Nadar University Gautam Budh Nagar, India Ashutosh Dwivedi.
Submitted by:- Vinay kr. Gupta Computer Sci. & Engg. 4 th year.
Digital Image Processing & Analysis Spring Definitions Image Processing Image Analysis (Image Understanding) Computer Vision Low Level Processes:
Filtering and Enhancing Images. Major operations 1. Matching an image neighborhood with a pattern or mask 2. Convolution (FIR filtering)
National Taiwan A Road Sign Recognition System Based on a Dynamic Visual Model C. Y. Fang Department of Information and.
Human pose recognition from depth image MS Research Cambridge.
1 Research Question  Can a vision-based mobile robot  with limited computation and memory,  and rapidly varying camera positions,  operate autonomously.
Visual Computing Computer Vision 2 INFO410 & INFO350 S2 2015
1 Machine Vision. 2 VISION the most powerful sense.
Team Members Ming-Chun Chang Lungisa Matshoba Steven Preston Supervisors Dr James Gain Dr Patrick Marais.
Low level Computer Vision 1. Thresholding 2. Convolution 3. Morphological Operations 4. Connected Component Extraction 5. Feature Extraction 1.
Text From Corners: A Novel Approach to Detect Text and Caption in Videos Xu Zhao, Kai-Hsiang Lin, Yun Fu, Member, IEEE, Yuxiao Hu, Member, IEEE, Yuncai.
Spring 2007 COMP TUI 1 Computer Vision for Tangible User Interfaces.
Knowledge Systems Lab JN 1/15/2016 Facilitating User Interaction with Complex Systems via Hand Gesture Recognition MCIS Department Knowledge Systems Laboratory.
Activity Analysis of Sign Language Video Generals exam Neva Cherniavsky.
Augmented Reality and 3D modelling Done by Stafford Joemat Supervised by Mr James Connan.
Intelligent Robotics Today: Vision & Time & Space Complexity.
Robotics Chapter 6 – Machine Vision Dr. Amit Goradia.
Neural Networks. Background - Neural Networks can be : Biological - Biological models Artificial - Artificial models - Desire to produce artificial systems.
Face Detection – EE368 Group 10 May 30, Face Detection EE 368 Group 10 Waqar Mohsin Noman Ahmed Chung-Tse Mar.
Over the recent years, computer vision has started to play a significant role in the Human Computer Interaction (HCI). With efficient object tracking.
SPACE MOUSE. INTRODUCTION  It is a human computer interaction technology  Helps in movement of manipulator in 6 degree of freedom * 3 translation degree.
Coin Recognition Using MATLAB - Emad Zaben - Bakir Hasanein - Mohammed Omar.
Student Gesture Recognition System in Classroom 2.0 Chiung-Yao Fang, Min-Han Kuo, Greg-C Lee, and Sei-Wang Chen Department of Computer Science and Information.
Visual Information Processing. Human Perception V.S. Machine Perception  Human perception: pictorial information improvement for human interpretation.
Course : T Computer Vision
COMP 9517 Computer Vision Binary Image Analysis 4/15/2018
Hand Gestures Based Applications
Program Studi S-1 Teknik Informatika FMIPA Universitas Padjadjaran
FISH IDENTIFICATION SYSTEM
Video-based human motion recognition using 3D mocap data
Chapter 1: Image processing and computer vision Introduction
Higher School of Economics , Moscow, 2016
network of simple neuron-like computing elements
Color Image Processing
An Infant Facial Expression Recognition System Based on Moment Feature Extraction C. Y. Fang, H. W. Lin, S. W. Chen Department of Computer Science and.
Binary Image Analysis: Part 2 Readings: Chapter 3:
Visual Recognition of American Sign Language Using Hidden Markov Models 문현구 문현구.
Higher School of Economics , Moscow, 2016
Higher School of Economics , Moscow, 2016
Morphological Filters Applications and Extension Morphological Filters
Presentation transcript:

Knowledge Systems Lab JN 9/10/2002 Computer Vision: Gesture Recognition from Images Joshua R. New Knowledge Systems Laboratory Jacksonville State University

Knowledge Systems Lab JN 9/10/2002 Outline Terminology Current Research and Uses Kjeldsen’s PhD Thesis Implementation Overview Implementation Analysis Future Directions

Knowledge Systems Lab JN 9/10/2002 Terminology Image Processing - Computer manipulation of images. Some of the many algorithms used in image processing include convolution (on which many others are based), edge detection, and contrast enhancement. Computer Vision - A branch of artificial intelligence and image processing concerned with computer processing of images from the real world. Computer vision typically requires a combination of low level image processing to enhance the image quality (e.g. remove noise, increase contrast) and higher level pattern recognition and image understanding to recognize features present in the image.

Knowledge Systems Lab JN 9/10/2002 Current Research Capture images from a camera Process images to extract features Use those features to train a learning system to recognize the gesture Use the gesture as a meaningful input into a system More information located at:

Knowledge Systems Lab JN 9/10/2002 Current Research Example Starner and Pentland 2 hands segmented Hand shape from a bounding ellipse Eight element feature vector Recognition using Hidden Markov Models

Knowledge Systems Lab JN 9/10/2002 Current Uses Sign Stream (released demo for MacOS) Database tool for analysis of linguistic data captured on video Developed at Boston University with funding from ASL Linguistic Research Project and NSF Stream/

Knowledge Systems Lab JN 9/10/2002 Current Uses Recursive Models of Human Motion (Smart Desk, MIT) Models the constraints by which we move Visually-guided gestural interaction, animation, and face recognition Stereoscopic vision for 3D modeling edu/vismod/demos/smartdesk/

Knowledge Systems Lab JN 9/10/2002 Current Uses

Knowledge Systems Lab JN 9/10/2002 Kjeldsen’s PhD thesis Application Gesture recognition as a system interface to augment that of the mouse Menu selection, window move, and resize Input: 200x300 image Calibration of user’s hand

Knowledge Systems Lab JN 9/10/2002 Kjeldsen’s PhD thesis Image split into HSI channels (I = Intensity, Lightness, Value) Segmentation with largest connected component Eroded to get rid of edges Gray-scale values sent to learning system

Knowledge Systems Lab JN 9/10/2002 Kjeldsen’s PhD thesis Learning System – Backprop network 1014 input nodes (one for each pixel) 20 hidden nodes 1 output node for each classification 40 images of each pose Results: Correct classification 90-96% of the time on images

Knowledge Systems Lab JN 9/10/2002 Implementation Overview System: 1.33 Ghz AMD Athlon OpenCV and IPL libraries (from Intel) Input: 2 – 640x480 images, saturation channel Max hand size in x and y orientations in # of pixels Output: Rough estimate of movement Refined estimate of movement Number of fingers being held up Rough Orientation

Knowledge Systems Lab JN 9/10/2002 Implementation Overview Chronological order of system: 1)Saturation channel extraction 2)Threshold Saturation channel 3)Calculate Center of Mass (CoM) 4)Reduce Noise 5)Remove arm from hand 6)Calculate refined-CoM 7)Calculate orientation 8)Count the number of fingers

Knowledge Systems Lab JN 9/10/2002 Implementation Analysis 1. Saturation channel extraction: Digital camera, saved as JPGs JPGs converted to 640x480 PPMs Saturation channels extracted into PGMs Original Image Hue Lightness Saturation

Knowledge Systems Lab JN 9/10/2002 Implementation Analysis 2. Threshold Saturation channel: a) Threshold value – 50 (values range from 0 to 255) PixelValue = PixelValue ≥ 50 ? 128 : 0

Knowledge Systems Lab JN 9/10/2002 a)0 th moment of an image: b) 1 st moment for x and y of an image, respectively: c) Center of Mass (location of centroid): where and Implementation Analysis 3. Calculate Center of Mass (CoM): a) Count number of 128-valued pixels b) Sum x-values and y-values of those pixels c) Divide each sum by the number of pixels

Knowledge Systems Lab JN 9/10/2002 Implementation Analysis 4. Reduce Noise: FloodFill at the computed CoM (128-valued pixels become 192)

Knowledge Systems Lab JN 9/10/2002 Implementation Analysis 5. Remove arm from hand a) Find top left of bounding box b) Apply border for bounding box from calibration measure c) FloodFill, 192 to 254

Knowledge Systems Lab JN 9/10/2002 Implementation Analysis 6. Calculate refined-CoM (rCoM): a) Threshold, 254 to 255 b) Compute CoM as before

Knowledge Systems Lab JN 9/10/ Orientation: a) 0 th moment of an image: b) 1 st moment for x and y of an image, respectively: c) 2 nd moment for x and y of an image, respectively: d) Orientation of image major axis: Implementation Analysis

Knowledge Systems Lab JN 9/10/2002 Implementation Analysis 8. Count the number of fingers (via FingerCountGivenX) Function inputs: a) Pointer to Image Data b) rCoM c) Radius =.17*HandSizeX +.17*HandSizeY d) Starting Location (x or y, call appropriate function) e) Ending Location (x or y, call appropriate function) f) White Pixel Counter g) Black Pixel Counter h) Finger Counter

Knowledge Systems Lab JN 9/10/2002 Implementation Analysis 8. Count the number of fingers: 2 similar functions – start/end location in x or y After all previous steps, the finger- finding function sweeps out an arc, counting the number of white and black pixels as it progresses A finger in the current system is defined to be any 10+ white pixels separated by 3+ black pixels (salt/pepper tolerance) minus 1 for the hand itself

Knowledge Systems Lab JN 9/10/2002 Implementation Analysis 8. Count the number of fingers:

Knowledge Systems Lab JN 9/10/2002 Implementation Analysis 8. Count the number of fingers: Illustration of noise tolerance

Knowledge Systems Lab JN 9/10/2002 Implementation Analysis System Input: System Output:

Knowledge Systems Lab JN 9/10/2002 Implementation Analysis System Input: System Output:

Knowledge Systems Lab JN 9/10/2002 Implementation Analysis System Runtime: Real Time – requires 30fps Current time – 16.5 ms for one frame (without reading or writing) Current Processing Capability on 1.33 Ghz Athlon – 60 fps Process Steps Time (ms) Athlon MP 1500 (1.33 Ghz) Pentium 850 Mhz 1) Reading Image?? 2) Reading Image ) Threshold ) Center of Mass ) Flood Fill ) Bounding Box Top-Left ) Arm Removal ) Refined CoM419 9) Finger Counting.51 10) Write Image Time w/o R&W Time w/o Write Total Time

Knowledge Systems Lab JN 9/10/2002 Future Directions Optimization Orientation for Hand Registration New Finger Counting Approach Learning System For additional information, please visit