Michele Merler Jacquilene Jacob.  Applications online are inherently insecure  Growing rate of hackers  Confidentiality of online systems should be.

Slides:



Advertisements
Similar presentations
Patient information extraction in digitized X-ray imagery Hsien-Huang P. Wu Department of Electrical Engineering, National Yunlin University of Science.
Advertisements

Kapitel 14 Recognition Scene understanding / visual object categorization Pose clustering Object recognition by local features Image categorization Bag-of-features.
Object Recognition from Local Scale-Invariant Features David G. Lowe Presented by Ashley L. Kapron.
Human Detection Phanindra Varma. Detection -- Overview  Human detection in static images is based on the HOG (Histogram of Oriented Gradients) encoding.
Thomas Berg and Peter Belhumeur
Random Forest Predrag Radenković 3237/10
Zhimin CaoThe Chinese University of Hong Kong Qi YinITCS, Tsinghua University Xiaoou TangShenzhen Institutes of Advanced Technology Chinese Academy of.
Three things everyone should know to improve object retrieval
Evaluating Color Descriptors for Object and Scene Recognition Koen E.A. van de Sande, Student Member, IEEE, Theo Gevers, Member, IEEE, and Cees G.M. Snoek,
Content-Based Image Retrieval
Activity Recognition Aneeq Zia. Agenda What is activity recognition Typical methods used for action recognition “Evaluation of local spatio-temporal features.
Addressing the Medical Image Annotation Task using visual words representation Uri Avni, Tel Aviv University, Israel Hayit GreenspanTel Aviv University,
A Low-cost Attack on a Microsoft CAPTCHA Yan Qiang,
Esmail Hadi Houssein ID/  „Motivation  „Problem Overview  „License plate segmentation  „Character segmentation  „Character Recognition.
Face Recognition & Biometric Systems, 2005/2006 Face recognition process.
Breaking an Animated CAPTCHA Scheme
Recognition using Regions CVPR Outline Introduction Overview of the Approach Experimental Results Conclusion.
1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.
Generic Object Detection using Feature Maps Oscar Danielsson Stefan Carlsson
Distinguishing Photographic Images and Photorealistic Computer Graphics Using Visual Vocabulary on Local Image Edges Rong Zhang,Rand-Ding Wang, and Tian-Tsong.
Smart Traveller with Visual Translator. What is Smart Traveller? Mobile Device which is convenience for a traveller to carry Mobile Device which is convenience.
California Car License Plate Recognition System ZhengHui Hu Advisor: Dr. Kang.
Handwritten Thai Character Recognition Using Fourier Descriptors and Robust C-Prototype Olarik Surinta Supot Nitsuwat.
© 2013 IBM Corporation Efficient Multi-stage Image Classification for Mobile Sensing in Urban Environments Presented by Shashank Mujumdar IBM Research,
Combining Supervised and Unsupervised Learning for Zero-Day Malware Detection © 2013 Narus, Inc. Prakash Comar 1 Lei Liu 1 Sabyasachi (Saby) Saha 2 Pang-Ning.
“Hello! My name is... Buffy” Automatic Naming of Characters in TV Video Mark Everingham, Josef Sivic and Andrew Zisserman Arun Shyam.
1 Recognition of Multi-Fonts Character in Early-Modern Printed Books Chisato Ishikawa(1), Naomi Ashida(1)*, Yurie Enomoto(1), Masami Takata(1), Tsukasa.
S EGMENTATION FOR H ANDWRITTEN D OCUMENTS Omar Alaql Fab. 20, 2014.
Kaihua Zhang Lei Zhang (PolyU, Hong Kong) Ming-Hsuan Yang (UC Merced, California, U.S.A. ) Real-Time Compressive Tracking.
Avoiding Segmentation in Multi-digit Numeral String Recognition by Combining Single and Two-digit Classifiers Trained without Negative Examples Dan Ciresan.
COMPARISON OF IMAGE ANALYSIS FOR THAI HANDWRITTEN CHARACTER RECOGNITION Olarik Surinta, chatklaw Jareanpon Department of Management Information System.
COLOR HISTOGRAM AND DISCRETE COSINE TRANSFORM FOR COLOR IMAGE RETRIEVAL Presented by 2006/8.
1 Webcam Mouse Using Face and Eye Tracking in Various Illumination Environments Yuan-Pin Lin et al. Proceedings of the 2005 IEEE Y.S. Lee.
BING: Binarized Normed Gradients for Objectness Estimation at 300fps
Binxing Jiao et. al (SIGIR ’10) Presenter : Lin, Yi-Jhen Advisor: Dr. Koh. Jia-ling Date: 2011/4/25 VISUAL SUMMARIZATION OF WEB PAGES.
Presented By: Abirami Poonkundran Authors: Jeff Yan, Ahmad El Ahmad.
Handwritten Hindi Numerals Recognition Kritika Singh Akarshan Sarkar Mentor- Prof. Amitabha Mukerjee.
CAPTCHA Processing CPRE 583 Fall 2010 Project CAPTCHA Processing Responsibilities Brian Washburn – Loading Image into RAM and Preprocessing and related.
PROJECT PROPOSAL DIGITAL IMAGE PROCESSING TITLE:- Automatic Machine Written Document Reader Project Partners:- Manohar Kuse(Y08UC073) Sunil Prasad Jaiswal(Y08UC124)
Gang WangDerek HoiemDavid Forsyth. INTRODUCTION APROACH (implement detail) EXPERIMENTS CONCLUSION.
Automated Solar Cavity Detection
USE RECIPE INGREDIENTS TO PREDICT THE CATEGORY OF CUISINE Group 7 – MEI, Yan & HUANG, Chenyu.
Counting How Many Words You Read
Hough Transform Procedure to find a shape in an image Shape can be described in parametric form Shapes in image correspond to a family of parametric solutions.
Face Image-Based Gender Recognition Using Complex-Valued Neural Network Instructor :Dr. Dong-Chul Kim Indrani Gorripati.
Demosaicking for Multispectral Filter Array (MSFA)
Color-Attributes-Related Image Retrieval Student: Kylie Gorman Mentor: Yang Zhang.
By Pushpita Biswas Under the guidance of Prof. S.Mukhopadhyay and Prof. P.K.Biswas.
A New Method for Crater Detection Heather Dunlop November 2, 2006.
Vehicle Detection in Aerial Surveillance Using Dynamic Bayesian Networks Hsu-Yung Cheng, Member, IEEE, Chih-Chia Weng, and Yi-Ying Chen IEEE TRANSACTIONS.
A Recognition Method of Restricted Hand Shapes in Still Image and Moving Image Hand Shapes in Still Image and Moving Image as a Man-Machine Interface Speaker.
Computer Vision Spring ,-685 Instructor: S. Narasimhan WH 5409 T-R 10:30am – 11:50am Lecture #23.
Using Word Based Features for Word Clustering The Thirteenth Conference on Language Engineering 11-12, December 2013 Department of Electronics and Communications,
Classifying Covert Photographs CVPR 2012 POSTER. Outline  Introduction  Combine Image Features and Attributes  Experiment  Conclusion.
Goggle Gist on the Google Phone A Content-based image retrieval system for the Google phone Manu Viswanathan Chin-Kai Chang Ji Hyun Moon.
Preliminary Transformations Presented By: -Mona Saudagar Under Guidance of: - Prof. S. V. Jain Multi Oriented Text Recognition In Digital Images.
Cell Segmentation in Microscopy Imagery Using a Bag of Local Bayesian Classifiers Zhaozheng Yin RI/CMU, Fall 2009.
Week 4: 6/6 – 6/10 Jeffrey Loppert. This week.. Coded a Histogram of Oriented Gradients (HOG) Feature Extractor Extracted features from positive and negative.
Optical Character Recognition
Another Example: Circle Detection
Efficient Image Classification on Vertically Decomposed Data
ABSTRACT FACE RECOGNITION RESULTS
Nonparametric Semantic Segmentation
Recognition using Nearest Neighbor (or kNN)
Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science
Efficient Image Classification on Vertically Decomposed Data
Brief Review of Recognition + Context
Midterm Exam Closed book, notes, computer Similar to test 1 in format:
A Novel Smoke Detection Method Using Support Vector Machine
Automatic Handwriting Generation
Presentation transcript:

Michele Merler Jacquilene Jacob

 Applications online are inherently insecure  Growing rate of hackers  Confidentiality of online systems should be guaranteed by Captchas  Image based Captchas propose to overcome issues of text based ones (user friendlyness, robustness to attacks) BUT… Are they really secure? Objective Verify effective security offered by image based Captchas

VidoopCaptcha.com Target System Verification Solution Challenge is combination of images from various categories User asked to report letters corresponding to requested categories

Process Flow Training Data Feature Extraction Train Classifier Test Data Feature Extraction Training data Feature extraction Train using kNN Results Preprocessing Character Recognizer Image Category Recognizer

Process Flow Training Data Feature Extraction Train Classifier Test Data Feature Extraction Training data Feature extraction Train using kNN Results Preprocessing Character Recognizer Image Category Recognizer

TRAINING DATA Images downloaded from Flickr with a Perl script ~500 images per category Data Acquisition TEST DATA 200 challenges downloaded from VidoopCaptcha with a Perl script 26 categories Manual ground truth annotation

Process Flow Training Data Feature Extraction Train Classifier Test Data Feature Extraction Training data Feature extraction Train using kNN Results Preprocessing Image Splitting Character region extraction Character Recognition Character Recognizer Image Category Recognizer

Test Data-Preprocessing Image Splitting Character region extraction Character Recognition LoG based edge extraction Horizontal and vertical dominant lines Generalized Hough transform Evaluate consistency among subimages Square (side = sqrt(2)*radius) character regions rescaled to 27x27 pixels Conversion to grayscale and binarization 1-NN classifier trained on 20 popular fonts images generated with GD library

Process Flow Training Data Feature Extraction Train Classifier Test Data Feature Extraction Training data Feature extraction Train using kNN Results Preprocessing Character Recognizer Image Category Recognizer

Character Training Data Character Feature Extraction Train using kNN classifier Character Classification Training data Feature extraction Train using 1-NN Character Recognizer 64 images generated with GD library for each upper case character, using 20 common fonts Simple binary vector with all pixels in image 1-NN classifier

Process Flow Training Data Feature Extraction Train Classifier Test Data Feature Extraction Training data Feature extraction Train using kNN Results Preprocessing Character Recognizer Image Category Recognizer

Features from all 26 categories  Edge Histograms (6x8 regions)  Color Moments (RGB, 3x3 regions)  Color Histograms (32+32 bins in CbCr)  GIST features (314 dims. vectors) Feature Extraction For each category, SVM classifier trained on all positive data, negative data randomly taken from other categories #positive data = #negative data

Results 200 test challenges Image split and character regions detection accuracy: 100% Character recognition accuracy: 96%

Average processing time per challenge: 12 sec. Best breaking rate: 3% We can break 9 image Captchas per hour (216/day) Results 200 test challenges Single image Pair images Triplet images # recognized images

Average processing time per challenge: 12 sec. Best breaking rate: 3% We can break 9 image Captchas per hour (216/day) Results 200 test challenges # passed challenges

Conclusions Breaking Image based Captchas is possible VidoopCaptcha is not 100% secure Future directions: - Try other features (SIFT + codebook) - Obtain cleaner training data (performances suggest poor training data) - Improve speed and efficiency using more powerful programming languages - Test online version of Captcha breaker

Questions?