Computer Vision Group University of California Berkeley Recognizing Objects in Adversarial Clutter: Breaking a Visual CAPTCHA Greg Mori and Jitendra Malik.

Slides:

Advertisements

Similar presentations

Shape Context and Chamfer Matching in Cluttered Scenes

Advertisements

Object Recognition Using Locality-Sensitive Hashing of Shape Contexts Andrea Frome, Jitendra Malik Presented by Ilias Apostolopoulos.

Location Recognition Given: A query image A database of images with known locations Two types of approaches: Direct matching: directly match image features.

RGB-D object recognition and localization with clutter and occlusions Federico Tombari, Samuele Salti, Luigi Di Stefano Computer Vision Lab – University.

CAPTCHA: Using Hard AI Problems for Security 12 Jun 2007 Ohad Barak (a.k.a. jo) Luis Von Ahn, EuroCrypt 2003.

Real-time, low-resource corridor reconstruction using a single consumer grade RGB camera is a powerful tool for allowing a fast, inexpensive solution to.

Recovering Human Body Configurations: Combining Segmentation and Recognition Greg Mori, Xiaofeng Ren, and Jitentendra Malik (UC Berkeley) Alexei A. Efros.

Fitting: The Hough transform. Voting schemes Let each feature vote for all the models that are compatible with it Hopefully the noise features will not.

Silhouette-based Object Phenotype Recognition using 3D Shape Priors Yu Chen 1 Tae-Kyun Kim 2 Roberto Cipolla 1 University of Cambridge, Cambridge, UK 1.

- Recovering Human Body Configurations: Combining Segmentation and Recognition (CVPR’04) Greg Mori, Xiaofeng Ren, Alexei A. Efros and Jitendra Malik -

CAPTCHA Completely Automated Public Turing test to tell Computers and Humans Apart A Computer Program that can generate and grade test that: Most Humans.

CS4670 / 5670: Computer Vision Bag-of-words models Noah Snavely Object

Robust Object Tracking via Sparsity-based Collaborative Model

A Low-cost Attack on a Microsoft CAPTCHA Yan Qiang,

CAPTCHA Presented by: Sari Louis SPAM Group: Marc Gagnon, Sari Louis, Steve White University of Illinois Spring 2006.

Ghunhui Gu, Joseph J. Lim, Pablo Arbeláez, Jitendra Malik University of California at Berkeley Berkeley, CA

Breaking an Animated CAPTCHA Scheme

Computer Vision Group University of California Berkeley Estimating Human Body Configurations using Shape Context Matching Greg Mori and Jitendra Malik.

Computer Vision Group University of California Berkeley Shape Matching and Object Recognition using Shape Contexts Jitendra Malik U.C. Berkeley (joint.

The Visual Recognition Machine Jitendra Malik University of California at Berkeley Jitendra Malik University of California at Berkeley.

Fitting: The Hough transform

Recognition using Regions CVPR Outline Introduction Overview of the Approach Experimental Results Conclusion.

CAPTCHA Presented By Sayani Chandra (Roll )

A new face detection method based on shape information Pattern Recognition Letters, 21 (2000) Speaker: M.Q. Jing.

A Study of Approaches for Object Recognition

CS335 Principles of Multimedia Systems Content Based Media Retrieval Hao Jiang Computer Science Department Boston College Dec. 4, 2007.

1 An Adaptive Nearest Neighbor Classification Algorithm for Data Streams Yan-Nei Law & Carlo Zaniolo University of California, Los Angeles PKDD, Porto,

CVR05 University of California Berkeley 1 Familiar Configuration Enables Figure/Ground Assignment in Natural Scenes Xiaofeng Ren, Charless Fowlkes, Jitendra.

Computer Vision Group University of California Berkeley Visual Grouping and Object Recognition Jitendra Malik * U.C. Berkeley * with S. Belongie, C. Fowlkes,

Visual Grouping and Recognition David Martin UC Berkeley David Martin UC Berkeley.

Highlights Lecture on the image part (10) Automatic Perception 16

Telling Humans and Computers Apart (Automatically) Or How Lazy Cryptographers do AI Luis von Ahn The Aladdin Center Carnegie Mellon University.

CAPTCHA Prabhakar Verma “08MC30”.

Computer Vision Group University of California Berkeley Matching Shapes Serge Belongie *, Jitendra Malik and Jan Puzicha U.C. Berkeley * Present address:

A New Correspondence Algorithm Jitendra Malik Computer Science Division University of California, Berkeley Joint work with Serge Belongie, Jan Puzicha,

Computational Vision Jitendra Malik, UC Berkeley.

IIIT Hyderabad Atif Iqbal and Anoop Namboodiri Cascaded.

The Three R’s of Vision Jitendra Malik.

CAPTCHA 1 Are you Human? (Sorry, I had to ask). CAPTCHA 2 Agenda What is CAPTCHA? Types of CAPTCHA Where to use CAPTCHAs? Guidelines when making a CAPTCHA.

2D Shape Matching (and Object Recognition)

Recognition using Regions (Demo) Sudheendra V. Outline Generating multiple segmentations –Normalized cuts [Ren & Malik (2003)] Uniform regions –Watershed.

Fitting: The Hough transform. Voting schemes Let each feature vote for all the models that are compatible with it Hopefully the noise features will not.

Shape Matching Tuesday, Nov 18 Kristen Grauman UT-Austin.

Visual Grouping and Recognition Jitendra Malik University of California at Berkeley Jitendra Malik University of California at Berkeley.

Computer Vision Why study Computer Vision? Images and movies are everywhere Fast-growing collection of useful applications –building representations.

Collective Vision: Using Extremely Large Photograph Collections Mark Lenz CameraNet Seminar University of Wisconsin – Madison February 2, 2010 Acknowledgments:

Representations for object class recognition David Lowe Department of Computer Science University of British Columbia Vancouver, Canada Sept. 21, 2006.

Features-based Object Recognition P. Moreels, P. Perona California Institute of Technology.

Fitting: The Hough transform

1 Artificial Intelligence: Vision Stages of analysis Low level vision Surfaces and distance Object Matching.

Computer Vision Group University of California Berkeley On Visual Recognition Jitendra Malik UC Berkeley.

3:01 PM Three points for today Sensory memory (SM) contains highly transient information about the dynamic sensory array. Stabilizing the contents of SM.

CSE 185 Introduction to Computer Vision Feature Matching.

A New Method for Crater Detection Heather Dunlop November 2, 2006.

Peter Matthews, Cliff C. Zou University of Central Florida AsiaCCS 2010.

Representation in Vision Derek Hoiem CS 598, Spring 2009 Jan 22, 2009.

By: Steven Baker.  What is a CAPTCHA?  History of CAPTCHA  Applications of CAPTCHAs  Accessibility  Examples of CAPTCHAs  reCAPTCHA  Vulnerabilities.

Separating man from machine since 2000….. ?. Agenda  Definition  History  Need  Types  Constructing CAPTCHAs  Breaking CAPTCHAs  Applications 

CAPTCHA What humans can do, But computers can not.

Hough Transform CS 691 E Spring Outline Hough transform Homography Reading: FP Chapter 15.1 (text) Some slides from Lazebnik.

CAPTCHA Presented by: Md.R ahim 08B21A Agenda Definition Background Motivation Applications Types of CAPTCHAs Breaking CAPTCHAs Proposed Approach.

SANDEEP MEHTA (ECE, IV Year). CAPTCHA Completely Automated Public Turing test to tell Computers and Humans Apart Invented at CMU by Luis von Ahn, Manuel.

Another Example: Circle Detection

Fitting: The Hough transform

Are you Human?.

A novel probabilistic language-based CAPTCHA system

Fighting the WebBots A webbot is a program that visits web sites for all kinds of purposes. For example, Google webbots make copies of all web sites for.

CSE 185 Introduction to Computer Vision

Fourier Transform of Boundaries

Presented By Vibhute J.B. Class : M.Sc. (CS)

Presentation transcript:

Computer Vision Group University of California Berkeley Recognizing Objects in Adversarial Clutter: Breaking a Visual CAPTCHA Greg Mori and Jitendra Malik

Computer Vision Group University of California Berkeley Outline What is a CAPTCHA? Why are they good datasets for object recognition? Our recognition framework Results on EZ-Gimpy and Gimpy

Computer Vision Group University of California Berkeley What is a CAPTCHA? CAPTCHA: Completely Automated Public Turing test to Tell Computers and Humans Apart (Blum et al., CMU) –Generates and grades tests that Most humans can pass Current computer programs can’t pass Different varieties –Word based (Gimpy, EZ-Gimpy) –Image based (Pix) –Sound based (Bongo)

Computer Vision Group University of California Berkeley EZ-Gimpy Word-based CAPTCHA –Task is to read a single word obscured in clutter Currently in use at Yahoo! and Ticketmaster –Filters out ‘bots’ from obtaining free accounts, buying blocks of tickets

Computer Vision Group University of California Berkeley CAPTCHAs as Object Recognition Datasets Pros: –Large number of objects (600 words) –Practically infinite set of test images –Quantitative results –“Adversarial”, intended to be difficult for computers Cons: –No variation due to pose, lighting –Synthetic objects

Computer Vision Group University of California Berkeley Object Recognition Framework Match objects using shape cues – Represented as a point set extracted using Canny edge detection A two stage approach –Fast pruning Quick tests to construct a shortlist of candidate objects Database of known objects could be large –Detailed matching Perform computationally expensive comparisons on only the few shapes in the shortlist

Computer Vision Group University of California Berkeley Shape contexts (Belongie et al. 2001) Count the number of points inside each bin, e.g.: Count = 8 … Count = 7 F Compact representation of distribution of points relative to each point

Computer Vision Group University of California Berkeley Features: Generalized Shape Contexts Can put more than just point counts in bins –Oriented Energy –Colour info –Optical flow

Computer Vision Group University of California Berkeley Fast Pruning: Representative Shape Contexts Pick k points in the image at random –Compare to all shape contexts for all known letters –Vote for closely matching letters Keep all letters with scores under threshold dopdop

Computer Vision Group University of California Berkeley Two Instances Algorithm A –Bottom up, parts-based approach –Find letters first, then form words Algorithm B –Top down, holistic approach –Find entire words immediately

Computer Vision Group University of California Berkeley Algorithm A Look for letters –Representative Shape Contexts Find pairs of letters that are “consistent” –Letters nearby in space Search for valid words Give scores to the words

Computer Vision Group University of California Berkeley EZ-Gimpy Results with Algorithm A 158 of 191 images correctly identified: 83% –Running time: ~10 sec. per image (MATLAB, 1 Ghz P3) horse smile canvas spade join here

Computer Vision Group University of California Berkeley Gimpy Multiple words, task is to find 3 words in the image Clutter is other objects, not texture

Computer Vision Group University of California Berkeley Algorithm B: Letters are not enough Hard to distinguish single letters with so much clutter Find words instead of letters –Use long range info over entire word –Stretch shape contexts into ellipses Search problem becomes huge –# of words 600 vs. # of letters 26 –Prune set of words using opening/closing bigrams

Computer Vision Group University of California Berkeley Results with Algorithm B # Correct words% tests (of 24) 1 or more92% 2 or more75% 333% EZ-Gimpy92% dry clear medical door farm importantcard arch plate

Computer Vision Group University of California Berkeley Conclusion CAPTCHAs useful as datasets for studying object recognition Two stage approach –Fast pruning –Detailed matching 92% success rate on EZ-Gimpy –OCR (+hacks) 10-50% 33% success rate on Gimpy