Combining geometry and domain knowledge to interpret hand-drawn diagrams As Presented By: Andrew Campbell Christopher Dahlberg.

Slides:



Advertisements
Similar presentations
Active Appearance Models
Advertisements

Networked Digital Whiteboard with Handwritten-Symbol Interpreter and Dynamic-Display-Object Creator Atsuhide Kobashi Henry M. Gunn High School Palo Alto,
Road-Sign Detection and Recognition Based on Support Vector Machines Saturnino, Sergio et al. Yunjia Man ECG 782 Dr. Brendan.
電腦視覺 Computer and Robot Vision I
Inpainting Assigment – Tips and Hints Outline how to design a good test plan selection of dimensions to test along selection of values for each dimension.
Hybrid hatching-shading method for depicting of relief in modern maps Marjan Sikora Faculty of Electrical Engineering, Mechanical Engineering and Naval.
Label Placement and graph drawing Imo Lieberwerth.
Teaching Courses in Scientific Computing 30 September 2010 Roger Bielefeld Director, Advanced Research Computing.
Performance Evaluation Measures for Face Detection Algorithms Prag Sharma, Richard B. Reilly DSP Research Group, Department of Electronic and Electrical.
Proportion Priors for Image Sequence Segmentation Claudia Nieuwenhuis, etc. ICCV 2013 Oral.
CSC321: Neural Networks Lecture 3: Perceptrons
1Ellen L. Walker Edges Humans easily understand “line drawings” as pictures.
Learn how to make your drawings come alive…  Lecture 3: SKETCH RECOGNITION Analysis, implementation, and comparison of sketch recognition algorithms,
Assuming normally distributed data! Naïve Bayes Classifier.
1 On the Statistical Analysis of Dirty Pictures Julian Besag.
Foundation Knowledge Identify The Problem Explore Interpretations & Connections Prioritize Alternatives Envision Strategic Innovation Confused Fact Finder.
Foundation Knowledge Identify The Problem Explore Interpretations & Connections Prioritize Alternatives Envision Strategic Innovation Confused Fact Finder.
Face Recognition Based on 3D Shape Estimation
Pattern Classification, Chapter 1 1 Basic Probability.
1 LM Approaches to Filtering Richard Schwartz, BBN LM/IR ARDA 2002 September 11-12, 2002 UMASS.
Learn how to make your drawings come alive…  NEW COURSE: SKETCH RECOGNITION Analysis, implementation, and comparison of sketch recognition algorithms,
Learn how to make your drawings come alive…  Lecture 2: SKETCH RECOGNITION Analysis, implementation, and comparison of sketch recognition algorithms,
Theoretical & Industrial Design of Aerofoils P M V Subbarao Professor Mechanical Engineering Department An Objective Invention ……
December 2, 2014Computer Vision Lecture 21: Image Understanding 1 Today’s topic is.. Image Understanding.
Object Tracking for Retrieval Application in MPEG-2 Lorenzo Favalli, Alessandro Mecocci, Fulvio Moschetti IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR.
Foundation Knowledge Identify The Problem Explore Interpretations & Connections Prioritize Alternatives Envision Strategic Innovation Confused Fact Finder.
Automated Drawing of 2D chemical structures Kees Visser.
Statistical Natural Language Processing. What is NLP?  Natural Language Processing (NLP), or Computational Linguistics, is concerned with theoretical.
Elec471 Embedded Computer Systems Chapter 4, Probability and Statistics By Prof. Tim Johnson, PE Wentworth Institute of Technology Boston, MA Theory and.
October 8, 2013Computer Vision Lecture 11: The Hough Transform 1 Fitting Curve Models to Edges Most contours can be well described by combining several.
Masquerade Detection Mark Stamp 1Masquerade Detection.
The Node Voltage Method
Irfan Essa, Alex Pentland Facial Expression Recognition using a Dynamic Model and Motion Energy (a review by Paul Fitzpatrick for 6.892)
06 - Boundary Models Overview Edge Tracking Active Contours Conclusion.
October 14, 2014Computer Vision Lecture 11: Image Segmentation I 1Contours How should we represent contours? A good contour representation should meet.
IBS-09-SL RM 501 – Ranjit Goswami 1 Basic Probability.
EE 492 ENGINEERING PROJECT LIP TRACKING Yusuf Ziya Işık & Ashat Turlibayev Yusuf Ziya Işık & Ashat Turlibayev Advisor: Prof. Dr. Bülent Sankur Advisor:
S EGMENTATION FOR H ANDWRITTEN D OCUMENTS Omar Alaql Fab. 20, 2014.
CSCE 5013 Computer Vision Fall 2011 Prof. John Gauch
Summary Marie Yarbrough. Introduction History of Image Forgery Method Segmentation Classification Common-Sense Reasoning Conclusion.
Learning to perceive how hand-written digits were drawn Geoffrey Hinton Canadian Institute for Advanced Research and University of Toronto.
MIT 6.893; SMA 5508 Spring 2004 Larry Rudolph Lecture Introduction Sketching Interface.
Michael Isard and Andrew Blake, IJCV 1998 Presented by Wen Li Department of Computer Science & Engineering Texas A&M University.
Jun-Won Suh Intelligent Electronic Systems Human and Systems Engineering Department of Electrical and Computer Engineering Speaker Verification System.
Foundation Knowledge Identify The Problem Explore Interpretations & Connections Prioritize Alternatives Envision Strategic Innovation Confused Fact Finder.
Inferring High-Level Behavior from Low-Level Sensors Donald J. Patterson, Lin Liao, Dieter Fox, and Henry Kautz.
Course 8 Contours. Def: edge list ---- ordered set of edge point or fragments. Def: contour ---- an edge list or expression that is used to represent.
Feature Point Detection and Curve Approximation for Early Processing of Free-Hand Sketches Tevfik Metin Sezgin and Randall Davis MIT AI Laboratory.
Bayesian Networks for Sketch Understanding Christine Alvarado MIT Student Oxygen Workshop 12 September 2003.
A New Approach to Utterance Verification Based on Neighborhood Information in Model Space Author :Hui Jiang, Chin-Hui Lee Reporter : 陳燦輝.
Gesture recognition techniques. Definitions Gesture – some type of body movement –a hand movement –Head movement, lips, eyes Depending on the capture.
Concepts and Realization of a Diagram Editor Generator Based on Hypergraph Transformation Author: Mark Minas Presenter: Song Gu.
MultiModality Registration Using Hilbert-Schmidt Estimators By: Srinivas Peddi Computer Integrated Surgery II April 6 th, 2001.
Lecture 1 INTRODUCTION TO ALGORITHMS Professor Uday Reddy
Hidden Markov Model Parameter Estimation BMI/CS 576 Colin Dewey Fall 2015.
Pen Based User Interface II CSE 481b January 25, 2005.
Making Research Tools Accessible for All AI Students Zach Dodds, Christine Alvarado, and Sara Sood Though a compelling area of research with many applications,
Real-Time Hierarchical Scene Segmentation and Classification Andre Uckermann, Christof Elbrechter, Robert Haschke and Helge Ritter John Grossmann.
Reading Working Drawing Packets and Section Views
Hidden Markov Models BMI/CS 576
Reading Working Drawing Packets and Section Views
Lecture 15: Text Classification & Naive Bayes
Data Mining Lecture 11.
Unit# 9: Computer Program Development
CSc4730/6730 Scientific Visualization
Where did we stop? The Bayes decision rule guarantees an optimal classification… … But it requires the knowledge of P(ci|x) (or p(x|ci) and P(ci)) We.
Pattern Recognition and Image Analysis
Authors: Barry Smyth, Mark T. Keane, Padraig Cunningham
Visual Recognition of American Sign Language Using Hidden Markov Models 문현구 문현구.
Presentation transcript:

Combining geometry and domain knowledge to interpret hand-drawn diagrams As Presented By: Andrew Campbell Christopher Dahlberg

Lecture Overview ● About the Authors ● Introduction to Sketch Recognition ● History on the Topic ● Paper Overview ● Details ● Contributions and Conclusion

About the Authors ● Stahovich and Shimana were professors in the mechanical engineering department of CMU ● Kara is a post-doctoral researcher there. ● Gennari is student research assistant. ● Stahovich has since moved to UC Riverside ● Very little has changed due to the recency of the paper

Sketch Recognition ● The process of translating human drawn figures into computer readable information. ● How does this challenge differ from vision problems we have seen so far?

Differences ● The task requires parsing the image based on some grammar. ● Human handwriting results in high variability of symbol appearance. ● Additional data is available- stroke order, pen speed.

Recent Work in Sketch Recognition ● Predefined pen stroke sequences ● Shilman- hierarchal definitions based on low level objects ● Kara- “Marker Symbols” ● Costagliola & Deufemia- prerecognized shapes ● Narayanaswamy- fixed drawing order, pauses between symbols

Problems ● Most prior work creates constraints on the order or manner in which the user draws the symbols. ● Many methods also rely on hard-coded recognition of low-level building blocks. ● This leads to a system that is sensitive to any changes from the expected input (size, orientation, legibility)

Paper Overview ● Segmentation ● Symbol location ● Classification ● Culling ● Error correction

Segmentation ● Change pen strokes into a set of lines and arcs. ● Some single pen strokes need to be split into several lines. ● Example:

Segmentation ● The authors used an algorithm from an earlier paper. ● It uses pen speed and changes in curve inflection. ● Differs from the segmentation algorithms we've seen because it depends on getting extra information from how the sketch was drawn.

Locating the Symbols ● Two methods are used: – Locating symbols based on changes in hand coded features. This method is similar to that used in previous work. – A novel domain-specific method based on measuring ink density.

Locating the Symbols Common Features ● Assign a type (line or arc), length, orientation and intersection type (L, T, or X) to each segment. ● Group segments with similar characteristics as potential symbols.

Locating the Symbols Ink Density ● In electrical circuit diagrams, circuit components are identified with fairly dense symbols connected by long stretches of single lines for the wires. ● Ink density is high for the components and low for the wires.

Locating the Symbols Ink Density ● For each segment: – Consider the next segment. If adding it will cause a big drop in ink density, mark this segment as a possible symbol endpoint. – Repeat until a user-specified number of segments are considered. ● For each potential endpoint: – Consider the previous segment. If adding it will cause a big drop in density, mark the whole range as a potential symbol. – Repeat for a user-specified number of segments.

Locating the Symbols ● Why both methods?

Locating the Symbols ● Some symbols can be found easily with one method but not the other.

Classify the Symbols ● A set of features for each potential symbol is calculated. ● A gaussian distribution is fit to each feature and forced to have a standard deviation of at least.3 so that zero counts don't translate to zero probability.

Classify the Symbols ● A naive bayes classifier is used to find the most likely component for each symbol. ● This assumes that each feature is independent (they aren't) and that the gaussians are a good estimation of how the features are distributed.

Culling ● The symbol locater steps are designed to pick up all possible symbols along with many non-symbols. ● A set of domain-specific heuristics are applied to each possible symbol. Symbols with scores above a threshold are accepted. If some symbols overlap, the one with the greatest score is accepted.

Error Correction ● After all the symbols in a sketch have been located and classified, the whole sketch is reexamined to see if it makes sense. ● If there are any components with unattached connections, dangling wires, wires drawn with many segments, or other evidence of parse errors, the threshold values for those segments are modified and they are reprocessed.

Analysis- Paper Quality ● How effectively do the authors frame their presentation of their work? ● Do they adequately introduce the problem? ● Are their examples relevant and clear?

Analysis- Technique Quality ● How does the work presented in this paper advance the field of computer vision?

Contributions ● Ink density model ● Lack of stroke constraints ● Error correction step

Conclusion ● The work presented here advances the power of sketch recognition techniques ● The system still relies on unique sketch data (stroke speed, order) ● Difficult to judge overall effectiveness due to lack of standard corpus of test problems