Advanced Computer Vision Devi Parikh Electrical and Computer Engineering.

Slides:



Advertisements
Similar presentations
Critical Reading Strategies: Overview of Research Process
Advertisements

Research skills. OUTLINE Mission and Vision What is Research? Ten Steps for Good Research Resources of Research Types of research Skills (Top_5 Skills)
Welcome to the seminar course
Computational Biology Journal Club aka “Current Topics in Computational Biology” aka “02-701” William Cohen Organizational Meeting Sept 6, 2007.
213: User Interface Design & Development Professor: Tapan Parikh TA: Eun Kyoung Choe
Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce.
CEP Welcome September 1, Matthew J. Koehler September 1, 2005CEP Cognition and Technology Who’s Who?  Team up with someone you don’t.
UMass Lowell Computer Science Advanced Algorithms Computational Geometry Prof. Karen Daniels Spring, 2004 Project.
Research and Writing Seminar Thursday, – 16 35, room C To find an up-to-date version of the schedule and to read the papers check the website
Visual Object Recognition Rob Fergus Courant Institute, New York University
EECE 571R (Spring 2010) Autonomic Computing (Building Self* Systems) Matei Ripeanu matei at ece.ubc.ca.
Chem 122 Molecular Spectroscopy Spring 2008 Professor Ronald Cohen GSI Drew Rollins.
Computer Vision and the Web Images on the Web Movies, news, sports 916,271 titles 10 mil. videos, 65,000 added daily 350 mil. photos, 1 mil. added daily.
Presentations As you know from the syllabus, for this class you need to read a book by a Latin American writer and write a formal paper. This is the information.
Approaches to Representing and Recognizing Objects Visual Classification CMSC 828J – David Jacobs.
Welcome to CS 395/495 Measurement and Analysis of Online Social Networks.
Machine learning & category recognition Cordelia Schmid Jakob Verbeek.
SBU Digital Media CSE 690 Internet Vision Organizational Meeting Tamara Berg Assistant Professor SUNY Stony Brook.
Computer Vision Spring ,-685 Instructor: S. Narasimhan Wean Hall 5409 T-R 10:30am – 11:50am.
Advanced Computer Vision Devi Parikh Electrical and Computer Engineering.
CS 1114: Introduction to Computing Using MATLAB and Robotics Prof. Noah Snavely CS1114
COMP 875 Machine Learning Methods in Image Analysis.
Chemistry B.S. Degree Program Assessment Plan Dr. Glenn Cunningham Professor and Chair University of Central Florida April 21, 2004.
The content of these slides by John Galeotti, © Carnegie Mellon University (CMU), was made possible in part by NIH NLM contract# HHSN P,
ENG3050 Embedded Reconfigurable Computing Systems General Information Handout Winter 2015, January 5 th.
Put the Lesson Title Here A webquest for xth grade Designed by Put your You may include graphics, a movie, or sound to any of the slides. Introduction.
ITCS 6157/8157 Visual Database Fall 2015
Perceptual and Sensory Augmented Computing Visual Object Recognition Tutorial Visual Object Recognition Bastian Leibe & Computer Vision Laboratory ETH.
04/30/13 Last class: summary, goggles, ices Discrete Structures (CS 173) Derek Hoiem, University of Illinois 1 Image: wordpress.com/2011/11/22/lig.
Information Retrieval and Web Search Lecture 1. Course overview Instructor: Rada Mihalcea Class web page:
Mini-Project on Web Data Analysis DANIEL DEUTCH. Data Management “Data management is the development, execution and supervision of plans, policies, programs.
1 HEINZ NIXDORF INSTITUT University of Paderborn Algorithms und Complexity Seminar Designing Peer-to-Peer- Networks Christian Schindelhauer
Course Organization & Format Visualization II MSIM 842, CS 795/895 Instructor: Jessica Crouch.
Dana Nau: CMSC 722, AI Planning Licensed under the Creative Commons Attribution-NonCommercial-ShareAlike License:
Welcome to CS 221! First Course in Computer Science for Engineers.
CSci8211: Logistics1 CSci8211: Advanced Computer Networks and Their Applications aka Basic Architecture, Mechanisms and Research Issues in Emerging Software-
Visual Scene Understanding (CS 598) Derek Hoiem Course Number: Instructor: Derek Hoiem Room: Siebel Center 1109 Class Time: Tuesday and Thursday.
Overviews of ITCS 6161/8161: Advanced Topics on Database Systems Dr. Jianping Fan Department of Computer Science UNC-Charlotte
1Thu D. NguyenCS 545: Distributed Systems CS 545: Distributed Systems Spring 2006 Department of Computer Science Rutgers University Thu D. Nguyen.
CS 6961: Structured Prediction Fall 2014 Course Information.
EECE 571e (Fall 2015) (Massively) Parallel Computing Platforms Matei Ripeanu ece.ubc.ca.
Why is computer vision difficult?
Internet Economics כלכלת האינטרנט Class 12 – Preparing for the second semester. 1.
The content of these slides by John Galeotti, © Carnegie Mellon University (CMU), was made possible in part by NIH NLM contract# HHSN P,
1 How to Give a Good Presentation? Cliff C. Zou CAP /2010.
Web Fundamentals (HTML and CSS) Course Introduction SoftUni Team Technical Trainers Software University
Information Retrieval and Web Search Course overview Instructor: Rada Mihalcea.
ITCS 6265 Details on Project & Paper Presentation.
Web Fundamentals (HTML and CSS)
Li Fei-Fei, UIUC Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning Object Categories ICCV 2005 Beijing, Short Course, Oct 15.
Local features: detection and description
Computer/Human Interaction Spring 2013 Northeastern University1 Name of Interface Tagline if you have one (80 chars max, including spaces) Team member.
INFOMGP Student names and numbers Papers’ references Title.
Lecture 1CS 380C 1 CS 380C Advanced Compiler Techniques Kathryn S McKinley The University of Texas at Austin.
Introduction to Recognition CS4670/5670: Intro to Computer Vision Noah Snavely mountain building tree banner vendor people street lamp.
Water and people in a changing world Yhd ; Spring
CSE6339 DATA MANAGEMENT AND ANALYSIS FOR COMPUTATIONAL JOURNALISM CSE6339, Spring 2012 Department of Computer Science and Engineering, University of Texas.
Machine learning & object recognition Cordelia Schmid Jakob Verbeek.
CS 8803 CVL: Vision and Language
Writing a Critical Summary of an Article or Paper
Water and people in a changing world
Research Methods Dr. X.
Lecture 25: Introduction to Recognition
Lecture 25: Introduction to Recognition
Crowd Simulation (INFOMCRWS) - Course Introduction
PHYS 202 Intro Physics II Catalog description: A continuation of PHYS 201 covering the topics of electricity and magnetism, light, and modern physics.
CAP 6412: Advanced Computer Vision
Presentation and project
Advanced Topics in Computer Vision
Generic object recognition
Presentation transcript:

Advanced Computer Vision Devi Parikh Electrical and Computer Engineering

Plan for today Topic overview Introductions Course overview: –Logistics –Requirements Placing this course in context of others –Plan for next lecture Please interrupt at any time with questions or comments

Computer Vision Automatic understanding of images and video –Computing properties of the 3D world from visual data (measurement) –Algorithms and representations to allow a machine to recognize objects, people, scenes, and activities. (perception and interpretation) –Algorithms to mine, search, and interact with visual data (search and organization) Kristen Grauman

What does recognition involve? Fei-Fei Li

Detection: are there people?

Activity: What are they doing?

Object categorization mountain building tree banner vendor people street lamp

Instance recognition Potala Palace A particular sign

Scene and context categorization outdoor city …

Attribute recognition flat gray made of fabric crowded

Why recognition? Recognition a fundamental part of perception – e.g., robots, autonomous agents Organize and give access to visual content –Connect to information –Detect trends and themes Where are we now? Kristen Grauman

We’ve come a long way…

Posing visual queries Kooaba, Bay & Quack et al. Yeh et al., MIT Belhumeur et al. Kristen Grauman

Exploring community photo collections Snavely et al. Simon & Seitz Kristen Grauman

Dubrovnik 17 Slide credit: Devi Parikh AutoTagger: Yunpeng Li, Noah Snavely, Dan Huttenlocher and Pascal Fua

Autonomous agents able to detect objects Kristen Grauman

We’ve come a long way… Fischler and Elschlager, 1973

We’ve come a long way…

Challenges

Challenges: robustness IlluminationObject pose Viewpoint Intra-class appearance Occlusions Clutter Kristen Grauman

Challenges: context and human experience Context cues Kristen Grauman

Challenges: context and human experience Context cues FunctionDynamics Video credit: J. Davis Kristen Grauman

Challenges: scale, efficiency Half of the cerebral cortex in primates is devoted to processing visual information ~20 hours of video added to YouTube per minute ~5,000 new tagged photos added to Flickr per minute Thousands to millions of pixels in an image 30+ degrees of freedom in the pose of articulated objects (humans) 3,000-30,000 human recognizable object categories Kristen Grauman

Challenges: learning with minimal supervision More Less Cropped to object, parts and classes labeled Classes labeled, some clutter Unlabeled, multiple objects Kristen Grauman

Slide from Pietro Perona, 2004 Object Recognition workshop

Inputs in 1963… L. G. Roberts, Machine Perception of Three Dimensional Solids, Ph.D. thesis, MIT Department of Electrical Engineering, 1963.Machine Perception of Three Dimensional Solids Kristen Grauman

Personal photo albums Surveillance and security Movies, news, sports Medical and scientific images Slide credit; L. Lazebnik … and inputs today

Images on the Web Movies, news, sports 916,271 titles 10 mil. videos, 65,000 added daily 350 mil. photos, 1 mil. added daily 1.6 bil. images indexed as of summer 2005 Satellite imageryCity streets Slide credit; L. Lazebnik Understand and organize and index all this data!!

Introductions Devi Parikh Ph.D., Carnegie Mellon University, 2009 Research Assistant Professor, TTI-Chicago, 2013 Assistant Professor, ECE, Virginia Tech

Introductions Which program are you in? How far along? Have you taken a computer vision course before? Have you taken a machine learning course before? What are you hoping to get out of this class?

This course ECE 6554 TR 5:00 pm to 6:15 pm Lavery Hall Room 345 Course webpage:

This course Focus on more advanced techniques and ideas in computer vision Presented in research papers High-level recognition problems, innovative applications.

Goals Understand state-of-the-art approaches Analyze and critique current approaches Identify interesting open questions Present clearly and methodically

Official Learning Objectives Describe state-of-the-art approaches in object recognition and scene understanding Discuss tools from other fields (e.g., machine learning) that are frequently used to solve computer vision problems Implement two approaches to address important problems in computer vision Discuss and critique research papers in computer vision Identify open research questions in computer vision

Expectations Paper reviews each class [25%] Leading discussion (~ twice) on papers [15%] Presentations (~ once) [25%] –Present topic –Papers and background reading Project [35%] No “Assignments”, Exams, etc.

Prerequisites Course in computer vision Courses in machine learning is a plus

Paper reviews For each class –Review one paper me reviews by noon (12:00 pm) the day of the class –firstname_lastname_MM_DD.pdf –I will grade a random subset in detail Skip reviews the classes you are presenting or leading discussion Late reviews will not be accepted Will drop three lowest grades on reviews

Paper review guidelines One page Detailed review: –Brief (2-3 sentences) summary –Main contribution –Strengths? Weaknesses? –How convincing are the experiments? Suggestions to improve them? –Extensions? Applications? –Additional comments, unclear points –Relationships observed between the papers we are reading Most interesting thought Write in your own words Write well, proof read

Leading Discussion ~ One of you will be assigned to argue for the paper ~ One of you will be assigned to argue against the paper Come prepared with 5 points

Presentation guidelines IMPORTANT: Don’t present papers – present the topic! Do a lit review and look at background papers (e.g. “seeds / pointers for presenters”), and also more recent work. Well-organized talk, 30 minutes

Presentation guidelines What to cover? –Topic overview, motivation –One or two papers in details Problem overview, motivation Algorithm explanation, technical details Experimental set up, results Strengths, weaknesses, extensions NOT the paper the class has read. –Any commonalities, important differences between techniques covered in the papers. A demo / experiment would be great See class webpage for more details.

Experiments Implement/download code for a main idea in the paper and evaluate it: –Experiment with different types of training/testing data sets –Evaluate sensitivity to important parameter settings –Show an example to analyze a strength/weakness of the approach –Show qualitative and quantitative results

Tips Look up papers and authors. Their webpage may have data, code, slides, videos, etc. –Make sure talk flows well and makes sense as a whole. –Cite ALL sources. Don’t forget the high-level picture. Give a very clear and well-organized and thought out talk. Will interrupt if something is not clear

Tips Make sure you are saying everything we need to know to understand what you are saying. Make sure you know what you are talking about. Think about your audience. Make your talks visual, animated (images, video, not lots of text).

Projects Possibilities: –Extension of a technique studied in class –Analysis and empirical evaluation of an existing technique –Comparison between two approaches –Design and evaluate a novel approach –Be creative! Can work with a partner

Project timeline Project proposals (1 page) [25%] –March 1 st Final presentations [40%] –April 19 th to 26 th Project video (1 minute) [35%] –April 28th th

Implementation Use any language / platform you like No support for code / implementation issues will be provided

Miscellaneous Best presentation, best project and best discussion prizes! –We will vote –Dinner Feedback welcome and useful

Coming up Read the class webpage –Schedule is up –Tentative, especially because of likely snow days Select 6 dates (topics) you would like to present –Sign up sheet shows how many people have already signed up for a topic –Select those that have fewer selections –“Bonus” for presenters next week. –Probability of dropping class? I will send pointers to good presentations, reviews, etc. Already on class webpage.

Context Intro to computer vision –Intro to machine learning Course last semester on Deep Learning for Perception: – –This course is complementary to it Ram’s presentation on Thursday –Individual presenters will touch on state-of- the-art in each

Each Lecture ~ 20 minute discussion on paper we read –Led by two students: “for” and “against” ~ 30 minute presentation on topic ~ 25 minutes for questions, interruptions, unplanned discussions

Questions? See you Thursday!