16-899A Pixels to Percepts Instructors: Alexei (Alyosha) Efros, 225 Smith Hall, CMU.Lavanya Sharan, Disney Research Pittsburgh Web Page:

Slides:



Advertisements
Similar presentations
Finding a Research Topic Janie Irwin CSE, Penn State with credits to Kathy Yelick, EECS, UC Berkeley.
Advertisements

16-824: Learning-based Methods in Vision Instructors: Alexei (Alyosha) Efros 225 Smith Hall Leon Sigal
Welcome to the seminar course
Study What’s that? Why? How?. School does not “do stuff” to you You do stuff to school – it is active You do stuff to school – it is active This is about.
Learning Styles Maximizing the Way We Learn. Agenda Today you will… –Find out what type of learner you are –Discover strategies that work best for your.
16-721: Advanced Machine Perception Staff: Instructor: Alexei (Alyosha) Efros 4207 TA: David Bradley 2216 NSH Web Page:
Internet Vision - Lecture 3 Tamara Berg Sept 10. New Lecture Time Mondays 10:00am-12:30pm in 2311 Monday (9/15) we will have a general Computer Vision.
(15-862): Computational Photography. Staff Prof: Alexei Efros 225 Smith TA: Natasha Kholgade Web.
CPSC 425: Computer Vision (Jan-April 2007) David Lowe Prerequisites: 4 th year ability in CPSC Math 200 (Calculus III) Math 221 (Matrix Algebra: linear.
Advanced Computer Graphics CSE 190 [Spring 2015], Lecture 14 Ravi Ramamoorthi
Advanced Computer Vision Introduction Goal and objectives To introduce the fundamental problems of computer vision. To introduce the main concepts and.
CORNELL UNIVERSITY CS 764 Seminar in Computer Vision Ramin Zabih Fall 1998.
Evaluating your ideas and Reading the Literature Psych 231: Research Methods in Psychology.
CSCD 555 Research Methods for Computer Science
CEP Welcome September 1, Matthew J. Koehler September 1, 2005CEP Cognition and Technology Who’s Who?  Team up with someone you don’t.
16-721: Learning-based Methods in Vision Staff: Instructor: Alexei (Alyosha) Efros 4207 TA: Tomasz Malisiewicz Smith Hall.
Color, lightness & brightness Lavanya Sharan February 7, 2011.
15-463: Rendering and Image Processing Staff Prof: Alexei Efros TA: James Hays Web Page
(15-862): Computational Photography Staff Prof: Alexei Efros 4207 TA: Mohit Gupta Web Page
(15-862): Computational Photography. Staff Prof: Alexei Efros 4207 NSH (for TA: Ronit Slyper Web Page
16-721: Learning-based Methods in Vision Staff: Instructor: Alexei (Alyosha) Efros 4207 TA: Jean-Francois Lalonde A521.
Computational Photography Introduction Jinxiang Chai Computer Science and Engineering Texas A&M University.
(15-862): Computational Photography. Staff Prof: Alexei Efros 4207 TA: Jim McCann Graphics Lab Web Page
Jochen Triesch, UC San Diego, 1 COGS Visual Modeling Jochen Triesch & Martin Sereno Dept. of Cognitive Science UC.
(15-862): Computational Photography Staff Prof: Alexei Efros 4207 Web Page Newsgroup:
Evaluating your ideas and Reading the Literature Psych 231: Research Methods in Psychology.
Welcome to CS 395/495 Measurement and Analysis of Online Social Networks.
Welcome to CS 395/495 Internet Architectures. What is this class about? (1) Goal: to help you understand what the future Internet will look like –What.
Animation Theory.
Introduction to the Research Paper
Paul Cezanne By Denise Jackson. Paul Cezanne was born in a French town in To show his love for nature, he painted things in a way that had never.
How to Research for an Essay and Avoid Plagiarism
SBU Digital Media CSE 690 Internet Vision Organizational Meeting Tamara Berg Assistant Professor SUNY Stony Brook.
Computer Vision Spring ,-685 Instructor: S. Narasimhan Wean Hall 5409 T-R 10:30am – 11:50am.
Introduction to Graphics and Virtual Environments.
thinking hats Six of Prepared by Eman A. Al Abdullah ©
The EZ way to write a prospectus, thesis, publishable paper John M. Hoenig, Ph.D. Department of Fisheries Science.
Definition of Computer Graphics
Class Info Website for materials available at: – We’ll use Moodle for turning in assignments, grades, s, blog.
3D COMPUTER GRAPHICS IMD Chapter 1: 3D Computer Graphics Chapter 1: 1 Lecturer: Norhayati Mohd Amin.
Sight Words.
Dana Nau: CMSC 722, AI Planning Licensed under the Creative Commons Attribution-NonCommercial-ShareAlike License:
Visual Scene Understanding (CS 598) Derek Hoiem Course Number: Instructor: Derek Hoiem Room: Siebel Center 1109 Class Time: Tuesday and Thursday.
Science Fair How To Get Started… (
How to Read Research Papers? Xiao Qin Department of Computer Science and Software Engineering Auburn University
2009 ML Project: Goal: Do some real machine learning… A project you are interested in works better Data is often the hard part (get it in plenty of time)
Why is computer vision difficult?
Class 13 LBSC 690 Information Technology More Multimedia Compression and Recognition, and Social Issues.
Welcome to EECS 395/495 Networking Problems in Cloud Computing.
CPSC : Data-driven Computer Graphics Jinxiang Chai.
Images.  Images include graphics, such as backgrounds, color schemes and navigation bars, and photos and other illustrations  An essential part of a.
Computer Vision, CS766 Staff Instructor: Li Zhang TA: Yu-Chi Lai
CORNELL UNIVERSITY CS 764 Seminar in Computer Vision Ramin Zabih Fall 1998.
(15-862): Computational Photography. Staff Prof: Alexei Efros 4207 TAs: Alvaro Collet and Laura Trutoiu
High Frequency Words.
Click on the running man to start the experiment (this will take longer than the Stroop task you ran last week)
CAP6135: Malware and Software Vulnerability Analysis Paper Presentation and Summary Cliff Zou Spring 2013.
Welcome to EECS 395/495 Online Advertising: A Systems Approach.
Prototyping Creation of concrete but partial implementations of a system design to explore usability issues.
Dana Nau: CMSC 722, AI Planning Licensed under the Creative Commons Attribution-NonCommercial-ShareAlike License:
3D Animation 1. Introduction Dr. Ashraf Y. Maghari Information Technology Islamic University of Gaza Ref. Book: The Art of Maya.
Created By Sherri Desseau Click to begin TACOMA SCREENING INSTRUMENT FIRST GRADE.
Activity: “Artists Observe” Mind Map
SPECIALIZED APPLICATION SOFTWARE
Introduction and Overview
Advanced Computer Graphics: Introduction
Today.
This week Conference papers due May 8..
“man, how you define computer
Presentation transcript:

16-899A Pixels to Percepts Instructors: Alexei (Alyosha) Efros, 225 Smith Hall, CMU.Lavanya Sharan, Disney Research Pittsburgh Web Page:

Today Instructors Why This Course? Why Perception? Administrative stuff Overview of the course Old Lady or Young Girl?

Lavanya Sharan Postdoctoral Researcher Visual perception, both behavioral and computational aspects. Computer Vision Human Visual Perception Computer Graphics

Light or dark? Real or fake Oreo? Material category? Emotion? Where does he land? GRADUATE WORK RECENT WORK

Alexei “Alyosha” Efros Associate Professor, Robotics & CSD Ph.D 2003, from UC Berkeley (signed by Arnie!) Research Fellow, University of Oxford, ’03-’04 Research Interests Computer Vision, Computer Graphics, Data-driven Approaches

PhD Thesis on Texture and Action Synthesis Smart Erase button in Microsoft Digital Image Pro:

Why This Course? A perfect storm: I wanted to do this for a long time but felt I wasn’t qualified Lavanya happened to be in Pittsburgh and she is more than qualified The first-ever textbook on Perception for CS types (Thompson et al) is coming out We will be giving out printouts soon There are great perception courses in Psych dept (e.g. by Klatzky), but our aims are different: The aim to be much more applied There are real problems in vision/graphics that we can solve Exposure for these who might never take perception otherwise

Why study Perception? It’s fascinating! Ok, but so is astronomy It’s still in the “early days” Relatively easy to get up to speed Still lots of low-hanging fruit OK, OK, but why should I care? I just want to create a seeing robot and/or create the next “Avatars” Two answers: 1) a classic one and 2) a real one

Classic Answer Understanding Human Visual Perception should help us design better vision algorithms After all, human is the only vision system known to work Just like understanding how birds fly should help us design airplanes …WAIT A MINUTE Ok, so there is a fine line between “helping” and “mindless copying” Beware of papers claiming to be “biologically inspired”

The Real Answer Human Perception is an integral part of most vision and graphics endeavors i.e. you can’t get away from it!

“What does it mean, to see? The plain man's answer (and Aristotle's, too). would be, to know what is where by looking.” -- David Marr, Vision (1982)

Vision: a split personality “What does it mean, to see? The plain man's answer (and Aristotle's, too). would be, to know what is where by looking. In other words, vision is the process of discovering from images what is present in the world, and where it is.” Answer #1: pixel of brightness 243 at position (124,54) …and depth.7 meters Answer #2: looks like bottom edge of whiteboard showing at the top of the image Which answer is better? Is the difference just a matter of scale? depth map

Measurement vs. Perception

Proof!

Measurement vs. Perception Pablo Picasso The Guitar Player

Vision as Measurement Real-time stereo on Mars Structure from Motion Physics-based Vision Virtualized Reality

Vision as Understanding Object / Scene Recognition Action / Activity Recognition even scene geometry

Perception in Graphics One of central goals of graphics is to simulate visual experience as realistically as possible So, if we could simulate the physics of light perfectly, why bother with perception? 1) Practical Reasons 2) Artistic Reasons

Complexity Amazingly real But so sterile, lifeless… Why?

Device Limitations Low Dynamic Range Limited modalities (visual + audio) No smell, taste, temperature Limited field of view

Artistic Reasons Reality is boring Visual content creators (film directors, game designers) want hyper-reality

How to study Perception? Ok, you convinced me: Perception is useful. But why not just “open up the brain” and figure out how it works? Two most popular methods in physiology: Cell recordings Recordings from small number of cells (1-100, out of millions) fMRI Overall activity in the brain as function of time Very low spatial resolution

Tale of Martians with an old PC Cell recordings fMRI

Instead, in this course…

Course Goals Read some interesting papers together Learn something new: both you and us! Get up to speed on big chunk of perception research Use perception / human studies in your own research Try your hand at a perception study Learn how to speak Learn how think critically about papers

Course Motto: 50/50 50% Lectures / 50% student presentations 50% textbook / 50% research papers 50% from perception side / 50% from vision/graphics side 50% Alyosha / 50% Lavanya

Course Organization Requirements: 1.Class Participation (25%) Keep annotated bibliography Post on the Class Blog before each class Ask questions / debate / flight / be involved! 2.Three Projects (75%) Two Analysis Projects (25% + 25%) Implement / Evaluate a paper and present it in class 1 CS paper, 1 perception paper Synthesis (Final) Project (25%) Produce a publishable result (study or implementation) Can be continuation of analysis project or something new Can be done solo or in groups of 2

Class Participation Keep annotated bibliography of readings (always a good idea!). The format is up to you. At least, it needs to have: Summary of key points A few Interesting insights, “aha moments”, keen observations, etc. Weaknesses. Unanswered questions. Areas of further investigation, improvement. Before each class: Submit your summary for current readings in hard copy (printout/xerox) Submit a comment on the Class Blog ask a question, answer a question, post your thoughts, praise, criticism, start a discussion, etc.

Analysis Project 1.Pick a paper / set of papers from the list 2.Understand it as if you were the author e.g. Re-implement it; try to replicate the study If there is code, understand the code completely Run it on data the same data (you can contact authors for data and even code sometimes) 3.Understand it better than the author Run it on LOTS of new data (e.g. LabelMe dataset, Flickr dataset, etc, etc), or try to replicate the study with new data Figure out how it succeeds, how it fails, where it fails, and, most importantly WHY it fails Look at which parts of the code do the real work, and which parts are just window-dressing Maybe suggest directions for improvement. 4.Prepare an amazing 45 min presentation Discuss with us twice – once when you start the project, and 3 days before the presentation

Synthesis Project Can grow out of analysis project, or your own research 1-2 people per project. Project proposals mid-semester Project presentations at the end of semester. Results presented as a CVPR-format paper. Hopefully, a few papers may be submitted to CVPR / VSS etc

End of Semester Awards We will vote for: Best Analysis Project Best Synthesis Project Best Blog Comment Prize: dinner in a French restaurant in Paris (transportation not included!) or some other worthy prizes

Course Outline The Big Picture Week 1: Physiology Week 2: Philosophies of Vision Week 3: Psychophysical Methods Illumination and Shadows Week 4: Color, lightness, brightness perception Week 5: Shadows and Shading Shape Perception (week 6) Texture and Materials (week 7) Objects and Scenes Categorization and Recognition (week 8) Geometry and Perspective (week 9) Spatial Cognition (week 10)

Course Outline (cont.) Faces (week 11) Visual Attention (week 12) Motion Perception (week 13) Visual Realism (week 14) Perception and Art (week 15) Final Project Presentations Subject to Change

Perception & Art Livingstone