1 Adaptive Submodularity: A New Approach to Active Learning and Stochastic Optimization Joint work with Andreas Krause 1 Daniel Golovin.

Slides:



Advertisements
Similar presentations
You have been given a mission and a code. Use the code to complete the mission and you will save the world from obliteration…
Advertisements

Fundamentals of Probability
Advanced Piloting Cruise Plot.
Kapitel S3 Astronomie Autor: Bennett et al. Raumzeit und Gravitation Kapitel S3 Raumzeit und Gravitation © Pearson Studium 2010 Folie: 1.
Kapitel 21 Astronomie Autor: Bennett et al. Galaxienentwicklung Kapitel 21 Galaxienentwicklung © Pearson Studium 2010 Folie: 1.
Chapter 1 The Study of Body Function Image PowerPoint
1 Copyright © 2013 Elsevier Inc. All rights reserved. Appendix 01.
1 Copyright © 2010, Elsevier Inc. All rights Reserved Fig 2.1 Chapter 2.
By D. Fisher Geometric Transformations. Reflection, Rotation, or Translation 1.
Properties Use, share, or modify this drill on mathematic properties. There is too much material for a single class, so you’ll have to select for your.
Thursday, March 7 Duality 2 – The dual problem, in general – illustrating duality with 2-person 0-sum game theory Handouts: Lecture Notes.
and 6.855J Spanning Tree Algorithms. 2 The Greedy Algorithm in Action
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Title Subtitle.
My Alphabet Book abcdefghijklm nopqrstuvwxyz.
DIVIDING INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
FACTORING ax2 + bx + c Think “unfoil” Work down, Show all steps.
Addition Facts
Mean-Field Theory and Its Applications In Computer Vision1 1.
ZMQS ZMQS
Richmond House, Liverpool (1) 26 th January 2004.
Chapter 4: Informed Heuristic Search
Cost-effective Outbreak Detection in Networks Jure Leskovec, Andreas Krause, Carlos Guestrin, Christos Faloutsos, Jeanne VanBriesen, Natalie Glance.
1 Column Generation. 2 Outline trim loss problem different formulations column generation the trim loss problem master problem and subproblem in column.
Randomized Algorithms Randomized Algorithms CS648 1.
Tutorial 8, STAT1301 Fall 2010, 16NOV2010, By Joseph Dong.
ABC Technology Project
The Weighted Proportional Resource Allocation Milan Vojnović Microsoft Research Joint work with Thành Nguyen Microsoft Research Asia, Beijing, April, 2011.
Gate Sizing for Cell Library Based Designs Shiyan Hu*, Mahesh Ketkar**, Jiang Hu* *Dept of ECE, Texas A&M University **Intel Corporation.
VOORBLAD.
15. Oktober Oktober Oktober 2012.
1 Breadth First Search s s Undiscovered Discovered Finished Queue: s Top of queue 2 1 Shortest path from s.
Chapter 4 Inference About Process Quality
Submodularity for Distributed Sensing Problems Zeyn Saigol IR Lab, School of Computer Science University of Birmingham 6 th July 2010.
RecMax – Can we combine the power of Social Networks and Recommender Systems? Amit Goyal and L. RecMax: Exploting Recommender Systems for Fun and Profit.
Routing and Congestion Problems in General Networks Presented by Jun Zou CAS 744.
© 2012 National Heart Foundation of Australia. Slide 2.
Lets play bingo!!. Calculate: MEAN Calculate: MEDIAN
Submodular Set Function Maximization via the Multilinear Relaxation & Dependent Rounding Chandra Chekuri Univ. of Illinois, Urbana-Champaign.
Chapter 5 Test Review Sections 5-1 through 5-4.
GG Consulting, LLC I-SUITE. Source: TEA SHARS Frequently asked questions 2.
Addition 1’s to 20.
25 seconds left…...
Week 1.
Analyzing Genes and Genomes
We will resume in: 25 Minutes.
©Brooks/Cole, 2001 Chapter 12 Derived Types-- Enumerated, Structure and Union.
Essential Cell Biology
Local Search Jim Little UBC CS 322 – CSP October 3, 2014 Textbook §4.8
Intracellular Compartments and Transport
A SMALL TRUTH TO MAKE LIFE 100%
PSSA Preparation.
IP, IST, José Bioucas, Probability The mathematical language to quantify uncertainty  Observation mechanism:  Priors:  Parameters Role in inverse.
Immunobiology: The Immune System in Health & Disease Sixth Edition
Essential Cell Biology
How Cells Obtain Energy from Food
Immunobiology: The Immune System in Health & Disease Sixth Edition
Energy Generation in Mitochondria and Chlorplasts
Online Node-weighted Steiner Connectivity Problems Vahid Liaghat University of Maryland MohammadTaghi Hajiaghayi (UMD) Debmalya Panigrahi (Duke) 1.
CpSc 3220 Designing a Database
Minimum Vertex Cover in Rectangle Graphs
1 Adaptive Submodularity: A New Approach to Active Learning and Stochastic Optimization Daniel Golovin and Andreas Krause.
Parallel Double Greedy Submodular Maxmization Xinghao Pan, Stefanie Jegelka, Joseph Gonzalez, Joseph Bradley, Michael I. Jordan.
Maximizing the Spread of Influence through a Social Network
Online Distributed Sensor Selection Daniel Golovin, Matthew Faulkner, Andreas Krause theory and practice collide 1.
Maximizing the Spread of Influence through a Social Network Authors: David Kempe, Jon Kleinberg, É va Tardos KDD 2003.
Presentation transcript:

1 Adaptive Submodularity: A New Approach to Active Learning and Stochastic Optimization Joint work with Andreas Krause 1 Daniel Golovin

2 Max K-Cover (Oil Spill Edition)

3 Submodularity Time Discrete diminishing returns property for set functions. ``Playing an action at an earlier stage only increases its marginal benefit''

4 The Greedy Algorithm Theorem [Nemhauser et al 78]

5 Stochastic Max K-Cover Asadpour et al. (`08): (1-1/e)-approx if sensors (independently) either work perfectly or fail completely. Bayesian: Known failure distribution. Adaptive: Deploy a sensor and see what you get. Repeat K times At 1 st location

6 Adaptive Submodularity Time Playing an action at an earlier stage only increases its marginal benefit expected (taken over its outcome) Gain more Gain less (i.e., at an ancestor) Select Item Stochastic Outcome Adaptive Monotonicity: Δ(a | obs) 0, always Δ(action | observations) [G & Krause, 2010]

7 Whats it good for? Allows us to generalize results to the adaptive realm, including: (1-1/e)-approximation for Max K-Cover, submodular maximization (ln(n)+1)-approximation for Set Cover Accelerated implementation Data-Dependent Upper Bounds on OPT

8 Recall the Greedy Algorithm Theorem [Nemhauser et al 78]

9 The Adaptive-Greedy Algorithm Theorem [G & Krause, COLT 10]

10 [Adapt-monotonicity] - - ( ) - [Adapt-submodularity]

11 … The world-state dictates which path in the tree well take. 1.For each node at layer i+1, 2.Sample path to layer j, 3.Play the resulting layer j action at layer i+1. How to play layer j at layer i+1 By adapt. submod., playing a layer earlier only increases its marginal benefit

12 [Adapt-monotonicity] - - ( ) - ( ) - [Def. of adapt-greedy] ( ) - [Adapt-submodularity]

13

Stochastic Max Cover is Adapt-Submod 1 3 Gain more Gain less adapt-greedy is a (1-1/e) 63% approximation to the adaptive optimal solution. Random sets distributed independently.

15 Influence in Social Networks Who should get free cell phones? V = {Alice, Bob, Charlie, Daria, Eric, Fiona} F(A) = Expected # of people influenced when targeting A Prob. of influencing Alice Bob Charlie Daria Eric Fiona [Kempe, Kleinberg, & Tardos, KDD `03]

16 Alice Bob Charlie Daria Eric Fiona Key idea: Flip coins c in advance live edges F c (A) = People influenced under outcome c (set cover!) F(A) = c P(c) F c (A) is submodular as well!

Daria Prob. of influencing Eric Fiona Alice Bob Charlie Adaptively select promotion targets, see which of their friends are influenced. Adaptive Viral Marketing

18 Adaptive Viral Marketing Alice Bob Charlie Daria Eric Fiona Objective adapt monotone & submodular. Hence, adapt-greedy is a (1-1/e) 63% approximation to the adaptive optimal solution.

19 Stochastic Min Cost Cover Adaptively get a threshold amount of value. Minimize expected number of actions. If objective is adapt-submod and monotone, we get a logarithmic approximation. [Goemans & Vondrak, LATIN 06] [Liu et al., SIGMOD 08] [Feige, JACM 98] [Guillory & Bilmes, ICML 10] c.f., Interactive Submodular Set Cover

20 Optimal Decision Trees x1x1 x2x2 x3x = = = Garey & Graham, 1974; Loveland, 1985; Arkin et al., 1993; Kosaraju et al., 1999; Dasgupta, 2004; Guillory & Bilmes, 2009; Nowak, 2009; Gupta et al., 2010 Diagnose the patient as cheaply as possible (w.r.t. expected cost) 1 1 0

21 Objective = probability mass of hypotheses you have ruled out. Its Adaptive Submodular. Outcome = 1 Outcome = 0 Test x Test w Test v

22 Generate upper bounds on Use them to avoid some evaluations. Accelerated Greedy time Saved evaluations

23 Generate upper bounds on Use then to avoid some evaluations. Accelerated Greedy Empirical Speedups we obtained: - Temperature Monitoring: 2 - 7x - Traffic Monitoring: x - Speedup often increases with instance size. Empirical Speedups we obtained: - Temperature Monitoring: 2 - 7x - Traffic Monitoring: x - Speedup often increases with instance size.

24 Ongoing work Active learning with noise With Andreas Krause & Debajyoti Ray, to appear NIPS 10 Edges between any two diseases in distinct groups

25 Active Learning of Groups via Edge Cutting Edge Cutting Objective is Adaptive Submodular First approx-result for noisy observations

26 Conclusions New structural property useful for design & analysis of adaptive algorithms Recovers and generalizes many known results in a unified manner. (We can also handle costs) Tight analyses & optimal-approx factors in many cases. Accelerated implementation yields significant speedups x1x1 x2x2 x3x

x1x1 x2x2 x3x