CS121 Heuristic Search Planning CSPs Adversarial Search Probabilistic Reasoning Probabilistic Belief Learning.

Slides:



Advertisements
Similar presentations
Big Ideas in Cmput366. Search Blind Search State space representation Iterative deepening Heuristic Search A*, f(n)=g(n)+h(n), admissible heuristics Local.
Advertisements

Local Search Jim Little UBC CS 322 – CSP October 3, 2014 Textbook §4.8
Constraint Satisfaction taking advantage of internal structure of states when paths don’t matter.
1 Constraint Satisfaction Problems A Quick Overview (based on AIMA book slides)
Search I: Chapter 3 Aim: achieving generality Q: how to formulate a problem as a search problem?
Markov Decision Process
Plan Generation & Causal-Link Planning 1 José Luis Ambite.
Constraint Satisfaction problems (CSP)
Review: Constraint Satisfaction Problems How is a CSP defined? How do we solve CSPs?
Problem Solving Agents A problem solving agent is one which decides what actions and states to consider in completing a goal Examples: Finding the shortest.
Search in AI.
Adversarial Search: Game Playing Reading: Chapter next time.
Best-First Search: Agendas
Lecture 13 Last time: Games, minimax, alpha-beta Today: Finish off games, summary.
Markov Decision Processes
Planning under Uncertainty
Lecture 5: Constraint Satisfaction
Midterm Review CMSC421 – Fall CH1 Summary: Intro AI Definitions: dimensions human/rational think/act Three Major Components of AI Algorithms Representation.
4 Feb 2004CS Constraint Satisfaction1 Constraint Satisfaction Problems Chapter 5 Section 1 – 3.
Min-Max Trees Based on slides by: Rob Powers Ian Gent Yishay Mansour.
Constraint Satisfaction Problems
Problem Spaces & Search CSE 473. © Daniel S. Weld Topics Agents & Environments Problem Spaces Search & Constraint Satisfaction Knowledge Repr’n.
Max Min Max Min Starting node and labels Figure 4.20 (pp. 153) Alpha-Beta Prune.
Uninformed Search Reading: Chapter 3 by today, Chapter by Wednesday, 9/12 Homework #2 will be given out on Wednesday DID YOU TURN IN YOUR SURVEY?
Machine Learning and Review Reading: C Bayesian Approach  Each observed training example can incrementally decrease or increase probability of.
Chapter 5 Outline Formal definition of CSP CSP Examples
Reinforcement Learning Game playing: So far, we have told the agent the value of a given board position. How can agent learn which positions are important?
1 Midterm Review cmsc421 Fall Outline Review the material covered by the midterm Questions?
Constraint Satisfaction Problems
1 Constraint Satisfaction Problems Slides by Prof WELLING.
Artificial Intelligence CS482, CS682, MW 1 – 2:15, SEM 201, MS 227 Prerequisites: 302, 365 Instructor: Sushil Louis,
Constraint Satisfaction Read Chapter 5. Model Finite set of variables: X1,…Xn Variable Xi has values in domain Di. Constraints C1…Cm. A constraint specifies.
Artificial Intelligence Recap & Expectation Maximization CSE 473 Dan Weld.
Constraint Satisfaction CPSC 386 Artificial Intelligence Ellen Walker Hiram College.
Searching by Constraint CMSC Artificial Intelligence January 24, 2008.
Hande ÇAKIN IES 503 TERM PROJECT CONSTRAINT SATISFACTION PROBLEMS.
Artificial Intelligence CS482, CS682, MW 1 – 2:15, SEM 201, MS 227 Prerequisites: 302, 365 Instructor: Sushil Louis,
CSCI 5582 Fall 2006 CSCI 5582 Artificial Intelligence Fall 2006 Jim Martin.
1 Chapter 5 Constraint Satisfaction Problems. 2 Outlines  Constraint Satisfaction Problems  Backtracking Search for CSPs  Local Search for CSP  The.
Search CPSC 386 Artificial Intelligence Ellen Walker Hiram College.
Chapter 5 Constraint Satisfaction Problems
MDPs (cont) & Reinforcement Learning
AI Lecture 17 Planning Noémie Elhadad (substituting for Prof. McKeown)
Review Test1. Robotics & Future Technology Future of Intelligent Systems / Ray Kurzweil futurist Ray Kurzweil / A Long Bet A Long Bet / Robot Soccer.
An Introduction to Artificial Intelligence Lecture 5: Constraint Satisfaction Problems Ramin Halavati In which we see how treating.
Problem Reduction So far we have considered search strategies for OR graph. In OR graph, several arcs indicate a variety of ways in which the original.
1 Constraint Satisfaction Problems Chapter 5 Section 1 – 3 Grand Challenge:
CHAPTER 5 SECTION 1 – 3 4 Feb 2004 CS Constraint Satisfaction 1 Constraint Satisfaction Problems.
Constraint Satisfaction Problems University of Berkeley, USA
Arc Consistency CPSC 322 – CSP 3 Textbook § 4.5 February 2, 2011.
1. 2 Outline of Ch 4 Best-first search Greedy best-first search A * search Heuristics Functions Local search algorithms Hill-climbing search Simulated.
Chapter 5 Team Teaching AI (created by Dewi Liliana) PTIIK Constraint Satisfaction Problems.
Domain Splitting, Local Search CPSC 322 – CSP 4 Textbook §4.6, §4.8 February 4, 2011.
CPSC 322, Lecture 10Slide 1 Finish Search Computer Science cpsc322, Lecture 10 (Textbook Chpt 3.6) Sep, 26, 2010.
CMPT 463. What will be covered A* search Local search Game tree Constraint satisfaction problems (CSP)
Alternative Search Formulations and Applications
Constraint Satisfaction Problems (CSPs)
Instructor: Vincent Conitzer
Constraint Propagation
Constraint Satisfaction Problems
Introduction to Artificial Intelligence Lecture 9: Two-Player Games I
Constraints and Search
Problem Spaces & Search
Chapter 5: General search strategies: Look-ahead
Constraint Satisfaction Problems. A Quick Overview
Problem Spaces & Search
Search.
Search.
Constraint Satisfaction Problems
Normal Form (Matrix) Games
Presentation transcript:

CS121 Heuristic Search Planning CSPs Adversarial Search Probabilistic Reasoning Probabilistic Belief Learning

Heuristic Search First, you need to formulate your situation as a Search Problem  What is a state?  From one state, what other states can you get to (successor function)?  For each of those transitions, what is the cost?  Where is the start? What is the goal?

Heuristic Search

Easy to formulate for problems that are inherently discrete  Solve a rubik's cube  Given all the flights of the airlines, figure out the best way (time/distance/cost) to get from city A to city B What about problems that have continuous spaces?  Maneuvering a robot through a building  Controlling a robot arm to do a task

Heuristic Search

No Heuristic  DFS, BFS, Iterative Deepening, Uniform Cost Heuristic  Have fringe sorted by f = g + h  Admissibility  Consistency

Planning Just a search problem! Use STRIPS to formulate the problem  A state is a set of propositions which are true IN(Robot, R1), HOLDING(Apple)  Successor function given by Actions Preconditions (which are allowed) Add/Delete (what is the new state)  How do we get a heuristic?

Planning Given some state s, how many actions will it take to get to a state satisfying g? Planning Graph  Initialize to S 0 all the proposition in s.  Add the add lists of actions that apply to get S 1  Repeat until convergence Find the first S i where the g is met

Planning Forward Planning  Start initial node as initial state  Find all successors by applying actions  For each successor, build a planning graph to determine heuristic value  Add to fringe, pop, repeat Problems  branching factor,  multiple planning graphs

Planning Backward Planning  Construct planning graph from initial state  Start initial node as goal  Find successors by regressing through relevant actions  Look up heuristic values in planning graph  Add to fringe, pop, repeat

Constraint Satisfaction Formulation  Variables, each with some domain  Constraints between variables and their values  Problem: assign values to everything without violating any constraint Again, just a search problem (Backtracking)  State: Partial assignment to variables  Successor: Assign a value to next variable without violating anything  Goal: All variables assigned

Constraint Satisfaction No sense of “optimal” path.. we just want to cut down on search time. How to choose variable to assign next?  Most constrained variable  Most constraining variable How to choose the next value?  Least constraining value

Constraint Satisfaction To benefit from these heuristics, should update domains  Forward Checking After assigning a value to a variable, remove all conflicting values from other variables  AC3 Given a set of variables, look at pairs X,Y  If for a value of X, there is no value of Y that works, remove that value from X

Adversarial Search Game tree from moves performed successively by MAX and MIN player Values at “bottom” of the tree – end of game, or use evaluation function. Propagate values up according to MIN/MAX Tells you which move to take Alpha-Beta pruning  Order of evaluation does matter

Probabilistic Reasoning Assume there is some state space Now actions are probabilistic  If I do action A, there are several different possible states I may end up in  There is a probability associated with going into each state (they must sum to 1) Some states have rewards (positive or negative) We would like to calculate utility for each state, and use that to determine what action to take.

Probabilistic Reasoning

How do you calculate the Utilities?  If no cycles, can back values up the tree  Otherwise, can use Value Iteration Start all utilities as 0, calculate new utilities, repeat until convergence  Or, Policy Iteration Pick a random policy, solve utilities for it, calculate new policy until convergence

Probabilistic Belief Say N variables, each with 2 values, joint probability table has 2^n entries.

Probabilistic Belief If variables are independent, can represent this table more compactly

(Supervised) Learning We are given a bunch of examples, where each example has values X1.. XN and Y We want to create some function H(X), that will take all the X's and output a single value The goal is that given some partial example X1... XN, we can use H(X) to guess Y This should work well for X's from the training set, but also for X's never seen before!

(Supervised) Learning

Some types of functions we can use: Data Cache Linear Regression Decision Tree Neural Net

(Supervised) Learning Decision Tree  At each non-terminal node in tree, branch according to the value of one of the Xi's  A leaf node should output a value for Y Building the Tree (Greedy)  Look at all examples at current node  Choose Xi to split on that will allow you to classify the most number of examples correctly

(Supervised) Learning Neural Net

(Supervised) Learning Neural Net