Presentation on theme: "SEARCH TECHNIQUES. SEARCH TECHNIQUES Search techniques BlindHeuristic Depth first Search ( DFS ) Breadth first Search ( BFS ) Hill climbing Search A*"— Presentation transcript:
SEARCH TECHNIQUES Search techniques BlindHeuristic Depth first Search ( DFS ) Breadth first Search ( BFS ) Hill climbing Search A* search Best-First Search Greedy Search
BLIND SEARCH ALGORITHM else begin generate children of X ; put X on closed; eliminate children of X if already in open or closed; put other children in order on left end of open; end end while return failure;
EXAMPLE (Depth-First S SS Search - DFS) A CD JI RQ HG B FE LK TS MNOP U Depth First Search examines the nodes in the following order: A, B, E, K, S, L, T, F, M, C, G, N, H, O, P, U, D, I, Q, J, R Goal Start
Depth-FirstSearch (DFS) Depth-First Search (DFS) This simple search algorithm uses Prolog’s unification routine to find the first link from the current node then follows it. It always follows the left-most branch of the search tree first; following it down until it either finds the goal state or hits a dead- end. It will then backtrack to find another branch to follow. = depth-first search. A E D C B F C F | ?- go(a,c,X). X = [a,e,f,c] ? ; X = [a,b,f,c] ? ; X = [a,b,c] ? ; no go(X,X,[X]). go(X,Y,[X|T]):- link(X,Z), go(Z,Y,T). C
BLIND SEARCH ALGORITHM else begin generate children of X ; put X on closed; eliminate children of X if already in open or closed; put other children in order on right end of open; end end while return failure;
A CDEFB GHIJKLMNOP QRSTUVWXYZ Example (Breadth-First Search - BFS) Initial state Goal state A L Press space to see a BFS of the example node set
A CDEFB GHIJKL QRSTU A BCD We begin with our initial state: the node labeled A. Press space to continue This node is then expanded to reveal further (unexpanded) nodes. Press space Node A is removed from the queue. Each revealed node is added to the END of the queue. Press space to continue the search. The search then moves to the first node in the queue. Press space to continue. Node B is expanded then removed from the queue. The revealed nodes are added to the END of the queue. Press space. Size of Queue: 0 Nodes expanded: 0Current Action:Current level: n/a Queue: EmptyQueue: ASize of Queue: 1 Nodes expanded: 1 Queue: B, C, D, E, F Press space to begin the search Size of Queue: 5 Current level: 0Current Action: Expanding Queue: C, D, E, F, G, HSize of Queue: 6 Nodes expanded: 2Current level: 1 We then backtrack to expand node C, and the process continues. Press space Current Action: BacktrackingCurrent level: 0Current level: 1 Queue: D, E, F, G, H, I, JSize of Queue: 7 Nodes expanded: 3Current Action: ExpandingCurrent Action: BacktrackingCurrent level: 0Current level: 1 Queue: E, F, G, H, I, J, K, LSize of Queue: 8 Nodes expanded: 4Current Action: ExpandingCurrent Action: BacktrackingCurrent level: 0Current level: 1Current Action: Expanding NM Queue: F, G, H, I, J, K, L, M, NSize of Queue: 9 Nodes expanded: 5 E Current Action: BacktrackingCurrent level: 0Current Action: ExpandingCurrent level: 1 OP Queue: G, H, I, J, K, L, M, N, O, PSize of Queue: 10 Nodes expanded: 6 F Current Action: BacktrackingCurrent level: 0Current level: 1Current level: 2Current Action: Expanding Queue: H, I, J, K, L, M, N, O, P, Q Nodes expanded: 7 G Current Action: BacktrackingCurrent level: 1Current Action: Expanding Queue: I, J, K, L, M, N, O, P, Q, R Nodes expanded: 8 H Current Action: BacktrackingCurrent level: 2Current level: 1Current level: 0Current level: 1Current level: 2Current Action: Expanding Queue: J, K, L, M, N, O, P, Q, R, S Nodes expanded: 9 I Current Action: BacktrackingCurrent level: 1Current level: 2Current Action: Expanding Queue: K, L, M, N, O, P, Q, R, S, T Nodes expanded: 10 J Current Action: BacktrackingCurrent level: 1Current level: 0Current level: 1Current level: 2Current Action: Expanding Queue: L, M, N, O, P, Q, R, S, T, U Nodes expanded: 11 K Current Action: BacktrackingCurrent level: 1 LLLL Node L is located and the search returns a solution. Press space to end. FINISHED SEARCH Queue: EmptySize of Queue: 0 Current level: 2 BREADTH-FIRST SEARCH PATTERN L Press space to continue the search
Another EXAMPLE (Breadth-First Search - BFS) A CD JI RQ HG B FE LK TS MNOP U OPEN A BCD CDEF DEFGH EFGHIJ FGHIJKL GHIJKLM HIJKLMN IJKLMNOP JKLMNOPQ KLMNOPQR LMNOPQRS MNOPQRST NOPQRST OPQRST PQRST QRSTU RSTU STU TU U U CLOSE A BA CBA DCBA EDCBA FEDCBA GFEDCBA HGFEDCBA IHGFEDCBA JIHGFEDCBA KJIHGFEDCBA LKJIHGFEDCBA MLKJIHGFEDCBA NMLKJIHGFEDCBA ONMLKJIHGFEDCBA PONMLKJIHGFEDCBA QPONMLKJIHGFEDCBA RQPONMLKJIHGFEDCBA SRQPONMLKJIHGFEDCBA TSRQPONMLKJIHGFEDCBA hold Breadth First Search examines the nodes in the following order: A, B, C, D, E, F, G, H, I, J, K, L, M, N, O, P, Q, R, S, T, U Initial state Goal state
Breadth-FirstSearch (BFS) Breadth-First Search (BFS) A simple, common alternative to depth-first search is: breadth-first search. This checks every node at one level of the space, before moving onto the next level. A E D C B F C F | ?- go(a,c,X). X = [a,b,c] ? ; X = [a,e,f,c] ? ; X = [a,b,f,c] ? ; no C 1st 2nd 3rd Depth-first = A,ED,FC,BFC,CBreadth-first = A,EB,DFFC,CC
BlindSearch Strategies Blind Search Strategies Breadth-first search Expand all the nodes of one level first. Depth-first search Expand one of the nodes at the deepest level.
What is the Complexity of Breadth-First Search? Time Complexity –assume (worst case) that there is 1 goal leaf at the RHS –so BFS will expand all nodes = 1 + b + b b d = O (b d ) Space Complexity –how many nodes can be in the queue (worst-case)? –at depth d-1 there are b d unexpanded nodes in the Q = O (b d ) d=0 d=1 d=2 d=0 d=1 d=2 G G
Examples of Time and Memory Requirements for Breadth-First Search Depth of Nodes SolutionExpandedTimeMemory 011 millisecond100 bytes seconds11 kbytes 411,11111 seconds1 megabyte hours11 giabytes years111 terabytes Assuming b=10, 1000 nodes/sec, 100 bytes/node
What is the Complexity of Depth-First Search? Time Complexity – assume (worst case) that there is 1 goal leaf at the RHS – so DFS will expand all nodes =1 + b + b b d = O (b d ) = O (b m ) Space Complexity – how many nodes can be in the queue (worst-case)? – at depth 1 < d we have b-1 nodes – at max. depth d we have b nodes – total = (d-1)*(b-1) + b = O(bd) = O(bm) d=0 d=1 d=2 G d=0 d=1 d=2 d=3 d=4
BlindSearch Strategies (cont.) Blind Search Strategies (cont.) CriterionBreadth- First Depth- First Time Space Optimal? Complete? b: branching factord: solution depthm: maximum depth Complexity
BlindSearch Strategies (cont.) Blind Search Strategies (cont.) CriterionBreadth- First Depth- First Time bdbd bmbm Space bdbd bm Optimal?YesNo Complete?YesNo b: branching factord: solution depthm: maximum depth Complexity
Depth-firstvs.Breadth-first Depth-first vs. Breadth-first Advantages of depth-first: Simple to implement; Needs relatively small memory for storing the state-space. Disadvantages of depth-first: Sometimes fail to find a solution (may be get stuck in an infinite long branch) - not complete; Not guaranteed to find an optimal solution (may not find the shortest path solution); Can take a lot longer to find a solution. Advantages of breadth-first: Guaranteed to find a solution (if one exists) - complete; Depending on the problem, can be guaranteed to find an optimal solution. Disadvantages of breadth-first: More complex to implement; Needs a lot of memory for storing the state space if the search space has a high branching factor.
HEURISTIC SEARCH ALGORITHM Heuristic : we use heuristic function or knowledge in order to explore the most promising state first. [ Heuristics are formalized as rules for choosing those branches in a state space that are most likely to lead to an acceptable solution ] This may lead to sub-optimal solution or fail to find any solution. (i.e., heuristics do not guarantee best solution or even a solution) AI problem-solver employ heuristic into basic situations : 1. Problem may not have an exact solution because of inherent ambiguities in a problem statement or available data (Examples: medical diagnosis, vision). 2. Inefficient exact method to solve the problem (not feasible to examine every state ( e.g. theorem proving and chess game).
EXAMPLE Heuristic function for 8-tile puzzle 1. The number of states out of place. [the state that has fewest tiles out of place is probably closer to the desired goal and would be best to examine next] 2. The summation distance between each tile and it’s correct position in the goal state.
EXAMPLE (cont.) Goal the number of direct tile reversals Sum of distances out of place Tiles out of place Three heuristics applied to states in the 8-puzzle
HILL-CLIMBING SEARCH - Expand the current state in the search and evaluate it’s children. - The best child is selected for further expansion. - neither it sibling nor its parent are retained. - Search halts when it reaches a state that is better than any of its children. (i.e. The process ends when all operators have been applied and none of the resulting states are better than the current state) - Ve : 1. Local maximum. 2. Flat area. 3. Cant backtrack. (WHY?) + Ve : Low memory requirement.
EXAMPLE on HILL-CLIMBING S AB CDEF H G1G1 I G2G S → B → E → G1 Cost = 1+2+3=6 SEARCH PATH = [S 0, B 1, E 2, G1 3 ]
HILL-CLIMBING SEARCH (cont.) Hill climbing is an optimization technique which belongs to the family of local search. It is best used in problems with “the property that the state description itself contains all the information needed for a solution” The algorithm is memory efficient since it does not maintain a search tree: It looks only at the current state and immediate future states. Hill climbing attempts to iteratively improve the current state by means of an evaluation function. “Consider all the [possible] states laid out on the surface of a landscape. The height of any point on the landscape corresponds to the evaluation function of the state at that point”.
HILL-CLIMBING SEARCH (cont.) In contrast with other iterative improvement algorithms, hill-climbing always attempts to make changes that improve the current state. In other words, hill-climbing can only advance if there is a higher point in the adjacent landscape. For example, hill climbing can be applied to the travelling salesman problem. It is easy to find an initial solution that visits all the cities but will be very poor compared to the optimal solution. The algorithm starts with such a solution and makes small improvements to it, such as switching the order in which two cities are visited. Eventually, a much shorter route is likely to be obtained.
TSP Example AB CD A, B, C and D are cities. Visit all cities with shortest path.
HILL-CLIMBING SEARCH (cont.) Hill Climbing can get stuck at local maxima. Consider the following tree. a is an initial state and h and k are final states. The numbers near the states are the heuristic values. When hill climbing is run on the tree, we get a -> f -> g and here we get stuck at local maxima g. Hill climbing can't go back and make a new choice (for example j or e) because it keeps no history. So how to avoid this stuck in order to get global maxima.
HILL-CLIMBING SEARCH (cont.) A common way to avoid getting stuck in local maxima with Hill Climbing is to use random restarts. In our example, if g is a local maxima, the algorithm would stop there and then pick another random node to restart from. So if j or c were picked (or possibly a, b, or d) you would find the global maxima in h or k. If once again you get stuck at some local maxima you have to restart again with some other random node. Generally there is a limit on the no. of times you can re-do this process of finding the optimal solution. After you reach this limit, you select the least amongst all the local maxima you reached during the process. Clean and repeat enough times (iterative) and you'll find the global maxima or something close. Hill Climbing is NOT complete and can NOT guarantee to find the global maxima. The benefit is that it requires a fraction of the resources; it's a very effective solution (optimization).
BEST FIRST SEARCH Definition Is another more informed heuristic algorithm. Best-first search in its most general form is a simple heuristic search algorithm. “Heuristic” here refers to a general problem-solving rule or set of rules that do not guarantee the best solution or even any solution, but serves as a useful guide for problem-solving. Best-first search is a graph-based search algorithm, meaning that the search space can be represented as a series of nodes connected by paths.
BEST FIRST SEARCH (cont.) How it works The name “best-first” refers to the method of exploring the node with the best “score” first. An evaluation function is used to assign a score to each candidate node. The algorithm maintains two lists, one containing a list of candidates yet to explore (OPEN), and one containing a list of already visited nodes (CLOSED). States in OPEN are ordered according to some heuristic estimate of their “closeness” to a goal. This ordered OPEN list is referred to as priority queue. Since all unvisited successor nodes of every visited node are included in the OPEN list, the algorithm is not restricted to only exploring successor nodes of the most recently visited node. In other words, the algorithm always chooses the best of all unvisited nodes that have been graphed, rather than being restricted to only a small subset, such as immediate neighbors. Other search strategies, such as depth-first and breadth- first, have this restriction. The advantage of this strategy is that if the algorithm reaches a dead- end node, it will continue to try other nodes.`
BEST FIRST SEARCH (cont.) Algorithm Best-first search in its most basic form consists of the following algorithm : 1.The 1 st step is to define the OPEN list with a single node, the starting node. 2.The 2 nd step is to check whether or not OPEN is empty. If it is empty, then the algorithm returns failure and exits. 3.The 3 rd step is to remove the node with the best score, n, from OPEN and place it in CLOSED. 4.The 4 th step “expands” the node n, where expansion is the identification of successor nodes of n. 5.The 5 th step then checks each of the successor nodes to see whether or not one of them is the goal node. If any successor is the goal node, the algorithm returns success and the solution, which consists of a path traced backwards from the goal to the start node. Otherwise, proceeds to the sixth step. 6.In 6 th step, for every successor node, the algorithm applies the evaluation function, f, to it, then checks to see if the node has been in either OPEN or CLOSED. If the node has not been in either, it gets added to OPEN. 7.Finally, the 7 th step establishes a looping structure by sending the algorithm back to the 2 nd step. This loop will only be broken if the algorithm returns success in step 5 or failure in step 2.
BEST FIRST SEARCH (cont.) Algorithm (con.) The algorithm is represented here in pseudo-code: 1. Define a list, OPEN, consisting solely of a single node, the start node, s. 2. IF the list is empty, return failure. 3. Remove from the list the node n with the best score (the node where f is the minimum), and move it to a list, CLOSED. 4. Expand node n. 5. IF any successor to n is the goal node, return success and the solution (by tracing the path from the goal node to s). 6. FOR each successor node: a) apply the evaluation function, f, to the node. b) IF the node has not been in either list, add it to OPEN. 7. Looping structure by sending the algorithm back to the 2 nd step.
EXAMPLE on Best-First Search S AB CDEF H G1G1 I G2G open=[S 0 ]; closed=[ ] open=[B 1, A 4 ]; closed=[S 0 ] open=[E 2, F 3, A 4 ]; closed=[S 0, B 1 ] open=[F 3, G1 3, A 4, H 4 ]; closed=[S 0, B 1, E 2 ] open=[G2 1, I 2, G1 3, A 4, H 4 ]; closed=[S 0, B 1, E 2, F 3 ] Cost = 1+3+1=5 SEARCH PATH = [S 0, B 1, E 2, F 3, G2 1 ]
Is best first algorithm will always find the shortest path ? Example : BEST FIRST SEARCH (cont.) G G
BEST FIRST SEARCH (cont.) 1. It may get stuck in an infinite branch that doesn’t contain the goal. 2. It’s not guarantee to find the shortest path solution. Memory requirement : In best case : as depth first search. In average case : between depth and breadth. In worst case : as breadth first search.
GREEDY BEST FIRST SEARCH GGreedy best-first search uses heuristic estimate h(n). EEEEXAMPLE: S: Initial state, G GG G1,G2 : goal. Table shows the heuristic estimates: S AB CDEF H G1G1 I G2G2 node h(n) node h(n) node h(n) A11D8H7 B5E4I3 C9F2
GREEDY BEST FIRST SEARCH (cont.) Solution: S, B, F, G 2 Cost = 1+3+1=5 node h(n) node h(n) node h(n) A11D8H7 B5E4I3 C9F2 S AB h(n)=11 h(n)=5 B EF h(n)=4 h(n)=2 F I G2G2 h(n)=3 h(n)=0 S B F G2G Search Path Obtain best solution than best-first. But not guaranteed the optimum solution
SOLUTIONS AND A ALGORITHM Solution for the worst case : we will use the n-beam search algorithm, that keep just the best states in the memory. Solution for the not finding the goal or shortest path : Is to use heuristic function that takes into consider how far a state from initial state, so that for any state : f ( n ) = g ( n ) + h ( n ) A algorithm Where : g ( n ) : measures the distance between the initial state and state of n. [actual length of the path start - n] h ( n ) : is a heuristic estimate of the distance from state n to a goal.
SOLUTIONS ( CONT … ) Is it get the shortest path with these solutions ? Example : G h ( n ) = 7h ( n ) = 5 h ( n ) = 4h( n ) = 1 h ( n ) = 2 g ( n ) = 1 g ( n ) = 2 g ( n ) = 3 f ( n ) = 8f ( n ) = 6 f ( n ) = 6 f ( n ) = 3 f ( n ) = 5 No, because some times the value of h ( n ) is overestimated and not the actual value.
A AND A* ALGORITHM F ( n ) is used to avoid getting stuck in an infinite long branch. When the best first search algorithm use the format : f ( n ) = h ( n ) + g ( n ), then it will be called A algorithm But A algorithm doesn’t always give us the shortest path, because of overestimate for h ( n ) Example : in the last graph, h ( n ) = 7 that’s mean we need to 7 movement to reach the goal.
A* ALGORITHM IIf we guarantee that there is no overestimate for h(n), we will guarantee that f(n) = h(n) + g(n) give us the shortest path, If h(n) <= h*(n) for all n. IIf h(n) <= h*(n) then h(n) is called Admissible heuristic where : h(n) is the heuristic value. h*(n) is the actual value. This algorithm called A* algorithm. A* Algorithm f(n) = h(n) + g(n)
EXAMPLE: Use A* search algorithm to find the solution. Initial state: S, Goal state: G 1 or G 2 Solution: f(A)=h(A)+g(A)=11+4=15 f(B)=h(B)+g(B)=5+1=6 f(E)=h(E)+g(E)=4+2=6 f(F)=h(F)+g(F)=2+3=5 f(I)=h(I)+g(I)=3+2=5 f(G 2 )=h(G 2 )+g(G 2 )=0+1=1 S, B, F, G 2 EXAMPLE on A* ALGORITHM (cont.) S AB CDEF H G1G1 I G2G node h(n) node h(n) node h(n) A11D8H7 B5E4I3 C9F2 g(B)=1 g(A)=4 g(H)=4 g(C)=1 g(D)=2 g(E)=2 g(F)=3 g(G 1 )=3 g(G 2 )=1 g(I)=2