Presentation is loading. Please wait.

Presentation is loading. Please wait.

1 Update on Learning By Observation Learning from Positive Examples Only Tolga Konik University of Michigan.

Similar presentations


Presentation on theme: "1 Update on Learning By Observation Learning from Positive Examples Only Tolga Konik University of Michigan."— Presentation transcript:

1

2 1 Update on Learning By Observation Learning from Positive Examples Only Tolga Konik University of Michigan

3 2 GOAL Generate AI agents by observing expert task execution Engineering Goal Reduce the cost of agent development Reduce the expertise required to develop agent development. AI Goal Agents that improve themselves observing experts

4 3 Learning Framework Episodic Database Behavior trace rules Annotations Agent Architecture Agent Program Background Knowledge examples Expert Annotated Behavior trace Behavior Recorder Environmental Interface Training Set Generator Concept Learner (ILP) Knowledge Generator Environment external Internal

5 4 Learning with Redux Episodic Database Behavior trace rules Annotations Agent Architecture Agent Program Background Knowledge examples Expert Annotated Behavior trace Behavior Recorder Environmental Interface Training Set Generator Concept Learner (ILP) Knowledge Generator Environment external Internal Redux

6 5 Current Experiments Episodic Database Behavior trace rules Annotations Agent Architecture Agent Program Background Knowledge examples Expert Annotated Behavior trace Behavior Recorder Environmental Interface Training Set Generator Concept Learner (ILP) Knowledge Generator Environment external Internal Expert Soar Agent

7 6 Episodic Database Behavior trace rules New Agent Program Annotations Agent Architecture Agent Program Background Knowledge examples Annotated Behavior trace Behavior Recorder Environmental Interface Training Set Generator Concept Learner (ILP) Knowledge Generator Environment external Internal Expert Learning Framework: Mode2

8 7 Experiments in Haunt 2 Domain

9 8 d1d2d3d4 Move-to example move-to-via-node move-to-connected-node r1 r2 r3 r4 d1 d2 d3d4 d5 d6 i4 i3 d5b d6b r3 move-to-area

10 9 move-to-via-node(Node) move-to-area(Area) An Example in Haunt Domain r1 r2 r3 r4 d1 d2 d3d4 d5 d6 move-to-connected-node(Node)

11 10 r1 r2 r3 r4 d1 d2 d3d4 d5 d6 move-to-via-node(Node) move-to-area(Area) move-to-connected-node(Node) An Example in Haunt Domain

12 11 r1 r3 d1 Correct selection condition for move-to-via-node move-to-via-node(Node) move-to-area(Area) move-to-connected-node(Node) An Example in Haunt Domain

13 12

14 13 Termination(A) A positivenegative Example Generation Operator Concepts

15 14 Selection(A) AB positive negative Example Generation Operator Concepts

16 15 A Positive Example: selection(Sit 20, move-to-via-node(d 1 ) ) r1 r2 r3 r4 d1 d2 d3d4 d5 d6 i4 i3 d5b d6b Learning Examples

17 16 General to Special Search with positive and negative examples

18 17 General to Special Search with positive and negative examples

19 18 General to Special Search with positive and negative examples

20 19 General to Special Search with positive and negative examples

21 20 General to Special Search with positive and negative examples

22 21 move-to-via-node Selection(move-to-via-node) r1 r2 r3 r4 d1 d2 d3d4 d5 d6 i4 i3 d5b d6b move-to-connected-node Problem in Choosing Parameters

23 22 move-to-via-node Positive Negative Selection(move-to-via-node) r1 r2 r3 r4 d1 d2 d3d4 d5 d6 i4 i3 d5b d6b move-to-connected-node Problem in Choosing Parameters

24 23 Specific to General Learning with Positive Examples only d1 Positive Difficult to deal with inconsistent examples

25 24 General to Specific Learning with Positive Examples Only Positive

26 25 General to Specific Learning with Positive Examples Only d1 Positive

27 26 A Positive Example of move-to-via-node: r1 r2 r3 r4 d1 d2 d3d4 d5 d6 i4 i3 d5b d6b Learning Examples

28 27 Random Examples of move-to-via-node r1 r2 r3 r4 d1 d2 d3d4 d5 d6 i4 i3 d5b d6b For each positive example, use the same situation with parameters selected in other situations Learning Examples

29 28 Nuggets Move-to operators are learned in Haunt domain ~ 3 mins of trace ~ 35000 situations ~ 10 min to prepare examples ~20 min for learning.

30 29 Coals Missing Components It is still research not a tool


Download ppt "1 Update on Learning By Observation Learning from Positive Examples Only Tolga Konik University of Michigan."

Similar presentations


Ads by Google