Presentation is loading. Please wait.

Presentation is loading. Please wait.

Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 Myopic Unfalsified Control: A.

Similar presentations


Presentation on theme: "Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 Myopic Unfalsified Control: A."— Presentation transcript:

1 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 Myopic Unfalsified Control: A Gradient Approach to Adaptation Michael G. Safonov University of Southern California

2 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 Background: Unfalsified Control ONLY 3 elements candidate controller hypotheses FALSIFIED COMPUTER SIEVE Unfalsified K M. G. Safonov. In Control Using Logic-Based Switching, Spring-Verlag, 1996. goals data

3 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 An Achilles heel of modern system theory has been the habit of ‘proof by assumption’ Theorists typically give insufficient attention to the possibility of future observations which may be at odds with assumptions. Motivation for Unfalsification:

4 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 “data-driven” Galileo: open-eyed “data-driven” Reality is what we observe. Two ways to learn & adapt: Observation vs. Introspection vs. ‘Curve-Fitting’ Plato: introspective “assumption driven” Reality is an ideal, observable only through noisy sensors. ‘Probabilistic Estimation’ MODELS approximate Observed DataData approximates Unobserved TRUTH

5 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 Count the teeth… Bertrand Russell 1949 Aristotle (y 1,u 1 ) (y 2,u 2 ) (y 3,u 3 ) (y 4,u 4 ) (y 5,u 5 ) (y 6,u 6 ) (y 7,u 7 ) LOOK IN THE MOUTH height 2  {

6 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 Observation vs. Introspection in adaptive control “Unfalsified: Data Driven” goals are modest: –no guaranteed predictions of future stability –just consistency of goals, decisions and data no troublesome assumptions, parsimonious formulation : –DATA –GOALS –DECISIONS Fast, reliable, exact “Traditional: Assumption Driven” goals are ambitious: –guaranteed future stability –Cost & estimation convergence many troublesome assumptions –“the ‘true’ plant is in model set”, –“noise I.I.D.” –“bounds on parameters, probabilities” –…, “linear time-invariant, minimum- phase plant, order < N” Slow, quasi-static, approximate Remarkably, some leading “assumption driven” control theorists have held that observed DATA inconsistent with ASSUMPTIONS should be ignored (cf. M. Gevers et al., "Model Validation in Closed-Loop", ACC, San Diego, 1999)

7 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 Unfalsified Adaptive Control candidate controllers FALSIFIED COMPUTER SIEVE LEARNING FEEDBACK LOOPS K Unfalsified Controllers K given M. G. Safonov. In Control Using Logic-Based Switching, Spring-Verlag, 1996. GOALS CONTROLLER SELECTOR DATA DECISIONS

8 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 3 elements of unfalsified control: DATA:CONTROLLERS:GOALS candidate controller hypotheses FALSIFIED COMPUTER SIEVE LEARNING FEEDBACK LOOPS Unfalsified Controllers K M. G. Safonov. In Control Using Logic-Based Switching, Spring-Verlag, 1996. goals CONTROLLER SELECTOR data

9 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 Unfalsification Algorithm Brugarolas & Safonov, CCA/CACSD ‘99 UNFALSIFIED ADAPTIVE CONTROL: The ability of each candidate controller to meet the performance goal is treated as a hypothesis to be tested directly against evolving real-time measurement data. A candidate controller need not be in the loop to test the hypothesis.

10 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 Unfalsified Controllers Unfalsification A candidate controller K is FALSIFIED when the data (y,u) proves existence of a ‘fictitious’ that would violate performance spec’s if the K were in the loop. Controllers may remain UNFALSIFIED until the data proves otherwise. Or, fading memory may be used to reinstate previously falsified controllers Falsified Controllers Key points: ● A controller need not be in the loop to be falsified. ● Even a single data point can falsify many controllers. Unknown Plant K gain  u y e - + y r

11 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 Plant Data:at time t=0, (u,y)=(1,1) Candidate K’s: u=Ke real gain Goal:|e(t)/r(t)| K is unfalsified if |1/(1+K)| < 0.1 => unfalsified K’s: K>9 or K<-11 Trivial Example Unknown Plant K gain  r uye - +

12 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 DATA-DRIVEN Behavioral Problem Formulation Observations operator maps input-output signals to measurement signals. Truncated Space results from applying the observations operator to a signal space. Typically is the experimental observation time sampling operator, so returns values of z(t) only for past time intervals over which experimental observations of z(t) have been recorded. is a projection operator.

13 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 BACKGROUND: Willems’ Behavioral System Theory

14 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 BEHAVIORAL ADAPTIVE CONTROL

15 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 Application: Behavioral MRAC Behavioral Representation of Standard Class of Candidate MRAC Controllers

16 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 Behavioral MRAC (cont.)

17 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 Behavioral MRAC (cont.) A matrix pencil e-value computation gives optimal.

18 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 Data Driven Behavioral MRAC Computation

19 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 Noisy Plant Simulation Example

20 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 Data Driven Behavioral MRAC Simulation Results u(t) y(t) w m *r(t) r(t) cost opt (t) u(t)  (t)

21 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 Discussion: Data Driven Behavioral MRAC Unstable plant stabilized in real time Computation effort does not grow with  No need for controller parameter gridding Exponential forgetting factor in cost – no controller ‘gain windup’ – behaves like classical  - modification Noise tolerant

22 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 Jun & Safonov, CCA/CACSD ‘99 control input u(t) proportional gain k P (t) integral gain k I (t) derivative gain k D (t), plant output y(t) Evolution of unfalsified setTime responses Tsao & Safonov, IEEE Trans, AC-42, 1997. time-history of unfalsified candidate control gains time-history of plant response and optimal unfalsified control gain Simulation Example: ACC Benchmark

23 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 Simulation Example: Missile Learns control gains Adapts quickly to compensate for damage & failures Superior performance Specified target response bound Actual response Commanded response Brugarolas, Fromion & Safonov, ACC ’98 Unfalsified adaptive missile autopilot: discovers stabilizing control gains as it flies, nearly instantaneously maintains precise sure-footed control Brugarolas, Fromion and Safonov, ACC98

24 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 Simulation Example: Unfalsified ‘PID Universal Controller’ ————— Jun & Safonov, CCA/CACSD ‘99 Example: Adaptive PID Goal: Unfalsified adaptive control loop stabilizes in real-time Unstable Plant 30 Candidate PID Controllers: K I =[2, 50, 100] K D = [.5,.6] K P = [5, 10, 25, 80, 110]

25 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 Other People’s Successes with Unfalsified Control Emmanuel Collins et al. (Weigh Belt Feeder adaptive PID tuning, CDC99) Kosut (Semiconductor Mfg. Process run-to- run tuning, CDC98) Woodley, How & Kosut (ECP Torsional disk control, adaptive tuning, ACC99 Razavi & Kurfess, Int. J ACSP, Aug. 2001

26 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 How Unfalsified Works ftp://routh.usc.edu/pub/safonov/PID_Demo_for_MATLAB6.zip

27 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 Inside the Unfalsification Algorithm Block Data In Controllers Out Bank of Filters (one for each candidate K)

28 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 Why doesn’t everybody use unfalsified control? Unfalsified Adaptive Control is better: –fast –reliable –exact Full and precise use of all available evidence –meets Russell’s criterion: “Intimate association between observation and hypothesis” But, it’s not yet widely used. Why?

29 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 Unfalsified Adaptive Control: Some Invalid Concerns Strange noise-free, model-free formulation You can’t prove convergence or stability (unless you assert the usual adaptive control assumptions) Claims seem to good to be true: –Fast, reliable –Handles noise without noise models –Works for non-minimum phase plants –No need for plant degree bounds –No need for plant models

30 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 Unfalsified Adaptive Control: A Valid Concern Unfalsified Adaptive Control can be computationally intensive: –Computationally intensive when there are many candidate controllers –Requires global minimization of unfalsified cost levels –May require ‘gridding’ (like multi-model adaptive)

31 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 Solutions: How to address concerns over unfalsified? 1.Direct comparision with traditional methods? 2.Make unfalsified more like traditional methods? –make it slow, quasi-static (break its legs): differential equation update, or run-to-run controller update –make it myopic, near-sighted (break its spectacles): do local optimization only, not global use steepest descent, gradient-based approaches to limit computational complexity

32 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 Traditional assumptions: –LTI minimum phase plant No –Known upper bound for plant orderNo –Known relative degree of the plantNo –Known sign of the high frequency gainNo Alternative ‘universal controllers’ (Martensson, 1985; Fu et al., 1986) : –Require less prior information YES –Search a set of controllers and switch to the one satisfying a given performanceYES –Assume the plant to be LTINo Traditional & Alternative Unfalsified Solution #1: Direct Comparison

33 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 EXAMPLE: Direct Comparision Narendra MRAC vs. Unfalsified MRAC Cost Function J( ,t ):Unfalsified if J( ,t) < threshold = 200 J( ,t) = α e( ,t) 2 + β s 0 t exp{-σ (t- τ)} (e ( , τ) 2 ) d τ < 200 ; weights  =  =1 where e( ,t) = fict ( ,t) * W m (t) - y p (t) σ=.001 is ‘fading-memory forgetting-factor (sigma-modification) ’ is the ‘fictitious reference signal’ Controller K(  ) upup ypyp ypyp REFERENCE MODEL W m (s) + r - - + Error e(t) ymym Unknown Plant Candidate Controllers K(  ) : where  1 =[-2;-1;0;2;4] ;  2 =[2;5.5;6;6.5;7;8] and  3 =[-10;-6;-5;-4;-3;2] A. Paul and M. Safonov, 2002

34 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 Example: SimuLink Simulation of Unfalsified Control Narendra MRAC vs. Unfalsified Adaptive A. Paul and M. Safonov, 2002

35 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 Simulation Results: Narendra MRAC vs. UNFALSIFIED Simulation Parameters: Reference signal: 5 cost(t) + 10 cos (5t) Reference model: W m = 1/ (s+1). Unknown plant, used in simulation: W p = (s+1) / (s 2 -3s+2) Narendra MRAC vs. Unfalsified MRAC Switching sequence: Unfalsification process Narendra & Annaswamy, 1988 Convergence Of Controller parameters : Error Signal : A. Paul and M. Safonov, 2002

36 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 Direct Comparison: MRAC vs. Unfalsified Tsao & Safonov, CCA/CACSC’99 T. C. Tsao and M. G. Safonov. Int. J. Adaptive Control and Signal Processing, 15:319–334, 2001.

37 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 Solution #2: Simplified Gradiant-Based ‘Myopic’ Unfalsified Controller ( IFAC 2002 ) Jun and Safonov, IFAC 2002

38 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 Similar in most ways to traditional adaptive –Simple to implement, low computational load –Uses only local (myopic) analysis of data –Slow, quasi-static adaptation –Same as ‘MIT Rule’ if MRAC cost and no s dt But even myopic unfalsified is better –Always fewer assumptions, often slightly faster –Data-driven scientific logic of unfalsification is inherently more reliable Properties of ‘Myopic’ Unfalsified Adaptive Control Jun and Safonov, IFAC 2002

39 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 Myopic Unfalsified Simulation (Example 1) Jun and Safonov, IFAC 2002

40 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 Myopic Simulation Result Jun and Safonov, IFAC 2002

41 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 Myopic Simulation (Example 2) Jun and Safonov, IFAC 2002

42 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 Simulation Result (Example 2) Jun and Safonov, IFAC 2002

43 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 Direct comparison with Hjalmarsson’s gradient-based IFT: Plant Data: at time t = 0, K ( 0 ) = 8; T = 10s; Sample time = 0.1 Unknown Plant: 1/s(s+2) with white noise; Candidate Controllers: K > 0; Cost Function: IFT run-to-run Unfalsified finite-memory Unknown Plant K gain  r uye - + Wang and Safonov, 2002 Hjalmarsson et al., IEEE CDC, 1994 vs.

44 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 IFT vs. Unfalsified: Simulation Setup IFT Unfalsified Wang and Safonov, 2002

45 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 IFT vs. Unfalsified: Comparison Wang and Safonov, 2002 Experiment #1 Experiment #2 Experiment #3 Experiment #1 Experiment #2 Experiment #3 IFT: Error Signal Squared Unfalsified: Error Signal Squared

46 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 IFT vs. Myopic Unfalsified: Discussion Both are ‘myopic’ gradient-based techniques Run-to-run vs. Continuous Update –IFT: ‘infinite’ data required before each update –Unfalsified: fast, online update as data evolves Complicated vs. Simple Structure –IFT: Requires time-invariant system; Need two or more separate experiments & quadratic cost –Myopic Unfalsified: Can be used for any plant and any smooth cost. Special Probing Signal vs. Any Measurements –IFT: Special test signal must probe system to compute gradients –Unfalsified: Computes cost gradient with any test signal

47 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 Conclusions Introspective models are not adequate for analyzing adaptation To explain learning and adaptation, we need the open-eyed, data-driven scientific logic of unfalsification Unfalsified control is better than traditional assumption-driven adaptive control, even when crippled and myopic

48 Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 Selected References http://routh.usc.edu


Download ppt "Michael G. Safonov University of Southern California AFOSR Dynamics and Control Workshop Pasadena,CA. 12 -14 August 2002 Myopic Unfalsified Control: A."

Similar presentations


Ads by Google