Computational Intelligence Winter Term 2013/14 Prof. Dr. Günter Rudolph Lehrstuhl für Algorithm Engineering (LS 11) Fakultät für Informatik TU Dortmund.

Slides:



Advertisements
Similar presentations
Mathematical Preliminaries
Advertisements

3.6 Support Vector Machines
1 Vorlesung Informatik 2 Algorithmen und Datenstrukturen (Parallel Algorithms) Robin Pomplun.
© 2008 Pearson Addison Wesley. All rights reserved Chapter Seven Costs.
Copyright © 2003 Pearson Education, Inc. Slide 1 Computer Systems Organization & Architecture Chapters 8-12 John D. Carpinelli.
Cognitive Radio Communications and Networks: Principles and Practice By A. M. Wyglinski, M. Nekovee, Y. T. Hou (Elsevier, December 2009) 1 Chapter 12 Cross-Layer.
Copyright © 2011, Elsevier Inc. All rights reserved. Chapter 6 Author: Julia Richards and R. Scott Hawley.
Author: Julia Richards and R. Scott Hawley
1 Copyright © 2013 Elsevier Inc. All rights reserved. Appendix 01.
1 Copyright © 2013 Elsevier Inc. All rights reserved. Chapter 3 CPUs.
STATISTICS Joint and Conditional Distributions
STATISTICS POINT ESTIMATION Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University.
STATISTICS Univariate Distributions
STATISTICS Random Variables and Distribution Functions
Properties Use, share, or modify this drill on mathematic properties. There is too much material for a single class, so you’ll have to select for your.
UNITED NATIONS Shipment Details Report – January 2006.
1 RA I Sub-Regional Training Seminar on CLIMAT&CLIMAT TEMP Reporting Casablanca, Morocco, 20 – 22 December 2005 Status of observing programmes in RA I.
Properties of Real Numbers CommutativeAssociativeDistributive Identity + × Inverse + ×
Computational Intelligence Winter Term 2009/10 Prof. Dr. Günter Rudolph Lehrstuhl für Algorithm Engineering (LS 11) Fakultät für Informatik TU Dortmund.
Chapter 7 Sampling and Sampling Distributions
Evaluating Window Joins over Unbounded Streams Author: Jaewoo Kang, Jeffrey F. Naughton, Stratis D. Viglas University of Wisconsin-Madison CS Dept. Presenter:
1 Outline relationship among topics secrets LP with upper bounds by Simplex method basic feasible solution (BFS) by Simplex method for bounded variables.
Solve Multi-step Equations
REVIEW: Arthropod ID. 1. Name the subphylum. 2. Name the subphylum. 3. Name the order.
Discrete time Markov Chain
Announcements Homework 6 is due on Thursday (Oct 18)
Particle Swarm Optimization (PSO)
Digital Filter Banks The digital filter bank is set of bandpass filters with either a common input or a summed output An M-band analysis filter bank is.
5-1 Chapter 5 Theory & Problems of Probability & Statistics Murray R. Spiegel Sampling Theory.
Computational Intelligence Winter Term 2009/10 Prof. Dr. Günter Rudolph Lehrstuhl für Algorithm Engineering (LS 11) Fakultät für Informatik TU Dortmund.
Copyright © 2013, 2009, 2005 Pearson Education, Inc.
Outline Minimum Spanning Tree Maximal Flow Algorithm LP formulation 1.
Chapter 5 Plan-Space Planning.
1. 2 No lecture on Wed February 8th Thursday 9 th Feb 14: :00 Thursday 9 th Feb 14: :00.
Name Convolutional codes Tomashevich Victor. Name- 2 - Introduction Convolutional codes map information to code bits sequentially by convolving a sequence.
Mathe III Lecture 1 Mathe III Lecture 1. 2 WS 2005/6 Avner Shaked Mathe III Math III.
Copyright © 2013, 2009, 2006 Pearson Education, Inc.
1 RA III - Regional Training Seminar on CLIMAT&CLIMAT TEMP Reporting Buenos Aires, Argentina, 25 – 27 October 2006 Status of observing programmes in RA.
Basel-ICU-Journal Challenge18/20/ Basel-ICU-Journal Challenge8/20/2014.
1..
CONTROL VISION Set-up. Step 1 Step 2 Step 3 Step 5 Step 4.
Optimization 1/33 Radford, A D and Gero J S (1988). Design by Optimization in Architecture, Building, and Construction, Van Nostrand Reinhold, New York.
© 2012 National Heart Foundation of Australia. Slide 2.
Artificial Intelligence
Splines I – Curves and Properties
6.4 Best Approximation; Least Squares
Model and Relationships 6 M 1 M M M M M M M M M M M M M M M M
Analyzing Genes and Genomes
1 Let’s Recapitulate. 2 Regular Languages DFAs NFAs Regular Expressions Regular Grammars.
©Brooks/Cole, 2001 Chapter 12 Derived Types-- Enumerated, Structure and Union.
Essential Cell Biology
Chapter 8 Estimation Understandable Statistics Ninth Edition
Computational Intelligence Winter Term 2011/12 Prof. Dr. Günter Rudolph Lehrstuhl für Algorithm Engineering (LS 11) Fakultät für Informatik TU Dortmund.
Intracellular Compartments and Transport
PSSA Preparation.
Computational Intelligence Winter Term 2014/15 Prof. Dr. Günter Rudolph Lehrstuhl für Algorithm Engineering (LS 11) Fakultät für Informatik TU Dortmund.
Essential Cell Biology
Mani Srivastava UCLA - EE Department Room: 6731-H Boelter Hall Tel: WWW: Copyright 2003.
Multiple Regression and Model Building
Energy Generation in Mitochondria and Chlorplasts
16. Mean Square Estimation
9. Two Functions of Two Random Variables
Computational Intelligence Winter Term 2013/14 Prof. Dr. Günter Rudolph Lehrstuhl für Algorithm Engineering (LS 11) Fakultät für Informatik TU Dortmund.
1 Decidability continued…. 2 Theorem: For a recursively enumerable language it is undecidable to determine whether is finite Proof: We will reduce the.
State Variables.
Commonly Used Distributions
Math Review with Matlab:
Chapter 5 The Mathematics of Diversification
Computational Intelligence Winter Term 2011/12 Prof. Dr. Günter Rudolph Lehrstuhl für Algorithm Engineering (LS 11) Fakultät für Informatik TU Dortmund.
Computational Intelligence
Presentation transcript:

Computational Intelligence Winter Term 2013/14 Prof. Dr. Günter Rudolph Lehrstuhl für Algorithm Engineering (LS 11) Fakultät für Informatik TU Dortmund

Lecture 10 G. Rudolph: Computational Intelligence ▪ Winter Term 2013/14 2 Design of Evolutionary Algorithms Three tasks: 1.Choice of an appropriate problem representation. 2.Choice / design of variation operators acting in problem representation. 3.Choice of strategy parameters (includes initialization). ad 1) different “schools“: (a) operate on binary representation and define genotype/phenotype mapping + can use standard algorithm – mapping may induce unintentional bias in search (b) no doctrine: use “most natural” representation – must design variation operators for specific representation + if design done properly then no bias in search

Lecture 10 G. Rudolph: Computational Intelligence ▪ Winter Term 2013/14 3 Design of Evolutionary Algorithms ad 1a) genotype-phenotype mapping original problem f: X → R d scenario: no standard algorithm for search space X available BnBn XRdRd f g standard EA performs variation on binary strings b 2 B n fitness evaluation of individual b via (f ◦ g)(b) = f(g(b)) where g: B n → X is genotype-phenotype mapping selection operation independent from representation

Lecture 10 G. Rudolph: Computational Intelligence ▪ Winter Term 2013/14 4 Design of Evolutionary Algorithms Genotype-Phenotype-Mapping B n → [L, R]  R ● Standard encoding for b  B n → Problem: hamming cliffs L = 0, R = 7 n = 3 1 Bit2 Bit1 Bit3 Bit1 Bit2 Bit1 Bit Hamming cliff genotype phenotype

Lecture 10 G. Rudolph: Computational Intelligence ▪ Winter Term 2013/14 5 Design of Evolutionary Algorithms ● Gray encoding for b  B n Let a  B n standard encoded. Then b i = a i,if i = 1 a i-1  a i,if i > 1  = XOR genotype phenotype OK, no hamming cliffs any longer …  small changes in phenotype „lead to“ small changes in genotype since we consider evolution in terms of Darwin (not Lamarck):  small changes in genotype lead to small changes in phenotype! but: 1-Bit-change: 000 → 100   Genotype-Phenotype-Mapping B n → [L, R]  R

Lecture 10 G. Rudolph: Computational Intelligence ▪ Winter Term 2013/14 6 Design of Evolutionary Algorithms ● e.g. standard encoding for b  B n genotype index Genotype-Phenotype-Mapping B n → P log(n) individual: consider index and associated genotype entry as unit / record / struct; sort units with respect to genotype value, old indices yield permutation: genotype old index (example only) = permutation

Lecture 10 G. Rudolph: Computational Intelligence ▪ Winter Term 2013/14 7 Design of Evolutionary Algorithms ad 1a) genotype-phenotype mapping typically required: strong causality → small changes in individual leads to small changes in fitness → small changes in genotype should lead to small changes in phenotype but: how to find a genotype-phenotype mapping with that property? necessary conditions: 1) g: B n → X can be computed efficiently (otherwise it is senseless) 2) g: B n → X is surjective (otherwise we might miss the optimal solution) 3) g: B n → X preserves closeness (otherwise strong causality endangered) Let d(¢, ¢) be a metric on B n and d X (¢, ¢) be a metric on X. 8x, y, z 2 B n : d(x, y) ≤ d(x, z) ) d X (g(x), g(y)) ≤ d X (g(x), g(z))

Lecture 10 G. Rudolph: Computational Intelligence ▪ Winter Term 2013/14 8 Design of Evolutionary Algorithms ad 1b) use “most natural“ representation but: how to find variation operators with that property? typically required: strong causality → small changes in individual leads to small changes in fitness → need variation operators that obey that requirement ) need design guidelines...

Lecture 10 G. Rudolph: Computational Intelligence ▪ Winter Term 2013/14 9 Design of Evolutionary Algorithms ad 2) design guidelines for variation operators a)reachability every x 2 X should be reachable from arbitrary x 0 2 X after finite number of repeated variations with positive probability bounded from 0 b)unbiasedness unless having gathered knowledge about problem variation operator should not favor particular subsets of solutions ) formally: maximum entropy principle c)control variation operator should have parameters affecting shape of distributions; known from theory: weaken variation strength when approaching optimum

Lecture 10 G. Rudolph: Computational Intelligence ▪ Winter Term 2013/14 10 Design of Evolutionary Algorithms ad 2) design guidelines for variation operators in practice binary search space X = B n variation by k-point or uniform crossover and subsequent mutation a) reachability: regardless of the output of crossover we can move from x 2 B n to y 2 B n in 1 step with probability where H(x,y) is Hamming distance between x and y. Since min{ p(x,y): x,y 2 B n } =  > 0 we are done.

Lecture 10 G. Rudolph: Computational Intelligence ▪ Winter Term 2013/14 11 Design of Evolutionary Algorithms b) unbiasedness don‘t prefer any direction or subset of points without reason ) use maximum entropy distribution for sampling! properties: - distributes probability mass as uniform as possible - additional knowledge can be included as constraints: → under given constraints sample as uniform as possible

Lecture 10 G. Rudolph: Computational Intelligence ▪ Winter Term 2013/14 12 Design of Evolutionary Algorithms Definition: Let X be discrete random variable (r.v.) with p k = P{ X = x k } for some index set K. The quantity is called the entropy of the distribution of X. If X is a continuous r.v. with p.d.f. f X (¢) then the entropy is given by The distribution of a random variable X for which H(X) is maximal is termed a maximum entropy distribution.■ Formally:

Lecture 10 G. Rudolph: Computational Intelligence ▪ Winter Term 2013/14 13 Excursion: Maximum Entropy Distributions Knowledge available: Discrete distribution with support { x 1, x 2, … x n } with x 1 < x 2 < … x n < 1 s.t. ) leads to nonlinear constrained optimization problem: solution: via Lagrange (find stationary point of Lagrangian function)

Lecture 10 G. Rudolph: Computational Intelligence ▪ Winter Term 2013/14 14 Excursion: Maximum Entropy Distributions partial derivatives: ) ) uniform distribution

Lecture 10 G. Rudolph: Computational Intelligence ▪ Winter Term 2013/14 15 Excursion: Maximum Entropy Distributions Knowledge available: Discrete distribution with support { 1, 2, …, n } with p k = P { X = k } and E[ X ] = s.t. ) leads to nonlinear constrained optimization problem: and solution: via Lagrange (find stationary point of Lagrangian function)

Lecture 10 G. Rudolph: Computational Intelligence ▪ Winter Term 2013/14 16 Excursion: Maximum Entropy Distributions partial derivatives: ) ) (continued on next slide) * ( )

Lecture 10 G. Rudolph: Computational Intelligence ▪ Winter Term 2013/14 17 Excursion: Maximum Entropy Distributions ) ) )discrete Boltzmann distribution ) value of q depends on via third condition: * ( )

Lecture 10 G. Rudolph: Computational Intelligence ▪ Winter Term 2013/14 18 Excursion: Maximum Entropy Distributions Boltzmann distribution (n = 9)  = 2  = 3  = 4  = 8  = 7  = 6  = 5 specializes to uniform distribution if = 5 (as expected)

Lecture 10 G. Rudolph: Computational Intelligence ▪ Winter Term 2013/14 19 Excursion: Maximum Entropy Distributions Knowledge available: Discrete distribution with support { 1, 2, …, n } with E[ X ] =  and V[ X ] =  2 s.t. ) leads to nonlinear constrained optimization problem: and solution: in principle, via Lagrange (find stationary point of Lagrangian function) but very complicated analytically, if possible at all ) consider special cases only note: constraints are linear equations in p k

Lecture 10 G. Rudolph: Computational Intelligence ▪ Winter Term 2013/14 20 Excursion: Maximum Entropy Distributions Special case: n = 3 and E[ X ] = 2 and V[ X ] =  2 Linear constraints uniquely determine distribution: I. II. III. II – I: I – III: insertion in III. unimodal uniform bimodal

Lecture 10 G. Rudolph: Computational Intelligence ▪ Winter Term 2013/14 21 Excursion: Maximum Entropy Distributions Knowledge available: Discrete distribution with unbounded support { 0, 1, 2, … } and E[ X ] = s.t. ) leads to infinite-dimensional nonlinear constrained optimization problem: and solution: via Lagrange (find stationary point of Lagrangian function)

Lecture 10 G. Rudolph: Computational Intelligence ▪ Winter Term 2013/14 22 Excursion: Maximum Entropy Distributions ) (continued on next slide) partial derivatives: * ( ) )

Lecture 10 G. Rudolph: Computational Intelligence ▪ Winter Term 2013/14 23 Excursion: Maximum Entropy Distributions )) setand insists that ) insert ) geometrical distributionfor it remains to specify q; to proceed recall that

Lecture 10 G. Rudolph: Computational Intelligence ▪ Winter Term 2013/14 24 Excursion: Maximum Entropy Distributions ) value of q depends on via third condition: * ( ) ) )

Lecture 10 G. Rudolph: Computational Intelligence ▪ Winter Term 2013/14 25 Excursion: Maximum Entropy Distributions geometrical distribution with E[ x ] = p k only shown for k = 0, 1, …, 8  = 1  = 2  = 3  = 4  = 5  = 6  = 7

Lecture 10 G. Rudolph: Computational Intelligence ▪ Winter Term 2013/14 26 Excursion: Maximum Entropy Distributions Overview: support { 1, 2, …, n }  discrete uniform distribution and require E[X] =  Boltzmann distribution and require V[X] =  2  N.N. (not Binomial distribution) support N  not defined! and require E[X] =  geometrical distribution and require V[X] =  2  ? support Z  not defined! and require E[|X|] =  bi-geometrical distribution (discrete Laplace distr.) and require E[|X| 2 ] =  2  N.N. (discrete Gaussian distr.)

Lecture 10 G. Rudolph: Computational Intelligence ▪ Winter Term 2013/14 27 Excursion: Maximum Entropy Distributions support [a,b]  R  uniform distribution support R + with E[X] =  Exponential distribution support R with E[X] =  V[X] =  2  normal / Gaussian distribution N( ,  2 ) support R n with E[X] =  and Cov[X] = C  multinormal distribution N( , C) expectation vector 2 R n covariance matrix 2 R n,n positive definite: 8x ≠ 0 : x‘Cx > 0

Lecture 10 G. Rudolph: Computational Intelligence ▪ Winter Term 2013/14 28 Excursion: Maximum Entropy Distributions for permutation distributions ? Guideline: Only if you know something about the problem a priori or if you have learnt something about the problem during the search  include that knowledge in search / mutation distribution (via constraints!) → uniform distribution on all possible permutations set v[j] = j for j = 1, 2,..., n for i = n to 1 step -1 draw k uniformly at random from { 1, 2,..., i } swap v[i] and v[k] endfor generates permutation uniformly at random in  (n) time

Lecture 10 G. Rudolph: Computational Intelligence ▪ Winter Term 2013/14 29 Design of Evolutionary Algorithms integer search space X = Z n ad 2) design guidelines for variation operators in practice a)reachability b)unbiasedness c)control ad a) support of mutation should be Z n ad b) need maximum entropy distribution over support Z n ad c) control variability by parameter → formulate as constraint of maximum entropy distribution - every recombination results in some z 2 Z n - mutation of z may then lead to any z* 2 Z n with positive probability in one step

Lecture 10 G. Rudolph: Computational Intelligence ▪ Winter Term 2013/14 30 X = Z n ad 2) design guidelines for variation operators in practice task: find (symmetric) maximum entropy distribution over Z with E[ | Z | ] =  > 0 ) need analytic solution of a 1-dimensional, nonlinear optimization problem with constraints! s.t. max! (symmetry w.r.t. 0) (normalization) (control “spread“) (nonnegativity) Z, Z. Design of Evolutionary Algorithms

Lecture 10 G. Rudolph: Computational Intelligence ▪ Winter Term 2013/14 31 Design of Evolutionary Algorithms result: a random variable Z with support Z and probability distribution Z symmetric w.r.t. 0, unimodal, spread manageable by q and has max. entropy ■ generation of pseudo random numbers: Z = G 1 – G 2 where stochastic independent!

Lecture 10 G. Rudolph: Computational Intelligence ▪ Winter Term 2013/14 32 Design of Evolutionary Algorithms probability distributions for different mean step sizes E|Z| = 

Lecture 10 G. Rudolph: Computational Intelligence ▪ Winter Term 2013/14 33 Design of Evolutionary Algorithms probability distributions for different mean step sizes E|Z| = 

Lecture 10 G. Rudolph: Computational Intelligence ▪ Winter Term 2013/14 34 Design of Evolutionary Algorithms How to control the spread? make mean step size adjustable! →  adjustable by mutative self adaptation 2 R+2 R+ 2 (0,1) → get q from  We must be able to adapt q 2 (0,1) for generating Z with variable E|Z| =  ! self-adaptation of q in open interval (0,1) ? like mutative step size size control of  in EA with search space R n !

Lecture 10 G. Rudolph: Computational Intelligence ▪ Winter Term 2013/14 35 Design of Evolutionary Algorithms n - dimensional generalization n = 2 random vector Z = (Z 1, Z 2,... Z n ) with Z i = G 1,i – G 2,i (stoch. indep.); parameter q for all G 1i, G 2i equal

Lecture 10 G. Rudolph: Computational Intelligence ▪ Winter Term 2013/14 36 Design of Evolutionary Algorithms n - dimensional generalization ) n-dimensional distribution is symmetric w.r.t. 1 norm! ) all random vectors with same step length have same probability!

Lecture 10 G. Rudolph: Computational Intelligence ▪ Winter Term 2013/14 37 Design of Evolutionary Algorithms How to control E[ || Z || 1 ] ? by def. linearity of E[¢]identical distributions for Z i  self-adaptation calculate from 

Lecture 10 G. Rudolph: Computational Intelligence ▪ Winter Term 2013/14 38 Design of Evolutionary Algorithms (Rudolph, PPSN 1994) Algorithm: Zn £ R+Zn £ R+

Lecture 10 G. Rudolph: Computational Intelligence ▪ Winter Term 2013/14 39 Excursion: Maximum Entropy Distributions continuous search space X = R n ad 2) design guidelines for variation operators in practice a)reachability b)unbiasedness c)control leads to CMA-ES