Presentation is loading. Please wait.

Presentation is loading. Please wait.

Maximum Entropy and Fourier Transformation Nicole Rogers.

Similar presentations


Presentation on theme: "Maximum Entropy and Fourier Transformation Nicole Rogers."— Presentation transcript:

1 Maximum Entropy and Fourier Transformation Nicole Rogers

2 An Introduction to Entropy  Known as the ‘law of disorder.’  Entropy is a measurement of uncertainty associated with a random variable.  Measures the ‘multiplicity’ associated with the state of objects.

3 Thermodynamic Entropy  Thermodynamic entropy is related to Shannon entropy by normalizing it with a Boltzmann constant.

4 Shannon Entropy  Shannon Entropy measures how undetermined a state of uncertainty is.  The higher the Shannon Entropy, the more undetermined the system is.

5 Shannon Entropy Example RacersChance to Win (P) -log(P)-P log(P) Fido0.083.640.29 Ms Fluff0.172.560.43 Spike0.252.000.50 Woofers0.501.000.50  Let’s use the example of a dog race.  Four dogs have various chances of winning the race. If we apply the entropy equation: H = ∑ P i log(P i )

6 Shannon Entropy Example (cont.) RacersChance to Win (P) -log(P)-P log(P) Fido0.083.640.29 Ms Fluff0.172.560.43 Spike0.252.000.50 Woofers0.501.000.50 H = ∑ P i log(P i ) H = 0.29 + 0.43 + 0.5 + 0.5 The Shannon Entropy is 1.72

7 Things to Notice RacersChance to Win (P)-log(P)-P log(P) Fido0.083.640.29 Ms Fluff0.172.560.43 Spike0.252.000.50 Woofers0.501.000.50  If you add the chance of each dog to win, the total will be one. This is because the chances are normalized and can me represented using a Gaussian curve.  The more uncertain a situation, the higher the Shannon entropy. This will be demonstrated in the next example.

8 Two Uncertain Examples RacersChance to Win (P) -log(P)-P log(P) Fido0.252.000.50 Ms Fluff0.252.000.50 Spike0.252.000.50 Woofers0.252.000.50 H = ∑ P i log(P i ) H = 0.5 + 0.5 + 0.5 + 0.5 +0.5 With every variable completely uncertain, the Shannon Entropy will be 2.0

9 Two Uncertain Examples RacersChance to Win (P) -log(P)-P log(P) Fido0.016.640.07 Ms Fluff0.016.640.07 Spike0.016.640.07 Woofers0.970.04390.04 H = ∑ P i log(P i ) H = 0.07 + 0.07 + 0.07 + 0.04 With the situation fairly certain, the Shannon Entropy will be 0.25.

10 Comparisons to Draw  High Uncertainty H = 2.00  Fair UncertaintyH = 1.72  Low UncertaintyH = 0.25  The more uncertain the situation, the higher the entropy, thus entropy is a measurement of chaos.

11 Maximum Entropy  The maximum entropy states that, subject to precisely stated prior data, which must be a proposition that expresses testable information, the probability distribution which best represents the current state of knowledge is one with the largest information theoretical entropy.  In most practical cases, the stated prior data or testable information is given by a set of conserved quantities associated with the probability distribution is question. We use Lagrange method to help us solve this.

12 Lagrange Multiplier  In mathematical optimization, the method of Lagrange multipliers provides a strategy for finding the local maxima and minima of a function subject to equality constraints.  Lagrange Method assumes maximum entropy. The first of these equations are a normalization constraint. All of the probabilities must equal 1. The second equation is a general constraint. We will see more of what this is in the next example.

13 Lagrange Multiplier  Since Lagrange Method assumes maximum entropy, we can say:  Maximizing L with respect to each of the p(A i ) is done by differentiating L with respect to one of the p(A i ) while keeping α, β, and all other p(A i ) constant. The result is:

14 Lagrange Multiplier  Rearranging the equation, we can get: Where f( β )=0 because. Using this method, we can solve equations with minimum constraints.

15 Fast Food Frenzy BurgerChickenFishTofu $1.00$2.00$3.00$8.00 A fast food restaurant sells four types of product. They find that the average amount of money made for each purchase is $2.50. The products are chosen by the consumer based on price alone, and not preference. What is the percentage of purchase for each of these four foods?

16 Fast Food Frenzy  We know that: Applying Lagrange Method:

17 Fast Food Frenzy Entropy is the largest, subject to the constraints, if Where

18 Fast Food Frenzy  A zero-finding program was used to find the variables in these equations. The results were: 0.3546+0.2964+0.2477+0.1011 = 0.9998 This rounds to one, and therefore is normalized. Lagrange method and maximum entropy can determine probabilities using only a small set of constraints. This answer makes sense because the probabilities of each food being chosen are consistent with the price constraint given to them. FoodProbability of Purchase Burger0.3546 Chicken0.2964 Fish0.2477 Tofu0.1011

19 Remarks  Only by assuming maximum entropy are we able to evaluate these equations.  Since this example is evaluated on price alone, then the burger would have been chosen with the most frequency because of the cheaper price. The probabilities are lower for the more expensive prices, as indicated by the results.  When the number of randomness increases, so does the entropy. Because we only had four variables, the entropy at maximum would have been lower than if there were five variables. BurgerChickenFishTofu $1.00$2.00$3.00$8.00

20 Fourier Transformation  Fourier transform is a mathematical operation with many applications in physics and engineering that expresses a mathematical function of time as a function of frequency. The frequency can be approximated with sine and cosine functions.  Fourier transforms and maximum entropy can both be utilized to find the specific frequencies of a sine/cosine wave.

21 Fourier vs. Max Entropy  Num=30  Num=90  Num=150 x(i)=dsin(twopi*2.d0*t) x(i)=x(i)+dsin(twopi*3.d0*t) x(i)=5.d0+x(i)+dsin(twopi*3.2d0*t)

22 Fourier vs. Max Entropy  Since we were looking for 2.0 π, 3.0 π, and 3.2 π in our sine and cosine waves, maximum entropy was consistently better at determining these numbers on the graphs  Maximum entropy works better than Fourier from the range of 30 to 150 data sets. This is because it calculates an average using a small amount of data. If the data were dramatically increased, Fourier Method would work better.

23 Sources  http://en.wikipedia.org/wiki/Entropy http://en.wikipedia.org/wiki/Entropy  http://www.eoht.info/page/High+entropy+state http://www.eoht.info/page/High+entropy+state  http://en.wikipedia.org/wiki/Second_law_of_ther modynamics http://en.wikipedia.org/wiki/Second_law_of_ther modynamics  http://www.entropylaw.com/ http://www.entropylaw.com/


Download ppt "Maximum Entropy and Fourier Transformation Nicole Rogers."

Similar presentations


Ads by Google