Presentation is loading. Please wait.

Presentation is loading. Please wait.

CSC401 – Analysis of Algorithms Lecture Notes 2 Asymptotic Analysis Objectives: Mathematics foundation for algorithm analysis Amortization analysis techniques.

Similar presentations


Presentation on theme: "CSC401 – Analysis of Algorithms Lecture Notes 2 Asymptotic Analysis Objectives: Mathematics foundation for algorithm analysis Amortization analysis techniques."— Presentation transcript:

1 CSC401 – Analysis of Algorithms Lecture Notes 2 Asymptotic Analysis Objectives: Mathematics foundation for algorithm analysis Amortization analysis techniques Case studies Asymptotic notions: Big-Oh, Big-Omega, Little-oh, little-omega, Big-Theta

2 2 Summation: Geometric summation: Natural summation: Harmonic number: Split summation: Summations

3 3 Logarithms –properties of logarithms: log b (xy) = log b x + log b y log b (x/y) = log b x - log b y log b xa = alog b x log b a = log x a/log x b Exponents –properties of exponentials: a (b+c) = a b a c a bc = (a b ) c a b /a c = a (b-c) b = a log a b b c = a c*log a b Floor and ceiling functions – – the largest integer less than or equal to x – the smallest integer greater than or equal to x Logarithms and Exponents

4 4 By Example -- counterexample Contrapositive –Principle: To justify “if p is true, then q is true”, show “if q is not true, then p is not true”. –Example: To justify “if ab is odd, a is odd or b is even”, assume “’a is odd or b is odd’ is not true”, then “a is even and b is odd”, then “ab is even”, then “’ab is odd’ is not true” Contradiction –Principle: To justify “p is true”, show “if p is not true, then there exists a contradiction”. –Example: To justify “if ab is odd, then a is odd or b is even”, let “ab is odd”, and assume “’a is odd or b is even’ is not true”, then “a is even and b is odd”, thus “ab is even” which is a contradiction to “ab is odd”. Proof Techniques

5 5 Induction -- To justify S(n) for n>=n0 –Principle Base cases: Justify S(n) is true for n0<=n<=n1 Assumption: Assume S(n) is true for n=N>=n1; or Assume S(n) is true for n1<=n<=N Induction: Justify S(n) is true for n=N+1 –Example: Question: Fibonacci sequence is defined as F(1)=1, F(2)=1, and F(n)=F(n-1)+F(n-2) for n>2. Justify F(n) =1 Proof by induction where n0=1. Let n1=2. –Base cases: For n0<=n<=n1, n=1 or n=2. If n=1, F(1)=1<2=2^1. If n=2, F(2)=2<4=2^2. It holds. –Assumption: Assume F(n) =2. –Induction: For n=N+1, F(N+1)=F(N)+F(N- 1)<2^N+2^(N-1)<2 2^N=2^(N+1). It holds. Proof by Induction

6 6 Principle: –To prove a statement S about a loop is correct, define S in terms of a series of smaller statements S0, S1, …, Sk, where The initial claim S0 is true before the loop begins If Si-1 is true before iteration i begins, then show that Si is true after iteration i is over The final statement Sk is true Example: Consider the algorithm arrayMax –Statement S: max is the maximum number when finished –A series of smaller statements: Si: max is the maximum in the first i+1 elements in the first i+1 elements of the array of the array S0 is true before the loop If Si is true, then easy to show Si+1 is true show Si+1 is true S=Sn-1 is also true Proof by Loop Invariants Algorithm arrayMax(A, n) max  A[0] for i  1 to n  1 do if A[i]  max then max  A[i] return max

7 7 Sample space: the set of all possible outcomes from some experiment Probability space: a sample space S together with a probability function that maps subsets of S to real numbers between 0 and 1 Event: Each subset of A of S called an event Properties of the probability function Pr –Pr(Ø)=0 –Pr(S)=1 –0<=Pr(A)<=1 for any subset A of S –If A and B are subsets of S and AB=, then Pr(AB)=Pr(A)+Pr(B) Independence –A and B are independent if Pr(AB)=Pr(A)Pr(B) –A 1, A 2, …, A n are mutually independent if Pr(A 1 A 2 … A n )=Pr(A 1 )Pr(A 2 )…Pr(A n ) Basic Probability

8 8 Conditional probability –The conditional probability that A occurs, given B, is defined as: Pr(A|B)=Pr(AB)/Pr(B), assuming Pr(B)>0 Random variables –Intuitively, Variables whose values depend on the outcomes of some experiment –Formally, a function X that maps outcomes from some sample space S to real numbers –Indicator random variable: a variable that maps outcomes to either 0 or 1 Expectation:a random variable has random values –Intuitively, average value of a random variable –Formally, expected value of a random variable X is defined as E(X)= x xPr(X=x) –Properties: Linearity: E(X+Y) = E(X) + E(Y) Independence: If X and Y are independent, that is, Pr(X=x|Y=y)=Pr(X=x), then E(XY)=E(X)E(Y) Basic Probability

9 9 Computing Prefix Averages We further illustrate asymptotic analysis with two algorithms for prefix averages The i -th prefix average of an array X is average of the first (i  1) elements of X: A[i]  X[0]  X[1]  …  X[i])/(i+1) Computing the array A of prefix averages of another array X has applications to financial analysis

10 10 Prefix Averages (Quadratic) The following algorithm computes prefix averages in quadratic time by applying the definition Algorithm prefixAverages1(X, n) Input array X of n integers Output array A of prefix averages of X #operations A  new array of n integers n A  new array of n integers n for i  0 to n  1 do n s  X[0] n for j  1 to i do 1  2  …  (n  1) s  s  X[j] 1  2  …  (n  1) A[i]  s  (i  1) n return A 1

11 11 Arithmetic Progression The running time of prefixAverages1 is O(1  2  …  n) The sum of the first n integers is n(n  1)  2 –There is a simple visual proof of this fact Thus, algorithm prefixAverages1 runs in O(n 2 ) time

12 12 Prefix Averages (Linear) The following algorithm computes prefix averages in linear time by keeping a running sum Algorithm prefixAverages2(X, n) Input array X of n integers Output array A of prefix averages of X #operations A  new array of n integersn s  0 1 for i  0 to n  1 don s  s  X[i]n A[i]  s  (i  1) n return A 1 Algorithm prefixAverages2 runs in O(n) time

13 13 Amortization Amortization –Typical data structure supports a wide variety of operations for accessing and updating the elements –Each operation takes a varying amount of running time –Rather than focusing on each operation –Consider the interactions between all the operations by studying the running time of a series of these operations –Average the operations’ running time Amortized running time –The amortized running time of an operation within a series of operations is defined as the worst-case running time of the series of operations divided by the number of operations –Some operations may have much higher actual running time than its amortized running time, while some others have much lower

14 14 The Clearable Table Data Structure The clearable table –An ADT Storing a table of elements Being accessing by their index in the table –Two methods: add(e) -- add an element e to the next available cell in the table clear() -- empty the table by removing all elements Consider a series of operations (add and clear) performed on a clearable table S –Each add takes O(1) –Each clear takes O(n) –Thus, a series of operations takes O(n 2 ), because it may consist of only clears

15 15 Amortization Analysis Theorem: –A series of n operations on an initially empty clearable table implemented with an array takes O(n) time Proof: –Let M 0, M 1, …, M n-1 be the series of operations performed on S, where k operations are clear –Let M i0, M i1, …, M i k-1 be the k clear operations within the series, and others be the (n-k) add operations –Define i -1 =-1,Mij takes i j -i j-1, because at most i j -i j-1 -1 elements are added by add operations between M i j-1 and M i j –The total time of the series is: (n-k) + ∑ k-1 j=0 (i j -i j-1 ) = n-k + (i k-1 - i -1 ) <= 2n-k Total time is O(n) Amortized time is O(1)

16 16 Accounting Method The method –Use a scheme of credits and debits: each operation pays an amount of cyber-dollar –Some operations overpay --> credits –Some operations underpay --> debits –Keep the balance at any time at least 0 Example: the clearable table –Each operation pays two cyber-dollars –add always overpays one dollar -- one credit –clear may underpay a variety of dollars the underpaid amount equals the number of add operations since last clear - 2 –Thus, the balance is at least 0 –So the total cost is 2n -- may have credits

17 17 Potential Functions Based on energy model –associate a value with the structure, Ø, representing the current energy state –each operation contributes to Ø a certain amount of energy t’ and consumes a varying amount of energy t –Ø 0 -- the initial energy, Ø i -- the energy after the i-th operation –for the i-th operation t i -- the actual running time t’ i -- the amortized running time t i = t’ i + Ø i-1 - Ø i –Overall: T=∑(t i ), T’=∑(t’ i ) –The total actual time T = T’ + Ø 0 - Ø n –As long as Ø 0 <= Ø n, T<=T’

18 18 Potential Functions Example -- The clearable table –the current energy state Ø is defined as the number of elements in the table, thus Ø>=0 –each operation contributes to Ø t’=2 –Ø 0 = 0 –Ø i = Ø i-1 + 1, if the i-th operation is add add: t = t’+ Ø i-1 - Ø i = 2 - 1 = 1 –Ø i = 0, if the i-th operation is clear clear: t =t’+ Ø i -1 = 2 + Ø i -1 –Overall: T=∑ (t), T’=∑(t’) –The total actual time T = T’ + Ø 0 - Ø n –Because Ø 0 = 0 <= Ø n, T<=T’ =2n

19 19 Extendable Array ADT: Extendable array is an array with extendable size. One of its methods is add to add an element to the array. If the array is not full, the element is added to the first available cell. When the array is full, the add method performs the following –Allocate a new array with double size –Copy elements from the old array to the new array –Add the element to the first available cell in the new array –Replace the old array with the new array Question: What is the amortization time of add? –Two situations of add: add -- O(1) Extend and add -- (n) –Amortization analysis: Each add deposits 3 dollars and spends 1 dollar Each extension spends k dollars from the size k to 2k (copy) Amortization time of add is O(1)

20 20 Relatives of Big-Oh big-Omega –f(n) is (g(n)) if there is a constant c > 0 and an integer constant n 0  1 such that f(n)  cg(n) for n  n 0 big-Theta –f(n) is (g(n)) if there are constants c’ > 0 and c’’ > 0 and an integer constant n 0  1 such that c’g(n)  f(n)  c’’g(n) for n  n 0 little-oh –f(n) is o(g(n)) if, for any constant c > 0, there is an integer constant n 0  0 such that f(n)  cg(n) for n  n 0 little-omega –f(n) is (g(n)) if, for any constant c > 0, there is an integer constant n 0  0 such that f(n)  cg(n) for n  n 0

21 21 Intuition for Asymptotic Notation Big-Oh –f(n) is O(g(n)) if f(n) is asymptotically less than or equal to g(n) big-Omega –f(n) is (g(n)) if f(n) is asymptotically greater than or equal to g(n) big-Theta –f(n) is (g(n)) if f(n) is asymptotically equal to g(n) little-oh –f(n) is o(g(n)) if f(n) is asymptotically strictly less than g(n) little-omega –f(n) is (g(n)) if is asymptotically strictly greater than g(n)

22 22 Example Uses of the Relatives of Big-Oh 5n 2 is  (n 2 ) f(n) is (g(n)) if there is a constant c > 0 and an integer constant n 0  1 such that f(n)  cg(n) for n  n 0 let c = 5 and n 0 = 1 5n2 is (n) f(n) is (g(n)) if there is a constant c > 0 and an integer constant n 0  1 such that f(n)  cg(n) for n  n 0 let c = 1 and n 0 = 1 5n2 is (n) f(n) is (g(n)) if, for any constant c > 0, there is an integer constant n 0  0 such that f(n)  cg(n) for n  n 0 need 5n02  cn 0  given c, the n 0 that satisfies this is n 0  c/5  0


Download ppt "CSC401 – Analysis of Algorithms Lecture Notes 2 Asymptotic Analysis Objectives: Mathematics foundation for algorithm analysis Amortization analysis techniques."

Similar presentations


Ads by Google