Presentation on theme: "On the Competitiveness of Self Organizing Linear Search J. Ian Munro (University of Waterloo) Competitiveness: How well does an on line algorithm do in."— Presentation transcript:
On the Competitiveness of Self Organizing Linear Search J. Ian Munro (University of Waterloo) Competitiveness: How well does an on line algorithm do in comparison to one that is given the sequence of operations in advance? Self Organizing Search: Based on requests received, continually reorganize the data structure to make frequently accessed elements cheaper to get. Model of Computation: What operations can we do? What do we count? Amortized cost in our case.
Self Organizing Linear Search Elements in a linear list (array or linked) On access can reorder the elements inspected, count number records inspected Likely approaches: lMove to Front S E L F O R G A N I Z I N G L I N E A R.. I S E L F O R G A N Z I N G L I N E A R.. lSimple exchange S E L F O R G A N I Z I N G L I N E A R.. S E L F O R G A I N Z I N G L I N E A R..
Known Bounds Expected case (long sequence, elements have fixed independent probabilities of access): Cost(SE) Cost(MtF) 2 Opt = 2 i p i (McCabe; Rivest) Replace p i by f i /m, then on an amortized basis Cost(MtF) 2 i f i /m (Bentley and McGeough) Move to Front is 2-competitive (Sleator and Tarjan): i.e. within a factor of 2 of the cost of the off line optimal…which can move elements about as if it knows what will be accessed but model is crucial.
That Model Scan to the element requested, and no further (charge 1 per element inspected) Put requested element anywhere among scanned values (free) Swap as many pairs of consecutive elements as you like (1 per exchange) so swapping front and back halves of list costs (n 2 ) n/2 (n/2+1)..n (n/2+1).. n n/2
A Realistic Change in the Model Scan as far as you like. Charge one per element inspected KEY POINT Can rearrange portion inspected arbitrarily at no charge (in fact we will perform very simple rearrangements) so so swapping front and back halves of list costs (n) n/2 (n/2+1).. n (n/2+1).. n n/2
Order by Next Request lScan for element i, continue to the 2 lg i lReorder the element requested by NEXT REQUEST CostNew Ordering
Cost of a “Cycle” for Order by Next Request Cost (under our model) is n lg n, amortized lg n Lower bound Theorem: Under our model, n ln n inspections are required to individually scan for each of n elements. Proof idea: “Crossing sequence argument” Access to elements in position i or later must occur at least n/i times … for a total cost of at least n/i n ln n
More General Bound on Amortized Cost of OBNR Main Theorem: Let r denote the number of distinct elements requested since the last for a given value, then the amortized cost for a search can be charged as at most lg r Idea of Proof: “Soak the Middle Class” For any request, entire cost is borne by penultimate block of size 2 lg i - 2, so costs 4 for each of them. Lemma: An element can be in the penultimate block at most once between its accesses.
Tweaking the Constant Blocks of size 2 k are not optimal. Make them of size powers of And the preceding bound becomes lg r
Corollaries Bounds look much like Splay Tree bounds Request Gap View: Total cost of sequence in which element i occurs r i times is at most n lg (r i + 1) . or Entropy View: Let f i denote the frequency of accesses to i in a sequence of length m, then the total cost is at most n f i lg (m/f i ) , or writing f i /m= p i, the amortized cost is O(1 + p i lg 1/p i )
Linear Search Versus Binary Trees.. Off Line Bounds in terms of frequencies are same order. Splay trees can take advantage of key order (e.g. accessing values sequentially). Order by Next Request on trees.. a bad case: Order by Next Request takes amortized time lg n and makes no changes in the tree!