Presentation is loading. Please wait.

Presentation is loading. Please wait.

The Load Distance Balancing Problem Eddie Bortnikov (Yahoo!) Samir Khuller (Maryland) Yishay Mansour (Google) Seffi Naor (Technion)

Similar presentations


Presentation on theme: "The Load Distance Balancing Problem Eddie Bortnikov (Yahoo!) Samir Khuller (Maryland) Yishay Mansour (Google) Seffi Naor (Technion)"— Presentation transcript:

1 The Load Distance Balancing Problem Eddie Bortnikov (Yahoo!) Samir Khuller (Maryland) Yishay Mansour (Google) Seffi Naor (Technion)

2 The Load-Distance Balancing Problem Given n clients and k servers s 1, s 2,…s k we need to assign each client to a server. Cost for client i assigned to server s j is as follows: Cost(i) = Distance(i,s j ) + Delay(j,L j ) Delay(j,L j ) is a FUNCTION of the number of clients L j assigned to s j. OBJECTIVE: Min Max Cost(i)

3 An Example Each server has its own delay function (can be arbitrary, we just assume its non-decreasing). Note that C is closer to s 1, but prefers to attach to s 2 since Dist(C,s 2 )+Delay(s 2,2)<Dist(C,s 1 )+Delay(s 1,3) Objective is to Minimize Max Cost for any client s1s1 s2s2 s3s3 AB C D E

4 Related Work Lots of research on locating facilities. Here the facilities are all given; we just have to compute assignment of clients to facilities. Notion of capacities has been used for various covering problems such as Vertex Cover, K Centers, Facility Location etc.

5 Main Results The problem is NP-hard. We develop a polynomial time 2 approx. We show that the bound 2 cannot be improved to 2-ε unless NP=P. With triangle inequality in the distance function the hardness reduces to 5/3-ε. When all clients and servers are on a line its solvable in polynomial time. For Min Sum Cost(i), we can solve in polynomial time using Min-Weight Matching.

6 NP-hardness by Exact Set Cover Given N elements and a collection S of K sets (each set has size m). Does there exist a subset S’ of S, such that each element belongs to exactly one set in S’? In other words, we need to pick exactly N/m subsets from S, to cover each element once.

7 Example of Exact Cover Here we have N=16 elements and 9 sets (m=4). The FOUR blue sets form an EXACT COVER, and we discard the FIVE orange sets.

8 Reduction from Exact Cover Each element is a client. In addition we create a collection of M(K-N/m) dummy clients. Subset Sj in S corresponds to server s j. Dist(dummy,server)=d 1 Dist(i,s j ) = d 2 if i ε Sj, o.w. ∞ d 2 >> d 1 d1d1 d2d2 i sjsj Clients (elements) Dummy clients

9 Reduction from Exact Cover Delay functions for servers are basically a step function. Delay(j,L j ) = Δ-d 2, when load is at most m. Delay(j,L j )= Δ-d 1, when load exceeds m, but is at most M. m Δ-d 2 Δ-d 1 M

10 Reduction from Exact Cover Suppose there is a solution to exact cover, then there is a solution to the LDB problem with delay at most Δ. For each chosen subset Sj, the corresp. server s j gets m clients each at distance d 2.Since the delay is Δ-d 2, the total cost is at most Δ. For the remaining subsets, those dummies are all assigned to the remaining servers (K-N/m), each gets M dummies. The proof in the other direction requires some work!

11 Proof (cont.) Each server can support at most M dummy clients if the total cost does not exceed Δ, and no more than m real clients. Suppose a server supports both real and dummy clients; then the total number of servers with real clients is k’ > N/m. These serve at most (mk’-N) dummy clients, while the rest can serve only M(k-k’) dummy clients. Adding the two shows (some algebra needed) that we can only assign m.

12 Hardness Results follow…. We can set d 1 =ε and d 2 =Δ-ε. A solution to EXACT COVER exists if and only if a solution with cost Δ exists for LDB. If there is no solution to EXACT COVER, then every solution to LDB has cost 2(Δ-ε). However, here we do violate triangle inequality in the distance function.

13 Hardness results with triangle inequality We need to set d 1 =⅓ Δ, and d 2 = Δ-ε. With these parameters, the distance between a (real) client i and a server s j such that i is not in Sj, is at least 5/3Δ- ε. A solution to EXACT COVER exists if and only if a solution with cost Δ exists for LDB. If there is no solution to EXACT COVER, then every solution to LDB has cost 5/3Δ-ε.

14 Approximation Algorithm with factor 2 Suppose a solution exists with maximum cost Δ. For each server sj, we can compute an upper bound on the number of clients that can be served with a delay of at most Δ (say L*j). For each client i we can compute the subset of servers that are within distance Δ (S*i). Now its just a flow problem to check if an assignment exists where each client i is assigned to a server from S*i and each s j has load at most L*j. Minimizing Δ gives a trivial 2 approximation.

15 All servers and clients on a line Use dynamic programming!

16

17 Minimizing the Sum of Costs We reduce this to min cost matching in a bipartite graph. Let G=(X,Y,E) where nodes in X correspond to n clients and there are nk nodes in Y. We have n nodes corresp. to each server. We ask for a min cost matching to find a solution.

18 Capacitated K Centers The related problem of choosing K facilities has been considered (each client should be assigned to a closeby facility and the load on the facility should not be too high): [Khuller & Sussman] (K,L,5R) or ((2/c)K, cL, 2R) related to K-centers clustering.

19 Conclusions Can we improve the 2 approximation when triangle inequality holds? Can we improve the 2 approximation when Delay functions satisfy specific properties? What is a natural delay function? Are there other special cases that can be solved in polynomial time?


Download ppt "The Load Distance Balancing Problem Eddie Bortnikov (Yahoo!) Samir Khuller (Maryland) Yishay Mansour (Google) Seffi Naor (Technion)"

Similar presentations


Ads by Google