Presentation is loading. Please wait.

Presentation is loading. Please wait.

Computations have to be distributed !

Similar presentations


Presentation on theme: "Computations have to be distributed !"— Presentation transcript:

1

2

3

4 Computations have to be distributed !

5

6

7

8

9

10 ExampleKey/value pair Distributed Grep map -> matched line reduce -> just pass Count of URL Access Frequency map -> reduce -> Reverse Web-Link Graph map -> reduce -> Term-Vector per Host map -> #term vector = a list of reduce -> Inverted Index map -> a sequence of reduce -> Distributed Sort map -> reduce -> just pass

11

12

13

14

15

16

17

18

19

20 Master Worker Master

21

22

23

24 SimilarityDifference Reduce Same code is used to implement both the combiner and the reduce functions. Output is written to the final output file. Combiner Output is written to an intermediate file that will be sent to a reduce task. MapperCombinerReducerMapperCombinerReducer

25 Worker Master

26

27

28

29

30

31

32

33

34

35 [ ]

36

37

38

39

40

41

42 The model is easy to use Large Variety of problems are easily expressible Developed that scales to large clusters of machines

43

44


Download ppt "Computations have to be distributed !"

Similar presentations


Ads by Google