Distributed Systems. CS422/522 Lecture17 17 November 2014

Size: px

Start display at page:

Download "Distributed Systems. CS422/522 Lecture17 17 November 2014"

Delphia Foster
5 years ago
Views:

1 Distributed Systems CS422/522 Lecture17 17 November 2014

2 Lecture Outline Introduction Hadoop Chord

3 What s a distributed system?

4 What s a distributed system? A distributed system is a collection of loosely coupled nodes interconnected by a communication network. In a distributed system, users access remote resources in the same way they access local resources.

5 What s a distributed system? A distributed system is a collection of loosely coupled nodes interconnected by a communication network. In a distributed system, users access remote resources in the same way they access local resources.

6 Why distributed systems?

7 PC Scalability

8 Scalability PC What if one computer is not enough? - Buy a bigger (server-class) computer

9 Scalability PC Server What if one computer is not enough? - Buy a bigger (server-class) computer

10 Scalability PC Server What if one computer is not enough? - Buy a bigger (server-class) computer What if the biggest computer is not enough? - Buy many computers

11 Scalability PC Server Cluster What if one computer is not enough? - Buy a bigger (server-class) computer What if the biggest computer is not enough? - Buy many computers

12 Scalability

13 Rack Scalability

14 Rack Scalability Network switches (connects nodes with each other and with other racks)

15 Rack Scalability Network switches (connects nodes with each other and with other racks) Many nodes/blades (often identical)

16 Rack Scalability Network switches (connects nodes with each other and with other racks) Many nodes/blades (often identical) Storage device(s)

17 Scalability PC Server Cluster What if cluster is too big to fit into machine room? - Build a separate building for the cluster - Building can have lots of cooling and power - Result: Data center

18 Scalability PC Server Cluster What if cluster is too big to fit into machine room? - Build a separate building for the cluster - Building can have lots of cooling and power - Result: Data center

19 Scalability PC Server Cluster Data center What if cluster is too big to fit into machine room? - Build a separate building for the cluster - Building can have lots of cooling and power - Result: Data center

20 Google Data Center in Oregon

21 Google Data Center in Oregon Data centers (size of a football field)

22 Google Data Center in Oregon Data centers (size of a football field) A warehouse-sized computer - A single data center can easily contain 10,000 racks with 100 cores in each rack (1,000,000 cores total)

23 Google Data Centers World Wide

24 Why distributed systems? Resource sharing - E.g., Napster Computation speedup - E.g., Hadoop Reliability - E.g., S3

25 Why distributed systems? Resource sharing - E.g., Napster Computation speedup - E.g., Hadoop Reliability - E.g., S3

26 Why distributed systems? Resource sharing - E.g., Napster Computation speedup - E.g., Hadoop Reliability - E.g., Amazon S3

27 Why distributed systems? Resource sharing - E.g., Napster Computation speedup - E.g., Hadoop Reliability - E.g., S3 Issues?

28 Lecture Outline Introduction Hadoop Chord

29 Lecture Outline Introduction Hadoop Chord

30 What s the Hadoop A project based on GFS and MapReduce - Distribute data across machines - Try to achieve the reliability and scalability An open-source software framework for big data The Google File System - Distributed storage - Distributed processing

31 What s the Hadoop A project based on GFS and MapReduce - Distribute data across machines - Try to achieve the reliability and scalability An open-source software framework for big data - Distributed storage - Distributed processing

32 Why Hadoop?

33 Hadoop Components

34 Hadoop Components NameNode DataNode1 DataNode2 DataNode3 DataNode4 DataNode5

35 Hadoop Components Two core components - The Hadoop distributed file system (HDFS) HDFS NameNode DataNode1 DataNode2 DataNode3 DataNode4 DataNode5

36 Hadoop Components Two core components - The Hadoop distributed file system (HDFS) Data HDFS NameNode DataNode1 DataNode2 DataNode3 DataNode4 DataNode5

37 Hadoop Components Two core components - The Hadoop distributed file system (HDFS) Metadata HDFS NameNode DataNode1 DataNode2 DataNode3 DataNode4 DataNode5

38 Hadoop Components Two core components - The Hadoop distributed file system (HDFS) Data Metadata HDFS NameNode DataNode1 DataNode2 DataNode3 DataNode4 DataNode5

39 Hadoop Components Two core components - The Hadoop distributed file system (HDFS) Data Metadata HDFS NameNode DataNode1 DataNode2 DataNode3 DataNode4 DataNode5

40 Hadoop Components Two core components - The Hadoop distributed file system (HDFS) - MapReduce software framework Metadata HDFS NameNode DataNode1 DataNode2 DataNode3 DataNode4 DataNode5

41 Hadoop Components Two core components - The Hadoop distributed file system (HDFS) - MapReduce software framework MapReduce Metadata HDFS NameNode DataNode1 DataNode2 DataNode3 DataNode4 DataNode5

42 Hadoop Components Two core components - The Hadoop distributed file system (HDFS) - MapReduce software framework MapReduce JobTracker TaskTracker TaskTracker TaskTracker TaskTracker TaskTracker Metadata HDFS NameNode DataNode1 DataNode2 DataNode3 DataNode4 DataNode5

43 Hadoop Components Two core components - The Hadoop distributed file system (HDFS) - MapReduce software framework MapReduce JobTracker TaskTracker TaskTracker TaskTracker TaskTracker TaskTracker Metadata HDFS NameNode DataNode1 DataNode2 DataNode3 DataNode4 DataNode5

44 Hadoop Components Two core components - The Hadoop distributed file system (HDFS) - MapReduce software framework MapReduce JobTracker TaskTracker TaskTracker TaskTracker TaskTracker TaskTracker Metadata HDFS NameNode DataNode1 DataNode2 DataNode3 DataNode4 DataNode5

45 HDFS HDFS is responsible for storing data - Data is split into blocks and distributed across nodes - Each block is replicated Implementation of HDFS: - Based on Google s GFS - Offers redundant storage for massive amounts of data

46 Getting Data in/out of HDFS Hadoop API: - Use hadoop fs to work with data in HDFS - hadoop fs -copyfromlocal local_dir /hdfs_dir - hadoop fs -copytolocal /hdfs_dir local_dir

47 HDFS Example NameNode: Stores metadata only Metadata: /usr/data/foo -> 1, 2, 4 /usr/data/bar -> 3, NameNode holds metadata for the data files DataNodes hold the actual blocks - Each block is replicated three times When a client wants to read file 5 3 DataNodes: Store Blocks

48 HDFS Example NameNode: Stores metadata only Metadata: /usr/data/foo -> 1, 2, 4 /usr/data/bar -> 3, bar? Client NameNode holds metadata for the data files DataNodes hold the actual blocks - Each block is replicated three times When a client wants to read file 5 3 DataNodes: Store Blocks

49 NameNode: Stores metadata only Metadata: /usr/data/foo -> 1, 2, 4 /usr/data/bar -> 3, HDFS Example bar? < DataNode IPs, 3, 5 > Client NameNode holds metadata for the data files DataNodes hold the actual blocks - Each block is replicated three times When a client wants to read file 5 3 DataNodes: Store Blocks

50 NameNode: Stores metadata only Metadata: /usr/data/foo -> 1, 2, 4 /usr/data/bar -> 3, HDFS Example < DataNode IPs, 3, 5 > Client NameNode holds metadata for the data files DataNodes hold the actual blocks - Each block is replicated three times When a client wants to read file 5 3 DataNodes: Store Blocks

51 Case Study Policy: Assigning blocks based on disk space tiger frog lion monkey python

52 Case Study Policy: Assigning blocks based on disk space Hadoop tiger frog lion monkey python

53 Case Study Policy: Assigning blocks based on disk space Metadata Hadoop tiger frog lion monkey python

54 Case Study Policy: Assigning blocks based on disk space Metadata Hadoop tiger frog lion monkey python

55 Case Study Policy: Assigning blocks based on disk space Metadata Hadoop tiger frog lion monkey python

56 Case Study Policy: Assigning blocks based on disk space Metadata Hadoop tiger frog lion monkey python gator

57 Recall: Hadoop Components Two core components - The Hadoop distributed file system (HDFS) - MapReduce software framework MapReduce JobTracker TaskTracker TaskTracker TaskTracker TaskTracker TaskTracker Metadata HDFS NameNode DataNode1 DataNode2 DataNode3 DataNode4 DataNode5

58 MapReduce A method distributing a task across nodes: - Each node processes data stored on that node Consists of two phases: - Map - Reduce

59 MapReduce A method distributing a task across nodes: - Each node processes data stored on that node Consists of two phases: - Map - Reduce Data Map Reduce Result (Mapper Input)

60 Features of MapReduce Automatic parallelization and distribution A clean abstraction for programmers - MapReduce programs are usually written in Java MapReduce abstracts all the housekeeping away from the developer: - Developers concentrate on Map and Reduce functions

61 MapReduce Example: Word Count Count the # of occurrences of each word in a large amount of input data Map(input_key, input_value) { foreach word w in input_value: emit(w, 1); }

62 MapReduce Example: Word Count Count the # of occurrences of each word in a large amount of input data Map(input_key, input_value) { foreach word w in input_value: emit(w, 1); } Input to the Mapper (3414, the cat sat on the mat ) (3437, the aardvark sat on the sofa )

63 MapReduce Example: Word Count Count the # of occurrences of each word in a large amount of input data Map(input_key, input_value) { foreach word w in input_value: emit(w, 1); } Input to the Mapper (3414, the cat sat on the mat ) (3437, the aardvark sat on the sofa ) Output from the Mapper ( the, 1), ( cat, 1), ( sat, 1), ( on, 1), ( the, 1), ( mat, 1), ( the, 1), ( aardvark, 1), ( sat, 1), ( on, 1), ( the, 1), ( sofa, 1)

64 Map Phase Mapper Input the cat sat on the mat the aardvark sat on the sofa

65 Map Phase Mapper Output Mapper Input the cat sat on the mat the aardvark sat on the sofa ( the, 1), ( cat, 1), ( sat, 1), ( on, 1), ( the, 1), ( mat, 1), ( the, 1), ( aardvark, 1), ( sat, 1), ( on, 1), ( the, 1), ( sofa, 1)

66 Reducer After the Map, all the intermediate values for a given intermediate key are combined together into a list

67 Reducer After the Map, all the intermediate values for a given intermediate key are combined together into a list Mapper Output ( the, 1), ( cat, 1), ( sat, 1), ( on, 1), ( the, 1), ( mat, 1), ( the, 1), ( aardvark, 1), ( sat, 1), ( on, 1), ( the, 1), ( sofa, 1) Reducer Input aardvark, 1 cat, 1 mat, 1 on [1, 1] sat [1, 1] sofa, 1 the [1, 1, 1, 1]

68 Reducer Add up all the values associated with each intermediate key: Reduce(output_key, intermediate_vals) { set count = 0; foreach v in intermediate_vals: count += v; emit(output_key, count); }

69 Reducer Add up all the values associated with each intermediate key: Reduce(output_key, intermediate_vals) { set count = 0; foreach v in intermediate_vals: count += v; emit(output_key, count); } Output from the Reducer ( the, 4), ( sat, 2), ( on, 2), ( sofa, 1), ( mat, 1), ( cat, 1), ( aardvark, 1)

70 Map + Reduce Mapping Shuffling Reducing ( the, 1), Mapper Input the cat sat on the mat the aardvark sat on the sofa ( cat, 1), ( sat, 1), ( on, 1), ( the, 1), ( mat, 1), ( the, 1), ( aardvark, 1), ( sat, 1), ( on, 1), aardvark, 1 cat, 1 mat, 1 on [1, 1] sat [1, 1] sofa, 1 the [1, 1, 1, 1] aardvark, 1 cat, 1 mat, 1 on, 2 sat, 2 sofa, 1 the, 4 ( the, 1), ( sofa, 1)

71 Why we care about counting words Word count is challenging over massive amounts of data Fundamentals of statistics often are aggregate functions Most aggregation functions have distributive nature MapReduce breaks complex tasks into smaller pieces which can be executed in parallel

72 Deployment NameNode JobTracker [Determines execution plan] Client TaskTracker Map Reduce DataNode TaskTracker Map Reduce DataNode TaskTracker Map Reduce DataNode

73 Discussions

74 Lecture Outline Introduction Hadoop Chord

75 Lecture Outline Introduction Hadoop Chord

76 Why Chord?

77 Chord

78 Chord

79 Chord

80 Chord

81 Chord start int. succ. 2 [2,3) 3 3 [3,5) 3 5 [5,1)

82 Chord start int. succ. 2 [2,3) 3 3 [3,5) 3 5 [5,1) start int. succ. 4 [4,5) 0 5 [5,7) 0 7 [7,3) 0

83 Chord start int. succ. 1 [1,2) 1 2 [2,4) 3 4 [4,0) start int. succ. 2 [2,3) 3 3 [3,5) 3 5 [5,1) start int. succ. 4 [4,5) 0 5 [5,7) 0 7 [7,3) 0

84 Chord start int. succ. 1 [1,2) 1 2 [2,4) 3 4 [4,0) start int. succ. 2 [2,3) 3 3 [3,5) 3 5 [5,1) start int. succ. 4 [4,5) 0 5 [5,7) 0 7 [7,3) 0

85 Importance: Complexity! start int. succ. 1 [1,2) 1 2 [2,4) [4,0) start int. succ. 2 [2,3) [3,5) 3 5 [5,1) start int. succ. 4 [4,5) 0 5 [5,7) 0 7 [7,3) 0

86 start int. succ. Node Joins 1 [1,2) 1 2 [2,4) [4,0) start int. succ. 2 [2,3) 3 3 [3,5) 3 5 [5,1) start int. succ. 4 [4,5) 0 5 [5,7) 0 7 [7,3) 0

87 start int. succ. Node Joins 1 [1,2) 1 2 [2,4) [4,0) start int. succ. 2 [2,3) 3 3 [3,5) 3 5 [5,1) start int. succ. 4 [4,5) 0 5 [5,7) 0 7 [7,3) 0

88 start int. succ. Node Joins 1 [1,2) 1 2 [2,4) 3 4 [4,0) start int. succ. 2 [2,3) 3 3 [3,5) 3 5 [5,1) start int. succ. 4 [4,5) 0 5 [5,7) 0 7 [7,3) 0

89 start int. succ. Node Joins 1 [1,2) 1 2 [2,4) 3 4 [4,0) start int. succ. 2 [2,3) 3 3 [3,5) 3 5 [5,1) start int. succ. 4 [4,5) 0 5 [5,7) 0 7 [7,3) 0

90 start int. succ. Node Joins 1 [1,2) 1 2 [2,4) 3 4 [4,0) start int. succ. 2 [2,3) 3 3 [3,5) 3 5 [5,1) start int. succ. 4 [4,5) 0 5 [5,7) 0 7 [7,3) 0

91 start int. succ. Node Joins 1 [1,2) 1 2 [2,4) 3 4 [4,0) start int. succ. 2 [2,3) 3 3 [3,5) 3 5 [5,1) start int. succ. 4 [4,5) 6 5 [5,7) 6 7 [7,3) 0

92 start int. succ. Node Joins 1 [1,2) 1 2 [2,4) 3 4 [4,0) start int. succ. 2 [2,3) 3 3 [3,5) 3 5 [5,1) start int. succ. 7 [7,0) 0 0 [0,2) 0 2 [2,6) start int. succ. 4 [4,5) 6 5 [5,7) 6 7 [7,3) 0

93 Case Study: Node Leaves start int. succ. 1 [1,2) 1 2 [2,4) 3 4 [4,0) start int. succ [2,3) 3 3 [3,5) 3 5 [5,1) start int. succ. 7 [7,0) 0 0 [0,2) 0 2 [2,6) start int. succ. 4 [4,5) 6 5 [5,7) 6 7 [7,3) 0

94 Case Study: Node Leaves start int. succ. 1 [1,2) 3 2 [2,4) 3 4 [4,0) start int. succ. 7 [7,0) 0 0 [0,2) 0 2 [2,6) start int. succ. 4 [4,5) 6 5 [5,7) 6 7 [7,3) 0

95 Chord Issues?

MapReduce & BigTable

CPSC 426/526 MapReduce & BigTable Ennan Zhai Computer Science Department Yale University Lecture Roadmap Cloud Computing Overview Challenges in the Clouds Distributed File Systems: GFS Data Process & Analysis: