Hadoop Namenode High Availability August 2008 Requirements and Procedures.

Slides:



Advertisements
Similar presentations
Express5800/ft series servers Product Information Fault-Tolerant General Purpose Servers.
Advertisements

Software Version: DSS ver up01
Virtual Trunk Protocol
Hadoop at ContextWeb February ContextWeb: Traffic Traffic – up to 6 thousand Ad requests per second. Comscore Trend Data:
Transitioning of existing applications to use HDFS August 2008.
MySQL Advanced MySQL Replication MySQL Cluster MySQL Partitioning
Multiplication X 1 1 x 1 = 1 2 x 1 = 2 3 x 1 = 3 4 x 1 = 4 5 x 1 = 5 6 x 1 = 6 7 x 1 = 7 8 x 1 = 8 9 x 1 = 9 10 x 1 = x 1 = x 1 = 12 X 2 1.
Doc.: IEEE /139r0 Submission 16 March, 2004 Francis daCosta, MeshDynamicsSlide 1 Project: IEEE P Working Group for Wireless Personal Area.
Extending Drive-Thru Data Access by Vehicle-to-Vehicle Relay Jing Zhao Todd Arnold Yang Zhang Guohong Cao Pennsylvania State University September 15 th,
1 Introducing the Specifications of the Metro Ethernet Forum.
Satoshi NARIKAWA NTT (G.8032 Acting Co-editor)
Tom Hamilton – America’s Channel Database CSE
Chapter 9 Part III Linux File System Administration
Hands-on RAID on Moxa Computer Prepared by: (40min) Date: mm-dd-yyyy.
The Linux Storage People Simple Fast Massively Scalable Network Storage Coraid EtherDrive ® Storage.
© 2003 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Performance Measurements of a User-Space.
Chapter 1: Introduction to Scaling Networks
Introduction to cloud computing Jiaheng Lu Department of Computer Science Renmin University of China
13,000 Jobs and counting…. Advertising and Data Platform Our System.
Copyright © 2014 EMC Corporation. All Rights Reserved. Linux Host Installation and Integration for Block Upon completion of this module, you should be.
Mecanismos de alta disponibilidad con Microsoft SQL Server 2008 Por: ISC Lenin López Fernández de Lara.
High Availability 24 hours a day, 7 days a week, 365 days a year… Vik Nagjee Product Manager, Core Technologies InterSystems Corporation.
High Availability through Virtualization
1 © Copyright 2010 EMC Corporation. All rights reserved. EMC RecoverPoint/Cluster Enabler for Microsoft Failover Cluster.
High Performance Computing Course Notes High Performance Storage.
1© Copyright 2011 EMC Corporation. All rights reserved. EMC RECOVERPOINT/ CLUSTER ENABLER FOR MICROSOFT FAILOVER CLUSTER.
Hardening Linux for Enterprise Applications Peter Knaggs & Xiaoping Li Oracle Corporation Sunil Mahale Network Appliance Session id:
MCTS Guide to Microsoft Windows Server 2008 Applications Infrastructure Configuration (Exam # ) Chapter Ten Configuring Windows Server 2008 for High.
Page 1 Mirror File System A patented breakthrough technology for Business Continuity and Disaster Prevention Twin Peaks Software Inc.
10/02/2004ELFms meeting1 Linux Virtual Server Miroslav Siket FIO-FS.
Networking Features Upon completion of this module, you should be able to: Discuss and configure VNX networking features This module continues the discussion.
Experience and Lessons learnt from running High Availability Databases on Network Attached Storage Ruben Gaspar Manuel Guijarro et al IT/DES.
1 The Google File System Reporter: You-Wei Zhang.
Guide to Linux Installation and Administration, 2e 1 Chapter 9 Preparing for Emergencies.
Nagios Is Down and Your Boss Wants to See You Andrew Widdersheim
High-Availability Linux.  Reliability  Availability  Serviceability.
Hadoop Hardware Infrastructure considerations ©2013 OpalSoft Big Data.
Next Generation Security Solutions Next Generation Controllers January 2013.
Page 1 of John Wong CTO Twin Peaks Software Inc. Mirror File System A Multiple Server File System.
Sensitivity of Cluster File System Access to I/O Server Selection A. Apon, P. Wolinski, and G. Amerson University of Arkansas.
High-Availability MySQL DB based on DRBD-Heartbeat Ming Yue September 27, 2007 September 27, 2007.
FailSafe SGI’s High Availability Solution Mayank Vasa MTS, Linux FailSafe Gatekeeper
The application of DRBD in Linux-HA Haibo Zhang 4/28/2014.
Course 301 – Secured Network Deployment and IPSec VPN
GFS. Google r Servers are a mix of commodity machines and machines specifically designed for Google m Not necessarily the fastest m Purchases are based.
1 © 2005 Cisco Systems, Inc. All rights reserved. 111 © 2004, Cisco Systems, Inc. All rights reserved. CNIT 221 Security 2 ver.2 Module 8 City College.
Introduction to HDFS Prasanth Kothuri, CERN 2 What’s HDFS HDFS is a distributed file system that is fault tolerant, scalable and extremely easy to expand.
Mark E. Fuller Senior Principal Instructor Oracle University Oracle Corporation.
CO5023 LAN Redundancy.
Linux Operations and Administration
The 2001 Tier-1 prototype for LHCb-Italy Vincenzo Vagnoni Genève, November 2000.
Fault Tolerance
AFS/OSD Project R.Belloni, L.Giammarino, A.Maslennikov, G.Palumbo, H.Reuter, R.Toebbicke.
Virtual Machine Movement and Hyper-V Replica
How to setup DSS V6 iSCSI Failover with XenServer using Multipath Software Version: DSS ver up55 Presentation updated: February 2011.
High-Availability MySQL with DR:BD and Heartbeat: MTV Japan mobile services ©2008 MTV Networks Japan K.K.
MySQL HA An overview Kris Buytaert. ● Senior Linux and Open Source ● „Infrastructure Architect“ ● I don't remember when I started.
High Availability Clusters in Linux Sulamita Garcia EDS Unix Specialist
Parallel Virtual File System (PVFS) a.k.a. OrangeFS
rain technology (redundant array of independent nodes)
Failover and High Availability
High Availability 24 hours a day, 7 days a week, 365 days a year…
HP ArcSight ESM 6.8c HA Fail Over Illustrated
Cluster Communications
Advanced Network Training
The Basics of Apache Hadoop
Hadoop Distributed Filesystem
DriveScale Log Collection Method of Procedure
Features Overview.
CS 295: Modern Systems Organizing Storage Devices
Presentation transcript:

Hadoop Namenode High Availability August 2008 Requirements and Procedures

2 Requirements  Two nodes to satisfy availability requirements.  High availability for internal components of each node.  Disk redundancy  Network redundancy  Redundant network architecture.  Heartbeat mechanism between the two nodes.  Replication of namenode metadata.  Automatic fail over with no human action required.

Internal Components Disks o 2x 300 GB 15k RPM SAS. o Hardware RAID 1 mirroring. o SMART monitoring. Network o Dual 1Gbps on-board NICs. o Linux bonding with LACP.

4 Redundant Network Architecture Linux bonding o See bonding.txt from Linux kernel docs. o LACP, aka 802.3ad, aka mode=4.  ( o Must be supported by your switches. o Throughput advantage  Observed at 1.76Gb/s o Allows for failure of either NIC instead of a single heartbeat connection via crossover. Switching infrastructure and physical segregation. o See diagram

5 Network Diagram

6 Heartbeat Between Nodes  Provided by "heartbeat" package.  (  Manage multiple resources:  Virtual IP address  DRBD Disk  Hadoop processes  /etc/ha.d/haresources example: cw-grid101.contextweb.prod IPaddr:: cw-grid101.contextweb.prod drbddisk::r0 cw-grid101.contextweb.prod Filesystem::/dev/drbd0::/hadoop::ext3::defaults cw-grid101.contextweb.prod hadoop  Heartbeat uses bond0 network interface. (* Not approved)  3 second timeout for "deadtime".  Created LSB compliant hadoop init script.

7 Replication of Namenode Metadata  DRBD Replication.  (

/etc/drbd.conf example: global { usage-count no; } resource r0 { protocol C; syncer { rate 110M; } # approximately 50% of total available startup { wfc-timeout 0; degr-wfc-timeout 120; } on cw-grid101.contextweb.prod { device /dev/drbd0; disk /dev/sda4; address :7788; meta-disk internal; } on cw-grid102.contextweb.prod { device /dev/drbd0; disk /dev/sda4; address :7788; meta-disk internal; } }

Fail Over Order of Events Virtual IP fails over. DRBD system switches primary node. (/proc/drbd status) File system fsck and mount at /hadoop. Hadoop started via LSB compliant init script. End to end fail over time approximately 15 seconds. Optionally, original master is rebooted to help avoid split-brain.

DRBD Status  Updating # cat /proc/drbd version: (api:88/proto:86-88) GIT-hash: 3e69822d3bb4920a8c1bfdf7d647169eba7d2eb4 build by :04:55 0: cs:SyncSource st:Primary/Secondary ds:UpToDate/Inconsistent C r--- ns: nr:0 dw: dr: al:11746 bm:12767 lo:14 pe:12 ua:246 ap:1 oos: [==> ] sync'ed: 18.0% (82459/100465)M finish: 0:14:31 speed: 96,904 (77,472) K/sec  Synchronized # cat /proc/drbd version: (api:88/proto:86-88) GIT-hash: 3e69822d3bb4920a8c1bfdf7d647169eba7d2eb4 build by :04:55 0: cs:Connected st:Primary/Secondary ds:UpToDate/UpToDate C r--- ns: nr:0 dw: dr: al:11781 bm:17923 lo:0 pe:0 ua:0 ap:0 oos:0 10