Hadoop Namenode High Availability August 2008 Requirements and Procedures.

Slides:

Advertisements

Similar presentations

Express5800/ft series servers Product Information Fault-Tolerant General Purpose Servers.

Advertisements

Software Version: DSS ver up01

Virtual Trunk Protocol

Hadoop at ContextWeb February ContextWeb: Traffic Traffic – up to 6 thousand Ad requests per second. Comscore Trend Data:

Transitioning of existing applications to use HDFS August 2008.

MySQL Advanced MySQL Replication MySQL Cluster MySQL Partitioning

Multiplication X 1 1 x 1 = 1 2 x 1 = 2 3 x 1 = 3 4 x 1 = 4 5 x 1 = 5 6 x 1 = 6 7 x 1 = 7 8 x 1 = 8 9 x 1 = 9 10 x 1 = x 1 = x 1 = 12 X 2 1.

Doc.: IEEE /139r0 Submission 16 March, 2004 Francis daCosta, MeshDynamicsSlide 1 Project: IEEE P Working Group for Wireless Personal Area.

Extending Drive-Thru Data Access by Vehicle-to-Vehicle Relay Jing Zhao Todd Arnold Yang Zhang Guohong Cao Pennsylvania State University September 15 th,

1 Introducing the Specifications of the Metro Ethernet Forum.

Satoshi NARIKAWA NTT (G.8032 Acting Co-editor)

Tom Hamilton – America’s Channel Database CSE

Chapter 9 Part III Linux File System Administration

Hands-on RAID on Moxa Computer Prepared by: (40min) Date: mm-dd-yyyy.

The Linux Storage People Simple Fast Massively Scalable Network Storage Coraid EtherDrive ® Storage.

© 2003 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Performance Measurements of a User-Space.

Chapter 1: Introduction to Scaling Networks

Introduction to cloud computing Jiaheng Lu Department of Computer Science Renmin University of China

13,000 Jobs and counting…. Advertising and Data Platform Our System.

Copyright © 2014 EMC Corporation. All Rights Reserved. Linux Host Installation and Integration for Block Upon completion of this module, you should be.

Mecanismos de alta disponibilidad con Microsoft SQL Server 2008 Por: ISC Lenin López Fernández de Lara.

High Availability 24 hours a day, 7 days a week, 365 days a year… Vik Nagjee Product Manager, Core Technologies InterSystems Corporation.

High Availability through Virtualization

1 © Copyright 2010 EMC Corporation. All rights reserved. EMC RecoverPoint/Cluster Enabler for Microsoft Failover Cluster.

High Performance Computing Course Notes High Performance Storage.

1© Copyright 2011 EMC Corporation. All rights reserved. EMC RECOVERPOINT/ CLUSTER ENABLER FOR MICROSOFT FAILOVER CLUSTER.

Hardening Linux for Enterprise Applications Peter Knaggs & Xiaoping Li Oracle Corporation Sunil Mahale Network Appliance Session id:

MCTS Guide to Microsoft Windows Server 2008 Applications Infrastructure Configuration (Exam # ) Chapter Ten Configuring Windows Server 2008 for High.

Page 1 Mirror File System A patented breakthrough technology for Business Continuity and Disaster Prevention Twin Peaks Software Inc.

10/02/2004ELFms meeting1 Linux Virtual Server Miroslav Siket FIO-FS.

Networking Features Upon completion of this module, you should be able to: Discuss and configure VNX networking features This module continues the discussion.

Experience and Lessons learnt from running High Availability Databases on Network Attached Storage Ruben Gaspar Manuel Guijarro et al IT/DES.

1 The Google File System Reporter: You-Wei Zhang.

Guide to Linux Installation and Administration, 2e 1 Chapter 9 Preparing for Emergencies.

Nagios Is Down and Your Boss Wants to See You Andrew Widdersheim

High-Availability Linux.  Reliability  Availability  Serviceability.

Hadoop Hardware Infrastructure considerations ©2013 OpalSoft Big Data.

Next Generation Security Solutions Next Generation Controllers January 2013.

Page 1 of John Wong CTO Twin Peaks Software Inc. Mirror File System A Multiple Server File System.

Sensitivity of Cluster File System Access to I/O Server Selection A. Apon, P. Wolinski, and G. Amerson University of Arkansas.

High-Availability MySQL DB based on DRBD-Heartbeat Ming Yue September 27, 2007 September 27, 2007.

FailSafe SGI’s High Availability Solution Mayank Vasa MTS, Linux FailSafe Gatekeeper

The application of DRBD in Linux-HA Haibo Zhang 4/28/2014.

Course 301 – Secured Network Deployment and IPSec VPN

GFS. Google r Servers are a mix of commodity machines and machines specifically designed for Google m Not necessarily the fastest m Purchases are based.

1 © 2005 Cisco Systems, Inc. All rights reserved. 111 © 2004, Cisco Systems, Inc. All rights reserved. CNIT 221 Security 2 ver.2 Module 8 City College.

Introduction to HDFS Prasanth Kothuri, CERN 2 What’s HDFS HDFS is a distributed file system that is fault tolerant, scalable and extremely easy to expand.

Mark E. Fuller Senior Principal Instructor Oracle University Oracle Corporation.

CO5023 LAN Redundancy.

Linux Operations and Administration

The 2001 Tier-1 prototype for LHCb-Italy Vincenzo Vagnoni Genève, November 2000.

Fault Tolerance

AFS/OSD Project R.Belloni, L.Giammarino, A.Maslennikov, G.Palumbo, H.Reuter, R.Toebbicke.

Virtual Machine Movement and Hyper-V Replica

How to setup DSS V6 iSCSI Failover with XenServer using Multipath Software Version: DSS ver up55 Presentation updated: February 2011.

High-Availability MySQL with DR:BD and Heartbeat: MTV Japan mobile services ©2008 MTV Networks Japan K.K.

MySQL HA An overview Kris Buytaert. ● Senior Linux and Open Source ● „Infrastructure Architect“ ● I don't remember when I started.

High Availability Clusters in Linux Sulamita Garcia EDS Unix Specialist

Parallel Virtual File System (PVFS) a.k.a. OrangeFS

rain technology (redundant array of independent nodes)

Failover and High Availability

High Availability 24 hours a day, 7 days a week, 365 days a year…

HP ArcSight ESM 6.8c HA Fail Over Illustrated

Cluster Communications

Advanced Network Training

The Basics of Apache Hadoop

Hadoop Distributed Filesystem

DriveScale Log Collection Method of Procedure

Features Overview.

CS 295: Modern Systems Organizing Storage Devices

Presentation transcript:

Hadoop Namenode High Availability August 2008 Requirements and Procedures

2 Requirements  Two nodes to satisfy availability requirements.  High availability for internal components of each node.  Disk redundancy  Network redundancy  Redundant network architecture.  Heartbeat mechanism between the two nodes.  Replication of namenode metadata.  Automatic fail over with no human action required.

Internal Components Disks o 2x 300 GB 15k RPM SAS. o Hardware RAID 1 mirroring. o SMART monitoring. Network o Dual 1Gbps on-board NICs. o Linux bonding with LACP.

4 Redundant Network Architecture Linux bonding o See bonding.txt from Linux kernel docs. o LACP, aka 802.3ad, aka mode=4.  ( o Must be supported by your switches. o Throughput advantage  Observed at 1.76Gb/s o Allows for failure of either NIC instead of a single heartbeat connection via crossover. Switching infrastructure and physical segregation. o See diagram

5 Network Diagram

6 Heartbeat Between Nodes  Provided by "heartbeat" package.  (  Manage multiple resources:  Virtual IP address  DRBD Disk  Hadoop processes  /etc/ha.d/haresources example: cw-grid101.contextweb.prod IPaddr:: cw-grid101.contextweb.prod drbddisk::r0 cw-grid101.contextweb.prod Filesystem::/dev/drbd0::/hadoop::ext3::defaults cw-grid101.contextweb.prod hadoop  Heartbeat uses bond0 network interface. (* Not approved)  3 second timeout for "deadtime".  Created LSB compliant hadoop init script.

7 Replication of Namenode Metadata  DRBD Replication.  (

/etc/drbd.conf example: global { usage-count no; } resource r0 { protocol C; syncer { rate 110M; } # approximately 50% of total available startup { wfc-timeout 0; degr-wfc-timeout 120; } on cw-grid101.contextweb.prod { device /dev/drbd0; disk /dev/sda4; address :7788; meta-disk internal; } on cw-grid102.contextweb.prod { device /dev/drbd0; disk /dev/sda4; address :7788; meta-disk internal; } }

Fail Over Order of Events Virtual IP fails over. DRBD system switches primary node. (/proc/drbd status) File system fsck and mount at /hadoop. Hadoop started via LSB compliant init script. End to end fail over time approximately 15 seconds. Optionally, original master is rebooted to help avoid split-brain.

DRBD Status  Updating # cat /proc/drbd version: (api:88/proto:86-88) GIT-hash: 3e69822d3bb4920a8c1bfdf7d647169eba7d2eb4 build by :04:55 0: cs:SyncSource st:Primary/Secondary ds:UpToDate/Inconsistent C r--- ns: nr:0 dw: dr: al:11746 bm:12767 lo:14 pe:12 ua:246 ap:1 oos: [==> ] sync'ed: 18.0% (82459/100465)M finish: 0:14:31 speed: 96,904 (77,472) K/sec  Synchronized # cat /proc/drbd version: (api:88/proto:86-88) GIT-hash: 3e69822d3bb4920a8c1bfdf7d647169eba7d2eb4 build by :04:55 0: cs:Connected st:Primary/Secondary ds:UpToDate/UpToDate C r--- ns: nr:0 dw: dr: al:11781 bm:17923 lo:0 pe:0 ua:0 ap:0 oos:0 10