Presented by, MySQL AB® & O’Reilly Media, Inc. 0 to 60 in 3.1 Tyler Carlton Cory Sessions.

Slides:



Advertisements
Similar presentations
Express5800/ft series servers Product Information Fault-Tolerant General Purpose Servers.
Advertisements

A Ridiculously Easy & Seriously Powerful SQL Cloud Database Itamar Haber AVP Ops & Solutions.
Case Study: Photo.net March 20, What is photo.net? An online learning community for amateur and professional photographers 90,000 registered users.
Computing Infrastructure
Enterprise Web Architecture and Performance Shennon Shen & Scott Carey --- Plumtree Software Inc.
© 2010 VMware Inc. All rights reserved Confidential Performance Tuning for Windows Guest OS IT Pro Camp Presented by: Matthew Mitchell.
1 Magnetic Disks 1956: IBM (RAMAC) first disk drive 5 Mb – Mb/in $/year 9 Kb/sec 1980: SEAGATE first 5.25’’ disk drive 5 Mb – 1.96 Mb/in2 625.
Chapter 5: Server Hardware and Availability. Hardware Reliability and LAN The more reliable a component, the more expensive it is. Server hardware is.
Faster Than Alter – Less Downtime Chris Schneider.
High Availability Group 08: Võ Đức Vĩnh Nguyễn Quang Vũ
Scale-out Central Store. Conventional Storage Verses Scale Out Clustered Storage Conventional Storage Scale Out Clustered Storage Faster……………………………………………….
Towards High-Availability for IP Telephony using Virtual Machines Devdutt Patnaik, Ashish Bijlani and Vishal K Singh.
Lesson 1: Configuring Network Load Balancing
Chapter 11: Maintaining and Optimizing Windows Vista
VTS INNOVATOR SERIES Real Problems, Real solutions.
F Fermilab Database Experience in Run II Fermilab Run II Database Requirements Online databases are maintained at each experiment and are critical for.
Hardening Linux for Enterprise Applications Peter Knaggs & Xiaoping Li Oracle Corporation Sunil Mahale Network Appliance Session id:
CPP Staff - 30 CPP Staff - 30 FCIPT Staff - 35 IPR Staff IPR Staff ITER-India Staff ITER-India Staff Research Areas: 1.Studies.
BNL Oracle database services status and future plans Carlos Fernando Gamboa RACF Facility Brookhaven National Laboratory, US Distributed Database Operations.
Mass RHIC Computing Facility Razvan Popescu - Brookhaven National Laboratory.
Cluster computing facility for CMS simulation work at NPD-BARC Raman Sehgal.
1 Storage Refinement. Outline Disk failures To attack Intermittent failures To attack Media Decay and Write failure –Checksum To attack Disk crash –RAID.
The Hadoop Distributed File System
Online Systems Status Review of requirements System configuration Current acquisitions Next steps... Upgrade Meeting 4-Sep-1997 Stu Fuess.
Nagios Is Down and Your Boss Wants to See You Andrew Widdersheim
CDF data production models 1 Data production models for the CDF experiment S. Hou for the CDF data production team.
The Red Storm High Performance Computer March 19, 2008 Sue Kelly Sandia National Laboratories Abstract: Sandia National.
Hosting on a managed server hosted by TAG  No technical support required  Full backup of database and files  RAID 5 system means that if a hard drive.
Copyright 2006 MySQL AB The World’s Most Popular Open Source Database MySQL Cluster: An introduction Geert Vanderkelen MySQL AB.
SSS Test Results Scalability, Durability, Anomalies Todd Kordenbrock Technology Consultant Scalable Computing Division Sandia is a multiprogram.
Designing and Deploying a Scalable EPM Solution Ken Toole Platform Test Manager MS Project Microsoft.
Block1 Wrapping Your Nugget Around Distributed Processing.
Module 10: Maintaining High-Availability. Overview Introduction to Availability Increasing Availability Using Failover Clustering Standby Servers and.
Taiwan APT OSM Sizing. THE SIZING ESTIMATES CONTAINED IN THIS DOCUMENT ARE BASED UPON THE ASSUMPTIONS OF PROPER APPLICATION CONFIGURATION AND TUNING,
The application of DRBD in Linux-HA Haibo Zhang 4/28/2014.
INFORMATION MANAGEMENT Unit 2 SO 4 Explain the advantages of using a database approach compared to using traditional file processing; Advantages including.
GFS : Google File System Ömer Faruk İnce Fatih University - Computer Engineering Cloud Computing
Hosted SharePoint. Part 3/3: Office Live as a WSS solution Speaker Name Microsoft Corporation Hosted.
70-293: MCSE Guide to Planning a Microsoft Windows Server 2003 Network, Enhanced Chapter 12: Planning and Implementing Server Availability and Scalability.
Infrastructure for Data Warehouses. Basics Of Data Access Data Store Machine Memory Buffer Memory Cache Data Store Buffer Bus Structure.
 Introduction  Architecture NameNode, DataNodes, HDFS Client, CheckpointNode, BackupNode, Snapshots  File I/O Operations and Replica Management File.
PROOF tests at BNL Sergey Panitkin, Robert Petkus, Ofer Rind BNL May 28, 2008 Ann Arbor, MI.
CNAF Database Service Barbara Martelli CNAF-INFN Elisabetta Vilucchi CNAF-INFN Simone Dalla Fina INFN-Padua.
Database CNAF Barbara Martelli Rome, April 4 st 2006.
BNL dCache Status and Plan CHEP07: September 2-7, 2007 Zhenping (Jane) Liu for the BNL RACF Storage Group.
Cluster computing. 1.What is cluster computing? 2.Need of cluster computing. 3.Architecture 4.Applications of cluster computing 5.Advantages of cluster.
Replicazione e QoS nella gestione di database grid-oriented Barbara Martelli INFN - CNAF.
Experiences with Xen virtualization and multi-brick Evergreen Environments Presented by Mike Peters – Indiana State Library & Niles Ingalls– Hussey Mayfield.
High-Availability MySQL with DR:BD and Heartbeat: MTV Japan mobile services ©2008 MTV Networks Japan K.K.
SQL Server 2016 – New Features Tilahun Endihnew March 12, 2016.
Manage large RAC Clusters Session# 851 Tom S. Reddy Database Administration, Inc.
DB Questions and Answers open session (comments during session) WLCG Collaboration Workshop, CERN Geneva, 24 of April 2008.
HELMHOLTZ INSTITUT MAINZ Dalibor Djukanovic Helmholtz-Institut Mainz PANDA Collaboration Meeting GSI, Darmstadt.
Top 10 Non-SharePoint Technical Issues That Can Doom Your Implementation Robert Bogue (317)
G. Russo, D. Del Prete, S. Pardi Frascati, 2011 april 4th-7th The Naples' testbed for the SuperB computing model: first tests G. Russo, D. Del Prete, S.
Testing the Zambeel Aztera Chris Brew FermilabCD/CSS/SCS Caveat: This is very much a work in progress. The results presented are from jobs run in the last.
Calgary Oracle User Group
Understanding and Improving Server Performance
70-293: MCSE Guide to Planning a Microsoft Windows Server 2003 Network, Enhanced Chapter 12: Planning and Implementing Server Availability and Scalability.
High Availability in SQL Server 2012
Scaling HDFS to more than 1 million operations per second with HopsFS
RHEV Platform at LHCb Red Hat at CERN 17-18/1/17
DCS Status and Amanda News
Failover and High Availability
Managing Multi-User Databases
High Availability Linux (HA Linux)
Lead SQL BankofAmerica Blog: SQLHarry.com
Experience of Lustre at a Tier-2 site
Design Unit 26 Design a small or home office network
AlwaysOn Availability Groups
Presentation transcript:

Presented by, MySQL AB® & O’Reilly Media, Inc. 0 to 60 in 3.1 Tyler Carlton Cory Sessions

Presented by, MySQL AB® & O’Reilly Media, Inc.

The Project  Medium sized demographics data mining project  1,700,000+ User base  Hundreds of data points per user

“Legacy” System – Why Upgrade? +  Main DB (External Users)  Offline backup (Internal Users)  Weekly manual copy backups  Max of 3 simultaneous data pulls  8hr+ data pull times for complex data pulls  Random index corruption

Notes: Smaller is Better On average, CPU usage with MySQL was 20% lower than our old database solution.

Why We Chose MySQL Cluster  Scalable  Distributed processing  5 – 9’s Reliability  Instant data availability between internal & external users

 8 Node NDB cluster –Dual Core 2 Quad 1.8 ghz –16 Gig ram (Data memory) –6x Raid 10 SAS 15k RPM drives What We Built – NDB Data Nodes

What We Built – API & MGMT Nodes  3 API nodes + 1 management node –Dual Core 2 Quad 1.8 ghz –8 Gig ram –300 gig 7200rpm (Raid 0)

NDB Issues with a Large Data Set  NDB load times –Loading from backup: ~ 1 hour –Restarting NDB nodes: ~ 1 hour Note: Load times differ depending on your data size

NDB Issues with a Large Data Set  Indexing Issues –Force index (NDB picks wrong) –Index creation/modification order matters (Seriously!)  Local Checkpoint Tuning –TimeBetweenLocalCheckpoints - 20 means 4MB (4 × 2 20 ) of write operations –NoOfFragmentLogFiles – No. of 4 x 16MB files –None deleted until 3 local checkpoints –On startup: Local checkpoint buffers would overrun –RTFM (two, maybe three times)

NDB Network Issues  Network transport packet size –Buffer would fill and overrun –This caused nodes to miss their heartbeats and drop  This would happen when: –A backup was running –A local checkpoint was running at the same time  Solved by : Increasing network packet buffer

Issues - IN Statements  IN statements die with engine_condition_pushdown=ON with a set of apx. 10,000 or more. (caused with zip codes)  Really need engine_condition_pushdown=ON, but this broke it for us, so… we had to disable it.

Structuring Apps: Redundency  Redundant power supply + dual power sources  Port trunking w/ redundant Gig-E switches  # NDB Replicas: 2 (2x4 setup) 64 gig max data size  MySQL (API Nodes ) Heads: Load balanced with automatic fail over

Structuring Apps: Internal Apps  Ultimate goal: Offload the data intensive processing to the MySQL nodes

The Good Stuff: Stats! Queries per Second (over 20 days)  Average Queries / Sec during our peak times  Average 250 Queries / Sec

Website Traffic Stats for March 2008

Net Usage: NDB Node  All NDB data nodes have nearly identical network bandwidth usage  MySQL ( API ) Nodes use about 9 MBs max under our current structure Totaling 75 MBs during peak(600 Mbs)

Monitoring & Maintenance  SNMP Monitoring: CPU, Network, Memory, Load, Disk  Cron Scripts: –Node status & Node down notification –Backups –Database maintenance routines MySQL Clustering book provided the base the scripts

 Dolphin NIC Testing – 4 node test cluster – 4 x overall performance – Brand new patch to handle automatic Ethernet failover / Dolphin Fail Over ( beta as of March 28 ) Net Usage: Next steps…

Questions?

Contact Information  Tyler Carlton  Cory Sessions CorySessions.com OrangeSoda.com