Presentation is loading. Please wait.

Presentation is loading. Please wait.

Express5800/ft series servers Product Information Fault-Tolerant General Purpose Servers.

Similar presentations


Presentation on theme: "Express5800/ft series servers Product Information Fault-Tolerant General Purpose Servers."— Presentation transcript:

1 Express5800/ft series servers Product Information Fault-Tolerant General Purpose Servers

2 Express5800/ft Series Servers High Availability Technologies

3 © NEC Corporation 2013 Page 3 Approaches to Reliability and Availability Select and combine hardware and software technologies for availability Cluster software Redundant hardware (dual modular architecture) Single server (Typical servers) Fault tolerant server Enhance availability of the system Failover across multiple servers FT server + cluster FT server cluster Continuous operation despite of hardware failures. Simplified installation and operation Enhanced HW/SW failure resilience For Large scale system with scalable nodes etc. Partially redundant hardware (e.g. HDD, PSU) Higher availability of a single server Higher availability of the system Select the best availability solution according to system requirements Enhance fault tolerance of the hardware

4 © NEC Corporation 2013 Page 4 FT Server and Cluster Solution Comparison Failover process Service during failure Performance enhancement Technology Resilience Aim Operation is interrupted for failover process Operation is interrupted for failover process (some several minutes to 10 minutes) Add CPU or node. Supports servers with 4 or more sockets Add CPU or node. Supports servers with 4 or more sockets EXPRESSCLUSTER FailoverFailure Cluster system Cluster system Hardware/ failures Hardware/ Software failures Failover Load balancing Achieve availability / scalability / load balancing Features load balancing as well as availability Software failure-resilient Suitable for large-scale systems (scalable nodes) Failover to other servers Continuous operation (no interruption) Add CPU Add CPU Supported apps Failover settings is required for each app. (creation of script batch files) General applications General applications No modifications needed Fault tolerant server Fault tolerant server Hardware failures Hardware failures Lockstep (CPU&MEM) and Failover (I/O) (Synchronized in normal conditions) High availability of a single server System configuration requires no app modifications Continuous operation without interruption Ideal for 24-7 systems, email and Web servers Isolate faulty component CPU Memory CPU Memory Failure Isolation HDDHDD ft servers provide hardware availability and can be installed quick and easily Ft servers + EXPRESSCLUSTER solution takes advantage of both solutions

5 © NEC Corporation 2013 Page 5 Express5800/ft series server Express5800/ft series server Failover complete 1. Interruption (a few secs) 2. Determine failover host (a few secs to 1-2 mins) 4. Restart apps (a few secs to a few mins) 3. Takeover of cluster resources (e.g. NW settings and disks) (a few secs to 1 min) Start failover process Cluster system Failure In service Failure Failover Repair / Replace System down for a few mins to 10 mins 1. Instantaneous isolation of the faulty module Non-stop service 2. Resynchronization after replacement Recovery complete Service Intermittence Restart serviceIn service Continuous operation Processing Lockstep Processing Module #0 Module #1 Processing Replacement of faulty module Recovery Process from HW Failures Isolated faulty model

6 Express5800/ft Series Servers Optional Features to Increase Fault Tolerance

7 © NEC Corporation 2013 Page 7 Express Report Service Support Express Report Service CPU Mem HDD CPU Mem HDD Failure CPU Mem HDD CPU Mem HDD CPU Mem HDD CPU Mem HDD Isolation NEC (monitoring center) NEC Service Center Client Alert Notification Notification Hardware monitoring & detection Isolate the failed components to continue operation. Monitor hardware status at the service center. Support the system proactively to ensure continuous availability. Isolate the failed components to continue operation. Monitor hardware status at the service center. Support the system proactively to ensure continuous availability. Continuous Operation CPU Mem CPU Mem Replace HDDHDD Recovery Only the alert information will be sent out with dedicated software (secure environment) Via the internet (mail server) public line (modem connection)

8 © NEC Corporation 2013 Page 8 Support for Redundant Peripheral Devices Double backup configuration is supported to provide for failures during backup LTO or DAT drives are offered for selection Selection of LTO or DAT and support for redundant backup * Double backup configuration is supported to provide for failures during backup LTO or DAT drives are offered for selection A two UPS configuration provides tolerance against UPS defects* Module #1 Module #2 SAS Controller SAS Controller SAS Controller SAS Controller Backup device Backup device Backup device Backup device ft series Data is output from each module to achieve backup redundancy Both backups are created almost simultaneously * Configuration of standalone backup is also supported Module #1 Module #2 PSU ft series Uninterruptable power supply Uninterruptable power supply Uninterruptable power supply Uninterruptable power supply * Single UPS configuration is also supported. UPS is controlled through the network Connecting each UPS to separate power sources helps avoid being affected by failures of the power sources Peripheral Devices

9 © NEC Corporation 2013 Page 9 ft series + EXPRESSCLUSTER for Higher Availability Clusters with ft servers enhance both HW and SW availability Enhancement SW OS Apps Module #0Module #1 EXPRESSCLUSTER Software failure EXPRESSCLUSTER monitors SW Failover to secondary server ft server (secondary) ft server (primary) OS Apps Module #0Module #1 ft series server Hardware failure Highest level of availability suitable for critical systems

10 © NEC Corporation 2013 Page 10 Benefits of ft Series + EXPRESSCLUSTER Clusters using ft servers deliver the benefits of both solutions Express5800/ft serverCluster system (configured by normal servers) Cluster system (configured by ft servers) Function Lockstep and Failover (within a server) Failover (between multiple servers) Failover (between multiple servers) HW failure tolerance Treatment Isolate faulty module (within the server) Failover from the primary server to the secondary server Isolate faulty module within the primary server (no failover between nodes) Treatment time Instantaneous Few minutes (Depends on the time necessary to startup apps) Instantaneous SW failure tolerance Treatment - (Apps level failures can be resolved by SingleServerSafe software) Failover from the primary server to the secondary server Failover from the primary server to the secondary server Treatment time - Several minutes (Depends on the time necessary to startup apps) Several minutes (Depends on the time necessary to startup apps) Periodical maintenance (SW update) Active Upgrade enables OS patches to be applied with only short interruption Each node can be separated for upgrade Each node can be separated for upgrade Performance enhancement Add CPU Add CPU or Nodes Add CPU Apps settings General apps can be used without special modifications Takeover process is required for each app Takeover process is required for each app Enhancement SW Legend: : Excellent, : Good, : Fair

11 © NEC Corporation 2013 Page 11 ft server + Hyper V + EXPRESSCLUSTER Clusters configured on Hyper-V on an ft server Hyper-V 2.0 Guest OS Apps Module #0Module #1 ft server Hardware failure Guest OS Apps ft series server EXPRESSCluster Software failure EXPRESSCluster monitors SW In the event of a SW failure, the operation fails over to another guest OS High HW and SW availability for virtualized environments Enhancement SW

12 © NEC Corporation 2013 Page 12 OS SingleServerSafe Reboot Service Process Apps Restart ExpressCluster X SingleServerSafe SW is monitored on the ft server to automatically restart the SW in the event of a failure. SingleServerSafe (SSS) monitors the server and SW status at all times. In an event of a failure, SSS restarts the service, process, OS etc. to resume operation. The ft server and SSS in tandem can handle both HW and SW failures SW availability can be improved even for a single ft server Enhancement SW By enabling failure detection and restart/reboot, SSS helps handle a wide range of failures with a single server By using the optional monitoring function of EXPRESSCluster, SSS is capable of further detailed monitoring including the detection of stalling in data bases.

13 © NEC Corporation 2013 Page 13


Download ppt "Express5800/ft series servers Product Information Fault-Tolerant General Purpose Servers."

Similar presentations


Ads by Google