Presentation on theme: "On-Demand Architecture for Network-Critical Physical Infrastructure"— Presentation transcript:
1 On-Demand Architecture for Network-Critical Physical Infrastructure Large Data CentresSmall & Medium Data CentresComputer Rooms & ClosetsI’d like you to consider my proposal to change to the way you think about data centers.Now, when I talk about data centers, I am referring to the classical large corporate data center, but also the small data centers and LAN rooms that actually house the majority of network computing power.I am going to focus my discussion on a single aspect of the data center: The Physical power, cooling, and rack Infrastructure that supports the IT equipment.And I want to talk about: how new ways of assessing the business value of this infrastructure are going to drive how we believe data centers are going to be deployed in the future.David BlumanisAPC, Data Centre Consultant, Asia Pacific
2 Network-Critical Physical Infrastructure (NCPI) Essential foundation of reliabilitySo what exactly do we mean by Network-Critical Physical Infrastructure?We are talking about the lowest level of your networks. Your network and business depends on people supported by process, supported by IT technology, and the IT technology is supported by power-rack-cooling equipment and the services and management that keep them running reliability.In your business, NCPI is the foundation of reliability of your IT operations. Ideally, it is out of sight and out of mind, is there when you need it, does not fail, requires only a small investment, and lasts a long time.You have NCPI and you depend on it. Lets examine your investment in NCPI.
3 You have a large NCPI investment Electricity: 27%Service: 13%Racks: 3%Space: 17%Engineering &Installation: 19%Cooling Equipment: 6%This pie chart shows where your investment in NCPI goes.The chart of Total Cost of Ownership, or TCO, and includes both capital costs and operating costs over a 10 year period. It does represent a typical data center.The biggest TCO cost is electricity…… (explain around the pie)Now obviously the total cost associated with a particular data center is going to depend on the size of the facility. However, it is possible to express total cost in a way so you can think about it for any sized data center. CLICK. If you express the cost per rack, the value is fairly consistent. A rule of thumb is that the TCO for NCPI is K PER RACK.Systems without redundancy come in at the low end of this range, and systems with high amounts of redundancy come in at the high end.But if you do the math, you will see that at a typical value of $100K per rack you are talking about a substantial investment, which may even be comparable to the IT equipment investment. It is a big cost, and you need to understand the value you are receiving for it.Power Equipment: 17%Typical 10 yr lifecycle TCO from $80K to $140K per rackSource: APC White Paper #6
4 Network-Critical Physical Infrastructure Challenges Rapid changes in IT technologyHigh densityHigh operating/service costsIncreasing availability expectationsServer consolidationRegulatory requirementsIn addition to the desire to understand and control the costs, there are also some substancial challenges that many companies face regarding NCPI. These include……In response to all of these issues, APC contends that a radical change is required to the way we approach data center design.Standardize process and architectureEliminate one-time engineeringReduce today’s plan-specify-engineer-purchase-install-test-commission cycle time by 90%Pre-engineer, pre-configure, pre-manufacture, and pre-test NCPI using modular componentsCreate systems that adapt to changing changing requirementsSystematically drive out the main cause of downtime: Human errorIntegrate power, cooling, and rack into a single management architectureEmpower IT professionals to specify, operate, and manage NCPIAPC has the world’s largest R&D budget focused on this problemSo what can we do about all this and what is the entitlement for improved performance and reduced cost?In response, APC is changing the way the world designs, installs, operates, manages and maintains Network-Critical Physical Infrastructure (NCPI)
5 What are the barriers to improving value by 10x? The Business Value Equation for Network-Critical Physical InfrastructureAvailabilityxAgilityValue=TCOAny rational business must assess the value received for their investment in critical power infrastructure.Businesses have a goal for availability which they have determined is required for their enterprise. The value obtained while achieving higher availability has two additional key factors.TCO. Naturally, the total cost of owenership factors directly into the return on the availabilty investment and its benefit.But second, an underrecognized value is created by the adaptability of the system. The ability of a system to adapt to change is central to the value proposition for today's rapidly changing environments. Adaptability allows rapid deployment. Scaling to meet changing requirements and Reduction in waste due to oversizing.Understanding the value proposition gives rise to a vision of what data centers will need to be like in the future. Lets look at these three issues one by one, asking the question “what are the barriers to…..What are the barriers to improving value by 10x?
6 Your entitlement: Reduce downtime by 90% 10 XAvailabilityxAgilityValue=TCOEquipment ReliabilityMean-Time-To-RecoverHuman Error80% less50% lessAvailability is imperative for Network Critical Physical Infrastructure. We suggest that you have at least a 10X entitlement to improving availability. There are three key elements drving availability in today’s data center.Equipment reliability. APC and other vendors provide highly reliable equipment for NCPI. We believe at APC we make the most reliable equipment available, but that is a difficult claim for a customer to assess. However, our value proposition for improving business value does not depend on claiming improved reliability, and as you will see there are other factors that have an even more dramatic effect on availability than reliability does.MTTR is just as important a contributor to availability as reliability is. A piece of equipment that fails half as frequently is said to be twice as reliable, but if takes four times as long to repair is only half as available. At APC, we have taken this fact into consideration in order to provide modular swapable systems that can be repaired 10 times faster. In some cases the customer can self repair, which can save ANOTHER factor of 10 in repair time. This allows us to achieve dramatically higher availability and is an obvious benefit of our approach.A less obvious contributor to down time is Human Error. We believe our approach to NCPI can reduce human error by 50% or more. Lets look at this in more detail.Your entitlement: Reduce downtime by 90%
7 Case Study of Downtime Failures Consistent with Surveys which attribute 60% or more of failures to Human ErrorA large financial company kept records over an extended period for all of their mission-critical facilities to identify the causes of downtime. We are talking here about extremely sophisticated customers with state-of-the-art high availability installations.This pareto chart shows the breakdown of the root causes of downtime for these mission critical facilities.We believe this data is also representative of typical customer experience.Consider the root causes…As a component supplier, all I can do is make the left bar better. And of course we continue to make progress in this area. However, note that even if equipment never fails I can only reduce downtime 30%. The big opportunity for improvement is in the red bars: human error.
8 Standardisation enables improved availability StandardiseTo EliminateModularisePre-test systems in the factoryHot swappablePre-Integrate Power-Cooling-Rack Management & diagnosticsOne-Time EngineeringUnique training requirementsOn-site assemblyOn-site testingCustom programmingHere is what we need to do; take data center infrastructure and…..Can you specify the exact location of the headlight on your new car? Or the location of the dashboard knobs? (CLICK)Eliminate one time engineeringWe must design products thatAre pre-configuredDon’t need extra engineered infrastructure like raised floorsAre pluggableAre expandableAre reconfigurableAre scalableUse standardized management toolsPeople need to figure out that when they ask for something unique they are going to get unique problems to go along with it.Standardisation accelerates learning at all levels, reducing human error and increasing availability
9 Your entitlement: As much as 10 x the agility AvailabilityxAgilityValue=TCOSpeed of DeploymentAbility to ScaleAbility to Reconfigure80% less time100% easier100% easierThe ability to adapt to changing business requirements is essential in today’s evolving data center. We consider here three of the core capabilities that make NCPI agile and adaptable.Increasing the speed of deployment is an essential part of APC’s approach, which we will discuss in more detail.The inherent ability to scale is a basic property of APC’s modular approach to physical infrastructure. It enables rightsizing, simplifies planning, and reduces initial capital outlay. But more importantly, it allows you to react to changing business requirements without shutting down critical loads.The equipment in a typical data center changes out every 2 years, but a data center has an expected life of over 10 years. By using modular, hot swappable components, our approach provides the ability to reconfigure for different power density, for different levels of redundancy, and for different voltages and plug types. Without ripping apart you data center or forcing unplanned maintenance. Our approach can even facilitate relocation of a data center.Taken together, these benefits can deliver business value by making your NCPI over 10X more adaptable.ADD Flight Center Customer testimonialOne of our customers Flight Center is adding 15 sites a month on his network. At the same time they are consolidating all their IT in the Brisbane head office. Knowing they have 700 branches that is a very difficult mission.This customers has also availability needs of %.Do you think this customer could achieve the integration of 15 new site a month while delivering high availability without a scalable, flexible, reconfigurable manageable NCPI?Your entitlement: As much as 10 x the agility
10 Data Centre traditional design What if the user has a separate battery room?
11 What if……The Design Concept is built on standardised parts that can be scaled to meet the need?Construction Design incorporates a “Lego Block” approach of pre-engineered components?Construction is mainly done in a factory instead of on-site?Commissioning tests pre-integrated infrastructure systems.Working with our clients, APC has asked the following questions: What if……..The design concept is built around standardized parts that can be scaled to meet the need? Could we not begin to make the design simpler, more effective and faster to implement?What if the Construction Design employs a Lego Block approach? For example, what happens when the UPS system has the power distribution system integrated into the design from the start? Doesn’t this lead to a shorter lead time/What if the construction transfers fabrication from the site to the factory? For example, what if the electrical distribution system used pre-measured, pre-fabricated branch circuit wiring as opposed to piece by piece fabrication on site? Wouldn’t that be faster? Not only that, but a recent independent analysis shows that factory wiring is 15X more reliable than field wiring.And finally, what if the commissioning process tests pre-integrated systems as opposed to mismatched components trying to be systems? This would lead to a commissioning process that involves days, not weeks.
12 Your entitlement: As much 20% overall NCPI TCO AvailabilityxAgilityValue=20% lessTCOCapital CostElectricity CostsService Costs30% less10% less30% lessIf you are a typical customer, you are entitled to substantial advantages in TCO. The single biggest opportunity is thorugh scalability: Matching the Capacity to the Load to avoid over-investment in capital. I will discuss this in more detail in a moment.But in addition to capital costs, there is a large entitlement posible in Efficiency – Using less energy to meet the load requirements through operating efficiency and reducing the redundant module size. And as I showed earlier, electricity is the single largest category of TCO.In addition, there are significant opportunities for saving service costs. Fist, Maintainability – Products with predictive maintenance can reduce the need for programmed maintenance, second, Integration – System features that avoid the need for custom installation services. And Third, Ease of Installation – Reduced design effort, site time preparation and labor services.Oversizing is the dominant contributor to excess costs in all of these categories. Lets look at this subject further.Your entitlement: As much 20% overall NCPI TCO
13 Waste due to oversizing Oversizing data center infrastructure is a ubiquitous problem.This graph is a result of surveys we have done regarding the capacity utilization of actual data centers. Here is what we found.Users have an expected load which they determine through various estimation methods and pad for uncertainty. This requirement typically starts at some value and then grows by a factor of 3, reaching final value about halfway through the lifetimeThey provide a room capacity and an installed power and cooling capacity to meet the expected requirement.However, on average the actual load is much less than the planned load, because on average the systems don’t start out to be as large as was expected, and they don’t grow as rapidly as was expected.This results in considerable waste due to oversizing. This waste includes excess capital expense but also creates excess operating expense, including inefficiency. On average, over 50% of the installed capacity is wasted. This waste is caused by uncertainty.The answer cannot be to simply reduce plans, because remember this graph only represents the average, and there certainly are many cases where the actual load ends up equaling or exceeding the expected value. The risk of underbuilding data center capacity is real, and in the past the only practical way to avoid it is through padding the design and oversizing.Lets examine a case study….Reference: APC White paper #37
14 Availability x Agility Value = TCO The Business Value Equation for Network-Critical Physical InfrastructureAvailabilityxAgilityValue=TCOTo summarize: Any rational business must assess the value received for their investment in critical power infrastructure, and demand improvements in that value over time. In addition, there are very real immediate challenges related to power density and availability.CLICK Businesses have a goal for availability which they have determined is required for their enterprise. Achieving the entitlement here requires reducing human error through standardization and improving maintainability of equipment. But the overall value obtained while achieving higher availability has two additional key factors.CLICK Naturally, the total cost of owenership factors directly into the return on the availabilty investment and its benefit. Rightsizing is the key to controlling TCOCLICK But just as important, an underrecognized value is created by the adaptability of the system. The ability of a system to adapt to change is central to the value proposition for todays rapidly changing enviornments. Adaptability allows rapid deployment. Scaling to meet changing requirements and Reduction in waste due to oversizing. Even portability.Understanding the value proposition gives rise to a vision of what data centers will need to be like in the future. CLICK And the future is that you have an entitlement of over 10X the current value you are getting.CLICKAPC is working hard to help customer’s achieve that entitlement. And our response to that challenge is our InfrastruXure system.
15 Brayton International “ InfraStruXure™ gives control of the power to the IT professional. I can allocate the power the way I want to, without calling in an electrician.”INDUSTRYManufacturingAPPLICATIONNew data centreREQUIREMENTAdaptability and controlMarkus M. Hill, Director, Technology & Information ServicesHigh Point, NCUnited States
16 Olympus Corporation“ Above all we wanted to avoid having to spend a fortune on initial costs before the server room was even filled.”INDUSTRYTechnologyAPPLICATIONNew server roomREQUIREMENTScalabilityMasashi AsanoAssistant ManagerInformation Systems DepartmentTokyo, Japan
17 ADCS Inc“ With InfraStruXure™, the total cost of ownership was much lower. This was the deciding factor.”INDUSTRYFinanceAPPLICATIONNew data centerREQUIREMENTHigher availability with lower total cost of ownershipCliff RittelDirector of ITPoway, CAUnited States
18 Avon Old Farms School“ With only two systems people to support the entire campus, it’s an immense help to have such an easy-to-manage solution as InfraStruXure™.”INDUSTRYEducationAPPLICATIONNew data CentreDeb CaseDirector of Information TechnologyAvon, CTREQUIREMENTAbility to manage with limited personnel
19 Just Some Australian Customers TelstraToyotaWoolworthsLend LeaseTenix DefenseCommonwealth BankGeneral Motors Holden