Why Predictive modeling is essential for managing a modern Data Center computing facility
William Dougherty is VP of Information Technology for RagingWire Data Centers, with over 15 years designing, building, securing and operating high-availability computing systems. He tweets regularly on industry issues @bdognet.
Tour any three data centers and you’ll be left scratching your head trying to differentiate between them. As a result, price and proximity become the primary decision points in an otherwise seemingly level playing field. However, there are crucial differences between data center providers that can drive up both your costs and your IT downtime risk. Ask the following questions of your potential data center provider and you will keep your costs and your downtime risk as low as possible.
Which components of the data center facility are both concurrently maintainable and fault tolerant?
Many data centers claim to be N+1 or N+2 redundant. Sounds great! But data center facilities have to maintain their equipment, too. The question you need to ask is, what impact does concurrent maintenance have on the fault tolerance of the data center? Pose the following scenario on your facility tour: generator #1, UPS #2, and CRAH unit #3 are unavailable due to maintenance; the facility loses utility power and generator #4, UPS #5 and CRAH unit #6 all fail to operate. What happens to their customers?
Your critical IT infrastructure operates in a world where utility outages or equipment failures happen. In the above case, N+1 redundancy won’t protect you. Your IT infrastructure needs isolation from multiple simultaneous events (N+2 or 2N redundancy at every level).
What are the average and maximum power densities of the facility on a watts per square foot AND watts per cabinet basis?
About 10 years ago, the first data centers were designed for much lower rack power densities than are required today. As a work-around, some facilities space cabinets farther apart (while charging you more) to accommodate higher-density clients. Most data centers are built to support an average of 100W – 175W per square foot with more modern data centers supporting an average of 225W per square foot and scalable on an individual basis up to 400W per square foot.
Cabinet power densities are equally important. Ten years ago, a 2kW cabinet was sufficient to power a full 42U of x86 servers. With today’s multi-core, high density blade servers, 8kW – 10kW is the new rack power minimum. Expect your required power density to climb and make sure your data center has the infrastructure to grow with you.
How often does the data center load test its generators?
Fuel consumption and expensive test equipment makes load testing generators a costly maintenance item. Sometimes, in lieu of regular load testing, data centers will use unexpected utility power outages as a way to “load test” their generators on client IT loads. If they aren’t regularly load testing, then they are likely to only identify generator problems when utility power fails, which is precisely the wrong time to find an issue. Ask your data center provider if they put every generator on an extended load test – not just a start-up – at least quarterly.
What are the highest risk natural disasters for the area, and what has the data center done to mitigate their impact?
Every data center is subject to natural disasters, but some are more vulnerable than others. The Uptime Institute has published an excellent study of composite risk from earthquakes, hurricanes, tornadoes, snow and other disasters. There are relatively few regions in the U.S. that fit in the low-risk zones. Another scenario: the data center survives a massive earthquake, but utility power is still out with no estimated repair timeline. How long can the facility’s generators run on normal on-hand fuel supply and how many suppliers are contracted for refuel services? It is critically important that you a) understand what disaster scenarios are likely for the facility; and b) work with your provider to make contingency plans based on likely risks.
What are the minimum skill sets of the remote hands and eyes staff?
It is an absolute certainty that at some point your equipment will need to be physically serviced. You can either drive to the data center or use their remote hands and eyes services. Some data centers cut corners by using security guards to provide remote services. You want to make sure the remote hands staff provided by the data center consists of true IT professionals. Ask for minimum job requirements and speak with the service manager so that you know who is answering the phone at 2 AM. Quality remote hands staff can reduce the relative importance of proximity in your decision making process.
What certifications has the data center earned, and do they undergo annual audits to maintain them?
SSAE 16, PCI DSS, LEED Gold, Energy Star, FISMA, HIPAA, SCIF, Tier IV. Each standard is a useful tool in differentiating your data center choices. If you process credit cards, you want your upstream providers to support your PCI compliance by maintaining their own PCI ROC. If you are a financial organization, you need your data center to be SSAE 16. If your company is environmentally conscious, especially if you purchase carbon credits to offset your power consumption, you want a data center with EPA ENERGY STAR and LEED Gold certifications. Ask your data center for proof of their certifications. This information is invaluable because it represents an independent analysis of the facility’s quality, reliability and security.
Bonus question #7 – Does the physical security include sharks with frickin’ laser beams attached to their heads?
Why? Because it would be cool if it did. Look under the raised floor sometime. Maybe there’s something interesting down there…
Cloud computing could lead to billions in energy savings http://bit.ly/nXHTSg
GoDaddy Bolsters Cloud Computing Competition with Data Center On Demand http://bit.ly/lFrCP8
Cloud Computing’s Tipping Point http://bit.ly/i9j5Ui
HP signs cloud computing agreement with Centrica http://bit.ly/iff4Qo