A new report demonstrating how the July 2022 heatwave led to unplanned NHS data center outages underlines the need to review existing continuity plans with weather in mind.

The NHS report found that the hotter weather compromised cooling systems at two facilities that provided backup to one another, disrupting digital services at London hospitals to the cost of £1.4m.

It also found that the incident could have been prevented if cooling systems had been adequately prepared for spiking temperatures.

Additionally, though concerns about the facility’s utility provision had been raised as early as 2018, funding requests to replace the systems were not approved.

In this article, Billy Durie, global sector head for data centers at Aggreko, highlights how different strategies to equipment procurement during crises could prevent further incidents.

“The fact issues with these cooling systems were pointed out in advance demonstrates that identifying potential concerns is not an issue here. Instead, where concerns arise is in what needs to be done, and what course of action can be taken within existing continuity plans.

“Namely, in an environment where there are delays to signing off capital expenditure on new, permanent equipment, there must be other contingencies available to guard against potential downtime.

“These backup plans need to be independent of existing infrastructure, which is coming under increasing pressure as extreme weather events such as last July’s heatwave become more common.”

WP.23.Aggreko.UptimeOnTheLine.jpg

Uptime on the Line?

Opportunities for power and temperature control based on research insights from 700 European data center consultants

When tackling issues caused by rising temperatures, Durie believes a rapid response is required to ensure problems caused by unplanned downtime are minimized. Specifically, he is imploring facility stakeholders to explore rapid response temporary hire options that can be used in emergencies such as the one back in July.

“When operating data centers, any unplanned downtime is rightly seen as unacceptable. However, the worst can always occur, and when it does, every minute spent offline can lead to snowballing costs and reputational damage.

“Consequently, getting emergency utility provision to site as quickly as possible is vital, as demonstrated in this incident, where full operations were not restored for six weeks with considerable disruption for patients and staff.

“Data center managers should therefore ensure they are fully aware of the options available, including pre-emptive loadbank testing, and work with equipment suppliers that can provide rapid response remedial services as an ongoing OpEx cost. By doing so, they can reduce facility downtime while also avoiding the CapEx constraints that indirectly led to this incident.”