30
Jun

picture-1114As many of you know, a lot of the sites that use Rackspace as their hosting provider were down for about an hour yesterday. That’s because Rackspace went down. Apparently, it was a power outage at a data center that caused it, an incident report that we’ve obtained explains.

While Rackspace has backup systems in place, a series of event apparently caused those backups to fail, resulting in the servers going down. Here’s the key nugget:

The breaker on the primary utility feeder tripped, initiating a sequence of events that ultimately caused a power interruption in Phase I and Phase II of the data center. All systems initially came up on generator power without customer impact. The ‘A’ bank of generators, which support UPS clusters A and B in Phase I and UPS cluster E in Phase II, then experienced excitation failure which escalated to the point where the generators were no longer able to maintain the electrical load. Rackspace then attempted to switch to our secondary utility feeder, but was unable to do so due to an issue in the Pad Mounted Switch (PMS). At approximately 3:15pm CDT, power supply through UPS clusters A, B and E was lost when the batteries in those clusters discharged, and equipment receiving power through those clusters experienced an interruption in service.

The service says only one of its nine servers were affected by this failure, but main high profile sites collapsed as a result, including EventBrite, Justin Timberlake’s site and Michelle Malkin’s popular political blog. As Rackspace noted yesterday that “We owe better, and will deliver.”

Below, find the full incident report.

picture-1011

Crunch Network: CrunchGear drool over the sexiest new gadgets and hardware.


Related posts:

  1. What Went Down At Rackspace Yesterday? A Power Outage And Some Backup Failures. As many of you know, a lot of the sites...
  2. Someone Needs To Stop Tripping Over The Power Cord At Rackspace As much of the web seemed to notice this morning,...
  3. Rackspace Goes Down. Again. Takes The Internet With It. Again. Another day, another Rackspace outage. The hosting company had a...
  4. MySQL.com Down Due to Massive Power Outage As of July 22nd, all MySQL Web services have become...
  5. Palo Alto Power Outage Affects Up To 240 Startups The tragic plane crash which killed three Tesla employees flying...

Related posts brought to you by Yet Another Related Posts Plugin.

Comments are closed.