Posted in
Main on March 20th, 2009 by Pingdom
Digg had an outage today that was most likely the result of the ongoing migration of Digg’s servers to a new data center.
That didn’t stop people from wondering what was going on when the site went offline, as you could see for example on Twitter.
Read more
Posted in
Main on March 19th, 2009 by Pingdom

One of the latest software errors that had widely noticed consequences was Google’s Gmail outage in February. The problem in that case was, according to Google, a bug in the software that distributed load between its different data centers.
The Gmail outage only resulted in people not having access to their email for a few hours. No one got killed. Nothing exploded. It was an inconvenience, and while it was a significant inconvenience for some of Gmail’s users, it was still just that: an inconvenience.
This article is about some of the more dire consequences of software errors through the years. Incidents that make the Gmail outage seem rather trivial.
Read more
Posted in
Main on March 10th, 2009 by Pingdom
Last week the BitTorrent site Mininova was hit by a large-scale DDoS attack that caused a total of 14 hours of downtime. Regardless of what you think about torrent sites, this was an interesting example of how a website can be incapacitated by a DDoS attack.
We chose this example to illustrate the effect of a DDoS attack because Mininova shared some relevant information about the attack, especially a very telling traffic graph from their Internet connection. This coupled with some Pingdom monitoring data gave us a chance to look closely at the effects of a DDoS attack.
Read more
Posted in
Main on March 4th, 2009 by Pingdom
Sooner or later, every site or application will fail. However the consequences depend not only on how the failure is managed but also on how it is communicated. Recently the web hosting company Media Temple and even Google have well illustrated how hard it is for modern connected organizations to respond quickly enough to system outages. Here’s a suggested crisis checklist and notes on the difficulties of always practicing it.
Read more
Posted in
Main on February 25th, 2009 by Pingdom
If you missed that Google had a 2.5-hour Gmail outage yesterday, you were probably hiding under a rock, or possibly in one of those sensory deprivation chambers. Every major tech blog and news outlet was on it (not to mention Twitter users).
It was night-time in the US, which limited the impact there, but the rest of the world wasn’t so lucky. For example, in Europe the outage started at 9:30 in the morning.
Google has now put a number on how much the potential productivity loss for Gmail users was worth.
Read more
Posted in
Main on February 16th, 2009 by Pingdom
You may remember the incident that Google had on January 31, when it during 55 minutes accidentally flagged all URL:s containing “/” as a potential malware site. This meant that every single site on the Internet was marked as harmful, including Google.com.
This is a look at some interesting side effects of that incident.
Read more
Posted in
Outages on February 10th, 2009 by Pingdom
The ever-popular Slashdot was unreachable for over an hour last evening due to massive amounts of traffic hitting its network. Normally Slashdot is known for bringing other sites down with the traffic it generates (the so-called Slashdot effect, or slashdotting).
Read more
Posted in
Main on February 3rd, 2009 by Pingdom
Yesterday a Twitter post (a tweet) by Mashable’s Pete Cashmore became so popular that traffic from Twitter crashed a blog. This sounds very similar to a common social media phenomenon originally known as the Slashdot effect (and later also the Digg effect), where a post on a popular social media site pushes more traffic than the target site can handle.
An interesting thing here is the mechanics of Twitter, which is fundamentally different from Digg and Slashdot. It’s not a social news site, with a front page that all visitors go to. We won’t go into the details of how Twitter works, that’s better covered elsewhere, but it’s worth noting that it’s a very different beast. It will be interesting times if Twitter is about to join the ranks of Slashdot and Digg as a potential “site crasher”.
For lack of a better word we will call the phenomenon of sites crashing as a result of traffic from Twitter, “the Twitter Effect”. (Or perhaps “the Tweet effect” would be catchier…?)
But now on to the big question: How could a single tweet generate that much traffic?
Read more
Posted in
Main on February 2nd, 2009 by Pingdom

Both the video-sharing site Revver and the personalized start page service Pageflakes have been down since last Thursday, January 29. As of this writing, that is more than three-and-a-half days of straight downtime.
Our monitoring shows that both sites went offline soon after 9 p.m. CET (3 p.m. US EST).
The connection between the two? Both are owned by Live Universe, whose site is also unavailable.
The outage is apparently not supposed to be permanent, but something has definitely gone very wrong. Last Friday Live Universe told CNET that the sites would be back within a few hours.
Read more
Posted in
Outages on January 27th, 2009 by Pingdom
President Barack Obama is said to be significantly more tech-savvy than his predecessor, so one might wonder how he felt when the White House mail server suffered a meltdown yesterday, leaving his entire press office without access to the official White House email addresses for much of the day.
Read more