Pingdom Home

US + international: +1-212-796-6890

SE + international: +46-21-480-0920

Business hours 3 am-11:30 am EST (Mon-Fri).

Pingdom Blog

Royal Pingdom

Ramblings from the Pingdom team about the Internet and web tech

RSS Feed

Gmail could be unavailable for more than 21 hours in a day, and Google could still tell you that according to their SLA, the service has had 100% uptime.

It sounds impossible, but it’s a direct consequence of how Google has written its SLA for Google Apps (which includes Gmail, Google Docs, Google Calendar and more). We will explain this in detail further down, but let’s first look at what the SLA actually says.

What the Google Apps SLA says

Here are the key parts, quoted from the Google Apps SLA, emphasis added by us:

“Downtime Period” means, for a domain, a period of ten consecutive minutes of Downtime. Intermittent Downtime for a period of less than ten minutes will not be counted towards any Downtime Periods.

[...]

“Monthly Uptime Percentage” means total number of minutes in a calendar month minus the number of minutes of Downtime suffered from all Downtime Periods in a calendar month, divided by the total number of minutes in a calendar month.

So, “downtime periods” are what’s ultimately used for counting the uptime percentage for Google Apps, and these downtime periods ignore all downtime that lasts less than 10 minutes.

A worst-case scenario

Now back to our initial statement. How does the SLA make it possible to have more than 21 hours of downtime in a day and yet Google would call it 100% uptime?

Here is the problem: What if Google Apps was down for 9 minutes, up for 1 minute, down 9 minutes, etc. That would mean 54 minutes of downtime each hour, but Google still wouldn’t count it because none of the individual downtimes lasted 10 minutes of more.

Over a day (24 hours), that’s 21 hours and 36 minutes of downtime that Google would simply ignore when calculating the final uptime percentage.


Above: Red is downtime, green is uptime. Note that none of the downtime periods above last 10 minutes or more and thus are not counted according to the Google Apps SLA.

It’s an extreme and very unlikely worst-case scenario, but we wanted to illustrate the consequence of how Google’s SLA sums up its downtime and calculates its uptime percentage.

A more likely scenario

Now let’s take a much more likely example of intermittent problems:

3m, 8m, 12m, 5m, 9m, 14m, 4m = 57 minutes of actual downtime

But Google would only count this as 26 minutes of downtime, including only the downtime periods lasting 12 and 14 minutes.


Above: In this scenario, only the downtime periods lasting 12 and 14 minutes (marked with yellow) would be counted according to the Google Apps SLA.

Short outages are common in the real world

The problem with the Google Apps SLA is that short outages, less than 10 minutes in length, are actually very common in the real world.

As you may know, we here at Pingdom run an uptime monitoring service, and we know from our own experience (and a LOT of data from thousands of sites) that it’s much more common for sites to have multiple short intervals of downtime instead of a few long ones.

The 99.9% monthly uptime guarantee in the Google Apps SLA allows for 43 minutes of downtime in a 30-day month, but ignoring problems that last less than 10 minutes at a time will definitely make it much easier for Google to honor its SLA.

Want to test your site every minute?








You will get an email with your login information.

12 Comments

Actually it is even more interesting, if they monitor by the second, which I’m sure they do.

9 Mins and 59 seconds would not count as downtime, so you can have 9:59 mins down and 1 second up, which would give you basically about 2 minutes of uptime daily.

As Super Bowl 46 is approaching, fans will flock to the Lucas Oil Stadium in Indianapolis, Indiana, and to TV sets around the world to follow the New York Giants battle it out with the New England Patriots.

Kickoff is scheduled for 6:30EST on Sunday, February 5, and we’re already monitoring Superbowl.com to see how the site will handle the event.

What team will win Super Bowl 46? How will the site cope? We can only wait to find out.

Read more

Weekend must-read articles #2

Every Friday we bring you a collection of links to places on the web that we find particularly newsworthy, interesting, entertaining, and topical. We try to focus on some particular area or topic each week, but in general we will cover Internet, web development, networking, performance, and other geeky topics.h

This week we bring you a collection of articles focusing on cloud, with a few other topics thrown in to boot.

Read more

Out of the 59 US-based e-commerce sites we monitored during the holiday season last year 28 scored a perfect 100% uptime for December.

Whether this helped spur on the booming sales in the US, we don’t know, but retail e-commerce spending in the US reached $37.2 billion for the November to December 2011 period. That was an increase of 15% from the same period in 2010.

We decided to dig into the numbers for these e-commerce sites to see how well they did in terms of uptime and performance. After massaging the data coming from our Pingdom probes, it turns out that the sites overall performed well during December 2011 in terms of uptime, but response time was an issue for several sites.

Read more

Pingdom Podcast #5

Pingdom’s Mobile Podcast is a weekly show about Internet, web, and mobile stuff.

In this show, Saleh also gives us an update on the pending submission of his Carbon for Windows Phone Twitter client. We’re also joined by Mario Lurig, who talks about using Amazon S3 and Cloudfront to speed up a website.

Read more

Want to be able to download a DVD worth of data in about 38 minutes? It may not seem very impressive, but that’s with the average Internet speed in South Korea, according to the latest “State of the Internet” report by Akamai.

Covering Q3 2011, the report again puts South Korea at the top of the list of countries with the fastest Internet connections. The country scored an average connection speed of 16.7 Mbps in Q3 2011.

Read more