Search by Algolia
Easily integrate Algolia into native apps with FlutterFlow
engineering

Easily integrate Algolia into native apps with FlutterFlow

Algolia's advanced search capabilities pair seamlessly with iOS or Android Apps when using FlutterFlow. App development and search design ...

Chuck Meyer

Sr. Developer Relations Engineer

Algolia's search propels 1,000s of retailers to Black Friday success
e-commerce

Algolia's search propels 1,000s of retailers to Black Friday success

In the midst of the Black Friday shopping frenzy, Algolia soared to new heights, setting new records and delivering an ...

Bernadette Nixon

Chief Executive Officer and Board Member at Algolia

Generative AI’s impact on the ecommerce industry
ai

Generative AI’s impact on the ecommerce industry

When was your last online shopping trip, and how did it go? For consumers, it’s becoming arguably tougher to ...

Vincent Caruana

Senior Digital Marketing Manager, SEO

What’s the average ecommerce conversion rate and how does yours compare?
e-commerce

What’s the average ecommerce conversion rate and how does yours compare?

Have you put your blood, sweat, and tears into perfecting your online store, only to see your conversion rates stuck ...

Vincent Caruana

Senior Digital Marketing Manager, SEO

What are AI chatbots, how do they work, and how have they impacted ecommerce?
ai

What are AI chatbots, how do they work, and how have they impacted ecommerce?

“Hello, how can I help you today?”  This has to be the most tired, but nevertheless tried-and-true ...

Catherine Dee

Search and Discovery writer

Algolia named a leader in IDC MarketScape
algolia

Algolia named a leader in IDC MarketScape

We are proud to announce that Algolia was named a leader in the IDC Marketscape in the Worldwide General-Purpose ...

John Stewart

VP Corporate Marketing

Mastering the channel shift: How leading distributors provide excellent online buying experiences
e-commerce

Mastering the channel shift: How leading distributors provide excellent online buying experiences

Twice a year, B2B Online brings together America’s leading manufacturers and distributors to uncover learnings and industry trends. This ...

Jack Moberger

Director, Sales Enablement & B2B Practice Leader

Large language models (LLMs) vs generative AI: what’s the difference?
ai

Large language models (LLMs) vs generative AI: what’s the difference?

Generative AI and large language models (LLMs). These two cutting-edge AI technologies sound like totally different, incomparable things. One ...

Catherine Dee

Search and Discovery writer

What is generative AI and how does it work?
ai

What is generative AI and how does it work?

ChatGPT, Bing, Bard, YouChat, DALL-E, Jasper…chances are good you’re leveraging some version of generative artificial intelligence on ...

Catherine Dee

Search and Discovery writer

Feature Spotlight: Query Suggestions
product

Feature Spotlight: Query Suggestions

Your users are spoiled. They’re used to Google’s refined and convenient search interface, so they have high expectations ...

Jaden Baptista

Technical Writer

What does it take to build and train a large language model? An introduction
ai

What does it take to build and train a large language model? An introduction

Imagine if, as your final exam for a computer science class, you had to create a real-world large language ...

Vincent Caruana

Sr. SEO Web Digital Marketing Manager

The pros and cons of AI language models
ai

The pros and cons of AI language models

What do you think of the OpenAI ChatGPT app and AI language models? There’s lots going on: GPT-3 ...

Catherine Dee

Search and Discovery writer

How AI is transforming merchandising from reactive to proactive
e-commerce

How AI is transforming merchandising from reactive to proactive

In the fast-paced and dynamic realm of digital merchandising, being reactive to customer trends has been the norm. In ...

Lorna Rivera

Staff User Researcher

Top examples of some of the best large language models out there
ai

Top examples of some of the best large language models out there

You’re at a dinner party when the conversation takes a computer-science-y turn. Have you tried ChatGPT? What ...

Vincent Caruana

Sr. SEO Web Digital Marketing Manager

What are large language models?
ai

What are large language models?

It’s the era of Big Data, and super-sized language models are the latest stars. When it comes to ...

Catherine Dee

Search and Discovery writer

Mobile search done right: Common pitfalls and best practices
ux

Mobile search done right: Common pitfalls and best practices

Did you know that 86% of the global population uses a smartphone? The 7 billion devices connected to the Internet ...

Alexandre Collin

Staff SME Business & Optimization - UI/UX

Cloud Native meetup: Observability & Sustainability
engineering

Cloud Native meetup: Observability & Sustainability

The Cloud Native Foundation is known for being the organization behind Kubernetes and many other Cloud Native tools. To foster ...

Tim Carry

Algolia DocSearch is now free for all docs sites
product

Algolia DocSearch is now free for all docs sites

TL;DR Revamp your technical documentation search experience with DocSearch! Previously only available to open-source projects, we're excited ...

Shane Afsar

Senior Engineering Manager

Looking for something?

facebookfacebooklinkedinlinkedintwittertwittermailmail

Infrastructure is hard. Modern businesses rely on payment processing and paycheck generation and marketing analytics and other SaaS tools, and they trust that those tools are dependable enough to not fail when they’re needed most. For example, Amazon’s search went down for a few hours recently, and even the most conservative estimates say it cost them tens of millions of dollars in revenue. 

How can companies guarantee that their most-needed infrastructure stays up, or at least get insurance for when inevitably it goes down? That’s the idea behind an Service Level Agreement (SLA), an insurance policy that kicks in when the service you paid for doesn’t operate as advertised.

In general, the more critical a service is to a company’s operations, the stronger the SLA, or promise of continual operation from the provider, will need to be to satisfy the customers’ worries.

As a SaaS player and a customer of several external services for both our search platform and our operations, as a company we’ve seen more than our fair share of SLAs. As we evolved and improved our own SLA over the past few years – providing an increasingly strong and transparent promise to our customers – we began combing through other companies’ SLAs with a fine-toothed comb. As it turns out, they’re not all created equal.

Busting the myth of the 100% SLA

In recent years it’s become fashionable for companies to include 100% uptime guarantees in their SLA – and, in some cases, even more than 100%, despite its mathematical impossibility.

Now, don’t get us wrong – all service providers have an obligation to put 100% of their effort into keeping their service running like a well-oiled machine; however, the detection of an outage itself can sometimes even be impossible… until it’s too late, of course. Being a SaaS provider on the internet implies dozens of dependencies on intermediary devices and networks, which themselves have downtime. When you promise 100% uptime, every millisecond of downtime counts – however, what if you can’t detect the outage? How can one tell if the issue comes from your connection dropping data, the service provider, or any of the dozens of intermediaries in between? To resolve this issue, SLAs define a minimum outage necessary in order to be triggered. The market standard is typically 1 minute; however, 1 minute of downtime per month means 99.9977% uptime – so what exactly is 100% uptime then?

One SaaS provider on the market today promises 100% uptime, yet their SLA only promises any refunds or credits after 0.05% downtime per month, which is a little over 20 minutes! Their service could go down for 19 minutes, which could take down your site and cause you to lose revenue, but they wouldn’t be responsible for compensating you for any of that.

We knew we could do better than this.

How we did better

When we set out to design our SLA, we had three goals:

  1. Make it simple – it needs to be understood by our users, and it’d hardly be fair to expect them to take in something worded like a legal document.
  2. Make it transparent – no one wants unexpected surprises, especially in the already stressful situation of their services not working.
  3. Trust our platform – we trust the system we built, and we want an SLA that speaks to that trust.

At Algolia, we currently have two different setups for our customers:

  • Enterprise: we replicate your search on at least three different machines hosted by two different providers in different datacenters and autonomous systems
  • Premium: we replicate your search on at least three different machines hosted by three different providers in three different data centers with three autonomous systems using at least two different Tier1 upstream providers.

 

SLAs

 

These setups are not different just on paper but they’re also different in terms of infrastructure and come with two different SLAs:

  • Enterprise: 99.99% uptime, each minute of downtime would make you eligible for 100 minutes of refund, up to a cumulative value of 100% of the monthly service billing.
  • Premium: 99.999% uptime, each minute of downtime would make you eligible for 1,000 minutes of refund, up to a cumulative value of 600% of the monthly service billing over a year.

Our outage detection starts at 30 seconds (0.001% of a month) instead of 1 minute. This is so granular that it can’t be measured with traditional monitoring architecture, so we built our own monitoring network that continuously monitors our API infrastructure, that gives us a fairly unique ability to detect downtime this fast. 

Here’s what our refund policy looks like in practice:

 

SLAs

 

Search down time Total refund of the monthly service bill
Enterprise SLA Premium SLA
30 seconds 0% 1%
1 minute 0% 2.3%
5 minutes 1% 11.6%
30 minutes 7% 70%
45 minutes 10% 100%
1 hour 13.8% 138%
2 hours 27.7% 277%
4 hours 55.5% 555%
8 hours 100% 600%

As you can see, with our Premium SLA, if our service is down 45 minutes, we refund you 100% of your monthly bill – it doesn’t get much simpler than that.

Is an SLA just SaaS insurance?

Most people don’t really see SLA as much more than a form of SaaS insurance – at Algolia, we see it as something much greater: a way to remind our customers of our reliability. We back our Premium SLA with our reinforced infrastructure and our goal is to make sure we provide the best service in the market – we don’t want downtime any more than you do, and we put our money where our mouth is. We incentivize ourselves to do everything possible to ensure that the probability of an outage is as close to zero as possible!

It has been a year since we introduced our three provider set-up, and, with it, we’ve been able to placate the worries of even the most cautious of customers. Our setup has been extensively tested with outages of entire datacenters and networks and we’ve still been able to maintain 100% uptime. 

To the best of our knowledge, our Premium SLA is unique to the market – in terms of simplicity, transparency & refund guarantee –  and we’d love to tell you more about it if you have any questions, or would like to see how your current SLA stands up against ours!

About the author
Adam Surak

Director of Infrastructure & Security @ Algolia

twitter

Recommended Articles

Powered byAlgolia Algolia Recommend

Algolia's Checklist for Selecting a Critical SaaS Service
engineering

Julien Lemoine

Co-founder & former CTO at Algolia

What to look for in a Search API
product

Benoit Perrot

Director, Engineering

10 things to ask your search provider about security
product

Denis Petit

Senior Manager, Security