Search by Algolia
Feature Spotlight: Query Rules
product

Feature Spotlight: Query Rules

You’re running an ecommerce site for an electronics retailer, and you’re seeing in your analytics that users keep ...

Jaden Baptista

Technical Writer

An introduction to transformer models in neural networks and machine learning
ai

An introduction to transformer models in neural networks and machine learning

What do OpenAI and DeepMind have in common? Give up? These innovative organizations both utilize technology known as transformer models ...

Vincent Caruana

Sr. SEO Web Digital Marketing Manager

What’s the secret of online merchandise management? Giving store merchandisers the right tools
e-commerce

What’s the secret of online merchandise management? Giving store merchandisers the right tools

As a successful in-store boutique manager in 1994, you might have had your merchandisers adorn your street-facing storefront ...

Catherine Dee

Search and Discovery writer

New features and capabilities in Algolia InstantSearch
engineering

New features and capabilities in Algolia InstantSearch

At Algolia, our business is more than search and discovery, it’s the continuous improvement of site search. If you ...

Haroen Viaene

JavaScript Library Developer

Feature Spotlight: Analytics
product

Feature Spotlight: Analytics

Analytics brings math and data into the otherwise very subjective world of ecommerce. It helps companies quantify how well their ...

Jaden Baptista

Technical Writer

What is clustering?
ai

What is clustering?

Amid all the momentous developments in the generative AI data space, are you a data scientist struggling to make sense ...

Vincent Caruana

Sr. SEO Web Digital Marketing Manager

What is a vector database?
product

What is a vector database?

Fashion ideas for guest aunt informal summer wedding Funny movie to get my bored high-schoolers off their addictive gaming ...

Vincent Caruana

Sr. SEO Web Digital Marketing Manager

Unlock the power of image-based recommendation with Algolia’s LookingSimilar
engineering

Unlock the power of image-based recommendation with Algolia’s LookingSimilar

Imagine you're visiting an online art gallery and a specific painting catches your eye. You'd like to find ...

Raed Chammam

Senior Software Engineer

Empowering Change: Algolia's Global Giving Days Impact Report
algolia

Empowering Change: Algolia's Global Giving Days Impact Report

At Algolia, our commitment to making a positive impact extends far beyond the digital landscape. We believe in the power ...

Amy Ciba

Senior Manager, People Success

Retail personalization: Give your ecommerce customers the tailored shopping experiences they expect and deserve
e-commerce

Retail personalization: Give your ecommerce customers the tailored shopping experiences they expect and deserve

In today’s post-pandemic-yet-still-super-competitive retail landscape, gaining, keeping, and converting ecommerce customers is no easy ...

Vincent Caruana

Sr. SEO Web Digital Marketing Manager

Algolia x eTail | A busy few days in Boston
algolia

Algolia x eTail | A busy few days in Boston

There are few atmospheres as unique as that of a conference exhibit hall: the air always filled with an indescribable ...

Marissa Wharton

Marketing Content Manager

What are vectors and how do they apply to machine learning?
ai

What are vectors and how do they apply to machine learning?

To consider the question of what vectors are, it helps to be a mathematician, or at least someone who’s ...

Catherine Dee

Search and Discovery writer

Why imports are important in JS
engineering

Why imports are important in JS

My first foray into programming was writing Python on a Raspberry Pi to flicker some LED lights — it wasn’t ...

Jaden Baptista

Technical Writer

What is ecommerce? The complete guide
e-commerce

What is ecommerce? The complete guide

How well do you know the world of modern ecommerce?  With retail ecommerce sales having exceeded $5.7 trillion worldwide ...

Vincent Caruana

Sr. SEO Web Digital Marketing Manager

Data is king: The role of data capture and integrity in embracing AI
ai

Data is king: The role of data capture and integrity in embracing AI

In a world of artificial intelligence (AI), data serves as the foundation for machine learning (ML) models to identify trends ...

Alexandra Anghel

Director of AI Engineering

What are data privacy and data security? Why are they  critical for an organization?
product

What are data privacy and data security? Why are they critical for an organization?

Imagine you’re a leading healthcare provider that performs extensive data collection as part of your patient management. You’re ...

Catherine Dee

Search and Discovery writer

Achieving digital excellence: Algolia's insights from the GDS Retail Digital Summit
e-commerce

Achieving digital excellence: Algolia's insights from the GDS Retail Digital Summit

In an era where customer experience reigns supreme, achieving digital excellence is a worthy goal for retail leaders. But what ...

Marissa Wharton

Marketing Content Manager

AI at scale: Managing ML models over time & across use cases
ai

AI at scale: Managing ML models over time & across use cases

Just a few years ago it would have required considerable resources to build a new AI service from scratch. Of ...

Benoit Perrot

VP, Engineering

Looking for something?

facebookfacebooklinkedinlinkedintwittertwittermailmail

We use marketplaces every day. Whether looking up new apps for our phones and computers, or while doing a little shopping on Etsy or Amazon, we’re always looking for something.

By design, marketplaces are built to deal with millions of objects, thus making search the critical element of the shopper’s experience. Navigation through that massive amount of products should be made as easy and intuitive as possible.

But, search is complex. Relevant search is way more complex. On top of that, you also need to leverage the few dozens of words that describe your objects in order to always return first the most relevant results your demanding users are looking for.

Relevancy is more than just relevant VS irrelevant: it’s all that gray zone in between. Without an adequate search engine technology, search results are not always as relevant as you wish they would be.

 

How to implement the best marketplace search ever? We’re giving you the whole recipe and an open-source project to implement it.

Check out the GitHub repository

 

The Psychology of Product Search

What your shoppers expect

Your end-users are used to the Google bar:  they want to find what they are searching for whatever the way they write their queries and regardless the number of typing mistakes. They want to find what they are looking for on the first result page, in the top 3 results.

It turns out that searching that kind of non-structured (free text) content is not taken for granted. Most of the time, the objects your end-users and searching for are described by a 3-4 words title and a short description.

What your publishers/sellers dream of

On the other side of the fence, publishers and sellers will do everything they can to appear on the 1st search results page. Sometimes, they will even come close to the same kinds of SEO techniques spammers use on Google so as to crack their ranking algorithm.

Your search engine must be able to work around these hacks and must keep returning relevant results, regardless how the publishers name their products.

What You Need to Do

1. Deal with user-generated content

Even if most marketplaces have strict rules and guidelines on object titles & descriptions, you will need to deal with edge-case submissions that might respect the rules but that will trick your search algorithm with SEO hacks.

One trick in the book consists of injecting trendy keywords in the object title and/or description. For instance, adding “Facebook” to an object title would make the search engine retrieve that object every time the “facebook” word is queried, even if the underlying object has nothing to do with it.

The number of matches of a query word is also often taken into account. But what if a query word matched multiple times in the object description? Is that better than a single match? Well that’s typically something you want to have an answer to. Imagine all Apple accessories resellers who will rush on adding long lists of compatible devices “iphone, iphone 3G, iphone 4, iphone 4S, iphone 5, iphone 5S, iphone 5C, iphone 6, iphone 6” just to trigger a “very relevant” match on any “iphone” query… Probably not something you want, at least not before the actual iphones.

Algolia doesn’t give more importance to objects that are matching several times compared to records matching once. Instead:

  • Algolia ranks the hits depending on the matching attribute weight (more important first),
  • Algolia (optionally) ranks hits based on the position of the matching word in the attribute (considering the “iPhone 6” more important than “Leather case for iPhone 6” for the “iphone” query),
  • Algolia considers all query words by default and fallback considering all words optional if there are not results. Hits matching more words are then ranked before others.

Searchable Attributes

2. Embrace typos

Your users will do typing mistakes. A lot. And even more as the volume of mobile searches keeps on growing. But it makes the task far more challenging and complex to execute because it also involves find-as-you-type search that retrieves objects before the query is even completed.

Algolia natively supports typing mistakes and as-you-type searches. Hits having the less typos are ranked before others. The highlighting feature still works in order to help your users understand where the match occurred.

That said, when I search for “rihana” (misspelled, with a single ‘n’) I still want to see the popular “rihanna” objects first. That’s tricky, because it means that even if there are some objects matching the misspelled “rihana” you want to see the real “rihanna” first.

To implement such ranking strategy, your ranking formula must be able to consider the popular objects separately, and apply the actual sorting twice:

  • display first the popular objects and compare them against each others to show the most relevant popular hit first,
  • and then display the other objects and compare them against each others to show the most relevant non-popular hit first.

An eCommerce website could apply the same type of formula for discounted and featured products.

Algolia’s Ranking Algorithm Unveiled is the ideal solution for such ranking strategy: comparing one ranking criterion after another, moving to the next criterion if the results are tied.

To deal with such “uber popular” objects, you would tag your objects with a “popular” flag and inject it into Algolia’s ranking formula. Putting that flag as the most important ranking criterion will always retrieve “uber popular” hits first (whatever their number of typos) and the non-popular after.

Algolia's Ranking Formula

3. Redefine Popularity

There may be several business metrics you want to use to refine your ranking algorithm. Those metrics probably include:

  • the number of rating stars (out of 5),
  • the number of reviews, downloads or installations
  • the best selling products ($ales)
  • etc.

The more, the better. But, you eventually need to deal with the mathematical formula ruling them all \o/ Not mentioning that you need to also consider the text relevance as well.

So what if the average rating of a object is 5/5 based on 3 reviews; compared to another object which has an average of 4.3/5 based on 1000 reviews?

Algolia doesn’t combine your ranking criteria automatically because that’s really too business-specific: there isn’t any generic way to mix those business metrics.

In Algolia’s default ranking formula, the business metrics are only used to compare hits that are matching equally from a text-relevance point of view. The goal is first to display results that matches the user’s query words and then, if several matches are found: to sort them based on that business data.

Custom Ranking

If you’re dealing with ratings and number of reviews, you should give a look at the Bayesian Average.

Demo

Here is an implementation we’ve done that combines all those best practices. It’s a search of Wordpress plugins:

  • 38K plugins indexed,
  • Default typo-tolerance settings,
  • Popular flag: set to “1” if downloaded at least 10K times,
  • Business metric used to customize the ranking: the number of downloads of each plugin.

Building such experience literally took a few minutes!

Try it live: WordPress Plugins Search

Wordpress Plugins Search

About the author
Sylvain Utard

VP of Engineering

linkedintwitter

Improve conversion rates with product discovery

Improve conversion rates with product discovery Learn how

Recommended Articles

Powered byAlgolia Algolia Recommend

Algolia's top 10 tips to achieve highly relevant search results
product

Julien Lemoine

Co-founder & former CTO at Algolia

How Algolia tackled the relevance problem of search engines
engineering

Nicolas Baissas

Comparing Algolia and Elasticsearch For Consumer-Grade Search Part 2: Relevance Isn’t Luck
engineering

Josh Dzielak