Our data shows that when people search for a certain product, most of them use roughly 1.5 words. These short queries, unfortunately, make it hard for full-text searches to offer them relevant results. While there is an improvement in using filters, there are often so many that it can be confusing.
One of the ways to make searching more effective is to use the ‘learning to rank’ approach, which creates an optimal ranking of results. However, even this machine-learning method is not all-mighty – and that’s why we’ve come up with Query Understanding, a great companion to ‘learning to rank.’
Full-text search is excellent if you use longer queries, say four or more words, or special terms, such as a product code. In these cases, it usually gives you exactly what you are looking for in the first position or shows you a “No results” page. Either way, both situations are better than offering you a list of completely irrelevant results, which is often the case if you use shorter queries but minimize or leave out unique words.
How search works in the real world
This is actually how most people search. Our data shows that, on average, people use 1.43 words per query(with a standard deviation of +/- 0.58, computed for our 150+ most active clients). While we have clients with many product code-only queries (this depends on a domain), the median of product code-only queries is only 2.9%.
People using short, non-specific queries is the cornerstone of full-text search technology issues. The drawbacks of full-text search are the same as its main advantages – it can find anything that matches the query, literally anywhere. And so it usually finds tons of results and leaves the searcher to sort through them. The full-text score, computed from short, non-specific queries, is not good enough to produce the most relevant results ranking.
How should e-commerce businesses respond?
That’s why ‘learning to rank’ exists. This machine learning-based method, which orders results by a numerical value representing their relevance, combines human behavior with full-text metrics and creates an optimal results ranking. At Luigi’s Box, we’ve already incorporated this mechanism into our search-related products. We have seen, however, that despite being very helpful, ‘learning-to-rank’ is not the silver bullet for the ultimate search solution. It is challenging to learn the right ranking model for a query that is seen once a month, and there are many domains where a long tail of such queries exists and must be taken care of.
The solution would be to nudge users to rely more on filters to narrow the search results based on the particular parameters of their interest. However, when considering filters, there can be overwhelming options. For instance, one of our clients has more than 2,000 different parameters (depending on the product category) and almost 19,000 different values for those parameters. It’s impossible to build an intuitive and simple interface for all of them.
Query Understanding available now
Our approach to this problem is to recognize users’ search intent and turning the maximum number of query terms into key-value filters. For instance, if a user types in “open headphones,” we automatically recognize the “Acoustic system: open” filter, which is usually unavailable through a standard faceted interface, and informs the user that they can turn it off should they change their mind. We can cope with any of the thousands of parameters available and turn on filters that are otherwise not accessible to the users when the system recognizes a need.
A unique but quite frequent case is when people search for a category. In that case, we redirect them to the category page, thereby driving them to the (often) manually-curated page with the e-commerce site’s context-dependent content, such as banners, promotions, etc. Now they’ve been shown a relevant result and your current deals!
Our feature, Query Understanding, can recognize categories, brands, and parameters of products, which helps customers get more relevant results for short queries. It is available for all of our customers.
Tomáš is the CTO & Co-Founder of Luigi’s Box. He received Ph.D. in Computer Software Engineering from FIIT STU. Tomáš has been researching cutting-edge search technologies for more than ten years and continually works to revolutionize the concept of search and navigation to provide the best possible customer experience for site search users.More blog posts from this author