Elasticsearch Pagination – Everything You Need to Know

Master Elasticsearch pagination for efficient web data display. Optimize performance and enhance user experience.

April 25, 2024
Search

Elasticsearch Pagination – Everything You Need to Know

What is Elasticsearch?
Why use Elasticsearch pagination
What is Elasticsearch pagination?
Elasticsearch typical pagination
Deep pagination and why avoid it
Search_after pagination
Scroll pagination and scroll API
Conclusion

When building a webpage and using Elasticsearch to display data stored, you need to consider some other things. The vast information within the index often can’t be handled by the API Gateway at once.

That’s why it’s essential to paginate results so the client can get a predictable and manageable data amount returned every time.

However, before you can start painting the results with the client, you must paginate backend storage data. Most data storage solutions have functions that allow users to paginate, filter, and sort data.

Before we get to all the search requests, search engine configurations, and size parameters, let’s go back and explain some of the simpler concepts before we can talk about Elastic Search pagination and how it works.

What is Elasticsearch?

Elasticsearch is an analytics and distributed search engine. It’s currently one of the most popular search engines used for operational intelligence, text search, business analytics, log analytics, security intelligence, etc. Elasticsearch lets users send data as JSON documents using API or various ingestion tools.

Elasticsearch stores original documents automatically and adds searchable references for them within the cluster’s index. Users can also use the Elasticsearch API to search and retrieve desired documents.

You can also use various visualization tools for building interactive dashboards and visualizing data.

Why use Elasticsearch pagination

When building a webpage that needs to display a large amount of data stored in Elasticsearch, there is so much information in the index that the API Gateway can’t handle. The best solution in this scenario is to paginate results so the client gets predictable data returned every time.

However, before you can paginate your results with the client, you will have to paginate backed storage data. Luckily, most data storage platforms, including Elasticsearch, have various functions allowing users to paginate, filter, and sort data.

Your data structure and requirements are vital to determining what paginating methods you should use. Today, we’ll take a look at multiple pagination methods and explain how they work.

What is Elasticsearch pagination?

Pagination is a known technique used for web presentations. This default search mechanism in Elasticsearch is used for fetching larger data results. When sending queries to Elasticsearch, default values are used to return the first or the most important ten documents.

Limiting the presentation to around five pages is generally a good idea. That helps users know which page they are on, go to the next or the previous page, and select a specific page they want to navigate to. There are three pagination types you should know, including:

Cursor-based
Keyset
Offset

However, since our topic today is Elasticsearch pagination, we shall focus on these three types:

Scroll pagination
Search-after pagination
Pagination

In most applications, ten hits are displayed on the initial page, and there are different options users can “see more.” There’s a button for the next or previous page, the pages are listed, and users can jump on them, or a scrolling option. Now, let’s take a look at each pagination method individually.

Elasticsearch typical pagination

As mentioned earlier, traditional pagination is the default mechanism for fetching more results in Elasticsearch. When sending search queries to Elasticsearch, it uses the default values to send the first or most relevant documents with a maximum of 10 results.

This pagination method sets the upper default limit at 10,000. Pagination doesn’t allow returning over 10,000 documents. However, this configuration can be changed by using the (index.max_result_window) command line. Many developers use this change, but it could cause issues down the road.

The search requests consist of two phases. The first phase follows the initial request, and it’s the query phase. The second phase is the fetch phase.

The query phase (first phase)

Data nodes calculate the scores during the initial phase and match the documents while returning a list of document IDs and a list of scores.

The list is built on the data node and subsequently forwarded to the node in charge of the search request, where it’s sorted and retained in memory. The query score-ID list can become really large over time.

The fetch phase (second phase)

During the second phase, the document JSON Source is fetched from all nodes holding the documents. It ultimately becomes a “Multi Get-request” based on the ID for all the documents that are parts of the pages that need to be returned.

Even though these requests are efficient, all of the query-related information must be kept in memory until the response is sent to the client, and that’s why it’s generally an excellent option to use smaller page sizes.

This method can easily lead to deep pagination, which leads to using up all cluster memory and performance loss.

Deep pagination and why avoid it

Deep pagination leads to extensive memory usage, causes cluster latency, and disrupts the performance of your cluster overall. With this approach, you’re allowing access to all pages. That might sound good in theory, but even Google has a limited number of pages displayed.

Elasticsearch always recalculates hits while sorting and storing the whole Score-ID list within the memory. Instead, the focus should be on providing relevant scoring, filters, and UI to make your users happy with the results they get on the first page.

In other words, a single request and page result should be enough for most people.

Search_after pagination

The search_after pagination is ideal for applications where you don’t need to jump from specific pages to other pages or when you use infinite scrolling. The search_after pagination lets you tell Elasticsearch which was the last hit viewed so that it can ignore all previous hits.

Rather than storing the entire score-ID list for the request within the memory and having to perform actions like sorting so that the right page results can be provided, this method uses a tiebreaker from the previous hit on the last search request.

With this search method, you can show many hits efficiently. With search_after, it’s possible to show over 10,000 hits without worrying about memory usage or using pre-calculated pages.

Live index updates and pagination

Elasticsearch is good for supporting live index updates without causing performance issues. It’s easy to add documents, delete, or update while performing the same-index queries.

Even though these key concepts can be really useful, they could lead to inconsistencies with search result pages when pagination is involved.

For example, if you insert a document relevant to the query and the user clicks on the second page, they will likely get the last document viewed displayed on the top. Search_after and pagination are stateless, meaning there’s no guarantee the order of the search results will be the same when users change pages.

You need to use stateful pagination to ensure that the search experience is consistent after a specific number of repetitions. That brings us to the next search method called Point in Time API.

Using the Point in Time API with pagination or search_after

You can use the Point in Time API to extend your Search_After or traditional pagination and turn them into stateful functions. Users will always get the same index version after a certain time.

All the updates will be sidelined or ignored, or at least users won’t notice them, meaning their search experience will be consistent. Users won’t see random documents showing up when navigating back and forth across pages.

Scroll pagination and scroll API

You can use Scroll API for iterating many documents that match a query and sometimes even all documents that are matching. Even though this API has the name Scroll API, you should never use it for implementing infinite scrolling. At the same time, it shouldn’t be used for frequent end-user requests.

This search operation can handle a scroll request, but Scroll API is completely stateful, meaning all index updates are ignored during the scroll request. Elasticsearch must store the snapshot of the current index version and keep it alive during the scroll’s lifespans to achieve this function.

Actively updated indexes can have difficulties keeping the initial search context alive. The scroll API can be used for retrieving a broad collection of documents with a single request.

You will need a scroll_id for the scroll API, and you can get it by adding a specific scroll argument within query requests.

Conclusion

To sum things up in a simple manner, you should use traditional pagination whenever you need to access pages freely, and you have no need for deep pagination.

The search_after method is the best solution when you and your users want to use the “next” & “previous” buttons, and there’s a need for wide access to multiple pages.

If you need consistency across search result pages, it’s best to use Point in Time API, and the Scroll method is ideal when you want to list all query hits or when you need consistent ordering across multiple search result pages.

We hope our post has helped you understand Elasticsearch pagination and how to choose the best method for your needs.

Barbora Bončová Product Marketing Specialist

Barbora does magic with words in Luigi's Box as a product marketing specialist. She got into writing while studying at university as a volunteer for various civic associations. Besides being part of Luigi's Box marketing team, she co-organizes the TEDxBratislava conference, where she cares about marketing and PR.

Read on

April 4, 2024
Search

What Is Semantic Search: Definition, Examples, And Usefulness

Take a look at our article and learn about what semantic search is, how it differs from other types of search, and how it can be used.

February 27, 2024
Search

Mastering E‑Commerce UX: What is Faceted Navigation and How to Leverage it

Harness faceted navigation's power to enhance your e-commerce. Avoid pitfalls and learn 9 top implementation practices.

October 25, 2023
Search

Mobile Search UI: Discover 8 Tips and Best Practices for Improvement

Looking to improve your mobile search UI? Take a look at our tips, tricks and best practices for a better mobile search.

October 19, 2023
Search

Neural Search 101: A Guide to NLP-Powered Neural Search Experiences

Explore the transformative power of neural search in enhancing online search experiences with Luigi's Box insights.

September 21, 2023
Search

Elasticsearch Pagination – Everything You Need to Know

Master Elasticsearch pagination for efficient web data display. Optimize performance and enhance user experience.

May 29, 2023
Search

Understanding The Different Types of Search Algorithms

Curious about different types of search algorithms and how they work? Take a look at our article and learn what you need about search algorithms.

May 28, 2023
Search

10 Tips to Avoid No Results Found Pages

If your e-commerce business loses customers due to no results found pages, here's what you can do to fix it.

April 25, 2024
Search

Power of Autocomplete for an E-Commerce Search

Refine your e-commerce search bar with autocomplete to boost sales. Discover best practices to enhance the user experience.

April 25, 2024
Search

How Does Luigi’s Box Affect the SEO of Your Website?

Find out more about how Luigi's Box can affect the SEO of your website and if it can bring a positive impact on your SEO performance.

April 6, 2023
Search

Tips for Search Box Optimization & Best Practices You Need to Know

Learn a few useful tips on how to best optimize your search box to improve your customers' experience and increase revenue.

March 2, 2023
Search

Advanced Product & Website Search (When to Implement)

Advanced product and website searches take many forms, from simple keyword searches to more advanced natural language processing.

February 17, 2023
Search

Why My E-Shop Needs Product Recommendation Software

Retaining customers is more challenging than ever. Learn how to keep them interested with personalized product recommendation software.

February 9, 2023
Search

The Perfect Search Bar Design: UI/UX Best Practices

Implement these search bar UI design best practices to boost your e-commerce store conversions.

January 30, 2023
Search

Predictive Search: Why is it Essential For a Modern E-Shop

How can predictive search help enhance user experience, generate more sales and increase revenue? Let's explore it.

December 6, 2022
Search

E-Commerce Search Trends

Leverage Luigi's Box Analytics which provides valuable reports on the most searched phrases and products/categories.

November 15, 2022
Search

How to Build a Search Engine for Your Website

Useful web search with features, that will make shopping easier and thus increase the chance of purchase is important in every good e-shop.

October 30, 2022
Search

What Is Federated Search and Why Is It Important

What is federated search, and how it's different from traditional search engines? Discover all its advantages.

October 10, 2022
Search

Voice Search Challenges: What It Needs to Be Able to Do

How to optimize for voice search? We know it's becoming increasingly popular and have prepared a list of challenges for easier implementation.

October 1, 2022
Search

E-Commerce Search Metrics Explained

For a successful e-commerce site, performance metrics are vital. What to measure and how? Learn more in our blog.

September 19, 2022
Search

How to Optimize Product Catalog for Maximum Search Efficiency

Improve the quality of your product catalog. Manage and optimize it without any trouble with this Luigi's Box list and increase sales.

June 21, 2022
Search

Why Your E-Shop Needs Faceted Filtering

It's vital to offer customers a smooth shopping experience. If they can’t find desired products, they'll instantly head elsewhere.

May 20, 2022
Search

Why Your Website Needs Faceted Search

Faceted search can massively improve your website usability and boost your business. Learn how to use it to increase your sales.

May 18, 2022
Search

Image Search Software: All Nuts and Bolts

Have you ever knew what something you wanted to buy looked like, but you didn’t know the name? Visual search has got you covered.

February 25, 2022
Search

Mobile Web Search Challenges & How to Overcome Them

Mobile web searches are becoming more popular every year. Explore the crucial challenges and discover effective solutions.

February 16, 2022
Search

Voice Search for E-Commerce Businesses

Voice search enables you to use speech to search for products and information on a website. Learn more about it in this guide.

January 19, 2022
Search

15 E-Commerce Search Best Practices to Look Out For

When was the last time you checked your site search performance? Here’s a 15-point checklist that’ll help you determine how fit your site search is.

January 3, 2022
Search

E-Commerce Website Development Steps in 2024 [Checklist]

Can you develop a website that satisfies customers' needs and keeps them coming back? Here are the features your e-shop needs.

September 6, 2021
Search

[Explained] Trendings

The Trendings tab in Luigi’s Box Analytics provides useful reports on the most searched phrases and items. Learn how to leverage this data.

May 26, 2021
Search

[Explained] Custom Keywords

Custom Keywords are an excellent way to group products or items that don’t share an already designated attribute or category on your site.

April 26, 2021
Search

[Explained] Synonyms and Synonym Recommendations

Synonyms can be a nightmare for standard searches. Learn how Luigi's Box Search makes search with synonyms easier.

February 25, 2021
Search

[Explained] Boosted Items and Boosted Terms

What's the difference between boosted items and terms? How and when to use them? Read and learn more.

August 1, 2019
Search

A Lesson on Search Satisfaction From Enterprise Search

World's tech giants invest heavily in their search. What makes enterprise search interesting, and what can e‑commerce businesses learn from it?

June 3, 2019
Search

Here’s the latest news on site search in 2019 (only what you need to know and nothing more!)

Site search is an important aspect of you success. We dived deep into a huge bulk of search data, so you don't have to.

May 14, 2019
Search

Query Understanding: An Efficient Way to Deal With Long Tail Queries

People don't use whole words for searching. So how to ensure your search always delivers relevant search results?

May 10, 2019
Search

How Should You Paginate Search Results on Your Website?

Read on to learn how to paginate search results to ensure your customers' most convenient browsing experience.

April 25, 2024
Search

How to Increase Conversion Rate on Your E-Commerce Site by Improving Your Site Search

Why is site search so important for e-shops? Because approximately one third of visitors use this function while looking for a product on a website!

January 30, 2019
Search

Offline Synthetic Testing: A Quick and Safe Method to Improving Search Results

Search optimization requires a lot of testing. Want to be sure the changes impact your website positively? Try offline synthetic testing.

August 24, 2018
Search

What Site Search Data Can Teach Your E-Commerce Brand About Buyer Intent

The power of search goes far beyond helping customers to find what they are looking. Here’s what you need to know about site search data!

January 9, 2018
Search

How Good is Your Search?

Have you ever wondered how good your search is? It's probably good enough for you, but is it good enough for your customers?

Visit Category

Elasticsearch Pagination – Everything You Need to Know

What is Elasticsearch?

Why use Elasticsearch pagination

What is Elasticsearch pagination?

Elasticsearch typical pagination

The query phase (first phase)

The fetch phase (second phase)

Deep pagination and why avoid it

Search_after pagination

Live index updates and pagination

Using the Point in Time API with pagination or search_after

Scroll pagination and scroll API

Conclusion

Read on

This website uses cookies