Performance characteristics and implementation details of context/completion suggester

Srinivasan_Ramaswamy · August 18, 2020, 11:58pm

Hi Everyone

I would like to understand the performance characteristics of context suggester. I am planning to use completion suggester for a very big index (billions of documents - but only for few fields). the field is an array of strings, which is likely to have duplicates across documents.

Here are some things i would like to understand

How does adding a context affect the performance ? Does it create one FST for each unique combination of context ?
What is the recommendation for number of shards (if i decide to put this in a separate index)? should i keep the number of shards minimum ?
I am planning to use the skip duplicates flag to filter out the duplicates. What is the cost of using this flag ?
I read that it builds an FST which is kept in heap. what are some recommendations to optimize the performance and memory footprint

Any detailed explanation of the internal implementation detail would also help me understand it better. I read http://blog.mikemccandless.com/2010/12/using-finite-state-transducers-in.html to get the overall idea. I am going over the source in https://github.com/elastic/elasticsearch/tree/master/server/src/main/java/org/elasticsearch/search/suggest/completion. some overall guidance would help me understand the source code better.

Thanks
Srini

system · September 15, 2020, 11:58pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Memory usage of completion/context suggester Elasticsearch	1	246	August 5, 2022
Context Suggester: FST size estimate (RAM) Elasticsearch	6	1129	November 23, 2020
Can I skip completion suggester indexing for selected items? Elasticsearch	1	624	July 5, 2017
Context Suggester: measures to limit memory usage Elasticsearch	1	662	December 3, 2020
Issue regarding completion suggester in elasticsearch? Elasticsearch	3	454	July 5, 2017

Performance characteristics and implementation details of context/completion suggester

Related topics