[es/search] failed: [search_phase_execution_exception] all shards failed

FTOR · June 7, 2023, 10:05am

Hello,

I would like to get documents from an index that contains a huge number of data ~ (1 million).
I am using ElasticSearchClient to connect and get information from Elasticsearch. I tested the solution with a small number of data and it works well. But I got an error while testing with size(105000). Do you have an idea how to solve the problem ?

Below the implementation of the connexion, the request query and the error

warkolm · June 7, 2023, 11:02pm

Please don't post pictures of text, logs or code. They are difficult to read, impossible to search and replicate (if it's code), and some people may not be even able to see them

FTOR · June 8, 2023, 8:40am

Ok thank you I take note. I updated some parameter setting on the index in order to enhance the result window.
PUT /MyIndex/_settings
{
"index" : {
"max_result_window" : 2100000
}
}

I updated the request on the code in order to get the total hits : .trackTotalHits(t->t.enabled(true))

SearchResponse response = client.search(s -> s
.index(index)
.size(1900000)
.trackTotalHits(t->t.enabled(true))
.query(QueryBuilders.matchAll().build()._toQuery())
,
MyClass.class
);

I get more documents but it's not enough comparing to the number of documents on Elasticsearch

Christian_Dahlqvist · June 8, 2023, 8:45am

What is the specification of your cluster? How much heap do you have assigned?

Increasing that limit will put a lot more load on the cluster and it is not clear it is able to handle this. Is there anything in the Elasticsearch logs?

dadoonet · June 8, 2023, 9:03am

You can use:

the size and from parameters to display by default up to 10000 records to your users. If you want to change this limit, you can change index.max_result_window setting but be aware of the consequences (ie memory).
the search after feature to do deep pagination.
the Scroll API if you want to extract a resultset to be consumed by another tool later. (Not recommended anymore)

FTOR · June 12, 2023, 1:30pm

Thank you for your response !
I tried the search after with a PIT. However, I faced two issues : time performance and total number is not enough

Time performance: The query on dev tools takes 4s. On the other hand, it takes 10s in the java API.
I don't get the exact number of documents on Elasticsearch index. I think the query does not consider duplicates

FTOR · June 12, 2023, 1:31pm

I can not change the parameters of the cluster. They are fixed by the architects

system · July 10, 2023, 1:32pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Result window is too large, from + size must be less than or equal to: [1000] but was [1025] Elasticsearch	3	1073	February 6, 2019
Search all docs into index in nest 2.1.1 Elasticsearch	5	1082	July 5, 2017
Search API Limits Elasticsearch	6	5159	November 15, 2018
Max size for getting data Elasticsearch	8	2766	February 28, 2018
ES Query RemoteTransportException, QueryPhaseExecutionException Elasticsearch	3	698	July 5, 2017

[es/search] failed: [search_phase_execution_exception] all shards failed

Related topics