Querying Elasticsearch 2.3.0 Causes node to exit cluster

AvantGardeDreams · April 29, 2016, 9:35pm

I have a five node cluster recently upgraded from ES 1.7 to ES 2.3.0.

Each node has 13 gig of memory dedicated to ES. I have observed that making a large search query will cause a node to exit the cluster. Error logs make reference to zen ping being unable to reach the missing node. Restarting the missing node will allow it to rejoin the cluster, but it subsequently hangs on re-sharding.

Any help in troubleshooting and understanding this issue would be greatly appreciated.

warkolm · April 30, 2016, 1:32am

Did you check the logs on the node that disconnected?

AvantGardeDreams · May 2, 2016, 12:28pm

Yes, the following entries appear in the log:

[2016-05-02 08:17:30,447][WARN ][index.translog ] [Node1] [aggblip][3] unexpected error while checking whether the translog needs a flush. rescheduling
java.lang.OutOfMemoryError: Java heap space
[2016-05-02 08:17:30,447][DEBUG][action.search ] [Node1] [120] Failed to execute fetch phase
RemoteTransportException[[Failed to deserialize response of type [org.elasticsearch.search.fetch.FetchSearchResult]]]; nested: TransportSerializationException[Failed to deserialize response of type [org.elasticsearch.search.fetch.FetchSearchResult]]; nested: OutOfMemoryError[Java heap space];
Caused by: TransportSerializationException[Failed to deserialize response of type [org.elasticsearch.search.fetch.FetchSearchResult]]; nested: OutOfMemoryError[Java heap space];

warkolm · May 2, 2016, 10:05pm

That's why.
What is the query, what are your node specs, what is the config (heap etc).

AvantGardeDreams · May 16, 2016, 12:35pm

Sorry for the late reply.

The heap space was the issue. Elasticsearch had been started with the following arguments:

./elasticsearch -d --Xms=20g --Xmx=20g

The Xms/Xmx args were not utilized when the JVM started, so it used the system defaults - 256M/1g

I set the ES_HEAP_SIZE environment variable to the desired value and the issue has been resolved. Thanks for the input.

Topic		Replies	Views
Elasticsearch Cluster data node comes out of cluster frequently Elasticsearch	1	378	May 3, 2018
[OutOfMemoryError[Java heap space]] Elasticsearch	2	2660	July 6, 2017
Elasticsearch cluster down due to Elasticsearch:java.lang.OutOfMemoryError: Java heap space Elasticsearch	4	374	June 25, 2019
Handling node failure in ES cluster Elasticsearch	3	2277	July 6, 2017
Unresponsive cluster after too large of a query (OutOfMemoryError: Java heap space)? Elasticsearch	7	775	July 6, 2017

Querying Elasticsearch 2.3.0 Causes node to exit cluster

Related topics