Tribe node to ignore cluster events

bennett · May 18, 2017, 7:38pm

Hi,

I have tribe nodes that look at multiple clusters, but every time there's a index rotation or any sort of maintenance, the tribe node goes down until things are allocated and back to normal which could take over an hour.

I heard there was a patch that was given to blizzard for the tribe node to ignore cluster events. I was wondering if I could get a similar patch or tell me how to patch it?

Thanks!
Krystle

jasontedor · May 19, 2017, 2:34am

Completely ignoring cluster "events" is inherently dangerous yet there are aspects of cluster state updates that are expensive yet can be ignored on tribe nodes.

Upgrade to a version that includes Skip shard management code when updating cluster state on client/tribe nodes by ywelsch · Pull Request #20731 · elastic/elasticsearch · GitHub (first released in 2.4.2, and also included in 5.0.0).

bennett · May 19, 2017, 3:19pm

I'm running version 2.4.4 . wouldn't that have the change if 2.4.2 has it?

bennett · May 19, 2017, 3:24pm

What else could I do to for more stability for the tribe nodes?

jasontedor · May 19, 2017, 3:31pm

Yes.

jasontedor · May 19, 2017, 3:31pm

I don't know because I don't have enough detail to assess what your problem is.

bennett · May 19, 2017, 3:34pm

basically when the indices rotate (create new ones and delete old ones), the tribe node goes in a red state saying it can't connect to elasticsearch. This happens for an hour sometimes as the clusters are trying to stabilize. We have users trying to access the tribe nodes but get errors. We are looking to reduce the downtime of the tribe node. And when I was listening to blizzard (which looks like we have a similar setup) they mentioned there was a patch that was applied that helped with the downtime of the tribe node.

jasontedor · May 20, 2017, 1:56pm

Please provide logs.

system · June 17, 2017, 1:56pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Tribe node lost connection to Cluster Elasticsearch	5	1364	July 5, 2017
Known tribe bug for ES v1.7.1? Elasticsearch	4	642	July 5, 2017
Fault tolerant tribe nodes? Elasticsearch	1	313	July 6, 2017
Tribe nodes between different Cluster-Versions? Elasticsearch	2	883	July 5, 2017
Elasticsearch cluster instability Elasticsearch	13	2821	July 6, 2017

Tribe node to ignore cluster events

Related topics