Hello all!
So I have been having this issue lately with elasticseach 2.4.
When I run
POST /_reindex
{
"source": {
"index": "indextest-2016.12.04"
},
"dest": {
"index": "indextest-2016.12.04-2"
}
}
I see random nodes leaving the cluster momentarily then joining back up. If I do requests_per_second=50 then that seems like a kind of sweet spot where it barely happens.
Here is a response once its finished (the failure nodes change)
{
"took": 182120,
"timed_out": false,
"total": 2999257,
"updated": 7200,
"created": 600,
"batches": 9,
"version_conflicts": 0,
"noops": 0,
"retries": 0,
"throttled_millis": 0,
"requests_per_second": "unlimited",
"throttled_until_millis": 0,
"failures": [
{
"shard": -1,
"index": null,
"reason": {
"type": "node_not_connected_exception",
"reason": "[NODE_2][IP:9300] Node not connected"
}
},
{
"shard": -1,
"index": null,
"reason": {
"type": "node_not_connected_exception",
"reason": "[NODE_2][IP:9300] Node not connected"
}
},
{
"shard": -1,
"index": null,
"reason": {
"type": "node_not_connected_exception",
"reason": "[NODE_6][IP:9300] Node not connected"
}
}
]
}
I have never noticed this before and I am wondering how can I best go about seeing why a node is leaving and joining? In the master node logs and the data node I notice leaving there is no logs. Very weird!
Was hoping for insight as to what stats or logs to look for.
Thanks