ElasticSearch nodes fail at random

Saurabh_Minni · March 21, 2014, 7:32am

Hi,
I am facing this weird issue where I have created an ES cluster with 3 data
nodes and 1 master node.

All data nodes are dual octa core CPU with 32 GB RAM and the master is a
quad core 16GB machine
I am trying to insert around 3000 records per second with replica set as 1
. Each record is 3KB in size.
All goes well with the setup but suddenly one of the machine's load shoots
up dramatically and then finally the machines becomes unreachable.
It happens with 1 of the 3 data nodes at random.
On further inspection I can see that page faults on that system for java
process increases very quickly compared to others. System IO% also shoots
up.

I have used mlockall and made refresh index rate as -1.

But inspite of all this, the cluster keeps on failing. I was hoping that
insert rate with ElasticSearch would be really high reading at posts at
various places.

Please let me know if you know of any setting that needs to be changed.

Thanks for your help in advance.
Saurabh

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/22264709-ddd5-4389-ac6f-9640c7523036%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Topic		Replies	Views
Please help - ES 2.1.1 cluster randomly crashing Elasticsearch	18	2544	July 5, 2017
ES cluster fails at random times Elasticsearch	5	1247	December 29, 2016
Random data node disconnections on AWS Elasticsearch	1	515	March 14, 2017
Elasticsearch cluster fails to stabilize Elasticsearch	5	929	July 6, 2017
Data node high CPU Elasticsearch	19	3646	February 26, 2018

ElasticSearch nodes fail at random

Related topics