Cluster details
100 data nodes, 3 dedicated master node
Data node JVM : 30 GB on each node
Date node cores: 64
Dedicated master core: 64
Data node JVM: 30 GB
Cluster data details
Total 500TB
Shards have a uniform size of 20GB
Total number of indices:3000, each with 5 primary and 1 replica
Total number of segments: 900,000
Mappings are close to 115MB, Cluster state is close to 158MB.
Requests on a data node- no indexing/no query, just management queries
[2017-10-06T04:36:53,017][INFO ][c.a.c.e.logger ] [_qYdend] HEAD / - 200 OK 345 0
[2017-10-06T04:36:53,053][INFO ][c.a.c.e.logger ] [_qYdend] GET /_nodes filter_path=nodes.*.version%2Cnodes.*.http.publish_address%2Cnodes.*.ip 200 OK 11999 36
[2017-10-06T04:36:53,056][INFO ][c.a.c.e.logger ] [_qYdend] GET /_nodes/_local filter_path=nodes.*.settings.tribe 200 OK 2 0
[2017-10-06T04:36:53,066][INFO ][c.a.c.e.logger ] [_qYdend] GET /_cluster/health/.kibana timeout=5s 200 OK 411 9
[2017-10-06T04:36:53,066][INFO ][c.a.c.e.logger ] [_qYdend] POST /.kibana/config/_search - 403 FORBIDDEN 253 0
[2017-10-06T04:36:55,568][INFO ][c.a.c.e.logger ] [_qYdend] HEAD / - 200 OK 345 0
[2017-10-06T04:36:55,607][INFO ][c.a.c.e.logger ] [_qYdend] GET /_nodes filter_path=nodes.*.version%2Cnodes.*.http.publish_address%2Cnodes.*.ip 200 OK 11999 38
[2017-10-06T04:36:55,610][INFO ][c.a.c.e.logger ] [_qYdend] GET /_nodes/_local filter_path=nodes.*.settings.tribe 200 OK 2 1
[2017-10-06T04:36:55,624][INFO ][c.a.c.e.logger ] [_qYdend] GET /_cluster/health/.kibana timeout=5s 200 OK 411 14
[2017-10-06T04:36:55,625][INFO ][c.a.c.e.logger ] [_qYdend] POST /.kibana/config/_search - 403 FORBIDDEN 253 0
[2017-10-06T04:36:58,127][INFO ][c.a.c.e.logger ] [_qYdend] HEAD / - 200 OK 345 0
[2017-10-06T04:36:58,165][INFO ][c.a.c.e.logger ] [_qYdend] GET /_nodes filter_path=nodes.*.version%2Cnodes.*.http.publish_address%2Cnodes.*.ip 200 OK 11999 37
[2017-10-06T04:36:58,168][INFO ][c.a.c.e.logger ] [_qYdend] GET /_nodes/_local filter_path=nodes.*.settings.tribe 200 OK 2 1
[2017-10-06T04:36:58,183][INFO ][c.a.c.e.logger ] [_qYdend] GET /_cluster/health/.kibana timeout=5s 200 OK 411 15
[2017-10-06T04:36:58,184][INFO ][c.a.c.e.logger ] [_qYdend] POST /.kibana/config/_search - 403 FORBIDDEN 253 0
[2017-10-06T04:37:00,686][INFO ][c.a.c.e.logger ] [_qYdend] HEAD / - 200 OK 345 0
[2017-10-06T04:37:00,724][INFO ][c.a.c.e.logger ] [_qYdend] GET /_nodes filter_path=nodes.*.version%2Cnodes.*.http.publish_address%2Cnodes.*.ip 200 OK 11999 37
[2017-10-06T04:37:00,727][INFO ][c.a.c.e.logger ] [_qYdend] GET /_nodes/_local filter_path=nodes.*.settings.tribe 200 OK 2 1
[2017-10-06T04:37:00,742][INFO ][c.a.c.e.logger ] [_qYdend] GET /_cluster/health/.kibana timeout=5s 200 OK 411 15
[2017-10-06T04:37:00,743][INFO ][c.a.c.e.logger ] [_qYdend] POST /.kibana/config/_search - 403 FORBIDDEN 253 0
[2017-10-06T04:37:03,247][INFO ][c.a.c.e.logger ] [_qYdend] HEAD / - 200 OK 345 0
[2017-10-06T04:37:03,286][INFO ][c.a.c.e.logger ] [_qYdend] GET /_nodes filter_path=nodes.*.version%2Cnodes.*.http.publish_address%2Cnodes.*.ip 200 OK 11999 38
[2017-10-06T04:37:03,288][INFO ][c.a.c.e.logger ] [_qYdend] GET /_nodes/_local filter_path=nodes.*.settings.tribe 200 OK 2 1
[2017-10-06T04:37:03,302][INFO ][c.a.c.e.logger ] [_qYdend] GET /_cluster/health/.kibana timeout=5s 200 OK 411 13
[2017-10-06T04:37:03,303][INFO ][c.a.c.e.logger ] [_qYdend] POST /.kibana/config/_search - 403 FORBIDDEN 253 0
[2017-10-06T04:37:05,806][INFO ][c.a.c.e.logger ] [_qYdend] HEAD / - 200 OK 345 1
My guess was that it was related to a slow file system, e.g. a slow or congested SAN, so I am a bit surprised if you are seeing this with locally attached SSDs.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.