Hi everyone,
I'm facing an issue with the memory usage of ES.
We have a 5 nodes cluster to store our logs sent from graylog.
I've searching and reading around for a week now I made some change on my configuration but nothing seems to be relevant or helping me to solve this issue.
Here is my elasticsearch.yml
bootstrap.memory_lock: true
network.host: 0.0.0.0
http.port: 9200
discovery.zen.ping.unicast.hosts: ["log01", "log02", "log03", "log04", "log05"]
discovery.zen.minimum_master_nodes: 3
cluster.routing.allocation.disk.watermark.low: 90%
cluster.routing.allocation.disk.watermark.high: 95%
indices.fielddata.cache.size: 20%
indices.memory.index_buffer_size: 20%
My jvm.option :
-Dcom.sun.management.jmxremote
-Dcom.sun.management.jmxremote.port=9999
-Dcom.sun.management.jmxremote.authenticate=false
-Dcom.sun.management.jmxremote.ssl=false
-Dcom.sun.management.jmxremote.local.only=false
-Xms31g
-Xmx31g
-XX:NewRatio=2
-XX:ParallelGCThreads=20
-XX:+UseConcMarkSweepGC
-XX:CMSInitiatingOccupancyFraction=75
-XX:+UseCMSInitiatingOccupancyOnly
10-:-XX:-UseConcMarkSweepGC
10-:-XX:-UseCMSInitiatingOccupancyOnly
10-:-XX:+UseG1GC
10-:-XX:InitiatingHeapOccupancyPercent=75
-Des.networkaddress.cache.ttl=60
-Des.networkaddress.cache.negative.ttl=10
-XX:+AlwaysPreTouch
-Xss1m
-Djava.awt.headless=true
-Dfile.encoding=UTF-8
-Djna.nosys=true
-XX:-OmitStackTraceInFastThrow
-Dio.netty.noUnsafe=true
-Dio.netty.noKeySetOptimization=true
-Dio.netty.recycler.maxCapacityPerThread=0
-Dlog4j.shutdownHookEnabled=false
-Dlog4j2.disable.jmx=true
-Djava.io.tmpdir=${ES_TMPDIR}
-XX:HeapDumpPath=/var/lib/elasticsearch
-XX:ErrorFile=/var/log/elasticsearch/hs_err_pid%p.log
Some more informations :
{
"name" : "log01.sib.fr",
"cluster_name" : "elasticC01",
"cluster_uuid" : "xxxxxxxxxxxxxxxx",
"version" : {
"number" : "6.8.13",
"build_flavor" : "default",
"build_type" : "rpm",
"build_hash" : "be13c69",
"build_date" : "2020-10-16T09:09:46.555371Z",
"build_snapshot" : false,
"lucene_version" : "7.7.3",
"minimum_wire_compatibility_version" : "5.6.0",
"minimum_index_compatibility_version" : "5.0.0"
},
"tagline" : "You Know, for Search"
}
Every few days I have to restart my whole cluster because it's going down due to memory usage.
I am planning to make special nodes only to do the "master" role but I guess it won't be enough.
I am certainly missing something here (I'm quite new to ES at least the recent versions.
Any help would be appreciate.
Regards,