Hi,
I have cluster with 5 nodes: 3 master and 2 data nodes.
Data nodes have 24 Gb heap, 12Gb Xmx.
I use daily indexes which automatically created at midnight. Each index is about 30Gb.
Index has 5 shards and 1 replica.
I have one node which receives data in bulk form from producer and second one node as replica. To second one node also Kibana is connected.
Today at midnight I have OutOfMemory exception on both data nodes.
Node 1 - is replica
Node 2 - is data receiver
Logs:
Node2:
[2018-01-26T00:00:14,648][ERROR][o.e.t.n.Netty4Utils ] fatal error on the network layer
at org.elasticsearch.transport.netty4.Netty4Utils.maybeDie(Netty4Utils.java:185)
at org.elasticsearch.http.netty4.Netty4HttpRequestHandler.exceptionCaught(Netty4HttpRequestHandler.java:89)
at io.netty.channel.AbstractChannelHandlerContext.invokeExceptionCaught(AbstractChannelHandlerContext.java:285)
at io.netty.channel.AbstractChannelHandlerContext.notifyHandlerException(AbstractChannelHandlerContext.java:850)
at io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:364)
at
[2018-01-26T00:00:21,306][WARN ][o.e.m.j.JvmGcMonitorService] [kib-vm06-data-2-es_data_instance] [gc][4244281] overhead, spent [3.3s] collecting in the last [3.5s]
[2018-01-26T00:00:24,358][WARN ][o.e.m.j.JvmGcMonitorService] [kib-vm06-data-2-es_data_instance] [gc][4244282] overhead, spent [3s] collecting in the last [3s]
[2018-01-26T00:00:24,360][ERROR][o.e.b.ElasticsearchUncaughtExceptionHandler] [kib-vm06-data-2-es_data_instance] fatal error in thread [elasticsearch[kib-vm06-data-2-es_data_instance][bulk][T#2]], exiting
java.lang.OutOfMemoryError: Java heap space
[2018-01-26T00:00:24,361][ERROR][o.e.i.e.Engine ] [kib-vm06-data-2-es_data_instance] [.monitoring-es-6-2018.01.25][0] tragic event in index writer
java.lang.OutOfMemoryError: Java heap space
[2018-01-26T00:00:24,362][ERROR][o.e.b.ElasticsearchUncaughtExceptionHandler] [kib-vm06-data-2-es_data_instance] fatal error in thread [elasticsearch[kib-vm06-data-2-es_data_instance][bulk][T#4]], exiting
java.lang.OutOfMemoryError: Java heap space
[2018-01-26T00:00:17,793][ERROR][o.e.b.ElasticsearchUncaughtExceptionHandler] [kib-vm06-data-2-es_data_instance] fatal error in thread [elasticsearch[kib-vm06-data-2-es_data_instance][search][T#3]], exiting
java.lang.OutOfMemoryError: Java heap space
at org.elasticsearch.common.util.PageCacheRecycler$1.newInstance(PageCacheRecycler.java:99) ~[elasticsearch-6.0.1.jar:6.0.1]
at org.elasticsearch.common.util.PageCacheRecycler$1.newInstance(PageCacheRecycler.java:96) ~[elasticsearch-6.0.1.jar:6.0.1]
at org.elasticsearch.common.recycler.DequeRecycler.obtain(DequeRecycler.java:53) ~[elasticsearch-6.0.1.jar:6.0.1]
at org.elasticsearch.common.recycler.AbstractRecycler.obtain(AbstractRecycler.java:33) ~[elasticsearch-6.0.1.jar:6.0.1]
at org.elasticsearch.common.recycler.DequeRecycler.obtain(DequeRecycler.java:28) ~[elasticsearch-6.0.1.jar:6.0.1]
at org.elasticsearch.common.recycler.FilterRecycler.obtain(FilterRecycler.java:39) ~[elasticsearch-6.0.1.jar:6.0.1]
at org.elasticsearch.common.recycler.Recyclers$3.obtain(Recyclers.java:119) ~[elasticsearch-6.0.1.jar:6.0.1]
at org.elasticsearch.common.recycler.FilterRecycler.obtain(FilterRecycler.java:39) ~[elasticsearch-6.0.1.jar:6.0.1]
at org.elasticsearch.common.util.PageCacheRecycler.bytePage(PageCacheRecycler.java:147) ~[elasticsearch-6.0.1.jar:6.0.1]
at org.elasticsearch.common.util.AbstractBigArray.newBytePage(AbstractBigArray.java:117) ~[elasticsearch-6.0.1.jar:6.0.1]
at org.elasticsearch.common.util.BigByteArray.resize(BigByteArray.java:143) ~[elasticsearch-6.0.1.jar:6.0.1]
at org.elasticsearch.common.util.BigArrays.resizeInPlace(BigArrays.java:449) ~[elasticsearch-6.0.1.jar:6.0.1]
at org.elasticsearch.common.util.BigArrays.resize(BigArrays.java:496) ~[elasticsearch-6.0.1.jar:6.0.1]
at org.elasticsearch.common.util.BigArrays.grow(BigArrays.java:513) ~[elasticsearch-6.0.1.jar:6.0.1]
Node 1
[2018-01-26T00:00:15,215][ERROR][o.e.x.m.c.n.NodeStatsCollector] [kib-vm05-data-1-es_data_instance] collector [node_stats] timed out when collecting data
[2018-01-26T00:00:20,967][WARN ][o.e.m.j.JvmGcMonitorService] [kib-vm05-data-1-es_data_instance] [gc][3224345] overhead, spent [22.7s] collecting in the last [22.7s]
[2018-01-26T00:00:30,672][WARN ][o.e.m.j.JvmGcMonitorService] [kib-vm05-data-1-es_data_instance] [gc][3224346] overhead, spent [9.6s] collecting in the last [9.7s]
[2018-01-26T00:00:30,678][ERROR][o.e.b.ElasticsearchUncaughtExceptionHandler] [kib-vm05-data-1-es_data_instance] fatal error in thread [elasticsearch[kib-vm05-data-1-es_data_instance][search][T#4]], exiting
java.lang.OutOfMemoryError: Java heap space
at org.elasticsearch.common.util.PageCacheRecycler$1.newInstance(PageCacheRecycler.java:99) ~[elasticsearch-6.0.1.jar:6.0.1]
at org.elasticsearch.common.util.PageCacheRecycler$1.newInstance(PageCacheRecycler.java:96) ~[elasticsearch-6.0.1.jar:6.0.1]
at org.elasticsearch.common.recycler.DequeRecycler.obtain(DequeRecycler.java:53) ~[elasticsearch-6.0.1.jar:6.0.1]
at org.elasticsearch.common.recycler.AbstractRecycler.obtain(AbstractRecycler.java:33) ~[elasticsearch-6.0.1.jar:6.0.1]
at org.elasticsearch.common.recycler.DequeRecycler.obtain(DequeRecycler.java:28) ~[elasticsearch-6.0.1.jar:6.0.1]
at org.elasticsearch.common.recycler.FilterRecycler.obtain(FilterRecycler.java:39) ~[elasticsearch-6.0.1.jar:6.0.1]
at org.elasticsearch.common.recycler.Recyclers$3.obtain(Recyclers.java:119) ~[elasticsearch-6.0.1.jar:6.0.1]
at org.elasticsearch.common.recycler.FilterRecycler.obtain(FilterRecycler.java:39) ~[elasticsearch-6.0.1.jar:6.0.1]
at org.elasticsearch.common.util.PageCacheRecycler.bytePage(PageCacheRecycler.java:147) ~[elasticsearch-6.0.1.jar:6.0.1]
at org.elasticsearch.common.util.AbstractBigArray.newBytePage(AbstractBigArray.java:117) ~[elasticsearch-6.0.1.jar:6.0.1]
at org.elasticsearch.common.util.BigByteArray.resize(BigByteArray.java:143) ~[elasticsearch-6.0.1.jar:6.0.1]
at org.elasticsearch.common.util.BigArrays.resizeInPlace(BigArrays.java:449) ~[elasticsearch-6.0.1.jar:6.0.1]
at org.elasticsearch.common.util.BigArrays.resize(BigArrays.java:496) ~[elasticsearch-6.0.1.jar:6.0.1]
at org.elasticsearch.common.util.BigArrays.grow(BigArrays.java:513) ~[elasticsearch-6.0.1.jar:6.0.1]
at org.elasticsearch.search.aggregations.metrics.cardinality.HyperLogLogPlusPlus.ensureCapacity(HyperLogLogPlusPlus.java:202) ~[elasticsearch-6.0.1.jar:6.0.1]
at