Elasticsearch 5.1.1 keep dying after 20 minutes

nathan.tivaci · January 3, 2017, 3:58am

Hi there,

I hope somebody can help me.

I have an instance of ELK in Ubuntu 16.04 (2GB of RAM, 30GB of HDD). I can setup visualisation, dashboard and all.

BUT, the elasticsearch instance keep dying after 20 minutes or so (Around 6000 row of input)

I've tried to add the memlock on /etc/security/limits.conf

elasticsearch soft memlock unlimited
elasticsearch hard memlock unlimited

still no luck.

Anyone can help me where can I start debugging things out? I've tried checking /var/log/elasticsearch/elasticsearch.log but nothing much on the "dying" part, I can only see that it started.

Cheers,

Nathan

JKhondhu · January 3, 2017, 2:25pm

Hi, How much of that 2GB RAM is allocated in the /etc/elasticsearch/jvm.options? ..elasticsearch instance keep dying.. What exactly are you experiencing on the operating system itself?

Cheers,

Mark_Harwood · January 3, 2017, 2:27pm

By dying you mean the process is killed or unresponsive?
Does it only fail when you are feeding it new docs?
Are you using any unusual plugins? (e.g. I remember reading Zookeeper can call System.exit when unhappy).

jasontedor · January 3, 2017, 5:37pm

Plugins can not call System#exit, we have the permissions for that locked down now:

github.com

elastic/elasticsearch/blob/9a65d2008eefcb59387b8b44f0c46d8a668d8b87/core/src/main/java/org/elasticsearch/bootstrap/Security.java#L122-L123


// enable security manager
System.setSecurityManager(new SecureSM(new String[] { "org.elasticsearch.bootstrap.", "org.elasticsearch.cli" }));

github.com

elastic/securesm/blob/d66424314b18eb1acf705f8f54515455548fe858/src/main/java/org/elasticsearch/SecureSM.java#L75-L83


/**
 * Creates a new security manager with the specified list of packages being the only packages
 * that can exit or halt the virtual machine.
 *
 * @param packagesThatCanExit the list of packages that can exit or halt the virtual machine
 */
public SecureSM(final String[] packagesThatCanExit) {
  this.packagesThatCanExit = packagesThatCanExit;
}

nathan.tivaci · January 4, 2017, 3:37am

Hi Jymit,

I've checked the /etc/elasticsearch/jvm.options it shown:

-Xms1g
-Xmx1g

is it enough?

The process of elasticsearch just stopped running after a few minutes (15-20minutes). The other processes like Kibana, Nginx, Logstash is still running fine.

Here is the screenshot of the Kibana:

nathan.tivaci · January 4, 2017, 3:59am

Hi Mark,

CMIIW, but it seems to be killed:

● elasticsearch.service - Elasticsearch
   Loaded: loaded (/usr/lib/systemd/system/elasticsearch.service; enabled; vendo
   Active: failed (Result: signal) since Tue 2017-01-03 05:01:55 UTC; 22h ago
     Docs: http://www.elastic.co
  Process: 1461 ExecStart=/usr/share/elasticsearch/bin/elasticsearch -p ${PID_DI
  Process: 1422 ExecStartPre=/usr/share/elasticsearch/bin/elasticsearch-systemd-
 Main PID: 1461 (code=killed, signal=KILL)

Yes, it only fails when I am feeding new docs.

The plugin I have is just Timelion which was installed by default.

jasontedor · January 4, 2017, 4:25am

Since you're running Elasticsearch with a 1 GB heap on a machine with 2 GB of RAM, I suspect that your instance is being killed by the OS OOM killer. Check your kernel logs.

nathan.tivaci · January 4, 2017, 5:25am

Hi Jason,

I checked my /var/log/kern.log:

[ 1295.629750] node invoked oom-killer: gfp_mask=0x24201ca, order=0, oom_score_adj=0
[ 1295.629876]  [<ffffffff81192722>] oom_kill_process+0x202/0x3c0
[ 1295.630149] Out of memory: Kill process 1461 (java) score 679 or sacrifice child
[ 1295.631466] Killed process 1461 (java) total-vm:3642564kB, anon-rss:1367924kB, file-rss:21488kB

Looks like you are right. Do you have any suggestion on what should I do?
Decreasing the heap or increasing my RAM?

Cheers!

jasontedor · January 4, 2017, 5:45am

The immediate problem is running Elasticsearch, Logstash, Kibana, and nginx in a machine with 2 GB of RAM. Even if you drop the heap by half you're likely to still have trouble, and then you're more likely to run into heap space issues in Elasticsearch. I think you need to either get sone of those other processes off this host, or get more RAM.

nathan.tivaci · January 4, 2017, 6:10am

Hi Jason,

I am going to bump it to a 4GB RAM, i hope this will help.

Would you know what is the normal RAM for ELK stack in a machine?

Cheers!

JKhondhu · January 4, 2017, 9:54am

Hi, this would very much depend on what you are expecting use this server for.
If this is a server purpose built for testing 5.1.1 then of course the resources you will look to have may suffice. This all goes hand in hand with what you are looking to achieve here.

nathan.tivaci · January 5, 2017, 1:16am

Hi Jymit,

In fact, I was thinking to make this a production ELK server. How do you guys normally judge the server requirement for ELK? based on number of docs coming in or?

Cheers!

jasontedor · January 5, 2017, 2:46am

Running all three on a single machine with only 4 GB might be too much, especially combined with an nginx server (it really depends on your use-case though). Elasticsearch loves the filesystem cache, but if all the memory not dedicated to the Elasticsearch heap is going to other processes, there is not going to be any room left over for the filesystem cache.

nathan.tivaci · January 9, 2017, 5:05am

Hello Jason,

I've just trying to reduce the xms to 750mb and fortunately (finger crossed) the server has been running fine for a few days now. I am feeding it around 40k hits every 15mins or so.

Thank you for your help!

jasontedor · January 9, 2017, 12:08pm

You're very welcome.

system · February 6, 2017, 12:08pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
ELASCITSEARCH PROCESS DIES Elasticsearch	7	2336	January 8, 2020
Out of memory: Kill process Elasticsearch	5	15419	December 13, 2016
Elasticsearch process on a node takes 99% of RAM as seen by top, and eventually gets killed by kernel Elasticsearch	1	763	July 6, 2017
Elasticsearch service gets killed Elasticsearch	8	2263	August 28, 2020
Run Elasticsearch-5.4.0 in low memory environment Elasticsearch	11	3322	June 14, 2017

Elasticsearch 5.1.1 keep dying after 20 minutes

Related topics