Elasticsearch + Deepstream connection no ACK after a couple hours

danielleushuis · May 29, 2017, 7:59am

Dear Elasticsearch forum users,

We are using four droplets right now at Digitalocean:

An elasticsearch droplet (8GB ram, 4 CPU's)
A deepstream droplet (512MB ram, 1 CPU)
A redis droplet (512MB ram, 1 CPU)
A java Tomcat droplet (1GB ram, 1 CPU)

The flow would be as following: a request is received by deepstream by either the java Tomcat droplet or our Android app, and data is send or retrieved by redis and elasticsearch.

Saving data and retrieving data is really easy this way and fast. Especially when using the subscribe methods of Deepstream to sync data client side instantly.

Now to our problem, with a single deepstream client connection, we do receive the message "No ACK message received in time for eventPrefix/." after a while. The connection no longer receives update from the subscribe path but we can still make requests on the getRecord or has path.

We did see great improvements by expanding the Elasticsearch droplet resources. We started from 512MB ram Elastcisearch heap size to 2GB heap size to (now) 4GB heap size. With 512MB ram received the "No ACK" message instantly, with 2GB heap size after about 30 seconds and with 4GB heap size after about 5 minutes.

Also, the java Tomcat droplet will get stuck on an "has" command of deepstream. The server simply doesn't respond and the script hangs on that call.

We did disable swapping of the Elasticsearch server by setting the following:

MAX_LOCKED_MEMORY=unlimited
bootstrap.mlockall: true
And ofcourse with our 8GB system memory the ES_HEAP_SIZE=4g

Question: Will expanding the Elasticsearch resources (especially more RAM) help to fix our issues we are facing right now? Is there anything that comes to your mind that we might try?

Kind regards,

Daniel Leushuis

warkolm · May 30, 2017, 5:36am

It's not really clear what the problem is here.

You have to assume we have no idea of what Deepstream is, what it does, or what any of these concepts mean.
Are you able to explain them?

danielleushuis · May 30, 2017, 7:54pm

Hi Warkolm,

Thanks for responding. Allow me elaborate about Deepstream.

Deepstream allows you to quickly retrieve and sync data with a storage (in this case Elasticsearch). We use three core commands of deepstream:

getRecord, to retrieve data from a record or write data to it (https://deepstream.io/docs/client-java/RecordHandler/ - https://deepstream.io/docs/client-java/Record/)
has, to check if a record exists (https://deepstream.io/docs/client-java/RecordHandler/)
subscribe, to sync data as soon as something changes on the record (https://deepstream.io/docs/client-java/Record/)

Now, we have a tomcat server running a script that goes on forever, the script updates about 50 records per minute (shouldn't be too much). The script has a routine, the routine starts with the "has" command. After about 24 hours on average, the script is stuck on that "has" command. We have about 8 threads running and all of them gets stuck at the same time on that command. It seems like there is no answer from the storage anymore. While the tomcat server script is stuck, we can still use calls from our Android app.

Now to our Android app, upon subscribing to a path, a record and an update it written to that specific path we subscribed to, we receive the following messages:

No ACK message received in time for SUBSCRIBE {RECORD_NAME}
No message received in time for READ {RECORD_NAME}

After this, every record subscribed to no longer receives any update and we can no longer use getRecord or has or whatsoever on the same Deepstream client object. The connection seems dead.

I think there is a connection between the "No ack in time" and our back-end getting stuck on the "has" command.

Do you have any ideas on this issue?

Kind regards,

Daniel Leushuis.

warkolm · May 31, 2017, 12:12am

What is happening in ES at the time this hangs?

system · June 28, 2017, 12:12am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Need help/suggestion for the massive queries user case Elasticsearch	16	562	July 6, 2017
Getting Timeout Exceptions with Elasticsearch Elasticsearch	12	10353	July 6, 2017
Gc overhead reduces ElasticSearch Performance Elasticsearch	14	13509	September 22, 2018
Errors while doing bulk update, Am I doing this wrong? Elasticsearch	10	1073	July 5, 2017
Our Elastic search server can not serve many connections for a long time Elasticsearch	1	704	July 6, 2017

Elasticsearch + Deepstream connection no ACK after a couple hours

Related topics