Rest client v5.5.0 compatibility with v5.6.x

abhijat · October 6, 2017, 7:06am

Are there any known compatibility issues between rest client v5.5.0 trying to index data into elasticsearch 5.6.2? We have been seeing, intermittently, indexing taking more than 10s.

We have our app that is using elastic rest client v5.5.0 and the cluster has recently been upgraded to v5.6.2.

warkolm · October 6, 2017, 7:12am

I haven't seen or heard of anything, that is not canon though.

Are you using Monitoring to check the status of your cluster while this happens?

abhijat · October 6, 2017, 7:14am

What exactly am I looking for on the monitoring tool? Any specific charts you want me to look at ?

warkolm · October 6, 2017, 7:17am

Merges, GC, refreshes, CPU increases.
It's hard to say because this could be anything, but look for anomalies around the time the longer requests take.

Also, you're using _bulk right?

abhijat · October 6, 2017, 7:21am

no... our use case is more like a realtime update... so no we are not using _bulk.

But our rate of writes is very low... and this is happening in a test env where we are writing data sequentially... think about an integration test running... indexing a record, reading it and then deleting it... the record size is about 1-2 KB.

dadoonet · October 6, 2017, 7:59am

Yeah. This is expected. You are basically doing a fsync on every single operation which is a costly operation.

Look at https://www.elastic.co/guide/en/elasticsearch/client/java-rest/current/java-rest-high-document-bulk.html#java-rest-high-document-bulk-processor

I love this class.

You could change that "index.translog.durability": "async" index setting at your own risk (but well it's an integration Test here): https://www.elastic.co/guide/en/elasticsearch/reference/5.6/index-modules-translog.html#_translog_settings_2

abhijat · October 6, 2017, 7:06pm

Thank you @dadoonet .

Additional FYI:

We are using ES rest client v5.5.0. But the cluster is at v5.6.2 - This is a pre-production test cluster.
When do indexing, immediately after that we invoke the refresh API as well. I know it is not the most efficient way to make data available sooner for searching but since our write rates are low, we thought we could live with this inefficiency. If there is a better way to handle this scenario then please let us know.

Based on what you have suggested and my usecase, I cannot use BulkProcesser API. In addition, if, by default, fsync is happening on each write, do I need to invoke refresh API to make the document searchable?

EDIT:
Wanted to add that the behavior I describe above in #2 is actually running live in our production cluster. We were evaluating v5.6.2 so that we can upgrade production to that newer version. For we have a bunch of integration tests that run sequentially. And during that test run, we ran into this issue. We are going to try and create new cluster with v5.5.0 to see if this issue persists or not. We had not seen this issue previously.

Thanks.

system · November 3, 2017, 7:06pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Upgrading to ES5 and Java TransporClient Elasticsearch	8	785	November 1, 2017
Bulk index with java rest client Elasticsearch	5	1295	February 14, 2018
Bulk throughput issues Elasticsearch	15	1674	July 6, 2017
Inserting documents are very slow in elastic 6.8.5 server Elasticsearch language-clients	2	422	October 27, 2020
Question on REST Client and Java API Elasticsearch	5	1599	September 13, 2017

Rest client v5.5.0 compatibility with v5.6.x

Related topics