Compress elasticsearch bulkrequest using java high level rest client and bulk processor

Praveen_Atluri · August 4, 2021, 12:12pm

Is there anyway to compress the elasticsearch bulk request payload using java high level rest client and bulk processor?

I have tried below

requestConfigBuilder.setContentCompressionEnabled(true);

But it does not seem to be compressing the request payload.
Do I need to enable any other settings?

Any inputs?

spinscale · August 4, 2021, 1:35pm

Hey, any chance to create a code snippet that can be tested in an IDE and follow the steps from your setup?

Thank you!

Praveen_Atluri · August 4, 2021, 2:41pm

 RestClientBuilder restClientBuilder = RestClient.builder(httpHosts);

        SniffOnFailureListener sniffOnFailureListener = new SniffOnFailureListener();
        if (!config.isEsNodeSniffingDisabled()) {
            restClientBuilder.setFailureListener(sniffOnFailureListener);
        }

        restClientBuilder.setHttpClientConfigCallback(httpClientBuilder -> {
            httpClientBuilder
                    .setDefaultConnectionConfig(ConnectionConfig.custom().setCharset(StandardCharsets.UTF_8).build());
            return httpClientBuilder;
        });

        restClientBuilder.setRequestConfigCallback(requestConfigBuilder -> {
            requestConfigBuilder.setSocketTimeout(config.getEsSocketTimeout()*1000);
            requestConfigBuilder.setContentCompressionEnabled(true);
            requestConfigBuilder.setConnectTimeout(config.getEsConnectionTimeout()*1000);
            requestConfigBuilder.setConnectionRequestTimeout(config.getEsConnectionTimeout()*1000);
            return requestConfigBuilder;
        });
 return new RestHighLevelClient(restClientBuilder);

Praveen_Atluri · August 6, 2021, 6:05am

Just to give more context, We were trying to do load test on es cluster (with 150 data nodes) deployed on AWS EC2 instances. We were not able to get the desired ingestion throughput despite making all the recommended settings for better indexing performance.
Then we saw that the problem was with the network bandwidth. The AWS i3.2x large instances come with Up to 10 Gigabit/second network bandwidth and this network was clogged.
So we were not able to ingest the data at a better throughput. So, I was thinking we could improve the throughput by compressing the bulk payload. I could not find any documentation on how to compress the payload using java high level rest client . Any inputs would be really helpful.

DavidTurner · August 6, 2021, 11:42am

I believe that the only effect of this line is to indicate to Elasticsearch that the client will accept a compressed response. If you want the client to compress the requests too you should call RestClientBuilder#setCompressionEnabled instead.

Note that compression takes substantial extra CPU effort. An i3.2xlarge only has 8 CPUs, I think you'd need several times that number to reach 10Gbps of throughput.

Praveen_Atluri · August 6, 2021, 12:05pm

Thank you @DavidTurner ,
Let me try this and see if I see any improvement in throughput. Thank you!

Praveen_Atluri · August 6, 2021, 8:43pm

@DavidTurner ,
I see that this method

RestClientBuilder#setCompressionEnabled

is available only from elasticsearch-rest-client version 7.10.1.
But we are running elasticsearch server version 7.8.0.
Is it possible to use rest client version 7.10.1 to compress bulk request and ingest into elasticsearch server 7.8.0?

DavidTurner · August 6, 2021, 9:32pm

Ah yes, it was introduced in 7.10.0 (see below). I don't know another way to do it with the high-level REST client, although of course it's pretty simple to use a bare HTTP client to send bulk requests with compression.

system · September 3, 2021, 9:32pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Java REST client: gzip/compress request Elasticsearch	1	1264	June 1, 2020
Issues about BulkProcessor with JAVA High Level Client Elasticsearch	2	959	January 14, 2019
Sending BulkRequest in Java API Client Elasticsearch version 7.16.3 Elasticsearch language-clients	3	822	March 22, 2022
Elasticsearch-rest-high-level-client _bulk compression Elasticsearch	2	2191	August 16, 2018
Configure Elasticsearch high level client with JSON parser Elasticsearch	2	443	September 5, 2018

Compress elasticsearch bulkrequest using java high level rest client and bulk processor

Related topics