ElasticSearch configuration for high performance

HARSHAL_CHAUDHARI · November 13, 2018, 2:29pm

Hi,

We have developed .NetCore WebApi Application using NEST library for performing CRUD (Create, read, update, delete) Operation in ElasticSearch.

We have setup ElasticSearch with Ingest plug-in on kubernetes cluster with HeapSize 2gb (On cloud).

Goal: Add/Push 100,000 documents (Per Document Size: 13MB to 15MB) in ElasticSearch in 10 - 15 Minutes

Could you please suggest the ideal ElasticSearch configuration or ElasticSearch configuration for high performance for above requirement.

Thanks in Advanced.

Christian_Dahlqvist · November 13, 2018, 2:37pm

Indexing can be CPU intensive and even more so if you are using ingest node. Given the size of your documents and the speed at which you want to index this, the cluster sounds quite small. How many CPU cores do you have? What type of storage?

What level of throughput are you seeing with the current setup? What is limiting performance?

jeroen1 · November 13, 2018, 4:20pm

Optimize your mappings: fewer analyzers = (way) faster.
Use as many shards as CPU cores available (system uses 1 core / shard).
Prefer fewer faster CPU cores (and thus shards see above) over more slower cores (more efficient storage and searches).
Use local SSD storage.

HARSHAL_CHAUDHARI · November 13, 2018, 4:35pm

Thanks for your reply,

How many CPU cores do you have?
We deployed on IBM kubernetes cluster and we have 3 worker nodes each node have 8 Cores 32 GB RAM

What type of storage?
We have used IBM storage volume.beta.kubernetes.io/storage-class: ibmc-block-silver

What level of throughput are you seeing with the current setup?
Speed is adding the 1000 document (Document size 13 to 15MB) per hour

What is limiting performance?
it is taking time to adding document, observing the running process.

Please suggest us on high performance kubernetes Elasticsearch configuration.

HARSHAL_CHAUDHARI · November 13, 2018, 4:37pm

Thanks for your reply.

My Configuration as below

We deployed on IBM kubernetes cluster and we have 3 worker nodes each node have 8 Cores 32 GB RAM
We have used IBM storage volume.beta.kubernetes.io/storage-class: ibmc-block-silver

Please let us know Elasticsearch on Kubernetes configuration for High performance.

Christian_Dahlqvist · November 13, 2018, 4:42pm

I would recommend looking at the following resources:

https://www.elastic.co/guide/en/elasticsearch/reference/6.4/tune-for-indexing-speed.html#tune-for-indexing-speed

https://www.elastic.co/guide/en/elasticsearch/reference/6.4/tune-for-disk-usage.html

Then run tests and try to identify what system resource that is limiting performance, e.g. CPU and/or disk I/O. I generally index a lot smaller documents, so am not sure how to best tune for your particular use-case.

If I am calculating correctly, that is about 1.33TB of raw data. If that is the case you will most likely need a lot larger cluster to be able to ingest that in 15 minutes...

HARSHAL_CHAUDHARI · November 14, 2018, 11:36am

Thanks for reply.

system · December 12, 2018, 11:36am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Performance issues when indexing data Elasticsearch docker	7	2442	September 29, 2022
Improving log ingestion speed and faster elasticsearch indexing Elasticsearch	6	1037	November 1, 2021
Elastic search AWS EC2 cluster indexing performance is decreased compared to single node performance Elasticsearch	5	2840	July 6, 2017
Doubts about hardware requirements for elasticsearch Elasticsearch	11	1525	July 3, 2020
How to Insert 50Million documents per 30sec in elasticsearch cluster? Elasticsearch docker	7	1614	October 8, 2023

ElasticSearch configuration for high performance

Related topics