Improve ElasticSearch Read Cache

strfx · September 21, 2015, 9:05am

Hi there

I'm doing heavy bulk updating on my elasticsearch cluster on a daily base. While is usually works fine on a fresh index (50k updates/sec), after a few bulks the process slows down terribly (1-5k updates/sec). Investigation shows that Elasticsearch keeps hitting the disk for reads (~1'200 K/s reads per process) - I guess it is checking for the existence of the document to be updated.

Is there a way to improve the read cache, so it doesn't has to hit the disk so frequent?

Specs:
24 cores, 36 GB RAM (24GB Heap Size, so 12GB are left for the OS disk cache)

warkolm · September 21, 2015, 10:05am

Move to SSDs would be one

otisg · September 21, 2015, 7:33pm

Hi,

Maybe you can make your heap a little smaller and give the OS a bit more RAM and see if that makes a difference?

Otis

Monitoring * Alerting * Anomaly Detection * Centralized Log Management
Elasticsearch Consulting & Support * http://sematext.com/

Topic		Replies	Views
High disk read on one node out of 3 Elasticsearch	1	912	September 18, 2018
Using elasticsearch as a cache with frequent updates Elasticsearch	1	873	November 16, 2015
Elasticsearch : Hight disk read + slow indexing Elasticsearch	0	415	March 11, 2020
Scaling elastic search for read heavy applications Elasticsearch language-clients	11	1992	November 11, 2022
Clear Cache After Bulk Insertion Elasticsearch	8	1021	December 12, 2017

Improve ElasticSearch Read Cache

Otis

Related topics