Optimizing configuration for ingestion

eyang · November 21, 2017, 2:52pm

Hi all,

I've been experimenting with my 3 nodes cluster with a focus on pushing the ingestion performance. The data set I use has 3 million lines, ~840MB, going in through logstash. Although I get decent performance ingesting into an empty index, getting ~6k lines/second, the ingestion slows down as the number of documents in the index grow. After a while I see in the logstash log file entries indicating the elasticsearch ingestion endpoint is not responding. Looking on the motoring tab and running REST query calls with Postman, I see the number of segments fluctuates constantly, seemingly indicating frequent segments merging, which I thought can get expensive as the size of the segments grow and cause elasticsearch to throttle down ingestion.

I do have my 3 hosts on VMs sharing a spinning disk managed by VMware ESXi, but before I try to switch to SSD for datastore, does anybody has any suggestion on how I can debug on the elasticsearch side and narrow down or confirm the cause of my slowing ingestion performance?

Thanks,
Eric

system · December 19, 2017, 2:52pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Ingestion performance issues - where to start? Elasticsearch	6	655	September 18, 2020
Improving Elasticsearach ingest capacity Elasticsearch	7	111	June 20, 2024
Slow ingestion problem (v 6.2.3) Elasticsearch	14	3649	July 22, 2018
Huge concurrent data ingestion to ElasticSearch Elasticsearch	16	2829	September 18, 2018
How to gain ingestion rate Elasticsearch	15	5908	July 5, 2017

Optimizing configuration for ingestion

Related topics