Elasticsearch and reliability

dollboot · March 1, 2020, 5:51pm

For a while i've been using Elasticsearch and I have a few questions regarding reliability, replication and ingest processors.

For this example we have the following setup:
3 Elasticsearch nodes in a cluster (master + data + ingest on every node)
1 Logstash

When you send a document to Elasticsearch, with Logstash, it will initially arrive in memory.

What happens if in this exact moment the node, that receives the document, will die?
Even if the connection is TCP is this also on the application level, does this mean that logstash will re-send the message or can it happen that a document will get lost?
How are ingest processors implemented, before or after replication?
When an ingest node dies could there be a possibility that a received document will be lost, or does Logstash just resend the document?

system · March 29, 2020, 5:51pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Clustering data consistency question Elasticsearch	3	363	January 14, 2020
Logstash to Elasticsearch resilience Elasticsearch	5	231	April 29, 2022
Huge concurrent data ingestion to ElasticSearch Elasticsearch	16	2829	September 18, 2018
Improve Logstash data resiliency Logstash	5	498	December 17, 2021
Client/Ingest node buffer/backpressure handling Elasticsearch	2	979	December 28, 2016