Logstash behaivour when elasticsearch fail to index

mtudisco · November 28, 2019, 6:45pm

Hi,

I'm running a test environment with one node only with elasticsearch and generating indexes with the patern prefix-[type]-yyyy.mm.dd.

Information is generated in logfiles, that are sent with filebeat to logstash to process.

There are different values for [type] so each day i get several indexes.

Todavy i realized that i did not have any information in elasticsearch since yerterday, and looking in logstash logfiles i realized when it send the information to elasticsearch, the later had found max number of shards and where not able to create a new shard.

Then i increased the value of cluster.max_shards_per_node and new information coming from filebeat got indexed, however the logs that failed to be indexed during the night where not indexed.

Isnt it supposed that logstash retry if it cannot reach the output? or as in this case the output was reachable but elasticsearch returned error then logstash does not try to index the information again?

thanks

Badger · November 28, 2019, 10:11pm

If elasticsearch was non-responsive I would expect logstash to queue. If elasticsearch returned an error then you would lose events (unless you have a DLQ configured and it is a retryable error).

Christian_Dahlqvist · November 29, 2019, 7:02am

Having large number of small shards is inefficient and can cause performance and stability problems. I would recommend reading this blog post and look to reduce the number of shards in the cluster, e.g. by switching from daily to weekly or monthly indices. The limit is there for a reason and set quite high, so just increasing it is not a great solution.

mtudisco · November 29, 2019, 12:13pm

Thanks Badger, what is a DLQ and where do you configure it?

mtudisco · November 29, 2019, 12:16pm

Thanks Christian, I have read that document while researching on the original problem and i'm changing the way indexes are being generated to have them monthly, after i removed some indexes i returned the parameter back to 1000.

Badger · November 29, 2019, 2:29pm

DLQs are documented here.

system · December 27, 2019, 2:29pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.