Enable filebeat and logstash caching during unavailablity

parthmaniar · June 29, 2022, 9:23am

Hello,

I hope you and your loved ones are safe and healthy.

I am running a cluster that collects logs from sources on the internet. I need to enable caching of logs in case the next hop is not reachable as dropping logs is detrimental to my project.
Following is my architecture:

Various log sources (mostly running Linux) send logs using Filebeat to my homelab which are collected by Logstash.
A. This is where the first unavailability can occur. As I use home ISP and do not commercial agreement, there are availability issues. How do I enable caching of logs (up to 48 hours) at filebeat in case the next hop (logstash hosted in my homelab) is not available.

I have enabled deduplication in filebeat + logstash using: Deduplicate data | Filebeat Reference [8.3] | Elastic

Logstash sends logs to two Elasticsearc nodes, while I have a solid (wired) connectivity in the homelab if I want logstash to collect and store logs if ES nodes are down, how do I enable that?
My logstash instance is running on an RPi 4 (2 GB RAM) and my logs events per second (EPS) is around 700, the storage on the RPi is 128 GB with a NAS volume mounted that has 10 TB

Thank you very much.

Badger · June 29, 2022, 4:37pm

Maybe insert kafka between filebeat and logstash, and add persistent queues in logstash with enough disk space to handle the longest possible elasticsearch outage.

parthmaniar · June 30, 2022, 9:15am

Yes that could be one way:

I could host Kafka on a seperate cloud service provider (CSP)to cache the logs. Making availablity a cloud CSP ownership. However,
instead of the additional cost is there no way to increase caching at Filebeat?
As in if Filebeat cannot reach Logstash for up to 48 hours it keeps the logs? From the linux configuration perspective the logs are on the disk until 7 days and I can easily keep it for 30 days.

Badger · June 30, 2022, 4:41pm

A question about increasing caching in filebeat should be in the filebeat forum, not the logstash forum.

parthmaniar · June 30, 2022, 6:25pm

Sure, I'll repost it there and keep this one for Logstash? Thank you very much.

system · July 28, 2022, 6:25pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Enable filebeat caching during unavailablity Beats filebeat	5	704	August 20, 2022
Is there a time limit for Filebeat or Logstash to attempt to retransmit? Elasticsearch	6	577	October 4, 2022
Question: what happens when Logstash is unavailable for long? Beats filebeat	2	591	March 12, 2020
Increasing throughput from Filebeat to Logstash Beats filebeat	1	1191	November 1, 2019
Logstash - What happen if logstash can't contact elasticsearch node Logstash	2	546	April 8, 2020

Enable filebeat and logstash caching during unavailablity

Related topics