Target my ES cluster with ~40 FileBeat instances

Yarden_Bar · July 31, 2016, 2:20pm

Hello,
Last year I found that Logstash doesn't perform load balancing between inputs and 'prefer' inputs that doesn't have pushback mechanism or that pushback is not relevant for them (like syslog input)

I'm thinking about pointing all of my FileBeat army, around 40 instances) to ship json-line formatted logs directly to our ES cluster using FileBeat's ES output.
Existing Logstash instances will still process logs that can't be shipped directly to ES.

The root cause is that there's a lot of contention on our Logstash instances which i'd like to minimize.

Can anyone comment on her/his experience with pointing FileBeat directly at ES cluster?

Thank you,
Yarden

Yarden_Bar · July 31, 2016, 2:59pm

It is mentioned here that FileBeat specifically ignore the max_retries configuration option.

On one hand, its good that it'll keep trying to ship an event if ES returns a failure response.
On the other hand, what if the failed event has incorrect structure that doesn't match the index mapping? will it still try to ship the broken event or it will fail on the ES side with an error logged to file?

ruflin · August 2, 2016, 1:59pm

Are you using JSON logs to be sent to elasticserach or just the basic default content? If the second, I would not expect any events that don't match the mapping. What is the structure of your event?

Yarden_Bar · August 2, 2016, 2:37pm

My logs are formatted as valid Json(one per line) and the filter in Logstash that handles those events only renames some fields(Today I implemented the rename operation in our application so the filter is actually redundant).

The index template on the ES side has relevant mapping to facilitate the events I'm going to shoot at it).

Event example:
{"method":"GET","request":"/notifications","format":"js","controller":"user_notifications","action":"new_notification_count","status":200,"duration":187.01,"view":0.55,"db":8.61,"true_ip":"XXX.XXX.XXX.XXX","route":"user_notifications#new_notification_count","request_id":"SOME_REQUEST_ID","type":"rails","user_id":"123456","source":"unknown","tags":["request"],"@timestamp":"2016-08-01T16:28:22.171Z","@version":"1"}

Thank you,
Yarden

steffens · August 2, 2016, 5:49pm

It depends on status code by elasticsearch if event will be retried.

See status code check. If message is dropped the warning message Can not index event ... will be logged.

Yarden_Bar · August 2, 2016, 6:02pm

Thank you @steffens,

I'll experiment by pointing 1,2,3,....N FileBeat instances to the ES cluster and report my findings

Yarden

system · August 21, 2016, 2:20pm

This topic was automatically closed after 21 days. New replies are no longer allowed.

Topic		Replies	Views
Problem with Filebeat > Logstash > Elasticsearch Elasticsearch	3	457	August 23, 2019
Loadbalance duplicating the events using logstash Beats filebeat	8	3826	October 14, 2016
Indices Creation Failed in ES via Logstash from Filebeat Logstash	6	621	January 29, 2018
Filebeat dropping events Beats filebeat	7	2688	June 14, 2021
Filebeats behavior when new ES node elected up to Primary Beats filebeat	4	550	July 31, 2017

Target my ES cluster with ~40 FileBeat instances

Related topics