Extract elasticsearch "index" field from event field

surajs · November 1, 2016, 12:10am

In filebeat for Kafka output, we are able to dynamically select the topic-name using data from the an event field using something like %{[type]}.

In a similar fashion, is it possible to dynamically select the index name for an elasticsearch output using data from the event field ?

ruflin · November 1, 2016, 9:30am

It should work in 5.0 with format strings, but I never tested it TBH. Let me know if it works as expected.

steffens · November 1, 2016, 1:24pm

See index and indices settings.

surajs · November 2, 2016, 10:06pm

Thanks @steffens , @ruflin!

Another quick question around the same lines.

Would it make sense for filebeat to expose the filename that its reading from in order to determine the index name( or kafka topic for that matter).
For example,
if its currently tailing from a file "/var/log/docker/api.log" then,
include a field in the event{"event_source" : "api" } or even {"event_source" : "api.log" }along with other metadata?

I know that filebeat already exposes an absolute path in the source field, but that is not enough for determining which index / kafka topic to write to if you're doing this at scale. Also, adding a field in the event is going to require change in the way the events are logged which makes the transition to filebeat much more difficult.

What we need is something very similar to : Using filename from filebeat in index pattern

ruflin · November 3, 2016, 9:35am

It would be totally possible but for advance processing / routing I recommend to use Logstash in the middle.

The reason we expose the full path and no the filename because the file name is not necessarly unique.

surajs · November 3, 2016, 5:48pm

I totally see the worth in exposing the full path.

As more and more large scale organizations start to consider beats as their option for log-tailing, as seen in few other questions on Stackoverflow as well as the Elastic forum, this feature is going to be something that could be really helpful to add.

Additional overhead of maintaining logstash for doing simple extractions/inductions based on either fields in events or path is something that I feel will hamper the adoption of filebeat (or even worse, could potentially lead to adopters maintaining their own versions of filebeat) when used at scale.

Thoughts?

steffens · November 3, 2016, 10:06pm

One workaround would be to make use of prospector fields.

e.g. adds create a prospector per file type and set document_type accordingly or use ```

filebeat.prospectors.X.fields:
  source_type: "api"

then you can use %{[fields.source_type]}.

Using indices or topics for kafka one can use conditionals todo some more processing.

But your request makes me think about introducing some kind of template-processors/functions as supported by more common templating systems. This could look somewhat like %{[source:basename]} or %{[source]|basename}. The former only on fields extracted from events, the second potentially on other value sources. I kind of like the pipe-symbol here. Imagine %{[source]|basename|trimRight('.log')}. Well, just some idea so far. Will have to think more about this.

ruflin · November 7, 2016, 4:36pm

@surajs Are you just referring the the feature of the file name or more general processing?

surajs · November 8, 2016, 1:34am

@ruflin,

I was originally thinking to extract index/topic name using the basename as an original request.

However, what @steffens mentioned, i can totally see the value of providing a scripting interface to existing metadata.

I'd be happy to contribute on that feature should you feel that we need to add that to filebeat or need to discuss potential use-cases that this request may suffice.

system · November 22, 2016, 12:10am

This topic was automatically closed after 21 days. New replies are no longer allowed.

Topic		Replies	Views
Best practice for setting index names when using filebeats to logstash Beats filebeat	3	2195	March 4, 2018
Using filename from filebeat in index pattern Beats filebeat	2	2026	October 18, 2016
How to customize index name with multipule inputs Logstash	4	1217	March 28, 2017
Re: Filebeat > Kafka > Logstash > ElasticSearch Logstash	5	1952	September 26, 2017
How to use filebeat fields name value in logstash config Beats filebeat	5	14732	May 2, 2017

Extract elasticsearch "index" field from event field

Related topics