Send parts of a message to different Elasticsearch indices from a single Logstash instance

powerful_clouds · November 15, 2022, 12:06pm

Suppose my message has fields A and B. I want to separately send each of the fields to the same ES instance, but to different indices. Is that possible?
E.g. if this is the message: {"A": some text, "B": some more text}, I want "some text" to end up in Index_1, and "some more text" in Index_2. Both indices are on the same host. I don't know if this is important, but the B field only gets added to the message in the filter section.

If only the elasticsearch output plugin had the ability to parse the message like other plugins do, I could just extract the desired field in each of the elasticsearch output blocks. However, that doesn't seem to be possible.

Of course, what I'm describing can be achieved using two Logstash instances, but can it be achieved using only one?

Note: I'm looking for a solution that can be put into a Logstash pipeline.conf file.

Christian_Dahlqvist · November 15, 2022, 12:24pm

You can use a clone filter to create two events that you can then format separately and send to different indices.

Krishna_Teja · November 15, 2022, 12:34pm

Hello powerful_clouds

Have you tried _reindex with filter?

Something like

POST /_reindex
{
  "source": {
    "index": "SOURCE INDEX",
    "query": {
      "match": {
        "A": "some text"
      }
    }
  },
  "dest": {
    "index": "DESTINATION_INDEX"
  }
}


POST /_reindex
{
  "source": {
    "index": "SOURCE INDEX",
    "query": {
      "match": {
        "B": "some more text"
      }
    }
  },
  "dest": {
    "index": "DESTINATION_INDEX"
  }
}

Regrads
Krishna

leandrojmp · November 15, 2022, 1:50pm

As @Christian_Dahlqvist already said, this can be done using the clone filter.

You would need something like this.

filter {
    clone {
        clones => ["index-field-b"]
    }
    if [type] == "index-field-b" {
        mutate {
            remove_field => ["A"]
        }
        mutate {
            ... logic that you use to add field B ...
        }
    }
}
output {
    if [type] == "index-field-b" {
        elasticsearch {
            ... output to the index  for the field B
        }
    } else {
        elasticsearch {
            ... output to the index for the field A
        }
    }
}

You just need to check if you have pipeline.ecs_compatibility enabled for your pipeline or not as this will change the behavior of the filter, you can check the documentation for more examples, but basically with pipeline.ecs_compatibility set to v1 or v8, you will need to use the tags field in the conditional.

Something like:

if "index-field-b" in [tags]

powerful_clouds · November 15, 2022, 4:14pm

@Christian_Dahlqvist @leandrojmp Thank you very much for your help. I've managed to get it working.
There's one thing I wanted to note: I noticed that the number of docs in each of the ES indices was double what I was expecting. I looked at the documents in the B index and saw that half of them had the desired field (e.g. a field of type string called desired_b_field), whereas half of them didn't. After some googling, I realized that the simplest solution is to add an and part to the if in the output section in order to check that the desired field is not null. E.g. for messages of type B:

if "index-field-b" in [tags] and ("" in [desired_b_field]) {
...
}

If you have any additional info on why that duplication happened, feel free to share it here. I'm sure people will find it useful.

Also, @Krishna_Teja, thanks for your proposed solution. Although it's not what I was looking for, I'm sure others will find it useful.

system · December 13, 2022, 4:14pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Send to multiple elasticsearch index with different fields Logstash	4	553	September 13, 2017
How to duplicate message in 2 elasticsearch index Logstash	1	772	July 6, 2017
Send all data to one index and specific fields to another at same time Logstash	3	860	May 12, 2022
Two outputs with different data Logstash	4	532	May 20, 2018
Split event into multiple different messages Logstash	5	1854	July 27, 2018

Send parts of a message to different Elasticsearch indices from a single Logstash instance

Related topics