Logstash elasticsearch input plugin logs duplication

mevimi · August 16, 2018, 11:01am

Hi,
I use logstash 5.6.8 with the below logstash configuration to forward logs from my elasticsearch to syslog server. I have scheduled the inut plugin to read every minute. I can see same old logs are being read every minute and sent to the syslog server. How can i avoid this duplication of logs?

   input {
   elasticsearch 
   id => "_logs"
    hosts => "localhost:9200"
    index => "_audit_logs-*"
    query => '{ "query": { "query_string": { "query": "*" } } }'
    size => 500
    scroll => "5m"
    docinfo => false
	schedule => "* * * * *"
  }
}

output {
	syslog{
		host => "localhost"
		port => 601
	}
}

Thanks

magnusbaeck · August 16, 2018, 11:06am

The elasticsearch input doesn't have any functionality for skipping already processed documents so there's no simple way of avoiding duplicates with the design you've chosen.

How do the documents end up in ES? Would it be possible to hook into the pipeline earlier on?

mevimi · August 16, 2018, 11:09am

Logs are provided by a team in Elasticsearch from multiple sources. I do not have control over this. But i am allowed to read from the Elasticsearch. So i am trying to implement logstash to read from Elasticsearch and forward it to syslog server.

Is there nothing like the sincedb for file input which keeps a track of the last record?

magnusbaeck · August 16, 2018, 11:12am

Logs are provided by a team in Elasticsearch from multiple sources. I do not have control over this. But i am allowed to read from the Elasticsearch. So i am trying to implement logstash to read from Elasticsearch and forward it to syslog server.

That's a flawed architecture. Don't use ES as a message-passing mechanism.

Is there nothing like the sincedb for file input which keeps a track of the last record?

No.

system · September 13, 2018, 11:12am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Logstash elasticsearch input reads same set of data everytime Logstash	4	612	July 8, 2020
How this elasticsearch input plugin works in logstash? Logstash	4	3512	January 20, 2021
Elasticsearch input on logstash duplicating documents Logstash	1	293	October 20, 2020
Duplicate data parsed by Logstash, which cause duplicate data in Elasticsearch index Logstash	5	1621	October 6, 2017
Logstash Elasticsearch plugin compare inputs Logstash	7	461	July 8, 2021

Logstash elasticsearch input plugin logs duplication

Related topics