Insert to elasticsearch from logstash IF NOT EXISTS

seanziee · June 3, 2022, 10:20pm

Hi,

I currently have an index that I set unique ids and sometimes there is replica data. In my use case, older data is often more accurate than newer data. So I would like to have logstash only insert if the _id currently does not exist in the index. If it exists, the event shouldn't be sent to Elasticsearch and if it doesn't exist it should be sent. Any thoughts on how I can do this?

Badger · June 3, 2022, 11:15pm

Set the action option on the Elasticsearch output to "create" -- create : indexes a document, fails if a document by that id already exists in the index. Hopefully logstash does not endlessly retry

seanziee · June 6, 2022, 9:28pm

Great, I'll give that a shot!

seanziee · June 8, 2022, 12:22am

looks like it worked, but yes logtash retries tirelessly. No problem because my data isn't streaming, just one time uploads

thanks.

Badger · June 8, 2022, 1:25am

If it does indeed retry an event that is rejected because it already exists then that feels very much like a bug. @warkolm, do you agree?

warkolm · June 8, 2022, 3:48am

Sounds like it, yeah.

seanziee · June 8, 2022, 5:38am

What happens specifically is that when I'm running it as a command (not as a service), it will run through it, then terminate the pipeline and start the pipeline all over again continually.

Badger · June 8, 2022, 6:09pm

It does not retry the 409, there is just a single instance of

[2022-06-08T14:08:45,101][WARN ][logstash.outputs.Elasticsearch][main][ded7c2896208304e6887fb9969cdd51534a5ace0eafa08289f565caac9250dbd] Failed action {:status=>409, :action=>["create", {:_id=>"14", :_index=>"ecs-logstash-2022.06.08", :routing=>nil}, {"event"=>{"sequence"=>0, "original"=>"{"id": 14}"}, "@version"=>"1", "id"=>14, "@timestamp"=>2022-06-08T18:08:44.971913Z, "host"=>{"name"=>"ip-172-31-22-149.us-east-2.compute.internal"}}], :response=>{"create"=>{"_index"=>"ecs-logstash-2022.06.08", "_id"=>"14", "status"=>409, "error"=>{"type"=>"version_conflict_engine_exception", "reason"=>"[14]: version conflict, document already exists (current version [1])", "index_uuid"=>"MiVA-Ee_QfOJhn_C3ckXlA", "shard"=>"0", "index"=>"ecs-logstash-2022.06.08"}}}}

system · July 6, 2022, 6:10pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Logstash elasticsearch output plugin Document_id and Upserts Logstash	2	438	October 8, 2021
Upsert / Update if exist, create if not Logstash	3	10093	September 7, 2019
How to check if a document was successfully insert into elasticsearch Logstash	5	1260	October 18, 2017
Logstash elasticsearch input retries forever when index does not exist Logstash	1	509	May 9, 2018
Logstash Update or Insert - logic for updating if existing Logstash	4	14408	April 13, 2017

Insert to elasticsearch from logstash IF NOT EXISTS

Related topics