Is it possible?

ailengcon · August 12, 2021, 11:50am

Can someone answer me this.

I have a database with several configurations, and each configuration can be linked to the same serial_number. To be able to analyze the data correctly, I need to filter så that each serial_number is grouped as one configuration.
So can I use an ingest pipeline to set up condition to say " if there is some document here with the same serial_number, only ingest the newest one of each"?

Is anything possible with these processors and the painless language?

warkolm · August 16, 2021, 4:57am

Can you elaborate more on why you only want the latest one stored?

ailengcon · August 16, 2021, 5:42am

the configurations get stored in the database when a user press save in the program, and if the user presses save multiple times the same configuration get's stored again. This can happen if a user is inexperienced or forgets a setting before saving. So it's a incorrect design of the system I guess, they admit this, but to get accurate statistics from the database they want to filter the redundant data out.

warkolm · August 16, 2021, 5:45am

You have two options;

Keep all state changes and only provide the latest, using a top hits aggregation. This lets you track changes over time and do analysis on them
Use the serial as a document ID and then it'll overwrite with the latest each time

ailengcon · August 16, 2021, 5:58am

Thank you very much, I don't understand what state changes are, but at least I know it is possible.

warkolm · August 16, 2021, 5:58am

State change means every time the config is changed, ie it goes from one state to another.

ailengcon · September 6, 2021, 7:17am

Hi, can I ask you to elaborate on the first option more? I got hired to implement elasticsearch so I need more information on how to move forward.

warkolm · September 6, 2021, 8:27am

Elaborate in what sense?

ailengcon · September 6, 2021, 9:00am

I'm not sure if I understand the concept of state changes yet. There are several configurations with the same serial_number, how would their state change? And would you do still do this with ingest pipelines? What processor?

I guess it is not possible to both keep all documents and also use just one of each serial number to analyze statistics?

warkolm · September 6, 2021, 11:07pm

Basically you want to use Elasticsearch as a time series datastore.
Where each event comes in with a timestamp and serial and whatever else is logged. Then you can graph changes over time, either at an individual serial level, or on an aggregated level.

If you want to retrieve only the latest event, which contains whatever state is logged, then you can do that easily.

system · October 4, 2021, 11:08pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Insert ingest pipeline if id is new Elasticsearch	1	436	April 16, 2019
Ingest Pipeline update existing array Elasticsearch ingest-pipeline	1	218	July 12, 2022
Time series datastore altering Kibana	6	360	December 12, 2021
Observability_Logging Elastic Training	2	485	August 17, 2020
Dec 13th, 2018: [EN][Elasticsearch] Chaining Ingest Pipelines Advent Calendar	1	1892	December 1, 2019

Is it possible?

Related topics