Sum with unique object

Emna1 · March 27, 2020, 2:46pm

For example if i have csv file with fields;
Car, parking,date of entry
V1, p1, 12-03-2020 12:30
V1, p1, 12-03-2020 11:30
V2, p2,12-03-2020 10:30
V2, p2,12-03-2020 10:45
How can i calculate number of time V1 enter to p1? And the same thing for the other cars ?
Please à y ideas can helps me !

Badger · March 27, 2020, 3:39pm

Use an aggregate filter.

Emna1 · March 27, 2020, 6:36pm

Thanks for the quick response, I tried with this code, it works
aggregate {
task_id => "%{Car}"
code => " map['sum'] ||= 0
map['sum'] += 1
event.cancel"
push_map_as_event_on_timeout => true
timeout_task_id_field => "Car"
timeout => 10
}

my problem now is that this code eliminates the other columns for example date of entry of each car.
so how can I leave the other data related to each car and just add in new column the number of times it enter to this parking

Badger · March 27, 2020, 7:17pm

If there are additional columns you want to add to the event then add them to the map.

Emna1 · March 27, 2020, 7:55pm

OK, I added the other columns but I have a question why it does not display all the entry dates for each car just it takes only one entry date for each car ?? ??
is there another method to leave all the data like in my csv file and just add a column which calculates the sum ??

Badger · March 27, 2020, 11:25pm

Well, event.cancel is optional. If you remove that you will get all the original events as well as events with the aggregated data.

Alternatively, for each value of the task id create an array, and add the event hash to it

a ||= []
a << event.to_hash

then split the array into separate events using a split filter once the aggregation is done.

Emna1 · March 28, 2020, 4:47pm

Thank you so much for your help; really i appreciate that.
I understood correctly and I tried to do everything, finally the initial data is stored separately and the aggregated data in sequence is stored with sum
my last question is can I eliminate @timestamp and @version in aggregated data?how can i added to this code?
aggregate {
task_id => "%{Car}"
code => "map['Car'] ||= [ ]
map['Car_registration'] << {'Car' => event.get('Car')}
map['sum'] ||= 0
map['sum'] += 1
"
push_map_as_event_on_timeout => true
timeout_task_id_field => "Car"
timeout => 5
}
split {
field => "Car"
}

Badger · March 28, 2020, 5:41pm

I don't think those field are optional, but you could try

mutate { remove_field => [ "@timestamp", "@version" ] }

Emna1 · March 28, 2020, 5:57pm

It works ,Thank's you are so helpful

Emna1 · April 2, 2020, 1:35pm

can i ask one more question please !!
for each csv file it calculates the number of times that car uses this parking but when I add another file it calculates the same thing separetly ..
if I recover the files in real time and i want for each time when I add a file i want him to calculate sum together.
Any help ?

Badger · April 2, 2020, 3:08pm

The aggregate filter will only aggregate data that arrives within the timeout. Extending the timeout may help. Otherwise I think you would need to aggregate the aggregates in elasticsearch.

Emna1 · April 2, 2020, 3:19pm

I think extension of timeout will not help me because I received the files every dayd but the second solution aggregate the aggregates how can i do that, i want to try, can you explicate more!!

Badger · April 2, 2020, 3:22pm

That is really an elasticsearch question.

Emna1 · April 4, 2020, 9:54am

Hi @Badger , i see this topic Extract Month and Year from date field
So, I have the same trouble and i want a new field contain for example 2020-02
i try this code
grok { match => { "Date_of_entry" => "^%{YEAR:year}-%{MONTHNUM}" } }
but he extract just the month so what shoud i add to this code to get result like this 2020-02?
can you answer me please?

Badger · April 4, 2020, 2:22pm

I would expect that to extract the year into [year]. It would not extract the month, but it would fail to match if the month was not present. If you want both year and month in a single field you could use

        pattern_definitions => { "YM" => "%{YEAR}-%{MONTHNUM}" }
        match => { "message" => "^%{YM:yearAndMonth}" }

Emna1 · April 4, 2020, 5:09pm

So beautiful!! it works, thank you so much

system · May 2, 2020, 5:10pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Trouble with aggregate filter? Logstash	17	645	June 1, 2020
Using aggregate filter to count number of events and sum field's values Logstash	29	3370	September 14, 2020
[Resolved]Logstash - Issue of SUM by aggregate filter conditionally and can't get the expected csv output, how to correct? Logstash	5	829	June 5, 2019
Need to aggregate data in a CSV file based on a key Logstash	2	21	May 29, 2025
Logstash aggregate field and increase count Logstash	4	1628	August 20, 2021

Sum with unique object

Related topics