Is there any way to aggregate an average without outliers?

ofir_y · December 31, 2020, 7:47am

I need a way to create a transform that will aggregate the average of a field but without the outliers (let's say all values that falls between 10%-90% percentiles). for example if I have the following values:
[1,2,3,4,5,6,7,8,9,10]

it will calculate the average of 2-9

Hendrik_Muhs · January 12, 2021, 2:33pm

Unfortunately there is no out of the box solution for this.

What I can think of:

You could in addition to you existing group_by further group by histogram. In the aggregation you need to calculate an average for that bucket and you need the count of documents (value_count on one of the group_by fields).

The transform would create a document for every histogram bucket. In a 2nd pass you can query the transform dest index, using a range query to filter out the outliers and aggregate using a weighted average aggregations, this is where you need the count as weight.

The other idea: filter the outliers already in the transform or use a filter aggregation in the transform with a avg child aggregation.

To get the left and right cut off value you can use a percentiles aggregation.

To sum it up, I do not see a solution that can be easily implemented but both ideas require extra work and additional queries or at least 2 transforms.

system · February 9, 2021, 2:33pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Visualizing Average with Removing Outliers Kibana vega	5	923	July 15, 2021
Filter outliers elastic 5 Elasticsearch	3	1239	February 15, 2017
Avarage without outliers Elasticsearch	4	2436	July 5, 2017
How is an 'avg' aggregation updated in transforms when old records are deleted? Elasticsearch	3	377	May 11, 2020
Pipeline aggregation to compute histogram of average values of a field, bucketed by term? Elasticsearch	12	622	November 13, 2019

Is there any way to aggregate an average without outliers?

Related topics