Best approach to totals when aggregating many documents

tg_fb · March 21, 2017, 3:09pm

We're want to use elasticsearch to create reports based on financial data.
Currently we are still creating our mappings. Now we are not certain which approach is best.
We want to compute different totals in our report and the value field of each document should only be included in a certain total based on the "nature" of this document (e.g. expenses for cloth => sum(cloth_expenses); expenses for food => sum(food_expenses) etc.).

First idea was to create one field that holds the value we want to total up and have several boolean fields that indicate which sum this documents value should be added to:
.subAggregation(AggregationBuilders.terms("myAgg")
.field("someBooleanField")
.subAggregation(AggregationBuilders.sum("sumMyField").field("myField").missing("0"))
.subAggregation(AggregationBuilders.terms("myOtherAgg")
.field("anotherBooleanField")
.subAggregation(AggregationBuilders.sum("sumMyField").field("myField").missing("0"))
Second idea was to create one field that holds the value we want to total up and use type to indicate to which sum we need to add this documents value. Haven't tried it yet.

Now we are afraid that filtering/selecting the value based on another field might be too expensive we are wondering now, if we should have several value fields and place the value in the correct filed on indexing time.

How does elasticsearch handle such requests, will filtering/selecting have a huge impact on performace?
Is there a best approach to this problem?

system · April 18, 2017, 3:10pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Get aggregation value and source field in one st Elasticsearch	2	435	March 16, 2018
Elasticsearch querying time Elasticsearch	5	342	October 6, 2020
Sum Aggregation using multiple fields Elasticsearch	6	861	January 13, 2023
Aggregate counting/sum query Elasticsearch	5	3497	March 3, 2019
Aggregation on most recent document in a group Elasticsearch	5	900	May 22, 2018

Best approach to totals when aggregating many documents

Related topics