Document size effect on aggregations

shaharmor · April 18, 2017, 7:24am

Hi,

I am wondering how much does the document size itself affect the query & aggregation performance.

Lets say I have documents that look like this:

{
  datetime: '2017-04-18T07:00:00.000Z',
  value: 10
}

and I have 10 million of them, and the query & aggregation time is X.

What if the documents would look like this:

{
  datetime: '2017-04-18T07:00:00.000Z',
  value: 10,
  somefield: 9999999.51,
  anothetfield: 'tons of unneeded information that is not related to the aggregation',
  anothetfield2: 'tons of unneeded information that is not related to the aggregation',
 anothetfield3: 'tons of unneeded information that is not related to the aggregation',
 ...
}

How will the performance be different? does it have any affect at all? (I'm only worried about query & aggregation performance, not indexing (Which is obvious that will take more time))

s1monw · April 21, 2017, 8:01am

if you query the same fields it won't really matter much. When you load the documents from disk to display them we obviously need to load more into memory from disk that will affect perf but since we do that only for the top N it won't make much of a difference.

hope that helps.

system · May 19, 2017, 8:05am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
How does document size affect ElasticSearch performance? Elasticsearch	5	1475	February 8, 2021
Document Count vs. Document Size on search performance Elasticsearch	1	365	March 29, 2020
Performance / memory impact by size of term aggregation? Elasticsearch	1	705	May 16, 2017
How does an index's document count affect performance? Elasticsearch	2	1098	July 5, 2017
Slow aggregation no matter the size of the result set Elasticsearch	3	479	October 26, 2018

Document size effect on aggregations

Related topics