Fluctuating Index Sizes

dawiro · December 16, 2016, 8:32am

Hi,
We're seeing significant fluctuations in index sizes in our log aggregation cluster for old indices that are no longer being updated. For example, the index for Dec 12th doubled in size in the days after its last update before reducing in size by around 15%.

Can someone help explain what is/may be going on here, how we can track the processes at work and what we can do to deal with it?

Regards,
David

warkolm · December 19, 2016, 1:17am

It could be due to merging segments, with sparse docvalues it can increase the size of the segments.

dawiro · December 20, 2016, 7:44am

Hi,
The thing is we optimise all of our indices after 1 day and the index I'm referring to here was growing after 3 days. Ended up being 2.2TB in size based on around 500 million docs before settling back to 1.7TB. When the last write came in the index was around 950GB!

BTW, is there a way of measuring the sparsity of our field data? Would I need to run lucene commands to do that? We're running on 2.1.2...

Regards,
David

jpountz · December 20, 2016, 10:19am

You can run an exists query https://www.elastic.co/guide/en/elasticsearch/reference/current/query-dsl-exists-query.html on a given field and compare the result with the number of docs that you have in your index.

system · January 17, 2017, 10:19am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Size of index is increasing abnormally? Elasticsearch	13	5138	July 5, 2017
Index size variation when updating Elasticsearch	1	397	November 25, 2019
Regarding Unexplained Index Growth Elasticsearch	6	411	November 1, 2017
Optimize is blowing up index size Elasticsearch	6	769	December 11, 2017
Index size with elasticseach 8 increasing? Elasticsearch	5	529	April 19, 2022

Fluctuating Index Sizes

Related topics