Optimize is blowing up index size

morphers82 · November 13, 2017, 4:22pm

So i'm on 1.7.x and i did an optimize on a large 10's of TBs index (haven't done one in a while) and all went well, the index size went down.

I am currently optimizing another, however, the size has gone up amost 25% so far, with no signs of slowing down. what could be the issue?

both had setting for only_expunge_deletes=true, first index has fewer writes too it, wile this second one is updated constantly. first index documents counts ie 5,000 (9,999) went down pretty quick ie 5,000(9,999) to 5,000 (6,999)
second one, the doc counts are not really budging at all, especially for deleted counts (within parenthesis)

I am in planning already to move everything to 6.0, but this is an immediate problem on 1.7.x

theuntergeek · November 13, 2017, 4:35pm

As segments merge, extra space will initially be consumed. It should free back up again as the source segments are marked for deletion, and then deleted (after successful re-indexing to larger segment). It could take a long time for this to happen, though.

morphers82 · November 13, 2017, 4:47pm

i'm now worried i'll run out of disk space before this can happen.

morphers82 · November 13, 2017, 4:57pm

I found this page https://www.elastic.co/guide/en/elasticsearch/reference/1.7/index-modules-merge.html
i lowered the defaults of index.merge.policy.max_merge_at_once_explicit and index.merge.policy.max_merge_at_once . The size is not blowing up as fast, and actually went down for a minute, so I am hopeful.

morphers82 · November 13, 2017, 5:02pm

now I am wondering whether sparce doc values is the culprit, there are maybe 5% (guessing) of docs with some fields that have kbs of data in a few fields of each docs, while other docs those same fields were empty. i see that lucene doc value files are getting big.
If i have to terminate the merge, will this stop the size explosion? how do i stop the optimize?

theuntergeek · November 13, 2017, 5:04pm

I don't think you can stop the optimize. There are no API calls to do so.

system · December 11, 2017, 5:04pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Fluctuating Index Sizes Elasticsearch	4	810	January 17, 2017
Elasticsearch 1.1.0 - Optimize broken? Elasticsearch	29	760	July 6, 2017
How does segment merging work Elasticsearch	6	943	July 5, 2017
Running _optimize, best practices Elasticsearch	4	1849	July 6, 2017
How can I disable versioning? Elasticsearch	5	1556	July 6, 2017

Optimize is blowing up index size

Related topics