Automatically increase number of shards

MaVa · October 4, 2022, 8:17am

Hi! I have few indices where all the data is equally relevant no matter how old it is. Over time these indices will increase in size (from 5-10MB to 1000GB). I want to optimize the number of shards in these indices, and increase them as the amount grows.

I've looked into ILM, but did not seem that any of the options there would suit this purpose. Does anyone have any other suggestions on how to solve this in a smooth way?

warkolm · October 4, 2022, 9:00am

Welcome to our community!

What sort of data is this?

MaVa · October 5, 2022, 6:21am

Thank you!

The data is information about photos (all the metadata and other information needed for each image). They should all be as easily accessed and updated, no matter how old the data is.
(The actual files are stored elsewhere)

Christian_Dahlqvist · October 5, 2022, 6:22am

Are you updating the data?

MaVa · October 5, 2022, 6:28am

Are you updating the data?

Yes

Christian_Dahlqvist · October 5, 2022, 6:43am

You can not change the number of primary shards of an index once it has been created. You can however use the split index API to create a new index with a greater number of primary shards. If you are querying through an alias you can have it point at the old index until the new one is ready and they flip it. The issue here is that you will need to pause any updates, inserts and deletes during the time the index is being split. This allows you to continue using a single index, which is convenient when you update data.

Using time-based indices using ILM and rollover means that you are always indexing into a single index and that this will change over time. You can easily query all indices but updates become more expensive as you first need to find in which index the document you are to update resides before you actually perform the update. The rollover feature will allow you to generate new indices of a spacific target size over time and may be an option if you do not perform a lot of updates or deletes and can take the extra cost. ILM also supports different lifecycle stages, but that is not really applicable to you as all data is equelly relevant.

system · November 2, 2022, 6:43am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Auto split an index's shard when certain size reached Elasticsearch ilm-index-lifecycle-management	2	780	October 27, 2021
How to Increase number of shards of an existing Index Elasticsearch	7	4769	July 2, 2019
Increase number of shards for an existing data stream Elasticsearch	1	87	February 27, 2024
Shard Configuration Elasticsearch ilm-index-lifecycle-management	2	219	August 19, 2022
Increasing the number of index shards Elasticsearch	8	208	November 22, 2022

Automatically increase number of shards

Related topics