Index Management considerations on ES used as search agent on top of cassandra

Naren_Sree · August 9, 2017, 2:50am

Hi We are using ES as our search engine for our Cassandra DB. We are dumping our business model data to both Cassandra and Elastic search. I want to design an index management strategy for this scenario. How would I do it ?

For example: Lets say we User Model data, which persists all the user related (first, last, address, phone number etc)

Should I actually just create just one index for all the users or Create weekly/monthly indices based on when user gets created in our system ?
How many shards do I need to allocate if the user data is like 1G.
Lets say the scenario completely changes and we decide to all more data into ES. And then the data might exponentially grow to 200GB or so. If so, then whats the criterion for allocating the more shards to ES. How do I calculate the shards etc.
4.Since I would not know how my system grows ahead of my time, lets say i make mistake in allocation shards (either too little or too many) Then is there a way to dyanimically shrink or expand them as and when more data is dumped into ES.

Thank you very much for your help..

warkolm · August 9, 2017, 4:55am

A single index sounds fine.
Depends on what size that is in Elasticsearch, what queries you use and their rates, what response SLAs you need, the underlying infrastructure, etc.
We recommend no more than 50GB per shard. But how large depends on 2
You can shrink easily with _shrink. To expand you need to _reindex.

system · September 6, 2017, 4:55am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Questions from a newbie Elasticsearch	15	417	July 6, 2017
Best Source for In-Depth Understanding of Indices and Shards Elasticsearch	7	367	July 19, 2018
How many indices can be created Elasticsearch	8	13236	August 14, 2018
Sizing and configuration for multi-tenant application Elasticsearch	6	819	March 28, 2019
Scaling ElasticSearch for many indexes Elasticsearch	2	18	October 22, 2024

Index Management considerations on ES used as search agent on top of cassandra

Related topics