Records per shard

TheCowboy · June 25, 2015, 1:23pm

Hi

What's the best practice/main thing to consider when deciding upon the number of shards in your index vs the number of documents you will have?

For example in our dev system we've had a problem highlighted here:

https://www.elastic.co/guide/en/elasticsearch/guide/current/relevance-is-broken.html

Is there a formula or "best practice guide" to consider when deciding this?

warkolm · June 26, 2015, 11:25pm

One shard per node is nice as then data is spread across them. However when taking things like relevance into account this obviously changes.

How big is your dataset?

TheCowboy · June 29, 2015, 7:20am

At the moment we were developing only on 6-8 records, which I accept is hardly anything but we needed to incrementally build and understand the scoring system when we add/remove/edit records.

Going forward we expect it to be approx 147,000 documents.

warkolm · June 29, 2015, 8:06am

And how big are they?

TheCowboy · July 7, 2015, 12:55pm

Sorry for the slow reply and excuse my ignorance but how could I find that?

EDIT: In sense I ran:

GET /IndexName/_stats

And got:

"docs": {
"count": 145261,
"deleted": 0
},
"store": {
"size_in_bytes": 23120808,
"throttle_time_in_millis": 95
}

warkolm · July 7, 2015, 11:02pm

You can also use the _cat API.

Given that size you should aim for a single shard. Otherwise check out https://www.elastic.co/guide/en/elasticsearch/reference/current/search-request-search-type.html#dfs-query-then-fetch

TheCowboy · July 8, 2015, 7:33am

Thanks for your reply Mark, much appreciated. Would I be on the right track thinking there might be a performance impact with dfs-query-then-fetch? I guess it's a trade-off we would have to consider but for now I think it's easier for us to have a single shard.

Topic		Replies	Views
When do you need more then 1 shard? Elasticsearch	12	1853	July 6, 2017
Sharding and Performance Elasticsearch	1	310	August 29, 2018
Large shard size Elasticsearch	4	399	December 4, 2021
Documents per shard Elasticsearch	2	549	July 6, 2017
Elasticsearch Shards Elasticsearch	5	700	August 22, 2017

Records per shard

Related topics