Is there a preferred config for Index / Shard configuration? Lots of indexes with lots of shards or fewer indexes and bigger shards?

mike_giardinelli · January 5, 2015, 9:51pm

Hi All,

We have started noticing in our environment that our query performance is
starting to suffer for some of our datasets that span the roughly 1 year of
data we keep online. We are looking into optimizations we can make to our
Index / Shard configuration and I was wondering if there is a preferable
way to configure our indexes / shards? Right now we create a new index for
each week and have 22 shards per index (We have 22 data nodes). Would it
be more optimal to reduce the number of indexes (index by month) and have
larger shards? Our documents are kb in size so they are not all that big,
we just have a lot of them.

The feedback we typically get back from support is just test and see. That
is something we can do, but there is a fair amount of effort / time that we
would need to put in to only find that it doesn't give us any benefit. I
was just hoping some of the more experienced folks could provide some input
on possible solutions. If all else fails, we can always try to test
different configs.

Thanks!

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/12e73093-236b-4656-b6d5-960b31df7747%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

warkolm · January 5, 2015, 10:26pm

One shard per node is ideal as you spread the load.
Reducing the shard count can help but it depends on a few things.

How much data do you have in your cluster, how many indexes?

On 6 January 2015 at 08:51, mike.giardinelli@gmail.com wrote:

Hi All,

We have started noticing in our environment that our query performance is
starting to suffer for some of our datasets that span the roughly 1 year of
data we keep online. We are looking into optimizations we can make to our
Index / Shard configuration and I was wondering if there is a preferable
way to configure our indexes / shards? Right now we create a new index for
each week and have 22 shards per index (We have 22 data nodes). Would it
be more optimal to reduce the number of indexes (index by month) and have
larger shards? Our documents are kb in size so they are not all that big,
we just have a lot of them.

The feedback we typically get back from support is just test and see.
That is something we can do, but there is a fair amount of effort / time
that we would need to put in to only find that it doesn't give us any
benefit. I was just hoping some of the more experienced folks could
provide some input on possible solutions. If all else fails, we can always
try to test different configs.

Thanks!

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/12e73093-236b-4656-b6d5-960b31df7747%40googlegroups.com
https://groups.google.com/d/msgid/elasticsearch/12e73093-236b-4656-b6d5-960b31df7747%40googlegroups.com?utm_medium=email&utm_source=footer
.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAEYi1X-yUUxqLWaPtAtdPmzMNROXSzH%3D0qoVKK3XtJJDWSP0rg%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

mike_giardinelli · January 5, 2015, 10:44pm

Hi Mark,

Thanks for the reply! We have roughly 13 TB of data and about 40 indexes (1
index per week). For each index we have 22 shards (one for every data
node).

On Monday, January 5, 2015 2:27:24 PM UTC-8, Mark Walkom wrote:

One shard per node is ideal as you spread the load.
Reducing the shard count can help but it depends on a few things.

How much data do you have in your cluster, how many indexes?

On 6 January 2015 at 08:51, <mike.gia...@gmail.com <javascript:>> wrote:

Hi All,

We have started noticing in our environment that our query performance is
starting to suffer for some of our datasets that span the roughly 1 year of
data we keep online. We are looking into optimizations we can make to our
Index / Shard configuration and I was wondering if there is a preferable
way to configure our indexes / shards? Right now we create a new index for
each week and have 22 shards per index (We have 22 data nodes). Would it
be more optimal to reduce the number of indexes (index by month) and have
larger shards? Our documents are kb in size so they are not all that big,
we just have a lot of them.

The feedback we typically get back from support is just test and see.
That is something we can do, but there is a fair amount of effort / time
that we would need to put in to only find that it doesn't give us any
benefit. I was just hoping some of the more experienced folks could
provide some input on possible solutions. If all else fails, we can always
try to test different configs.

Thanks!

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearc...@googlegroups.com <javascript:>.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/12e73093-236b-4656-b6d5-960b31df7747%40googlegroups.com
https://groups.google.com/d/msgid/elasticsearch/12e73093-236b-4656-b6d5-960b31df7747%40googlegroups.com?utm_medium=email&utm_source=footer
.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/65c2167b-905a-4403-bab8-fef301c57287%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Topic		Replies	Views
Advice on cluster configuration Elasticsearch	10	553	January 8, 2019
Intial load and Cluster configuration query Elasticsearch	5	468	July 6, 2017
Index settings recommendations Elasticsearch	3	358	April 17, 2019
Best config for Performance Elasticsearch	11	2507	November 5, 2018
Multi-Tenant - Shard and Index strategy / Optimizing Elasticsearch configuration? Elasticsearch	4	245	September 21, 2023

Is there a preferred config for Index / Shard configuration? Lots of indexes with lots of shards or fewer indexes and bigger shards?

Related topics