Hi,
I am evaluating elastic search for a data warehouse project. I have a
9 node cluster with 1 replica. I have loaded 6 days worth of data.
Things seem sluggish all around. Here is the health of the nodes
I am wondering if I am missing some configuration that will speed things up
or if I did something wrong? I see the status is "RED" and it seems that
there are still node initializing and the cluster has been up for over 24
hours. I used logstash to push the data into elastic.
Any help would be appreciated....or if any additional information is needed
I can provide.
Yes you have something very wrong. That is showing you have a huge number
of shards and the cluster is obviously struggling to allocate all of them.
You said you have 9 nodes and 1 replica but you didn't specify how many
shards per index?
Hi,
I am evaluating Elasticsearch for a data warehouse project. I have a
9 node cluster with 1 replica. I have loaded 6 days worth of data.
Things seem sluggish all around. Here is the health of the nodes
I am wondering if I am missing some configuration that will speed things
up or if I did something wrong? I see the status is "RED" and it seems
that there are still node initializing and the cluster has been up for over
24 hours. I used logstash to push the data into elastic.
Any help would be appreciated....or if any additional information is
needed I can provide.
Yes you have something very wrong. That is showing you have a huge number
of shards and the cluster is obviously struggling to allocate all of them.
You said you have 9 nodes and 1 replica but you didn't specify how many
shards per index?
Hi,
I am evaluating Elasticsearch for a data warehouse project. I have
a 9 node cluster with 1 replica. I have loaded 6 days worth of data.
Things seem sluggish all around. Here is the health of the nodes
I am wondering if I am missing some configuration that will speed things
up or if I did something wrong? I see the status is "RED" and it seems
that there are still node initializing and the cluster has been up for over
24 hours. I used logstash to push the data into elastic.
Any help would be appreciated....or if any additional information is
needed I can provide.
Set the number of shards (splits) of an index (5 by default):
index.number_of_shards: 64
Set the number of replicas (additional copies) of an index (1 by default):
index.number_of_replicas: 1
I have it set to 64 shards and 1 replica. Is there a recommended amount
of shards that I should be using?
On Thursday, January 22, 2015 at 1:45:23 PM UTC-5, Sam Flint wrote:
Hi,
I am evaluating Elasticsearch for a data warehouse project. I have a
9 node cluster with 1 replica. I have loaded 6 days worth of data.
Things seem sluggish all around. Here is the health of the nodes
I am wondering if I am missing some configuration that will speed things
up or if I did something wrong? I see the status is "RED" and it seems
that there are still node initializing and the cluster has been up for over
24 hours. I used logstash to push the data into elastic.
Any help would be appreciated....or if any additional information is
needed I can provide.
How many shards to use is a complicated question and depends on the
specific use case. For testing in this scenario though, it's likely that
just matching the number of nodes you have would be a good choice. Then you
will have 1 primary shard for each index on each node.
That said it also looks like maybe you're creating more indices than is
ideal as well. 64 shards with 1 replica and 6 days data in daily indices
should only produce 768 total shards. Looks to me like maybe you have 1
index per hour?
Set the number of shards (splits) of an index (5 by default):
index.number_of_shards: 64
Set the number of replicas (additional copies) of an index (1 by
default):
index.number_of_replicas: 1
I have it set to 64 shards and 1 replica. Is there a recommended amount
of shards that I should be using?
On Thursday, January 22, 2015 at 1:45:23 PM UTC-5, Sam Flint wrote:
Hi,
I am evaluating Elasticsearch for a data warehouse project. I have
a 9 node cluster with 1 replica. I have loaded 6 days worth of data.
Things seem sluggish all around. Here is the health of the nodes
I am wondering if I am missing some configuration that will speed things
up or if I did something wrong? I see the status is "RED" and it seems
that there are still node initializing and the cluster has been up for over
24 hours. I used logstash to push the data into elastic.
Any help would be appreciated....or if any additional information is
needed I can provide.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.