I'm wondering if anyone had seen something like this before: I have a 5 node cluster with a bunch of large date based indices (I.e. each daily index can have 50m docs). I have some kibana dashboards which visualize the data. Every so often one of the histogram graphs displays incorrect data - it creates a huge spike for one day and everything else is unreadable. The spike is incorrect (I.e. the elasticsearch data result is actually wrong). The weirder thing is that even on the same graph it is inconsistent - it looks OK at one point and then after a refresh it shows the bad data and then randomly switches back and forth. Is it possible that a replica or shard is corrupt? (The status of the cluster is green). Any thoughts on what may cause this or an approach for debugging it?
Just realized I forgot to mention: this is with version 1.4.4. Also I've
been able to localize it to a specific day's index, but I'm not sure what
the next steps should be to find the root cause and prevent/correct it.
On Monday, March 23, 2015 at 4:34:04 PM UTC-4, MC wrote:
I'm wondering if anyone had seen something like this before: I have a 5
node cluster with a bunch of large date based indices (I.e. each daily
index can have 50m docs). I have some kibana dashboards which visualize
the data. Every so often one of the histogram graphs displays incorrect
data - it creates a huge spike for one day and everything else is
unreadable. The spike is incorrect (I.e. the elasticsearch data result is
actually wrong). The weirder thing is that even on the same graph it is
inconsistent - it looks OK at one point and then after a refresh it shows
the bad data and then randomly switches back and forth. Is it possible
that a replica or shard is corrupt? (The status of the cluster is green).
Any thoughts on what may cause this or an approach for debugging it?
Just realized I forgot to mention: this is with version 1.4.4. Also I've been able to localize it to a specific day's index, but I'm not sure what the next steps should be to find the root cause and prevent/correct it.
On Monday, March 23, 2015 at 4:34:04 PM UTC-4, MC wrote:
I'm wondering if anyone had seen something like this before: I have a 5 node cluster with a bunch of large date based indices (I.e. each daily index can have 50m docs). I have some kibana dashboards which visualize the data. Every so often one of the histogram graphs displays incorrect data - it creates a huge spike for one day and everything else is unreadable. The spike is incorrect (I.e. the elasticsearch data result is actually wrong). The weirder thing is that even on the same graph it is inconsistent - it looks OK at one point and then after a refresh it shows the bad data and then randomly switches back and forth. Is it possible that a replica or shard is corrupt? (The status of the cluster is green). Any thoughts on what may cause this or an approach for debugging it?
I've found with K3 that if I'm updating filters but I don't wait for the
previous change to load completely that it seems that there are two
concurrent processes updating the histogram. Refreshing the page using the
in-page refresh, not the browser reload seems to resolve the display
problem for me.
Just realized I forgot to mention: this is with version 1.4.4. Also I've
been able to localize it to a specific day's index, but I'm not sure what
the next steps should be to find the root cause and prevent/correct it.
On Monday, March 23, 2015 at 4:34:04 PM UTC-4, MC wrote:
I'm wondering if anyone had seen something like this before: I have a 5
node cluster with a bunch of large date based indices (I.e. each daily
index can have 50m docs). I have some kibana dashboards which visualize
the data. Every so often one of the histogram graphs displays incorrect
data - it creates a huge spike for one day and everything else is
unreadable. The spike is incorrect (I.e. the elasticsearch data result is
actually wrong). The weirder thing is that even on the same graph it is
inconsistent - it looks OK at one point and then after a refresh it shows
the bad data and then randomly switches back and forth. Is it possible
that a replica or shard is corrupt? (The status of the cluster is green).
Any thoughts on what may cause this or an approach for debugging it?
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.