Inconsistent facet results


(aromasca) #1

Hi, we are running a 7 machine cluster in EC2 and when were expirencing a
little inconsistency when we run the same query against a different machine.
We have a load balancer in front so each request is handled by a different
node, however we are getting about 5% mismatch with the facets.
We have tried flushing the index, and clearing the index cache but none of
these help and the facet counts still change from node to node.

here is an example query:

{
"filtered" : {
"query" : {
"term" : {
"tem1" : "abcde-fghijk"
}
},
"filter" : {
"range" : {
"timestamp" : {
"from" : 1325368800000,
"to" : 1345496399999
}
}
}
}
},
"facets" : {
"tab_types" : {
"terms" : {
"field" : "pageType",
"size" : 20
}
}
}

--


(David Pilato) #2

I have already seen this some weeks ago.
One of my node had inconsistent data. I stopped the node, let the shards been replicated in the cluster, delete data dir and restart ES.

But, if you are unlucky, you will perhaps have to reindex...

HTH

--
David :wink:
Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs

Le 26 août 2012 à 13:44, aromasca aromasca@gmail.com a écrit :

Hi, we are running a 7 machine cluster in EC2 and when were expirencing a little inconsistency when we run the same query against a different machine.
We have a load balancer in front so each request is handled by a different node, however we are getting about 5% mismatch with the facets.
We have tried flushing the index, and clearing the index cache but none of these help and the facet counts still change from node to node.

here is an example query:

{
"filtered" : {
"query" : {
"term" : {
"tem1" : "abcde-fghijk"
}
},
"filter" : {
"range" : {
"timestamp" : {
"from" : 1325368800000,
"to" : 1345496399999
}
}
}
}
},
"facets" : {
"tab_types" : {
"terms" : {
"field" : "pageType",
"size" : 20
}
}
}

--


(system) #3