We have reindexed one big index [shards=1, size=220GB, documents=110M] to multiple nodes (9 shards & 3 nodes) using Elasticsearch 6.5.4
with java 1.8.0_191
However, now we are seeing that when doing a search the scoring between shards is not consistent (not 100% sure, but it might only be when shards are on different nodes). Identical documents (given the search criteria) can have quite different scores. Is this really expected behavior? How can we have consistent results?
Example query:
{
"query": {
"bool": {
"must": [
{
"bool": {
"should": [
{
"match_phrase": {
"searchableName.exact": {
"query": "example",
"boost": 3.0
}
}
},
{
"match_phrase": {
"searchableName": {
"query": "example",
"boost": 2.0
}
}
},
{
"prefix": {
"searchableName": {
"value": "example",
"boost": 1.45
}
}
},
{
"prefix": {
"searchableName.lower_latin": {
"value": "example",
"boost": 1.45
}
}
},
{
"prefix": {
"searchableName.reverse_lower_latin": {
"value": "elpmaxe",
"boost": 1.0
}
}
}
]
}
},
{
"bool": {
"should": [
{
"term": {
"status": {
"value": "Valid",
"boost": 0.9
}
}
},
{
"term": {
"status": {
"value": "Pending",
"boost": 0.09
}
}
},
{
"term": {
"status": {
"value": "GracePeriod",
"boost": 0.009
}
}
},
{
"term": {
"status": {
"value": "Expired",
"boost": 0.0009
}
}
},
{
"term": {
"status": {
"value": "Invalid",
"boost": 0.00009
}
}
},
{
"term": {
"status": {
"value": "Unknown",
"boost": 0.000009
}
}
}
]
}
}
]
}
}
}
Example of two hits with different scores: https://pastebin.com/0TU4ngPc