Hi , I am trying to run a like_text query with tf set to false. I am seeing the queryNorm being inconsistent across shards. This is causing issues with sorting.
Here is my query :
{
"flt_field": {
"identity.first_name": {
"like_text": "mike",
"ignore_tf": true
}
}
}, {
"flt_field": {
"identity.last_name": {
"like_text": "roger",
"ignore_tf": true
}
}
}
Here are my results :
"_shard": 3,
"_node": "NeAtgK8zT9GGDiRpql921Q",
"_index": "myindex",
"_type": "v1",
"_id": "226912189",
"_score": 0.76921964,
"fields": {
"identity.last_name": [ "ROGER" ],
"identity.first_name": [ "MIKE" ]
},
"_explanation": {
"value": 0.7692197,
"description": "sum of:",
"details": [ {
"value": 0.04058554,
"description": "ConstantScore(cache(internal.status.value:A)), product of:",
"details": [ {
"value": 1.0,
"description": "boost"
}, {
"value": 0.04058554,
"description": "queryNorm"
} ]
}
}, {
"_shard": 3,
....
....
}
}, {
"_shard": 2,
"_node": "NeAtgK8zT9GGDiRpql921Q",
"_index": "myindex",
"_type": "v1",
"_id": "380786027",
"_score": 0.7689439,
"fields": {
"identity.last_name": [ "ROGER" ],
"identity.first_name": [ "MIKE" ]
},
"_explanation": {
"value": 0.76894397,
"description": "sum of:",
"details": [ {
"value": 0.040466454,
"description": "ConstantScore(cache(internal.status.value:A)), product of:",
"details": [ {
"value": 1.0,
"description": "boost"
}, {
"value": 0.040466454,
"description": "queryNorm"
} ]
}, {
"value": 0.2910072,
"description": "sum of:",
"details": [ {
"value": 0.2910072,
"description": "ConstantScore(identity.first_name:mike)^7.1913195, product of:",
"details": [ {
"value": 7.1913195,
"description": "boost"
}, {
"value": 0.040466454,
"description": "queryNorm"
} ]
} ]
}, ]
} ]
}
}
Here you can see 3 records all having the same first name and last name . First two are from one shard and the third result is from a different shard. The score seems to vary even though the values are the same.