Okay @dadoonet. will follow the instructions.
Issue: indexed 14 pdf files with same settings and mappings on 2 es nodes. but getting different results when queried.
Case-1: Elasticsearch deployed on Amazon EC2(Windows)
Indexed 14 pdf files
query:
indexname: testindex
{ "_source" : "url",
"query": {
"match" : {
"content" : {
"query" : "windows install"
, "operator": "and"
}
}
}
}
Response:
the last term in url is the name of the file
"hits": [
{
"_index": "testindex",
"_type": "_doc",
"_id": "5",
"_score": 2.230532,
"_source": {
"url": "http://127.0.0.1:5000/js/Linux/linux _faq_3_manual.pdf"
}
},
{
"_index": "testindex",
"_type": "_doc",
"_id": "8",
"_score": 2.084747,
"_source": {
"url": "http://127.0.0.1:5000/js/Linux/the-linux-faq.pdf"
}
}
]
Case-2: Elasticsearch deployed on Redhat Linux
Indexed same 14 pdf files
Index name: testindex
query:
{ "_source" : "url",
"query": {
"match" : {
"content" : {
"query" : "windows install"
, "operator": "and"
}
}
}
}
results:
"hits": [
{
"_index": "testindex",
"_type": "_doc",
"_id": "11",
"_score": 2.6487362,
"_source": {
"url": "http://filesystemwef.com/Windows_Issues/31831392.pdf"
}
},
{
"_index": "testindex",
"_type": "_doc",
"_id": "12",
"_score": 1.2416239,
"_source": {
"url": "http://http://filesystemwef.com/Windows_Issues/357786482.pdf"
}
}
]