Elasticsearch version:
version : {
"number" : "2.4.1",
"build_hash" : "c67dc32e24162035d18d6fe1e952c4cbcbe79d16",
"build_timestamp" : "2016-09-27T18:57:55Z",
"build_snapshot" : false,
"lucene_version" : "5.5.2"
}
Plugins installed: []
JVM version: "1.8.0_102"
OS version: OSX El Capitan 10.11.6
Description of the problem including expected versus actual behavior:
when we do a NOT based query on a large dataset of 300 million rows the time it takes to get data is very slow.
time taken: 12 minutes
number of rows: 300 million
returning roughly 80% of the data.
Index fields are all analyzed
curl -XGET 'http://localhost:9200/cars/item/_search?size=200' -d '
{ "profile": true, "query": { "bool": { "must_not": [ { "match": { "color": "red" } }, { "match": { "description": "car" } } ] } }, "aggs": { "description": { "terms": { "field": "description", "size": 100 } } }} '
Steps to reproduce:
- Do a simple curl command of a query in ES
- Observe the time it takes from the profile information