The first time a query (classic full text query with a filter and a sort) is ran, it takes about 15-20s to get the result. It takes only few milliseconds for the next tries. Could anybody give me some explanations about the reason ? And how could I solve this and have a decent response time ?
Here is the configuration we have :
ES version 6.2.3
5 nodes (5 data nodes)
1 index with 2P Shards / 2R - 900 000 docs (35GB)
1 index with 1P Shard / 2R - 13 000 000 docs (10GB)
2 CPUs / node with 8GB RAM
You'll find the kind of request we have to run at the bottom. I know the size of results I want to return is pretty big but by now I'm stuck with this behaviour. Basically what I need is to retrieve all ids of matching documents.
Before that there was another design where documents were splitted accross many indices (several thousand). We had issues when indexing but search was very fast.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.