For a query, I observe a performance difference if I include or not the _source.
With: "_source": true, the query needs about 250 ms to complete.
With: "_source": false, the query needs about 50 ms to complete.
In the mapping, I already disabled the _all.
I search about how to optimise this use-case and could not found any interesting information.
If you have any tips:
about a possible configuration (disk, kernel, JVM...)
about how to work-around this.
For example, has anybody tried to put the JSON as a String into one field, like {"json": "<the json of the document>"}?
Yes I am speaking exactly about this single field _source. When I deactive the retrieving of _source in the query results, I observe this difference of performance.
I am retrieving 20 documents.
I am not sure about what you ask precisely with FileSystem Cache. The free command outputs:
$ free
total used free shared buffers cached
Mem: 15396852 12732848 2664004 4296 153104 3033332
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.