There are many aspects to caching. it stats with the OS file system cache.
The main other cache that is expensive is the one used for sorting /
facets, which you can clear, but those can take a long time to load for the
initial query. Are you using facets / sorting (not on score)? Another cache
is the filters cache, are you using filters?
To clear those, across all indices, you can: curl -XPOST
localhost/_cache/clear. What you do, is clear the cache for the _river
index, thats not the actual index you have and query...
On Wed, Nov 2, 2011 at 1:34 PM, Felix Sappelt f.sappelt@clueda.com wrote:
Hi!
I'm trying to do performance tests on various queries we're using; I'm
interested in the worst-case performance, that is, how long it takes
to run the query for the first time without any caching etc. (I
already know that cached queries are acceptably fast in all cases).
What I can't figure out is how to achieve this; when I run a query for
the second time, it is blazingly fast, even if it takes seconds to
finish on the initial attempt. It appears that ES caches it somewhere,
but I've tried clearing caches and flushing, to no avail.I've tried doing the following things before re-running a query (yes,
ES is running on port 4002, and the index is called my_river):
curl -XPOST 'http://localhost:4002/my_river/_cache/clear'
curl -XPOST 'http://localhost:4002/my_river/_flush'What else do I need to do to eliminate this caching behavior?
Thanks!
Felix