Improve performance on first time query

pr2012 · March 18, 2013, 11:44pm

I am seeing somewhat lackluster performance from the first execution of
queries, subsequent requests hit the cache and seem to be alright. I'd like
to improve performance on the first requests though, as this will be the
majority of usage.

We have a 5x Node cluster of EC2 Large instances with IOPS at 600.
Elasticsearch has 50% of the available memory on each node. All requests
are over Thrift. I'm still looking for ways to tweak this configuration.
We have about 10million documents spread roughly evenly across 4 different
indexes.

The query matches about between 100k and 300k of the documents in the
indexes 2-4million.
A number of facets are calculated.

The first time the query is executed, response time is ~5s.
Subsequent executions are sub 1s.
Without facets, first execution is under 100ms.

Here is an example query:

gist.github.com

https://gist.github.com/phillro/803701269a1b2a384c32

gistfile1.json

{
    "query": {
        "bool": {
            "must_not": [],
            "should": [],
            "must": [
                {
                    "term": {
                        "fanscore.page_id": "30596038768"
                    }

This file has been truncated. show original

All numerical fields are mapped as integers, but beyond that I havent
optimized the mappings at all.

Any tips on how to optimize this would be much appreciated.

Thx!

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

otisg · March 19, 2013, 3:39am

Hi,

Consider using

Otis

ELASTICSEARCH Performance Monitoring - Sematext Monitoring | Infrastructure Monitoring Service

On Monday, March 18, 2013 7:44:04 PM UTC-4, ES_Newb wrote:

I am seeing somewhat lackluster performance from the first execution of
queries, subsequent requests hit the cache and seem to be alright. I'd like
to improve performance on the first requests though, as this will be the
majority of usage.

We have a 5x Node cluster of EC2 Large instances with IOPS at 600.
Elasticsearch has 50% of the available memory on each node. All requests
are over Thrift. I'm still looking for ways to tweak this configuration.
We have about 10million documents spread roughly evenly across 4 different
indexes.

The query matches about between 100k and 300k of the documents in the
indexes 2-4million.
A number of facets are calculated.

The first time the query is executed, response time is ~5s.
Subsequent executions are sub 1s.
Without facets, first execution is under 100ms.

Here is an example query:
gist:803701269a1b2a384c32 · GitHub
All numerical fields are mapped as integers, but beyond that I havent
optimized the mappings at all.

Any tips on how to optimize this would be much appreciated.

Thx!

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

Topic		Replies	Views
Query performance issue - need help to investigate Elasticsearch	9	2185	July 5, 2017
Windows Elasticsearch cluster performance tuning Elasticsearch	5	1249	July 6, 2017
First time query with different params is taking a lot of time Elasticsearch	18	992	January 24, 2019
Query performance issue with the very first one Elasticsearch	6	484	March 17, 2020
Monthly query performance on too many shards Elasticsearch	11	815	March 10, 2022

Improve performance on first time query

Otis

Related Topics