Elasticsearch _msearch api useless?

acv2 · June 12, 2017, 3:51pm

Hello there,
I'm running tests in my elasticsearch cluster trying to optimize parallel queries, sorprisinly the _msearch querie is constantly and way slower than the individual _search API.

My test:

from the same endpoint:

im doing 10 times:

[GET] {index}/document/_search

    {
              "size": 10,

              "sort" : {
                "_script" : { 
                    "script" : "Math.random()",
                    "type" : "number",
                    "params" : {},
                    "order" : "asc"
                }
              },

              "query": {
                "query_string": {


                  "query": "*:*"

                }
              }
            }

then:

sending just one call with

[POST] (binary) /_msearch
    {"index":"jobs","type":"document"}
    {"size": 10,"sort" : {"_script" : {"script" : "Math.random()","type" : "number","params" : {},"order" : "asc"}},"query": {"query_string": {"query": "*:*"  }}}
    {"index":"jobs","type":"document"}
    {"size": 10,"sort" : {"_script" : {"script" : "Math.random()","type" : "number","params" : {},"order" : "asc"}},"query": {"query_string": {"query": "*:*"  }}}
    {"index":"jobs","type":"document"}
    {"size": 10,"sort" : {"_script" : {"script" : "Math.random()","type" : "number","params" : {},"order" : "asc"}},"query": {"query_string": {"query": "*:*"  }}}
    {"index":"jobs","type":"document"}
    {"size": 10,"sort" : {"_script" : {"script" : "Math.random()","type" : "number","params" : {},"order" : "asc"}},"query": {"query_string": {"query": "*:*"  }}}
    {"index":"jobs","type":"document"}
    {"size": 10,"sort" : {"_script" : {"script" : "Math.random()","type" : "number","params" : {},"order" : "asc"}},"query": {"query_string": {"query": "*:*"  }}}
    {"index":"jobs","type":"document"}
    {"size": 10,"sort" : {"_script" : {"script" : "Math.random()","type" : "number","params" : {},"order" : "asc"}},"query": {"query_string": {"query": "*:*"  }}}
    {"index":"jobs","type":"document"}
    {"size": 10,"sort" : {"_script" : {"script" : "Math.random()","type" : "number","params" : {},"order" : "asc"}},"query": {"query_string": {"query": "*:*"  }}}
    .
    .
    .

both queries return exactly the smae thing, notice im using random to prevent elasticsearch from caching any filter or results.

doing 10 individual queries (not even using persistent conections, AKA we need to add the HELO 10x50ms = 500ms extra) is cosistently faster than using _msearch

results individual ~4400ms
results multi ~10230ms

which is more than x2 the time... ideas?

i really hope im doing something stupid, but im applying exactly as the documentation says, and im not getting any "good" thing about this multi crap stuff

thanks in advance for any help

Daniel.

system · July 10, 2017, 3:51pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
How to speed up msearch queries? Elasticsearch	6	500	March 25, 2020
ElasticSearch msearch Elasticsearch	1	725	July 5, 2017
Getting throttled by _msearch Elasticsearch	3	363	February 18, 2020
Performance Question Elasticsearch	2	574	August 2, 2017
ES not returning results when doing multi-thread msearch Elasticsearch	11	641	August 10, 2018

Elasticsearch _msearch api useless?

Related topics