Search slow when high concurrency?

famoss · October 16, 2016, 10:52am

I have a index which need to save about 100 million docs,and the index-type have about 30 fields..

Here's what I do.

template:

{
    "template": "index*",
    "settings": {
        "number_of_shards": 6,
        "number_of_replicas": 1
    },
    "mappings": {
        "type": {
            "_all": {
                "enabled": false
            },
            "dynamic_templates": [
                {
                    "notanalyzed": {
                        "match_mapping_type": "string",
                        "mapping": {
                            "type": "string",
                            "index": "not_analyzed"
                        }
                    }
                }
            ],
            "properties": {
                "field1":{
                     "type": "string"
                },
                "field2":{
                    "type": "string"
                },
                "field3":{
                    "type": "string",
                     "index": "not_analyzed"
                },
                "field4":{
                    "type": "long"
                },
                "field5":{
                    "type": "string",
                     "index": "not_analyzed"
                },
                "field6":{
                    "type": "string",
                     "index": "not_analyzed"
                }
            }
        }
    }
}

dsl:

{
"filtered": {
    "filter": {
        "bool": {
            "should": [
                {
                    "term": {
                        "field3": "xxx"
                    }
                },
                {
                    "range": {
                        "field4": {
                            "lt": "xxx",
                            "gte": "xxx"
                        }
                    }
                }
            ],
            "must": [
                {
                    "term": {
                        "field6": "xxx"
                    }
                }
            ],
            "from": 0,
            "sort": [
                {
                    "field5": "asc"
                }
            ]
        }
    }
}

{"term":{"field6":"?"},field6 about have 1 million different values,and need to be used for query every time, field4 is a time stamp，different query has different value.

server:
3 servers(ssd,98G memory,32 core cpu),each server i deploy 3 nodes,1 master node(10G heap size),2 data nodes(20G heap size).

EDIT:
I find if i use 500 thread to search,and the es response very slow,and many search need be executed exceed 20000 millisecond,and es search.queue sometimes exceed 500,So i want to optimize my cluster or search dsl.

danielmitterdorfer · October 19, 2016, 6:32am

Hi @famoss,

500 concurrent search requests against a three node cluster are just too much. It's natural for systems that response times increase, when systems are overloaded. To give you an analogy: Think of 500 customers calling a hotline with 3 agents that answer the phone. Naturally, customers will have to wait. If you're interested in the theory just google for "Little's Law".

So, you should scale horizontally (see the Definitive Guide for specific suggestions). It's also not ideal that you run multiple nodes per machine as they will steal resources from each other (see some tips for running multiple nodes per machine).

Daniel

Topic		Replies	Views
Optimize slow aggregation Elasticsearch	1	698	April 6, 2017
Dynamic template vs Index template with explicit mapping in terms of performance Elasticsearch	5	758	December 20, 2022
Template for elastic indexing Elasticsearch	11	976	May 12, 2017
How to make a template for a document which has too many fields? Elasticsearch	3	421	December 29, 2018
Big mapping and few fields on index Elasticsearch	3	317	May 25, 2021

Search slow when high concurrency?

Related topics