Need help in improving search speed (about 20s) (elastic 6.3)

Convergence · May 25, 2019, 4:13am

Hello community,

I have an elastic search cluster split around 2 nodes (and given 4gb ram to elastic on each). There I have an index containing about 80 million docs. I have split this across 10 shards. There are many aggregations and searches I do, after reading at many I places I converted many long fields to string keyword fields with eager loading of cardinals enabled. After all this my results are still terrible (20-30 seconds) and knowing elastic search this should have been less than a sec. Please help in finding out what I am doing wrong and how can I fix this.

Index settings

 "settings": {
    "index": {
      "refresh_interval": "-1",
      "number_of_shards": "10",
      "translog": {
        "durability": "async"
      },
      "provided_name": "a",
      "creation_date": "1558624380928",
      "priority": "100",
      "number_of_replicas": "1",
      "uuid": "6PyhVEhFTcSNxhqyUE5SaQ",
      "version": {
        "created": "6030299"
      }
    }
  },

index mapping
https://pastebin.com/rgJvEAvT

query

POST /doc/doc/_search?pretty
{ 
  "profile": "true",
  "from": 0,
  "size": 20,
  "query": {
    "bool": {
      "must": [
        {
          "term": {
            "a.keyword": "hello"
          }
        },{
          "term": {
            "b.keyword": 260464
          }
        }
      ]
    }
  },
  "aggs": {
    "a": {
      "terms": {
        "field": "a.keyword"
      }
    },
    "b": {
      "terms": {
        "field": "b.keyword"
      }
    },
    "c": {
      "terms": {
        "field": "c.keyword"
      }
    },
    "d": {
      "terms": {
        "field": "d.keyword"
      }
    },
    "e": {
      "terms": {
        "field": "d.keyword"
      }
    },
    "f": {
      "terms": {
        "field": "e.keyword"
      }
    },
    "g": {
      "terms": {
        "field": "f.keyword"
      }
    },
    "h": {
      "terms": {
        "field": "g.keyword"
      }
    },
    "_id_count": {
      "value_count": {
        "field": "_id"
      }
    }
  },
  "sort": [
    {
      "updated_at": {
        "order": "desc"
      }
    }
  ]
}

profile of one shard
https://pastebin.com/AjJR1Baz

dadoonet · May 25, 2019, 11:02am

The query you shared and the query which is profiled seem different +a:a +b:b + c:c. Could you explain that? BTW could you run the profiler with ?human parameter so the times should be more readable?

Convergence · May 25, 2019, 11:36am

@dadoonet you got me , the data is a bit sensitive. I didn't know there was a human param and was used to convert nano secs to secs. Thank you for your time!

https://pastebin.com/2PaURqTQ

dadoonet · May 25, 2019, 12:18pm

Was the search cancelled ? How took that one?

Convergence · May 25, 2019, 7:53pm

Nope I didn't cancel it, it seems to have done it. the results also appear to be fine

  "took": 7896,
  "timed_out": false,
  "_shards": {
    "total": 10,
    "successful": 10,
    "skipped": 0,
    "failed": 0
  },

dadoonet · May 25, 2019, 8:54pm

What kind of hardware configuration do you have ?

Convergence · May 26, 2019, 3:20pm

Both the nodes have following configuration

Architecture:          x86_64
CPU op-mode(s):        32-bit, 64-bit
Byte Order:            Little Endian
CPU(s):                2
On-line CPU(s) list:   0,1
Thread(s) per core:    2
Core(s) per socket:    1
Socket(s):             1
NUMA node(s):          1
Vendor ID:             GenuineIntel
CPU family:            6
Model:                 85
Model name:            Intel(R) Xeon(R) Platinum 8175M CPU @ 2.50GHz
Stepping:              4
CPU MHz:               2500.000
BogoMIPS:              5000.00
Hypervisor vendor:     KVM
Virtualization type:   full
L1d cache:             32K
L1i cache:             32K
L2 cache:              1024K
L3 cache:              33792K
NUMA node0 CPU(s):     0,1

Both have 8gm ram each and 4gb allocated to ElasticSearch. During the setup we also increased the max number of open files to 65k. The complete output of node stats is here:
https://pastebin.com/avGNmfa5

Christian_Dahlqvist · May 26, 2019, 4:13pm

What is the total size of the data on disk? What type of disk do you have? What does CPU usage look like when you are running a slow query?

system · June 23, 2019, 4:13pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Index throughput issues - tried all tuning suggestions posted Elasticsearch	1	382	July 6, 2017
Issue with bulk index performance Elasticsearch	1	733	July 2, 2019
How long it should take to search 10Million docs? Elasticsearch	3	1561	August 18, 2017
How to make elastic querying faster Elasticsearch	6	405	August 31, 2019
Need advise on increasing the elastic search performance Elasticsearch	1	451	December 26, 2016

Need help in improving search speed (about 20s) (elastic 6.3)

Related topics