Performance issue with Elastic

manikesh · September 17, 2017, 5:21am

we are using elastic for very efficient search but it seems to be taking ~800ms for one simple search.. this is when we hit directly using elastic API..

we have our own cluster with 3 nodes where one is as master and other two are data node. we have only one index with one data type as all items are same.. we have around 8 million records in that. we have 5 primary and 2 replica shards.

My query is:

having 8 million in one index/type might cause this?Should I consider splitting it?
we always hit master node to write and read, should I consider calling data node for reading?
anything else I should consider of doing it different to have better performance?
Thanks much in advance.

warkolm · September 18, 2017, 6:32am

That's not good, see Important Configuration Changes | Elasticsearch: The Definitive Guide [2.x] | Elastic

Unlikely.

Yes, always use all nodes.

What version, what OS, what JVM, what does the mapping and query look like?

Sambit_Kabi · September 18, 2017, 12:09pm

There is a lot of things performance depends on. What is your set up?
Are your nodes running in one machine or different machines?

The search performance also to some extent depends on the number of shards you use.
Are you performing a wildcard search with wildcard at the beginning?

manikesh · September 18, 2017, 12:42pm

we dont have VM.. each node is a complete unix machine.. i have three nodes .. all geographically located...
i have 5 primary and one replica shard for my index.. another qus is.. when I search, will elastic look in all nodes? data is stored in shards and shards are distributed in all nodes..I am doubting that this could be issue, as hosts are not in same region and if it goes to other node there might be latency..

there is not much wild card, we use term more where we search with start with and end with?

manikesh · September 18, 2017, 12:44pm

its unix box, with java-8 and with 250 RAM. in mapping most of the fields are string only..

query is really huge with lot of OR condition.. based on input we decide some specific field and some generic search..

Sambit_Kabi · September 18, 2017, 1:25pm

well ES will hit all shards in a round robin fashion but there are ways to control this behaviour using routing.
You can configure the ES to search in local shards using shard allocation awareness. Go through the below link, it might help you in your set up. I would suggest running ES on only one machine and get the reading for the query time.

https://www.elastic.co/guide/en/elasticsearch/reference/current/modules-cluster.html
https://www.elastic.co/guide/en/elasticsearch/reference/current/search.html
https://www.elastic.co/guide/en/elasticsearch/reference/current/index-modules-allocation.html

manikesh · September 18, 2017, 4:57pm

Even on single node, its taking arround 500 ms for simple request..

Sambit_Kabi · September 19, 2017, 8:33am

I think you should see less time for the same search second time because of caching.

Christian_Dahlqvist · September 19, 2017, 8:38am

For each query Elasticsearch will hit one copy of the each required shard, irrespective on which node it resides. You can make queries try to use local copies first by specifying _local preference. Having clusters distributed geographically can result in performance and stability issues, which is why it is not recommended.

manikesh · September 20, 2017, 10:39am

Thank you very much.. This explains alot.. this is what even I have been doubting on..
have one more query, if I run multiple elastic processes on same host then will it add any advantage?

Christian_Dahlqvist · September 20, 2017, 10:50am

Not unless you have very large hosts.

system · October 18, 2017, 10:50am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Overall cluster performance is relation to number of shards Elasticsearch	2	612	June 9, 2018
Performance of Elasticsearch running in one node Elasticsearch	5	957	July 5, 2017
Elasticsearch same query has different performance Elasticsearch	5	780	June 21, 2018
ES with one node Elasticsearch	3	343	July 24, 2019
Performance issues with search Elasticsearch	1	292	September 9, 2019

Performance issue with Elastic

Related topics