Getting worse search performance with a replica shard

akaizora · September 19, 2016, 12:21pm

Hi!

To increase search performance, I tried to add a replica to my cluster.
Initially, I measured a response time of around 700ms for a specific request with a single node with a primary shard.
After adding a node and a replica shard to the cluster, it takes averagely 2000ms to get the result of the request (so it almost tripled).

I'm using these configs for the replica :
cluster.name: findmyfpstore
node.name: fmfs_r1
node.master: false
network.host: ...
http.port: ...
discovery.zen.ping.multicast.enabled: false
discovery.zen.ping.unicast.hosts: ['...']
index.number_of_shards: 1
index.number_of_replicas: 1

Am I doing something wrong ? Feel free to tell me if you need more informations, I'm a beginner at ElasticSearch.

Thanks a lot!

warkolm · September 20, 2016, 1:40am

Did you query multiple times, or just once?

akaizora · September 20, 2016, 8:09am

I query multiple times. For the example, I did it 1000 times.

jpountz · September 20, 2016, 9:52am

Replicas do not help with latency, they can only help with throughput by replicating the data on more nodes.

If you have lots of data, then the slow down of your queries is expected since it means that each shards has less filesystem cache to work with (since there are more shards overall).

akaizora · September 20, 2016, 10:08am

By having lots of data, do you mean having lots of documents ? If you're referring to this, I only have 829 documents.
I tried with a server that is in the same datacenter but it still doesn't improve the response time of the queries (a bit longer).

Excuse me but I don't understand your answer very well, I only have one shard per node (primary on one node, replica on the other one).

jpountz · September 20, 2016, 10:24am

With such a small dataset, it is very hard to reason about what the hot spot might be. I am not sure we can help much here. Even with replicas and 2000ms is a response time, that is still only 2ms per search request, which should be fast enough for most use-cases?

I am reluctant to try to optimize this case since for such a small dataset it would probably be easier to hold everything in RAM and perform a brute-force scan to find matches all the time.

akaizora · September 20, 2016, 10:46am

I understand, thanks for your fast reply.

Does this mean setting the "index.store.type" to "memory" ?
Can you provide more details about "performing a brute-force scan to find matches all the time", please ? Does it require to change anything ?
Is there anything else to know about this ? Thanks !

jpountz · September 20, 2016, 10:53am

I just mean that with such a small data set, everything should be fast and using Elasticsearch is a bit overkill. Does the slow down incurred by the addition of new shards matter to you?

akaizora · September 20, 2016, 11:20am

Mainly, I'm using ElasticSearch to try it and for the geo queries.
Yes, it would matter a bit (it is quite important to push the performance as much as I can) but I only need one primary shard in my case if I understand correctly what you said previously.

Topic		Replies	Views
Number of replicas and query speed Elasticsearch	6	881	July 6, 2017
Very slow performance after setting replicas Elasticsearch	3	1548	July 6, 2017
Number of replicas and query speed Elasticsearch	2	400	July 6, 2017
Time Search with more node in one cluster Elasticsearch	2	479	July 5, 2017
Replicas reduce performance complex queries drastically? Elasticsearch	6	3336	October 17, 2018

Getting worse search performance with a replica shard

Related topics