Number of results per shard

mredaelli · April 9, 2020, 2:05pm

We have a basic ES instance, and I'm considering an index that has only one primary shard and one replica shard.

Doing a basic bool query, hits.total turns out to depend on which shard we hit. I get consistently different numbers specifying ?preference=_primary or ?preference=_replica.

The shards are different in some sense, because with _cat I see:

index       shard prirep state   docs   store ip            node
admin_ch-v1 0     r      STARTED 3220 295.2mb x.x.x.x  JO8tqXw
admin_ch-v1 0     p      STARTED 3220   294mb x.x.x.x aCqEzYQ

However, the document count is the same. I also wrote a script to get all the documents specifically from each shard (using preference=f"_only_nodes:xxx") and comparing them, and, modulo a bug in my script, everything is identical.

So... what is going on?

Christian_Dahlqvist · April 13, 2020, 8:02am

Merging of segments is not coordinated across shards, so even if primary and replica shards hold exactly the same contents their size may differ as they may have merged differently.

mredaelli · April 13, 2020, 3:27pm

That's perfectly fine. My question is why, if the contents are exactly the same, the counts for the same query are different.

Christian_Dahlqvist · April 13, 2020, 3:32pm

Are you making changes to these indices? What is the refesh_interval set to?

mredaelli · April 15, 2020, 12:45pm

Sorry I didn't notice your reply earlier.

I don't see refresh_interval in GET <index>/_settings, so I assume it's the default 1s.

And I don't think it's a matter of heavy write usage: the index receives an average of 2 new documents per day.

Also we got the same two counts, say N and M, for the same query on the two shards, trying it hours apart.

system · May 13, 2020, 12:45pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Primary and Replica shards are giving different results for same query Elasticsearch	1	610	May 23, 2019
Different primary and replica shard size Elasticsearch	8	2256	July 29, 2019
Differnt shards giving different results Elasticsearch	7	1896	July 29, 2019
Why docCount differs on primary and replicas of the same shard Elasticsearch	1	370	October 25, 2018
Shard size is different between primary and replica Elasticsearch	3	1612	October 26, 2018

Number of results per shard

Related topics