Elasticsearch data consumption difference

we have 2 shards 2 replica setting on cluster.
we have 2 data nodes and 2 client nodes, 1 master node.

so according to above setup, we should have same disk consumed on both the data ndoes
whereas, as per what I see we have a difference of 8 gig.
is it normal? or I am doing something wrong?

Shards on different nodes tend to merge independently, so the size can differ between nodes even if the hold exactly the same data. How large is this 8GB error compared to the total data size on your nodes? Do all data nodes hold the same amount of shards?

Total data is 400+ GB, so according to what you said, I think it's fine.

Thanks Christian :slight_smile: