I am looking for a reliable way where we can say that replication is in progress on a node ?
From the below output, what are the parameters based on which i can reliably tell that replication is in progress/completed on a cluster ?
I know that cluster status "yellow" is 1 way , can we also rely on any other parameters apart from status ?
This tells you that no shards are currently being initialized or relocated, which means Elasticsearch is done balancing your cluster.
The reason your cluster is in a yellow state is because of these 5 unassigned shards, that will remain unassigned because they are replicas of the 5 primary shards you have. The reason they can't be assigned is that you cluster only has one data node:
A replica shard can never be assigned to the same node hosting the primary (because a main point with replica shards is to provide backup if a node goes down). There are two ways you can make your cluster state green:
Add a second data node so that replica shards can be assigned.
i undertstand how to fix the cluster status Yellow.
My question is if there is any way to identify if cluster status is yellow because of an ongoing replication process within cluster.. Can we conclude that if replication is yellow because of ongoing replication, then the "relocating shards" and "initializing shards settings" values will be non-zero
This looks little confusing.. Documentation says that this is the response for a live recovery.. But the stage is shown as done. Does this make sense ? Also, what are the valid recovery stages and recovery types (peer | store | snapshot)
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.