I've started working on elasticsearch and having some doubts about shards
and replicas and how they handle data. I don't have any prior knowledge on
As I know lucene will split data in segments and store in disk, and shard
is the lucene index itself. Some of the doubts which I have is...
There are two way we can do shard allocation, one in cluster level with
config settings and another in index level settings. Suppose in cluster
level I mentioned max shard is 3 and in index level I mentioned 5 shards,
how the shards will be allocated? I have one cluster one node.
Suppose, one index is having 5 shards and 2 replicas and I'm pushing
data in bulk api, how the data will be stored? Is same data will be stored
in 5 shards or the data will split and store in chunks in 5 shards? How
replicas will have backup of data of all 5 shards?
Suppose I have 5 nodes and 10 shards are distributed over the nodes, 2
shards each. So when I index new documents how the data will be stored in
over the nodes?
Suppose the 5th node goes down suddenly which is holding 9th and 10th
shard. Now do I loose all the data stored in 9th and 10th shard or the data
are already copied in rest of the nodes ?
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to firstname.lastname@example.org.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/4ac575bd-0d0a-4f5f-972e-7f3c54f2eb85%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.