Failed to start shard


(gnulinux) #1

Hi

I am evaluating ElasticSearch (0.17.8) for a spatial search platform. I was able to setup a two-node cluster and everything was working fine. But after rebooting both the nodes, I am getting the following error on both.

[2011-10-19 06:02:45,243][WARN ][indices.cluster ] [linux Ubu2] [books][1] failed to start shard
org.elasticsearch.index.gateway.IndexShardGatewayRecoveryException: [books][1] shard allocated for local recovery (post api), should exists, but doesn't
at org.elasticsearch.index.gateway.local.LocalIndexShardGateway.recover(LocalIndexShardGateway.java:99)
at org.elasticsearch.index.gateway.IndexShardGatewayService$1.run(IndexShardGatewayService.java:179)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1110)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:603)
at java.lang.Thread.run(Thread.java:679)

Config Files:

Node01 (Master)

cluster:
name: gnulinux

node.name: "linux Ubu2"
node.master: true
node.data: true
node.rack: rack01

network:
bindHost: 192.168.2.10
publishHost: 192.168.2.10

index.engine.robin.refreshInterval: -1
index.gateway.snapshot_interval: -1
index.gateway.type: local
index.number_of_shards: 5
index.number_of_replicas: 1

gateway.recover_after_nodes: 2
gateway.recover_after_time: 5m
gateway.expected_nodes: 2
cluster.routing.allocation.node_initial_primaries_recoveries: 4
cluster.routing.allocation.node_concurrent_recoveries: 2
indices.recovery.concurrent_streams: 5

index:
store:
fs:
memory:
enabled: true
discovery:
jgroups:
config: tcp
bind_port: 9700
bind_address: 192.168.2.10
tcpping:
initial_hosts: 192.168.2.10[9700], 192.168.2.11[9700]

Node02:

cluster:
name: gnulinux

node.name: "linux Ubu1"
node.master: false
node.data: true
node.rack: rack01

network:
bindHost: 192.168.2.11
publishHost: 192.168.2.11

index.engine.robin.refreshInterval: -1
index.gateway.snapshot_interval: -1
index.gateway.type: local
index.number_of_shards: 5
index.number_of_replicas: 1

gateway.recover_after_nodes: 2
gateway.recover_after_time: 5m
gateway.expected_nodes: 2
cluster.routing.allocation.node_initial_primaries_recoveries: 4
cluster.routing.allocation.node_concurrent_recoveries: 2
indices.recovery.concurrent_streams: 5

index:
store:
fs:
memory:
enabled: true
discovery:
jgroups:
config: tcp
bind_port: 9700
bind_address: 192.168.2.11
tcpping:
initial_hosts: 192.168.2.10[9700], 192.168.2.11[9700]


(system) #2