Hi,
We have cluster of 3 ES nodes and around 200 million documents (number of
shards 10 and number of replicas 2). We did an upgrade from version 0.18.7
to 0.19.0 across the cluster. Following are the steps we followed:
- Stop live data indexing.
- Flush index.
- Stop ES on all the nodes and upgrade it.
- Start ES.
But while recovering old indices, master node is continuously spewing out
errors of the following sort:
[2012-03-07 11:05:51,374][WARN ][cluster.action.shard ] [Scrambler]
sending failed shard for [tweets][6], node[DZ_lKvWFRTuUXMGeitDHIA], [P],
s[INITIALIZING], reason [Failed to start shard, message
[IndexShardGatewayRecoveryException[[tweets][6] failed to recover shard];
nested: StringIndexOutOfBoundsException[String index out of range: 0]; ]]
[2012-03-07 11:05:51,374][WARN ][indices.cluster ] [Scrambler]
[tweets][9] failed to start
shardorg.elasticsearch.index.gateway.IndexShardGatewayRecoveryException:
[tweets][9] failed to recover shard
at
org.elasticsearch.index.gateway.local.LocalIndexShardGateway.recover(LocalIndexShardGateway.java:201)
at
org.elasticsearch.index.gateway.IndexShardGatewayService$1.run(IndexShardGatewayService.java:177)
at
java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
at java.lang.Thread.run(Thread.java:662)
Caused by: java.lang.StringIndexOutOfBoundsException: String index out of
range: 0
at java.lang.String.charAt(String.java:686) at
org.elasticsearch.index.mapper.MapperService.add(MapperService.java:180)
at
org.elasticsearch.index.mapper.MapperService.add(MapperService.java:172)
at
org.elasticsearch.index.mapper.MapperService.documentMapperWithAutoCreate(MapperService.java:298)
at
org.elasticsearch.index.shard.service.InternalIndexShard.prepareIndex(InternalIndexShard.java:310)
at
org.elasticsearch.index.shard.service.InternalIndexShard.performRecoveryOperation(InternalIndexShard.java:624)
at
org.elasticsearch.index.gateway.local.LocalIndexShardGateway.recover(LocalIndexShardGateway.java:196)
Other (non-master) ES nodes are throwing error:
[2012-03-07 11:08:18,026][WARN ][cluster.action.shard ] [Shingen
Harada] received shard failed for [tweets][7],
node[DZ_lKvWFRTuUXMGeitDHIA], [P], s[INITIALIZING], reason [Failed to start
shard, message [IndexShardGatewayRecoveryException[[tweets][7] failed to
recover shard]; nested: StringIndexOutOfBoundsException[String index out of
range: 0]; ]]
Any suggestion what is possibly going wrong here?
Cheers
Nitish