CheckIndex dies with bus error ... how else can I fix shard?

ecweaver · June 12, 2015, 9:14pm

I have an index shard that seems to have been damaged by a server reboot (neglected to stop indexing and flush first, it appears). It won't complete replicating and the master is chattering madly on the network unless the index is closed.

Ran CheckIndex on that shard (with the index closed), and it's consistently errored out on the file whose name shows up in the ES logs. It says "Bus Error" meaning I think dereferencing a pointer with not enough zeroes on the right. Is there any way to just manually delete that particular file out of the shard's consciousness?

I have made a Snapshot that ended in Partial status, might that be restorable, so I can at least recover the rest of the shards?

Thanks.

warkolm · June 13, 2015, 4:34am

I'd say your shard is lost, did you have replicas?

ecweaver · June 14, 2015, 8:10pm

Unfortunately not. That was the primary; it would hang when trying to replicate. Took a snapshot, and trying a restore with partial:true... the damaged shard is coming up empty as expected.

Topic		Replies	Views
CorruptIndexException after node restart Elasticsearch	5	1033	September 26, 2017
Non-recovering index shard Elasticsearch	4	317	July 6, 2017
Failed to flush shard on translog threshold Elasticsearch	1	1165	July 6, 2017
Shard failing after a cluster restart Elasticsearch	1	957	July 5, 2017
Shard index gone bad, anyone know how to fix this: java.io.EOFException: read past EOF: NIOFSIndexInput Elasticsearch	3	2552	July 6, 2017

CheckIndex dies with bus error ... how else can I fix shard?

Related topics