Failed shards + lost of kibana data

szemlyanoy · January 28, 2016, 1:15pm

Hi all,

I faced very annoying problem.
Elasticsearch crashed, all shards became in UNASSIGNED state.

Errors in log

2016-01-27 13:23:27,101DEBUGaction.search.type elk-ID1 All shards failed for phase: query

RemoteTransportException[elk-ID1127.0.0.1:9300[indices:data/read/searchphase/query]]; nested: IllegalIndexSha

rdStateException[CurrentStateRECOVERING operations only allowed when shard state is one of [POST_RECOVERY, STARTE

D, RELOCATED]];

Caused by: logstash-2016.01.27[logstash-2016.01.273] IllegalIndexShardStateException[CurrentStateRECOVERING

operations only allowed when shard state is one of POST_RECOVERY, STARTED, RELOCATED]

    at org.elasticsearch.index.shard.IndexShard.readAllowed(IndexShard.java:974)

    at org.elasticsearch.index.shard.IndexShard.acquireSearcher(IndexShard.java:808)

    at org.elasticsearch.search.SearchService.createContext(SearchService.java:640)

    at org.elasticsearch.search.SearchService.createAndPutContext(SearchService.java:617)

    at org.elasticsearch.search.SearchService.executeQueryPhase(SearchService.java:368)

    at org.elasticsearch.search.action.SearchServiceTransportAction$SearchQueryTransportHandler.messageReceived

(SearchServiceTransportAction.java:368)

    at org.elasticsearch.search.action.SearchServiceTransportAction$SearchQueryTransportHandler.messageReceived

(SearchServiceTransportAction.java:365)

    at org.elasticsearch.transport.TransportService$4.doRun(TransportService.java:350)

    at org.elasticsearch.common.util.concurrent.AbstractRunnable.run(AbstractRunnable.java:37)

    at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)

    at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)

    at java.lang.Thread.run(Thread.java:745)

I recovered shards by posting query

curl -XPOST 'localhost:9200/_cluster/reroute' -d '{
"commands" : [ {
"allocate" : {
"index" : "index",
"shard" : shard,
"node" : "127.0.0.1",
"allow_primary" : true}}]}'
sleep 3

All shards seems recovered but I unexpectedly lost all data in Kibana and .kibana shard is still UNASSIGNED. It happened twice per last day.

IS that some well-known issue?

BR,
Sergey

warkolm · January 29, 2016, 2:07am

Forcing primary shard allocation will cause data loss, see here.

What version of ES are you on?

szemlyanoy · January 29, 2016, 8:55am

Version : 2.1.1
Release : 1

Today again elasticsearch failed, recent(today) index failed and kibana again lost its dashboards

[2016-01-29 09:40:54,742][DEBUG][action.admin.indices.stats] [elk-ID1] [indices:monitor/stats] failed to execute operation for shard [[logstash-2016.01.29][3], node[hFTc1KGEQOO3lZMAYIOIaA], [P], v[3], s[INITIALIZING], a[id=LNjkbhGXSU-DwizjhEi0aA], unassigned_info[[reason=CLUSTER_RECOVERED], at[2016-01-29T08:37:49.831Z]]]
[logstash-2016.01.29][[logstash-2016.01.29][3]] BroadcastShardOperationFailedException[operation indices:monitor/stats failed]; nested: IllegalIndexShardStateException[CurrentState[RECOVERING] operations only allowed when shard state is one of [POST_RECOVERY, STARTED, RELOCATED]];
at org.elasticsearch.action.support.broadcast.node.TransportBroadcastByNodeAction$BroadcastByNodeTransportRequestHandler.onShardOperation(TransportBroadcastByNodeAction.java:405)
at org.elasticsearch.action.support.broadcast.node.TransportBroadcastByNodeAction$BroadcastByNodeTransportRequestHandler.messageReceived(TransportBroadcastByNodeAction.java:382)
at org.elasticsearch.action.support.broadcast.node.TransportBroadcastByNodeAction$BroadcastByNodeTransportRequestHandler.messageReceived(TransportBroadcastByNodeAction.java:371)
at org.elasticsearch.transport.TransportService$4.doRun(TransportService.java:350)
at org.elasticsearch.common.util.concurrent.AbstractRunnable.run(AbstractRunnable.java:37)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
at java.lang.Thread.run(Thread.java:745)
Caused by: [logstash-2016.01.29][[logstash-2016.01.29][3]] IllegalIndexShardStateException[CurrentState[RECOVERING] operations only allowed when shard state is one of [POST_RECOVERY, STARTED, RELOCATED]]
at org.elasticsearch.index.shard.IndexShard.readAllowed(IndexShard.java:974)
at org.elasticsearch.index.shard.IndexShard.acquireSearcher(IndexShard.java:808)
at org.elasticsearch.index.shard.IndexShard.docStats(IndexShard.java:628)
at org.elasticsearch.action.admin.indices.stats.CommonStats.(CommonStats.java:131)
at org.elasticsearch.action.admin.indices.stats.TransportIndicesStatsAction.shardOperation(TransportIndicesStatsAction.java:165)
at org.elasticsearch.action.admin.indices.stats.TransportIndicesStatsAction.shardOperation(TransportIndicesStatsAction.java:47)
at org.elasticsearch.action.support.broadcast.node.TransportBroadcastByNodeAction$BroadcastByNodeTransportRequestHandler.onShardOperation(TransportBroadcastByNodeAction.java:401)
... 7 more

szemlyanoy · January 29, 2016, 9:51am

And another question - how can I backup Kibana dashboards' stuff on filesystem level to be safe in case of such failures?

Thnx

szemlyanoy · January 29, 2016, 10:48am

Unfortunately I was forced to remove .kibana index as it failed to start, I lost all my dashboards stuff and not sure I would avoid this issue again.
Please give some advise what a is going on with elasticsearch in my case?

BR
Sergey

dadoonet · January 29, 2016, 11:05am

What I did in that case, I opened the .kibana index with an older version of elasticsearch, used the elasticsearch-knapsack plugin to export .kibana docs to disk, then I started a completely new instance of elasticsearch 2.1.1, started kibana, and then import .kibana index again from disk.

Not sure if it's ideal but at least I was able to get back my dashboards.

dadoonet · January 29, 2016, 11:06am

Oh I misread the thread. I was not hitting the same issue as you got. Was a mapping issue in my case.

Feel free to ignore my comment...

szemlyanoy · February 2, 2016, 3:22pm

So any ideas on this? Indexes keep crashing which is pretty annoying

Thanks
Sergey

warkolm · February 3, 2016, 10:17pm

I'd suggest you upgrade to latest 2.1 and see if that help.

szemlyanoy · February 3, 2016, 10:31pm

So you mean downgrade since I'm running 2.2.0 ?

warkolm · February 4, 2016, 2:44am

Ahh, well you mentioned 2.1.1 previously

szemlyanoy · February 4, 2016, 8:02am

Ah yes, sorry, but it was automatically upgraded to 2.2.0
So I was forced to recreate all indexes, since only test data was stored there now.

Also would like to understand how to backup kibana stuff namely searches, visualizes, dashboards?

warkolm · February 4, 2016, 8:42pm

You can use snapshot + restore, or just export everything via KB manually.

Topic		Replies	Views
CurrentState[RECOVERING] operations only allowed when started/relocated \| action.search.type \| All shards failed for phase: [dfs] Elasticsearch	1	3149	July 5, 2017
Unassigned shards, crashed cluster recovery Elasticsearch	9	13024	February 2, 2018
Recovering elasticsearch cluster Elasticsearch	7	6754	July 5, 2017
Shard failure, RemoteTransportException [search/phase/fetch/id]... IllegalArgumentException Elasticsearch	10	1559	July 6, 2017
How to fix UNASSIGNED shard in Elasticsearch Elasticsearch	6	5412	July 5, 2017

Failed shards + lost of kibana data

Related topics