Elastic `_cluster/health` showing unassigned shards

Hello!

I inherited a 3 node Elastic 7.17 cluster and yesterday, the health status was red.

_cluster/health?pretty=true
{
  "cluster_name" : "graylog-production",
  "status" : "red",
  "timed_out" : false,
  "number_of_nodes" : 3,
  "number_of_data_nodes" : 3,
  "active_primary_shards" : 2208,
  "active_shards" : 2253,
  "relocating_shards" : 0,
  "initializing_shards" : 0,
  "unassigned_shards" : 3,
  "delayed_unassigned_shards" : 0,
  "number_of_pending_tasks" : 0,
  "number_of_in_flight_fetch" : 0,
  "task_max_waiting_in_queue_millis" : 0,
  "active_shards_percent_as_number" : 99.86702127659575
}

Doing a rolling restart of the nodes usually clears things up, but not this time. I reviewed the Common Cluster Issues documents and also noted that we have no snapshots to fall back to. So I am loathe to work with the reroute API until I understand it a little more.

I did note the Unassigned shards claim that there is no_valid_shard_copy but also saw there was a 4th shard that looks available, but it says "the cluster has unassigned shards and cluster setting cluster.routing.allocation.allow_rebalance is set to indices_all_active". All 4 shards for this index are primary, but only 3 nodes.

I was thinking there was a way to copy the "good" shard onto the others (is that rebalancing?), and willing to accept data loss (but hopefully as a last resort) as it is log data. Any suggestions or clarifications are appreciated.

I am appending _cluster/allocation/explain info on the bad shards and then the good shard for reference:

"Bad Shards"

root@elastic-03-in-prod:~# curl -X GET "http://elastic-03-in-prod:9200/_cluster/allocation/explain?pretty" -H 'Content-Type: application/json' -d'{"index": "graylog_39","shard": 0,"primary": true}'
{
  "index" : "graylog_39",
  "shard" : 0,
  "primary" : true,
  "current_state" : "unassigned",
  "unassigned_info" : {
    "reason" : "ALLOCATION_FAILED",
    "at" : "2023-06-21T16:45:40.991Z",
    "failed_allocation_attempts" : 2,
    "details" : "failed shard on node [POHn_aN0R-CE7kteDGVRfA]: shard failure, reason [corrupt file (source: [start])], failure CorruptIndexException[Problem reading index. (resource=/var/lib/elasticsearch/nodes/0/indices/utxx9Zc_TKaeZqRwpTRkfw/0/index/_b_Lucene84_0.tim)]; nested: NoSuchFileException[/var/lib/elasticsearch/nodes/0/indices/utxx9Zc_TKaeZqRwpTRkfw/0/index/_b_Lucene84_0.tim]; ",
    "last_allocation_status" : "no_valid_shard_copy"
  },
  "can_allocate" : "no_valid_shard_copy",
  "allocate_explanation" : "cannot allocate because all found copies of the shard are either stale or corrupt",
  "node_allocation_decisions" : [
    {
      "node_id" : "POHn_aN0R-CE7kteDGVRfA",
      "node_name" : "elastic-02-in-prod",
      "transport_address" : "xx.xx.xx.xx:9300",
      "node_attributes" : {
        "ml.machine_memory" : "16787419136",
        "ml.max_open_jobs" : "512",
        "xpack.installed" : "true",
        "ml.max_jvm_size" : "15032385536",
        "transform.node" : "true"
      },
      "node_decision" : "no",
      "store" : {
        "in_sync" : true,
        "allocation_id" : "C-m7pTmoRJKzZglZBrydRg",
        "store_exception" : {
          "type" : "corrupt_index_exception",
          "reason" : "failed engine (reason: [corrupt file (source: [start])]) (resource=preexisting_corruption)",
          "caused_by" : {
            "type" : "i_o_exception",
            "reason" : "failed engine (reason: [corrupt file (source: [start])])",
            "caused_by" : {
              "type" : "corrupt_index_exception",
              "reason" : "Problem reading index. (resource=/var/lib/elasticsearch/nodes/0/indices/utxx9Zc_TKaeZqRwpTRkfw/0/index/_b_Lucene84_0.tim)",
              "caused_by" : {
                "type" : "no_such_file_exception",
                "reason" : "/var/lib/elasticsearch/nodes/0/indices/utxx9Zc_TKaeZqRwpTRkfw/0/index/_b_Lucene84_0.tim"
              }
            }
          }
        }
      }
    },
    {
      "node_id" : "UZTEb2h7Q7SiiBWAEvU8yg",
      "node_name" : "elastic-03-in-prod",
      "transport_address" : "xx.xx.xx.xx:9300",
      "node_attributes" : {
        "ml.machine_memory" : "16787484672",
        "ml.max_open_jobs" : "512",
        "xpack.installed" : "true",
        "ml.max_jvm_size" : "15032385536",
        "transform.node" : "true"
      },
      "node_decision" : "no",
      "store" : {
        "found" : false
      }
    },
    {
      "node_id" : "tv0duWT1Q_OQ9Xslv7drEQ",
      "node_name" : "elastic-01-in-prod",
      "transport_address" : "xx.xx.xx.xx:9300",
      "node_attributes" : {
        "ml.machine_memory" : "16787484672",
        "ml.max_open_jobs" : "512",
        "xpack.installed" : "true",
        "ml.max_jvm_size" : "15032385536",
        "transform.node" : "true"
      },
      "node_decision" : "no",
      "store" : {
        "found" : false
      }
    }
  ]
}
root@elastic-03-in-prod:~# curl -X GET "http://elastic-03-in-prod:9200/_cluster/allocation/explain?pretty" -H 'Content-Type: application/json' -d'{"index": "graylog_39","shard": 2,"primary": true}'
{
  "index" : "graylog_39",
  "shard" : 2,
  "primary" : true,
  "current_state" : "unassigned",
  "unassigned_info" : {
    "reason" : "ALLOCATION_FAILED",
    "at" : "2023-06-21T16:45:40.837Z",
    "failed_allocation_attempts" : 2,
    "details" : "failed shard on node [tv0duWT1Q_OQ9Xslv7drEQ]: shard failure, reason [corrupt file (source: [start])], failure CorruptIndexException[Problem reading index. (resource=/var/lib/elasticsearch/nodes/0/indices/utxx9Zc_TKaeZqRwpTRkfw/2/index/_b_Lucene84_0.tim)]; nested: NoSuchFileException[/var/lib/elasticsearch/nodes/0/indices/utxx9Zc_TKaeZqRwpTRkfw/2/index/_b_Lucene84_0.tim]; ",
    "last_allocation_status" : "no_valid_shard_copy"
  },
  "can_allocate" : "no_valid_shard_copy",
  "allocate_explanation" : "cannot allocate because all found copies of the shard are either stale or corrupt",
  "node_allocation_decisions" : [
    {
      "node_id" : "POHn_aN0R-CE7kteDGVRfA",
      "node_name" : "elastic-02-in-prod",
      "transport_address" : "xx.xx.xx.xx:9300",
      "node_attributes" : {
        "ml.machine_memory" : "16787419136",
        "ml.max_open_jobs" : "512",
        "xpack.installed" : "true",
        "ml.max_jvm_size" : "15032385536",
        "transform.node" : "true"
      },
      "node_decision" : "no",
      "store" : {
        "found" : false
      }
    },
    {
      "node_id" : "UZTEb2h7Q7SiiBWAEvU8yg",
      "node_name" : "elastic-03-in-prod",
      "transport_address" : "xx.xx.xx.xx:9300",
      "node_attributes" : {
        "ml.machine_memory" : "16787484672",
        "ml.max_open_jobs" : "512",
        "xpack.installed" : "true",
        "ml.max_jvm_size" : "15032385536",
        "transform.node" : "true"
      },
      "node_decision" : "no",
      "store" : {
        "found" : false
      }
    },
    {
      "node_id" : "tv0duWT1Q_OQ9Xslv7drEQ",
      "node_name" : "elastic-01-in-prod",
      "transport_address" : "xx.xx.xx.xx:9300",
      "node_attributes" : {
        "ml.machine_memory" : "16787484672",
        "ml.max_open_jobs" : "512",
        "xpack.installed" : "true",
        "ml.max_jvm_size" : "15032385536",
        "transform.node" : "true"
      },
      "node_decision" : "no",
      "store" : {
        "in_sync" : true,
        "allocation_id" : "Du7lJexcQwWG7jcH45obTw",
        "store_exception" : {
          "type" : "corrupt_index_exception",
          "reason" : "failed engine (reason: [corrupt file (source: [start])]) (resource=preexisting_corruption)",
          "caused_by" : {
            "type" : "i_o_exception",
            "reason" : "failed engine (reason: [corrupt file (source: [start])])",
            "caused_by" : {
              "type" : "corrupt_index_exception",
              "reason" : "Problem reading index. (resource=/var/lib/elasticsearch/nodes/0/indices/utxx9Zc_TKaeZqRwpTRkfw/2/index/_b_Lucene84_0.tim)",
              "caused_by" : {
                "type" : "no_such_file_exception",
                "reason" : "/var/lib/elasticsearch/nodes/0/indices/utxx9Zc_TKaeZqRwpTRkfw/2/index/_b_Lucene84_0.tim"
              }
            }
          }
        }
      }
    }
  ]
}
root@elastic-03-in-prod:~# curl -X GET "http://elastic-03-in-prod:9200/_cluster/allocation/explain?pretty" -H 'Content-Type: application/json' -d'{"index": "graylog_39","shard": 3,"primary": true}'
{
  "index" : "graylog_39",
  "shard" : 3,
  "primary" : true,
  "current_state" : "unassigned",
  "unassigned_info" : {
    "reason" : "ALLOCATION_FAILED",
    "at" : "2023-06-21T16:39:25.682Z",
    "failed_allocation_attempts" : 2,
    "details" : "failed shard on node [POHn_aN0R-CE7kteDGVRfA]: shard failure, reason [corrupt file (source: [start])], failure CorruptIndexException[Problem reading index. (resource=/var/lib/elasticsearch/nodes/0/indices/utxx9Zc_TKaeZqRwpTRkfw/3/index/_l_Lucene84_0.tim)]; nested: NoSuchFileException[/var/lib/elasticsearch/nodes/0/indices/utxx9Zc_TKaeZqRwpTRkfw/3/index/_l_Lucene84_0.tim]; ",
    "last_allocation_status" : "no_valid_shard_copy"
  },
  "can_allocate" : "no_valid_shard_copy",
  "allocate_explanation" : "cannot allocate because all found copies of the shard are either stale or corrupt",
  "node_allocation_decisions" : [
    {
      "node_id" : "POHn_aN0R-CE7kteDGVRfA",
      "node_name" : "elastic-02-in-prod",
      "transport_address" : "xx.xx.xx.xx:9300",
      "node_attributes" : {
        "ml.machine_memory" : "16787419136",
        "ml.max_open_jobs" : "512",
        "xpack.installed" : "true",
        "ml.max_jvm_size" : "15032385536",
        "transform.node" : "true"
      },
      "node_decision" : "no",
      "store" : {
        "in_sync" : true,
        "allocation_id" : "KnnvS0goSM2tlczol_G5Rg",
        "store_exception" : {
          "type" : "corrupt_index_exception",
          "reason" : "failed engine (reason: [corrupt file (source: [start])]) (resource=preexisting_corruption)",
          "caused_by" : {
            "type" : "i_o_exception",
            "reason" : "failed engine (reason: [corrupt file (source: [start])])",
            "caused_by" : {
              "type" : "corrupt_index_exception",
              "reason" : "Problem reading index. (resource=/var/lib/elasticsearch/nodes/0/indices/utxx9Zc_TKaeZqRwpTRkfw/3/index/_l_Lucene84_0.tim)",
              "caused_by" : {
                "type" : "no_such_file_exception",
                "reason" : "/var/lib/elasticsearch/nodes/0/indices/utxx9Zc_TKaeZqRwpTRkfw/3/index/_l_Lucene84_0.tim"
              }
            }
          }
        }
      }
    },
    {
      "node_id" : "UZTEb2h7Q7SiiBWAEvU8yg",
      "node_name" : "elastic-03-in-prod",
      "transport_address" : "xx.xx.xx.xx:9300",
      "node_attributes" : {
        "ml.machine_memory" : "16787484672",
        "ml.max_open_jobs" : "512",
        "xpack.installed" : "true",
        "ml.max_jvm_size" : "15032385536",
        "transform.node" : "true"
      },
      "node_decision" : "no",
      "store" : {
        "found" : false
      }
    },
    {
      "node_id" : "tv0duWT1Q_OQ9Xslv7drEQ",
      "node_name" : "elastic-01-in-prod",
      "transport_address" : "xx.xx.xx.xx:9300",
      "node_attributes" : {
        "ml.machine_memory" : "16787484672",
        "ml.max_open_jobs" : "512",
        "xpack.installed" : "true",
        "ml.max_jvm_size" : "15032385536",
        "transform.node" : "true"
      },
      "node_decision" : "no",
      "store" : {
        "found" : false
      }
    }
  ]
}

"Good Shard"

root@elastic-03-in-prod:~# curl -X GET "http://elastic-03-in-prod.webtech.uits.iu.edu:9200/_cluster/allocation/explain?pretty" -H 'Content-Type: application/json' -d'{"index": "graylog_39","shard": 1,"primary": true}'
{
  "index" : "graylog_39",
  "shard" : 1,
  "primary" : true,
  "current_state" : "started",
  "current_node" : {
    "id" : "UZTEb2h7Q7SiiBWAEvU8yg",
    "name" : "elastic-03-in-prod",
    "transport_address" : "xx.xx.xx.xx:9300",
    "attributes" : {
      "ml.machine_memory" : "16787484672",
      "ml.max_open_jobs" : "512",
      "xpack.installed" : "true",
      "ml.max_jvm_size" : "15032385536",
      "transform.node" : "true"
    },
    "weight_ranking" : 2
  },
  "can_remain_on_current_node" : "yes",
  "can_rebalance_cluster" : "no",
  "can_rebalance_cluster_decisions" : [
    {
      "decider" : "cluster_rebalance",
      "decision" : "NO",
      "explanation" : "the cluster has unassigned shards and cluster setting [cluster.routing.allocation.allow_rebalance] is set to [indices_all_active]"
    }
  ],
  "can_rebalance_to_other_node" : "no",
  "rebalance_explanation" : "rebalancing is not allowed, even though there is at least one node on which the shard can be allocated",
  "node_allocation_decisions" : [
    {
      "node_id" : "tv0duWT1Q_OQ9Xslv7drEQ",
      "node_name" : "elastic-01-in-prod",
      "transport_address" : "xx.xx.xx.xx:9300",
      "node_attributes" : {
        "ml.machine_memory" : "16787484672",
        "ml.max_open_jobs" : "512",
        "xpack.installed" : "true",
        "ml.max_jvm_size" : "15032385536",
        "transform.node" : "true"
      },
      "node_decision" : "yes",
      "weight_ranking" : 1
    },
    {
      "node_id" : "POHn_aN0R-CE7kteDGVRfA",
      "node_name" : "elastic-02-in-prod",
      "transport_address" : "xx.xx.xx.xx:9300",
      "node_attributes" : {
        "ml.machine_memory" : "16787419136",
        "ml.max_open_jobs" : "512",
        "xpack.installed" : "true",
        "ml.max_jvm_size" : "15032385536",
        "transform.node" : "true"
      },
      "node_decision" : "worse_balance",
      "weight_ranking" : 3
    }
  ]
}

Again, my thanks!
Robert

I went looking for more errors in the logs and here is what I see so far...

Elastic errors:

[2023-06-21T07:39:24,219][WARN ][o.e.c.r.a.AllocationService] [elastic-01-in-prod] failing shard [failed shard, shard [graylog_39]
[3], node[POHn_aN0R-CE7kteDGVRfA], [P], s[STARTED], a[id=KnnvS0goSM2tlczol_G5Rg], message [shard failure, reason [lucene commit fa
iled]], failure [NoSuchFileException[/var/lib/elasticsearch/nodes/0/indices/utxx9Zc_TKaeZqRwpTRkfw/3/index/_l_Lucene84_0.tim]], ma
rkAsStale [true]]
java.nio.file.NoSuchFileException: /var/lib/elasticsearch/nodes/0/indices/utxx9Zc_TKaeZqRwpTRkfw/3/index/_l_Lucene84_0.tim
        at sun.nio.fs.UnixException.translateToIOException(UnixException.java:92) ~[?:?]
        at sun.nio.fs.UnixException.rethrowAsIOException(UnixException.java:106) ~[?:?]
        at sun.nio.fs.UnixException.rethrowAsIOException(UnixException.java:111) ~[?:?]
        at sun.nio.fs.UnixFileSystemProvider.newFileChannel(UnixFileSystemProvider.java:224) ~[?:?]
        at java.nio.channels.FileChannel.open(FileChannel.java:308) ~[?:?]
        at java.nio.channels.FileChannel.open(FileChannel.java:367) ~[?:?]
        at org.apache.lucene.util.IOUtils.fsync(IOUtils.java:469) ~[lucene-core-8.11.1.jar:8.11.1 0b002b11819df70783e83ef36b42ed12
23c14b50 - janhoy - 2021-12-14 13:46:43]
        at org.apache.lucene.store.FSDirectory.fsync(FSDirectory.java:331) ~[lucene-core-8.11.1.jar:8.11.1 0b002b11819df70783e83ef
36b42ed1223c14b50 - janhoy - 2021-12-14 13:46:43]
        at org.apache.lucene.store.FSDirectory.sync(FSDirectory.java:286) ~[lucene-core-8.11.1.jar:8.11.1 0b002b11819df70783e83ef36b42ed1223c14b50 - janhoy - 2021-12-14 13:46:43]
        at org.apache.lucene.store.FilterDirectory.sync(FilterDirectory.java:84) ~[lucene-core-8.11.1.jar:8.11.1 0b002b11819df70783e83ef36b42ed1223c14b50 - janhoy - 2021-12-14 13:46:43]
        at org.apache.lucene.store.FilterDirectory.sync(FilterDirectory.java:84) ~[lucene-core-8.11.1.jar:8.11.1 0b002b11819df70783e83ef36b42ed1223c14b50 - janhoy - 2021-12-14 13:46:43]
        at org.apache.lucene.store.LockValidatingDirectoryWrapper.sync(LockValidatingDirectoryWrapper.java:68) ~[lucene-core-8.11.1.jar:8.11.1 0b002b11819df70783e83ef36b42ed1223c14b50 - janhoy - 2021-12-14 13:46:43]
        at org.apache.lucene.index.IndexWriter.startCommit(IndexWriter.java:5099) ~[lucene-core-8.11.1.jar:8.11.1 0b002b11819df70783e83ef36b42ed1223c14b50 - janhoy - 2021-12-14 13:46:43]
        at org.apache.lucene.index.IndexWriter.prepareCommitInternal(IndexWriter.java:3460) ~[lucene-core-8.11.1.jar:8.11.1 0b002b11819df70783e83ef36b42ed1223c14b50 - janhoy - 2021-12-14 13:46:43]
        at org.apache.lucene.index.IndexWriter.commitInternal(IndexWriter.java:3770) ~[lucene-core-8.11.1.jar:8.11.1 0b002b11819df70783e83ef36b42ed1223c14b50 - janhoy - 2021-12-14 13:46:43]
        at org.apache.lucene.index.IndexWriter.commit(IndexWriter.java:3728) ~[lucene-core-8.11.1.jar:8.11.1 0b002b11819df70783e83ef36b42ed1223c14b50 - janhoy - 2021-12-14 13:46:43]
        at org.elasticsearch.index.engine.InternalEngine.commitIndexWriter(InternalEngine.java:2793) ~[elasticsearch-7.17.10.jar:7.17.10]
        at org.elasticsearch.index.engine.InternalEngine.flush(InternalEngine.java:2075) ~[elasticsearch-7.17.10.jar:7.17.10]
        at org.elasticsearch.index.shard.IndexShard.flush(IndexShard.java:1432) ~[elasticsearch-7.17.10.jar:7.17.10]
        at org.elasticsearch.index.shard.IndexShard$8.doRun(IndexShard.java:3818) ~[elasticsearch-7.17.10.jar:7.17.10]
        at org.elasticsearch.common.util.concurrent.ThreadContext$ContextPreservingAbstractRunnable.doRun(ThreadContext.java:777) ~[elasticsearch-7.17.10.jar:7.17.10]
        at org.elasticsearch.common.util.concurrent.AbstractRunnable.run(AbstractRunnable.java:26) ~[elasticsearch-7.17.10.jar:7.17.10]
        at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1144) [?:?]
        at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:642) [?:?]
        at java.lang.Thread.run(Thread.java:1623) [?:?]
[2023-06-21T07:39:24,418][INFO ][o.e.c.r.a.AllocationService] [elastic-01-in-prod] Cluster health status changed from [GREEN] to [RED] (reason: [shards failed [[graylog_39][3]]]).
[2023-06-21T07:39:25,682][WARN ][o.e.c.r.a.AllocationService] [elastic-01-in-prod] failing shard [failed shard, shard [graylog_39][3], node[POHn_aN0R-CE7kteDGVRfA], [P], recovery_source[existing store recovery; bootstrap_history_uuid=false], s[INITIALIZING], a[id=KnnvS0goSM2tlczol_G5Rg], unassigned_info[[reason=ALLOCATION_FAILED], at[2023-06-21T16:39:24.209Z], failed_attempts[1], delayed=false, details[failed shard on node [POHn_aN0R-CE7kteDGVRfA]: shard failure, reason [lucene commit failed], failure NoSuchFileException[/var/lib/elasticsearch/nodes/0/indices/utxx9Zc_TKaeZqRwpTRkfw/3/index/_l_Lucene84_0.tim]], allocation_status[no_valid_shard_copy]], message [shard failure, reason [corrupt file (source: [start])]], failure [CorruptIndexException[Problem reading index. (resource=/var/lib/elasticsearch/nodes/0/indices/utxx9Zc_TKaeZqRwpTRkfw/3/index/_l_Lucene84_0.tim)]; nested: NoSuchFileException[/var/lib/elasticsearch/nodes/0/indices/utxx9Zc_TKaeZqRwpTRkfw/3/index/_l_Lucene84_0.tim]; ], markAsStale [true]]
org.apache.lucene.index.CorruptIndexException: Problem reading index. (resource=/var/lib/elasticsearch/nodes/0/indices/utxx9Zc_TKaeZqRwpTRkfw/3/index/_l_Lucene84_0.tim)
        at org.apache.lucene.index.SegmentCoreReaders.<init>(SegmentCoreReaders.java:144) ~[lucene-core-8.11.1.jar:8.11.1 0b002b11819df70783e83ef36b42ed1223c14b50 - janhoy - 2021-12-14 13:46:43]
        at org.apache.lucene.index.SegmentReader.<init>(SegmentReader.java:83) ~[lucene-core-8.11.1.jar:8.11.1 0b002b11819df70783e83ef36b42ed1223c14b50 - janhoy - 2021-12-14 13:46:43]
        at org.apache.lucene.index.ReadersAndUpdates.getReader(ReadersAndUpdates.java:171) ~[lucene-core-8.11.1.jar:8.11.1 0b002b11819df70783e83ef36b42ed1223c14b50 - janhoy - 2021-12-14 13:46:43]
        at org.apache.lucene.index.ReadersAndUpdates.getReadOnlyClone(ReadersAndUpdates.java:213) ~[lucene-core-8.11.1.jar:8.11.1 0b002b11819df70783e83ef36b42ed1223c14b50 - janhoy - 2021-12-14 13:46:43]
        at org.apache.lucene.index.IndexWriter.lambda$getReader$0(IndexWriter.java:571) ~[lucene-core-8.11.1.jar:8.11.1 0b002b11819df70783e83ef36b42ed1223c14b50 - janhoy - 2021-12-14 13:46:43]
        at org.apache.lucene.index.StandardDirectoryReader.open(StandardDirectoryReader.java:108) ~[lucene-core-8.11.1.jar:8.11.1 0b002b11819df70783e83ef36b42ed1223c14b50 - janhoy - 2021-12-14 13:46:43]
        at org.apache.lucene.index.IndexWriter.getReader(IndexWriter.java:629) ~[lucene-core-8.11.1.jar:8.11.1 0b002b11819df70783e83ef36b42ed1223c14b50 - janhoy - 2021-12-14 13:46:43]
        at org.apache.lucene.index.DirectoryReader.open(DirectoryReader.java:121) ~[lucene-core-8.11.1.jar:8.11.1 0b002b11819df70783e83ef36b42ed1223c14b50 - janhoy - 2021-12-14 13:46:43]
        at org.apache.lucene.index.DirectoryReader.open(DirectoryReader.java:97) ~[lucene-core-8.11.1.jar:8.11.1 0b002b11819df70783e83ef36b42ed1223c14b50 - janhoy - 2021-12-14 13:46:43]
        at org.elasticsearch.index.engine.InternalEngine.createReaderManager(InternalEngine.java:669) ~[elasticsearch-7.17.10.jar:7.17.10]
                at org.elasticsearch.index.engine.InternalEngine.<init>(InternalEngine.java:261) ~[elasticsearch-7.17.10.jar:7.17.10]
        at org.elasticsearch.index.engine.InternalEngine.<init>(InternalEngine.java:199) ~[elasticsearch-7.17.10.jar:7.17.10]
        at org.elasticsearch.index.engine.InternalEngineFactory.newReadWriteEngine(InternalEngineFactory.java:14) ~[elasticsearch-7.17.10.jar:7.17.10]
        at org.elasticsearch.index.shard.IndexShard.innerOpenEngineAndTranslog(IndexShard.java:2064) ~[elasticsearch-7.17.10.jar:7.17.10]
        at org.elasticsearch.index.shard.IndexShard.openEngineAndRecoverFromTranslog(IndexShard.java:2028) ~[elasticsearch-7.17.10.jar:7.17.10]
        at org.elasticsearch.index.shard.StoreRecovery.internalRecoverFromStore(StoreRecovery.java:472) ~[elasticsearch-7.17.10.jar:7.17.10]
        at org.elasticsearch.index.shard.StoreRecovery.lambda$recoverFromStore$0(StoreRecovery.java:90) ~[elasticsearch-7.17.10.jar:7.17.10]
        at org.elasticsearch.action.ActionListener.completeWith(ActionListener.java:436) ~[elasticsearch-7.17.10.jar:7.17.10]
        at org.elasticsearch.index.shard.StoreRecovery.recoverFromStore(StoreRecovery.java:88) ~[elasticsearch-7.17.10.jar:7.17.10]
        at org.elasticsearch.index.shard.IndexShard.recoverFromStore(IndexShard.java:2361) ~[elasticsearch-7.17.10.jar:7.17.10]
        at org.elasticsearch.action.ActionRunnable$2.doRun(ActionRunnable.java:62) ~[elasticsearch-7.17.10.jar:7.17.10]
        at org.elasticsearch.common.util.concurrent.ThreadContext$ContextPreservingAbstractRunnable.doRun(ThreadContext.java:777) ~[elasticsearch-7.17.10.jar:7.17.10]
        at org.elasticsearch.common.util.concurrent.AbstractRunnable.run(AbstractRunnable.java:26) ~[elasticsearch-7.17.10.jar:7.17.10]
        at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1144) [?:?]
        at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:642) [?:?]
        at java.lang.Thread.run(Thread.java:1623) [?:?]
Caused by: java.nio.file.NoSuchFileException: /var/lib/elasticsearch/nodes/0/indices/utxx9Zc_TKaeZqRwpTRkfw/3/index/_l_Lucene84_0.tim
        at sun.nio.fs.UnixException.translateToIOException(UnixException.java:92) ~[?:?]
        at sun.nio.fs.UnixException.rethrowAsIOException(UnixException.java:106) ~[?:?]
        at sun.nio.fs.UnixException.rethrowAsIOException(UnixException.java:111) ~[?:?]
        at sun.nio.fs.UnixFileSystemProvider.newFileChannel(UnixFileSystemProvider.java:224) ~[?:?]
        at java.nio.channels.FileChannel.open(FileChannel.java:308) ~[?:?]
        at java.nio.channels.FileChannel.open(FileChannel.java:367) ~[?:?]
        at org.apache.lucene.store.MMapDirectory.openInput(MMapDirectory.java:238) ~[lucene-core-8.11.1.jar:8.11.1 0b002b11819df70783e83ef36b42ed1223c14b50 - janhoy - 2021-12-14 13:46:43]
        at org.elasticsearch.index.store.FsDirectoryFactory$HybridDirectory.openInput(FsDirectoryFactory.java:126) ~[elasticsearch-7.17.10.jar:7.17.10]
        at org.apache.lucene.store.FilterDirectory.openInput(FilterDirectory.java:100) ~[lucene-core-8.11.1.jar:8.11.1 0b002b11819df70783e83ef36b42ed1223c14b50 - janhoy - 2021-12-14 13:46:43]
        at org.apache.lucene.store.FilterDirectory.openInput(FilterDirectory.java:100) ~[lucene-core-8.11.1.jar:8.11.1 0b002b11819df70783e83ef36b42ed1223c14b50 - janhoy - 2021-12-14 13:46:43]
                at org.apache.lucene.codecs.blocktree.BlockTreeTermsReader.<init>(BlockTreeTermsReader.java:141) ~[lucene-core-8.11.1.jar:8.11.1 0b002b11819df70783e83ef36b42ed1223c14b50 - janhoy - 2021-12-14 13:46:43]
        at org.apache.lucene.codecs.lucene84.Lucene84PostingsFormat.fieldsProducer(Lucene84PostingsFormat.java:441) ~[lucene-core-8.11.1.jar:8.11.1 0b002b11819df70783e83ef36b42ed1223c14b50 - janhoy - 2021-12-14 13:46:43]
        at org.apache.lucene.codecs.perfield.PerFieldPostingsFormat$FieldsReader.<init>(PerFieldPostingsFormat.java:315) ~[lucene-core-8.11.1.jar:8.11.1 0b002b11819df70783e83ef36b42ed1223c14b50 - janhoy - 2021-12-14 13:46:43]
        at org.apache.lucene.codecs.perfield.PerFieldPostingsFormat.fieldsProducer(PerFieldPostingsFormat.java:395) ~[lucene-core-8.11.1.jar:8.11.1 0b002b11819df70783e83ef36b42ed1223c14b50 - janhoy - 2021-12-14 13:46:43]
        at org.apache.lucene.index.SegmentCoreReaders.<init>(SegmentCoreReaders.java:114) ~[lucene-core-8.11.1.jar:8.11.1 0b002b11819df70783e83ef36b42ed1223c14b50 - janhoy - 2021-12-14 13:46:43]
        ... 25 more
[2023-06-21T07:45:38,828][WARN ][o.e.i.e.Engine           ] [elastic-01-in-prod] [graylog_39][2] failed engine [lucene commit failed]
java.nio.file.NoSuchFileException: /var/lib/elasticsearch/nodes/0/indices/utxx9Zc_TKaeZqRwpTRkfw/2/index/_b_Lucene84_0.tim
...

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.