I am using the Elastic Endpoint agents with 7.9.2.
Elasticsearch is running in a single node cluster
I am getting an error when in the network screen saying 5 of 7 shards have failed.
curl -u elastic -X GET "127.0.0.1:9200/_cluster/health"?pretty=true
{
"cluster_name" : "elasticsearch",
"status" : "yellow",
"timed_out" : false,
"number_of_nodes" : 1,
"number_of_data_nodes" : 1,
"active_primary_shards" : 287,
"active_shards" : 287,
"relocating_shards" : 0,
"initializing_shards" : 0,
"unassigned_shards" : 20,
"delayed_unassigned_shards" : 0,
"number_of_pending_tasks" : 0,
"number_of_in_flight_fetch" : 0,
"task_max_waiting_in_queue_millis" : 0,
"active_shards_percent_as_number" : 93.48534201954396
One of the indices is failing. I am not sure how to fix it.
curl -u elastic -X GET "127.0.0.1:9200/_cluster/allocation/explain"?pretty=true
{
"index" : ".ds-logs-endpoint.events.network-default-000001",
"shard" : 0,
"primary" : false,
"current_state" : "unassigned",
"unassigned_info" : {
"reason" : "CLUSTER_RECOVERED",
"at" : "2020-10-20T19:32:30.113Z",
"last_allocation_status" : "no_attempt"
},
"can_allocate" : "no",
"allocate_explanation" : "cannot allocate because allocation is not permitted to any of the nodes",
"node_allocation_decisions" : [
{
"node_id" : "Q2WsarzlQMOXy-Ptu5RF2A",
"node_name" : "ml-monitor2",
"transport_address" : "127.0.0.1:9300",
"node_attributes" : {
"ml.machine_memory" : "8349188096",
"xpack.installed" : "true",
"transform.node" : "true",
"ml.max_open_jobs" : "20"
},
"node_decision" : "no",
"deciders" : [
{
"decider" : "same_shard",
"decision" : "NO",
"explanation" : "a copy of this shard is already allocated to this node [[.ds-logs-endpoint.events.network-default-000001][0], node[Q2WsarzlQMOXy-Ptu5RF2A], [P], s[STARTED], a[id=FH2eDC0-QUWUtUpEwt2Uyw]]"
}
]
}
]
}
I have lots of duplicate shards, this is a small subset. I am not sure how to clean them up. One is started, the other is not assigned.
curl -u elastic -X GET "127.0.0.1:9200/_cat/shards"?pretty=true
.ds-logs-endpoint.events.network-default-000001 0 p STARTED 50943 19.7mb 127.0.0.1 ml-monitor2
.ds-logs-endpoint.events.network-default-000001 0 r UNASSIGNED
.ds-metrics-system.process_summary-default-000001 0 p STARTED 10628 2.1mb 127.0.0.1 ml-monitor2
.ds-metrics-system.process_summary-default-000001 0 r UNASSIGNED
.ds-metrics-system.cpu-default-000001 0 p STARTED 10628 3.1mb 127.0.0.1 ml-monitor2
.ds-metrics-system.cpu-default-000001 0 r UNASSIGNED
.siem-signals-default-000001 0 p STARTED 0 208b 127.0.0.1 ml-monitor2
.siem-signals-default-000001 0 r UNASSIGNED
Any thoughts?
thanks,
Geoff