Hello,
I hope this messages finds you and your loved ones safe and healthy.
I have a 3 node Elastic cluster with 2 data nodes and 1 voting node. Nodes are running Ubuntu 20.04.2 LTS with the stack at the latest current release - 7.12.1. Primary purpose of the stack is to collect data from 30+ honeypots & I will be creating virtual networks and labs to emulate attacker behavior and reate detection rules under Elastic SIEM. I plan to use ML analysis to create anomaly detection of tunneling data for exfiltration.
I am fortunate enough to be provided with a license by Elastic to continue my research (thank you very much team Elastic). However, when I try to create ML jobs I get an error that 2000/2000 shards are active.
I have also lost the ability to monitor the cluster and get the following error:
[index_closed_exception] closed, with { index_uuid="OeFKiokSRviOkkjE9i12JA" & index="metricbeat-7.9.3-2021.02.21-000005" }: Check the Elasticsearch Monitoring cluster network connection or the load level of the nodes.
HTTP 400
**Edit -1: **
The cluster just failed with the following error:
index [.async-search] blocked by: [TOO_MANY_REQUESTS/12/disk usage exceeded flood-stage watermark, index has read-only-allow-delete block];
Edit - 2:
I am unable to login and I am getting following error:
{"statusCode":429,"error":"Too Many Requests","message":"cluster_block_exception"}
Edit - 3:
I am getting following error while I try to start Kibana:
{"type":"log","@timestamp":"2021-05-01T14:12:04+00:00","tags":["fatal","root"],"pid":3188,"message":"Error: Unable to complete saved object migrations for the [.kibana_task_manager] index. Please check the health of your Elasticsearch cluster and try again. Error: [cluster_block_exception]: index [.kibana_task_manager_7.12.1_001] blocked by: [TOO_MANY_REQUESTS/12/disk usage exceeded flood-stage watermark, index has read-only-allow-delete block];\n at migrationStateActionMachine (/usr/share/kibana/src/core/server/saved_objects/migrationsv2/migrations_state_action_machine.js:138:13)\n at processTicksAndRejections (internal/process/task_queues.js:93:5)\n at async Promise.all (index 1)\n at SavedObjectsService.start (/usr/share/kibana/src/core/server/saved_objects/saved_objects_service.js:163:7)\n at Server.start (/usr/share/kibana/src/core/server/server.js:283:31)\n at Root.start (/usr/share/kibana/src/core/server/root/index.js:58:14)\n at bootstrap (/usr/share/kibana/src/core/server/bootstrap.js:100:5)\n at Command.<anonymous> (/usr/share/kibana/src/cli/serve/serve.js:169:5)"}
Edit - 4:
The cluster is yellow even though Elasticsearch is running on both the data and single voting node:
{
"cluster_name": "data-analytics-1",
"status": "yellow",
"timed_out": false,
"number_of_nodes": 3,
"number_of_data_nodes": 2,
"active_primary_shards": 1150,
"active_shards": 1150,
"relocating_shards": 0,
"initializing_shards": 0,
"unassigned_shards": 1150,
"delayed_unassigned_shards": 0,
"number_of_pending_tasks": 0,
"number_of_in_flight_fetch": 0,
"task_max_waiting_in_queue_millis": 0,
"active_shards_percent_as_number": 50.0
}
Edit - 5:
I just saw that one of the node's HDD has gone to 95% usage. This is the primary node in the cluster. However, there has been no influx of logs to justify this. It was 89% at the start of the day.
I want to do housekeeping on the cluster since I will be using it for the next six months to finish my research at college. I expect around 2 TB of more data will be written to the cluster. I was hoping to get assistance in making the cluster usable beyond just the ingestion of data. Historial data is very important for my analysis.
Could I kindly request assistance from the community for cleaning up things on the cluster.
Thank you very much.