Hi,
Env details:
I am using ELK 7.0.1 in kubernetes environment. I have a 9 node cluster which has 3 master pods, 3 data pods and 3 client pods.
Memory & cpu limits are configured as:
Master pods: RAM: no limit, CPU: 1000m, Jvm: -Xms1g -Xmx1g
Data pods: RAM: no limit, CPU: 2000m, Jvm: -Xms4g -Xmx4g
Client pods: RAM: no limit, CPU: 2000m, Jvm: -Xms4g -Xmx4g
Problem -
I am not getting response for few rest apis on elasticsearch.
GET _cluster/health - works.
GET _cat/health - works
For ex. GET /_cluster/health
{"cluster_name":"elk-efkc","status":"green","timed_out":false,"number_of_nodes":9,"number_of_data_nodes":3,"active_primary_shards":136,"active_shards":273,"relocating_shards":0,"initializing_shards":0,"unassigned_shards":0,"delayed_unassigned_shards":0,"number_of_pending_tasks":4,"number_of_in_flight_fetch":0,"task_max_waiting_in_queue_millis":4964575,"active_shards_percent_as_number":100.0}
But there is NO response for _cat/nodes, _cat/shards, _cat/indices, _nodes/stats and many such apis. The curl remains stuck for hours.
I saw an older post with similar problem - CAT api doesn't respond
I have shared the response of GET /_nodes/hot_threads?threads=9999
here - https://gist.github.com/aggarwalShivani/e49e3d359f06f6ec5e9a9e10067819db .
Can you please help me figure out the issue with the cluster and how can we resolve this?
Thanks,
Shivani