Troubleshooting performance and load

Is there a centralise repository or knowledge base that we can start for general troubelshooting.

For scenrios not limiting to below.

  • Elastic fleet server Health Check. What are the things to watch out for?

  • Overloading. Which specific of observability is the highest possible cause?

  • Slow performance

  • Delay in logs

  • Polling gap

Kindly advice

Hello @Whoami1980

You can check below available documents for all components :

If you drill down or navigate further will find issues related to Elastic Agent :

Thanks!!

@Tortoise

Thanks for the the documents. Now thats quite a lot of information!!!

Is there a suggested workflow especially for troubleshooting issues?

For example. For a start a high level check that everything is working fine without any major concern or issue.

Next maybe we would like to go a bit deeper to gradually zoom into the issue and eventually the RCA

Or maybe I should ask another way. There are so many users support cases that customer open.

In general, when Elastic supports have access to customer environment web console or their diagnostics bundle

What are the things that they definitely support will check through or they will take note for further analysis?

I am coming from a perspective that as a customer how can I help support to help ourselves

so as to cut short some back and forth to resolve issue faster. Hope that clears my intention.

Have you looked at Elastic AutoOps, which I believe was recently launched?

@Christian_Dahlqvist Looks very cool. Unfortunately, I highly doubt we have the luxury due to whatever reasons. Back to square one.

Your ask seems really broad, a sort of Holy Grail for support teams. Real life usage is so varied, way too hard for one tool or checklist or approach. It's not Lord of the Rings :slight_smile:

If such a thing existed, every thread on here would be a lot less interesting.

What issues?

Thats noted, but you have already been pointed to different documentation and approaches and replied:

which IMO is not the friendliest reaction.

@RainTown No worries. Just trying to do my part within my limitations before things happen within my available resource. Understand there is no holy grail but there must be a certain approach to kick start, Anyway no worries. Will open up individual thread for specific issue when we face them. Thanks all

Why would you not have the luxury given that it is free and available with the free basic license as far as I can see?

@Christian_Dahlqvist i am totally aware is free. its hard to explain and probably not left to explain here. thanks for the assistance

One thing you can do is acquire knowledge.

e.g. Do some Elastic training courses, maybe you or one of the team get certified. It's not clear your current knowledge level, but you have opened 4 threads in 3 days already.

Secondly, you could offload a lot of day to day support by using Elastic Cloud.

And if you dont have one already, get a license and access to the official Elastic Support channels. These community forums are great, really great, but there is no SLA. As one of your other threads was around AD authentication, which IIRC is a licensed feature?, so I guess you have a license already.

1 Like

@RainTown Acquiring knowledge thats what i am trying!!! Either I will swim or drown. lol.

Well, when pointed at the docs, what I would certainly put into the knowledge bucket, you answered:

You are right, it is. Get reading!!

You want a short cut? Others have tried to do something similar, and come up with AutoOps, under banner which reads

"Real-time issue detection, automated root cause analysis, performance recommendations and cost insights for all self-managed clusters. No license required, no infrastructure to manage."

Read your initial post. To be quite honest, YOU seem precisely the target market for that product? But you somehow decided that it's a luxury not open to you, for reasons unspecified!?

Listen to good advice?

1 Like

@RainTown Is there any misunderstanding cause u didnt sound friendly? I am a direct person and I am not expecting to be spoon-fed. Maybe its the way i phrase my question. let me give u some background. I am in my 50s totally white hair man took up this job with a lot of environment constraint which resulted in high turnover rate thats all i can say. i guess i shall stop here. feel free to close the tread. thanks all for the assistance once again

I am also in my 50s. Closer to no hair than white hair :slight_smile: , sadly. That doesn't change the advice. Given the broadness of what you actually asked, I think the responses so far were pretty good advice and gave you a few options going forward. Maybe you expected something different, but a "silver bullet" tool/checklist would be a little optimistic. 2 of your your other threads with more specific questions got more specific answers, and the other 2 are only posted a couple of hours ago.

Anyone with our level of experience, and likely some scars, knows the difficulties when handed a "running system", often a critical environment with many constraints, a complex history, and limitations with what you can practically do/change. I wish you luck and success with it.

To "close" a thread, you can accept any answer you think is closest to resolving your query. Or none. People can, and do, respond to old threads every now and again too.

Good luck!

assuming there is a KPI here for u guys here. but sorry how do i close the thread without accepting a solution. kindly advice

This is a community forum, so there are no KPIs nor any need to explicitly close the thread or even accept an answer. I am here trying to help on my spare time so am like others not measured on anything. I believe that is the case even for people working at Elastic.

1 Like

Yes! That's true :wink: