Load-balancing APM traffic across Elasticsearch nodes

luisramos · June 23, 2020, 4:08pm

Hello, I've come across https://www.elastic.co/guide/en/kibana/current/production.html#load-balancing-es and believe this is the right approach for our Kibana setup.

We're also setting up APM Server, and was wondering if it is common to also run a coordinator ES node on each APM server host to achieve the same functionality.

What's the recommended approach here for APM - hardcoding the ES node addresses in the APM config or the above?

Thanks in advance!

axw · June 24, 2020, 1:35am

Hi @luisramos, welcome to the forum!

I don't think it's at all common for APM Server to have a coordinating ES node running locally.

APM Server is mostly just indexing data, and the few queries to ES it does make are cached. This is unlike Kibana, where queries to Elasticsearch are in the critical path and affect UI responsiveness.

We generally recommend that you position APM Server closer to the Agents to minimise time taken for agents to offload data. That doesn't preclude running a coordinating ES node, but I wouldn't expect it to make a significant difference.

luisramos · June 24, 2020, 4:29pm

Thanks, @axw - that clarifies my question!

We'll explore moving our APM server closer to our agents as you've recommended - that does make sense.

On a slightly different note, what's your recommendation on which Elasticsearch node types should APM server output to - dedicated ingest or directly to the data nodes? (we already have dedicated ingest nodes which our Beats are using, we're just not sure if there's added benefit of one over the other).

Thanks,

Luis

axw · June 25, 2020, 1:51am

On a slightly different note, what's your recommendation on which Elasticsearch node types should APM server output to - dedicated ingest or directly to the data nodes? (we already have dedicated ingest nodes which our Beats are using, we're just not sure if there's added benefit of one over the other).

At the moment, I don't think it would make much difference in general.

Data comes out of APM Server mostly fully formed. We do use ingest node, primarily for GeoIP enrichment and User-Agent parsing. This is mostly relevant for RUM. How heavy the impact on ingest node is depends somewhat on your services, e.g. if you have RUM traffic with many distinct User-Agents, then that would impact User-Agent parsing.

Beats modules (not all, but many) rely more heavily on ingest node pipelines for parsing log messages and so on, so it's more relevant there to scale ingest node independently.

system · July 15, 2020, 9:52pm

This topic was automatically closed 20 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
What type of node should I use for GET and PUT? Elasticsearch	6	450	June 10, 2020
Issue with APM Server through Logstash APM	3	534	April 3, 2019
Build a remote cluster for APM server APM server	5	419	June 10, 2022
Kibana pointing to ES cluster Kibana	3	2135	April 5, 2017
APM and AWS Elasticsearch APM	2	9325	June 4, 2018

Load-balancing APM traffic across Elasticsearch nodes

Related topics