Scale ingest node based on number of application

alfianaf · October 13, 2021, 4:30am

Hello, I have a plan to use ingest pipeline on my cluster,
the plan is:
90 application will hit to elastic server, so I'll make around 200 pipeline on my cluster
How can I scale the ingest node I need on my cluster to make this possible

Thanks in advance

alfianaf · October 13, 2021, 5:02am

anyone know the method to scale the number of the ingest node pls

stephenb · October 13, 2021, 5:55am

Hi @alfianaf Are you in Elastic Cloud or Self Managed.

You can scale the number of ingest nodes by deploying nodes with only ingest role only.

See here

But more importantly that how many pipelines is how complex they are and what is throughput in terms of Events / Min etc.

The number and size of the ingest nodes will depend on that. Also unless otherwise specified your other nodes probably are ingest nodes by default, if you want to isolate ingest processing to only the dedicated ingest only nodes you will need to remove the ingest role from the other nodes.

Ingest node do not need much disk but CPU and RAM are good. Depending on your ingest needs will help understand what you needs are.

If you provide a little perspective perhaps we could suggest a starting point.

Hope this helps...

alfianaf · October 13, 2021, 6:19am

hello @stephenb, I'm on self managed server, if I can explain the complexity of pipeline, maybe each pipeline checks on document field to get different index output,
on example, if field == A, indexname == A, and the event throughput is about 1500 events / Min per pipeline

alfianaf · October 13, 2021, 6:27am

for additional information, I'm running my cluster on VM
my cluster have
1 load-balance node (this node now also act as ingest node)
1 load-balance node (not act as ingest node)
1 master node
4 data node

so right now I only have 1 ingest node

stephenb · October 13, 2021, 1:51pm

15O0 events / min I suspect 1 ingest node is fine depending on the RAM and CPU it is provided.

I am not sure what you mean by a load balance node.. I suspect you might mean a coordinator node?

alfianaf · October 13, 2021, 2:12pm

yes I guess it is coordinator node,
so is it fine 1 ingest node handling 1500 events / min * 200 pipeline? does this mean my node can handle around 30k events / min total?

stephenb · October 13, 2021, 2:56pm

First I miss-understood I thought you meant 1500 EPM Total, but really you meant about 30K EPM Total, that should still be fine as that is really only about 500 Events / Sec... you should probably be able to scale a 4 CPU 8 GB RAM Ingest node to about 7-10x that ... about 3500 - 5000 Events / Sec for a low / medium level pipeline complexity. This is just an estimate.

Basically if it becomes CPU or RAM bound you can increase those... up to about what I said above then start scaling horizontally.

Again without close inspection these are just guesses... if you pipelines are not complex you might get more or more complex you might get less throughput.

alfianaf · October 13, 2021, 2:59pm

thank you for the detailed explanation, I guess I will stay on the current one ingest node then

system · November 10, 2021, 3:00pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Elastic ingest node load not balanced with pipeline Elasticsearch ingest-pipeline	10	805	May 17, 2022
Elasticsearch Cluster Ingest Node Sizing Elasticsearch ingest-pipeline	4	1397	April 6, 2022
Regarding Cluster Sizing Elasticsearch	6	387	May 4, 2020
Ingest Node Problems Elasticsearch	4	1371	December 7, 2018
Logstash and Ingest Nodes Elasticsearch ingest-pipeline	9	481	February 8, 2021

Scale ingest node based on number of application

Related topics