Can we give more than 32GB Memory to dedicated Machine learning Node?

Umang_Pachaury · February 9, 2023, 3:09pm

As per the documentation it is recommended by Elasticsearch Team that every Elasticsearch node should have the memory slightly less than 32GB. Now My question is that does this apply to a dedicated Machine learning Node as well. And Even if we give more than 32GB memory to the Dedicated Machine Learning Node what might be the repercussions of that.

Regards
Umang Pachaury

droberts195 · February 9, 2023, 3:51pm

The recommendation is that the JVM heap size for Elasticsearch should be less than 32GB. This is so that the JVM can use compressed pointers. A JVM heap of 33GB will use space more wastefully than a JVM heap of 31GB.

The JVM heap size is different to the size of the machine/VM/container where the software is running. It's certainly possible and desirable at large scale for that to be bigger than 32GB.

So this question comes down to what you mean by "node". The Elastic docs are a bit confusing in this regard. Sometimes "node" is used to mean a JVM running Elasticsearch and sometimes "node" is used to mean the whole machine/VM/container that it's running on.

If you mean JVM heap size then no, you shouldn't give an ML JVM heap more than 32GB. In fact, ML nodes use most memory outside the JVM so the JVM on an ML node should be smaller than on a data node.

But you'll be able to run more native ML processes with more memory, so you can certainly have machines/VMs/containers for ML that are bigger than 32GB.

Umang_Pachaury · February 23, 2023, 9:42am

Hello,
We followed this configuration and gave less than 31GB to JVM heap now we have configured one ML job and observed that while the job is running we were able to monitor some changes in JVM heap utilization in kibana stack monitoring. But we didn't see any change in the total used RAM of our machine.
As per the documentation the ML job (ml processes) uses memory outside of JVM heap. Now we first observed the ram used while the job was in closed state and again while the job was in opened state and the datafeed was running, in both cases we were not able to see any changes in the used RAM figure. Can you explain this why is this happpening and why are we only able to see changes in JVM heap utilization and not in total RAM used.

droberts195 · February 23, 2023, 9:55am

How are you measuring total used RAM?

Umang_Pachaury · February 23, 2023, 11:04am

We are looking at the memory usage of the machine using free -m command.

droberts195 · February 23, 2023, 12:00pm

Try using top. While jobs are running you should see autodetect processes in the list of processes that are using CPU. And top will also show that they are using memory as well.

Umang_Pachaury · February 23, 2023, 12:05pm

Yes we did that . When our ml job was in opened state and when it was in closed state we examined the processes using that command. In both cases the figure was unchanged. But we could see some changes in kibana stack monitoring in case of jvm heap. But there was no change in the total ram used. As per my understanding the ml process should use the memory outside of jvm heap. so we should expect some changes in total ram used.

system · March 23, 2023, 12:06pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Max heap size in nodes elasticsearch Elasticsearch elastic-stack-machine-learning	2	483	August 9, 2021
Resource Utilization Machine Learning Elasticsearch elastic-stack-machine-learning	8	1588	June 16, 2022
ML node memory configuration Elastic Cloud Enterprise (ECE) elastic-stack-machine-learning	4	1720	February 26, 2020
JVM Heap size larger than 32 GB Elasticsearch	6	914	April 27, 2023
What happens when you go over 32GiB of JVM heap memory? Elasticsearch	6	1306	April 5, 2024

Can we give more than 32GB Memory to dedicated Machine learning Node?

Related topics