Unable to load a trained model on ML node with sufficient memory

gueri · November 5, 2025, 5:52pm

Hello

Here is the scenario.

I have 2 ML nodes with 8 procs/64 Gb on Elastic 8.15.3.

I start by loading a large bge_m3 model with 1 allocation and 1 thread.

As you can see on picture, one node had a lot of memory consumed but the second node had no model deployed to it with all the memory free.

I try to load a smaller model like bge-large-en-v1.5 or multilingual-e5-large. Obviously, it should be able to run on the second node.

But insted, I have this error :

{
  "error": {
    "root_cause": [
      {
        "type": "illegal_argument_exception",
        "reason": "not enough memory on node [sCw90xbzTs6faJw26Rozog] to assign model [baai_bge_m3]"
      }
    ],
    "type": "illegal_argument_exception",
    "reason": "not enough memory on node [sCw90xbzTs6faJw26Rozog] to assign model [baai_bge_m3]"
  },
  "status": 400
}

What is happening here? Is Elastic not seeing my second node? I don't see what configuration to change to alter this behavior.

Regards

Topic		Replies	Views
No ML nodes with sufficient capacity for trained model deployment Elasticsearch elastic-stack-machine-learning	9	1500	September 19, 2024
Error opening machine learning Elasticsearch elastic-stack-machine-learning	7	619	April 30, 2018
Could not open job because no ML nodes with sufficient capacity were found Elasticsearch elastic-stack-machine-learning	16	6522	October 13, 2018
Elastic cloud basic setup: Could not open job because no ML nodes with sufficient capacity were found Elasticsearch elastic-stack-machine-learning	2	2923	March 26, 2019
CLOUD TRIAL: Could not open job because no ML nodes with sufficient capacity were found [SOLVED] Elasticsearch elastic-stack-machine-learning	3	820	June 26, 2019

Unable to load a trained model on ML node with sufficient memory

Related topics