What is meant by Heap in relation to ES?

rgelb · March 10, 2020, 11:34pm

Consider the following situation: the node has 100 GB of RAM, but the ES heap size is 30 GB.

Does this mean that ES will only ever use 30 GB of RAM? And the other 70 GB will go unused?
If not, than what how does it use the memory.

P.S. I see lots of articles on setting the correct size of the heap, but can't find anything that describes what it is and how it works in relation to ES.

dadoonet · March 11, 2020, 1:22am

The filesystem will be also using the available memory to cache files from the disk. So memory is not wasted.

rgelb · March 11, 2020, 5:17pm

@dadoonet but ES itself only ever uses the amount of RAM defined as the HEAP?

dadoonet · March 11, 2020, 10:47pm

Yes.

DavidTurner · March 11, 2020, 11:43pm

I think that's not quite the case. It's normal to see the Elasticsearch process using more memory than the configured heap size (usually not more than 2×). It also indirectly uses the rest of the available memory in the system via the filesystem cache. A node with 100GB of memory may well perform better than a node with 64GB of memory even though Elasticsearch's heap size is 30GB in both cases. The filesystem cache is very important. The extra memory is not wasted.

rgelb · March 12, 2020, 12:41am

@DavidTurner Understood. However the filesystem cache is not the function of ElasticSearch but rather a function of the underlying OS. Am I correct?

Or is ES doing something explicit to force the files into the filesystem cache?

warkolm · March 12, 2020, 12:53am

Thats right, it's the OS that handles it.

rgelb · March 12, 2020, 1:18am

Thank you fellas!

DavidTurner · March 12, 2020, 8:08am

I'm not sure I understand the distinction. The OS provides the RAM that the JVM uses too, both for the heap and for everything else. Your original question was whether this extra memory was used or not, and the answer is that it is indeed used.

Elasticsearch (really Lucene) puts a lot of effort into accessing files in a way that increases the chances that the data it needs is already cached, and a larger filesystem cache can make that much easier.

rgelb · March 13, 2020, 6:37am

David, what I mean is this. Consider the following pseudo code:

var content = FileRead("foo.txt");

All the application (ES) did was read the contents of the file into a variable - and RAM was allocated for the application. The OS, behind the scenes, may have placed foo.txt into RAM (e.g. filesystem cache) so that next time FileRead is called, the disk doesn't have to be accessed. That RAM isn't directly used by the application, but by the OS.

system · April 10, 2020, 6:37am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
ES heap memory and OS filecaching Elasticsearch	3	374	August 14, 2018
JVM Heap Advice from documentation Elasticsearch	4	500	March 26, 2019
How elasticsearch uses memory Elasticsearch	2	304	May 11, 2020
Locked memory exceeded Elasticsearch	5	1035	July 5, 2017
Elastic search using more RAM and not release Elasticsearch	2	395	March 21, 2021

What is meant by Heap in relation to ES?

Related topics