Elasticsearch data nodes - pagefile/hard faults vs. heap usage

BradVido · October 27, 2022, 1:46am

I have an Elasticsearch cluster version 7.8.1running on Windows server 2019. There are 3 3 data nodes, 3 master nodes, and 1 coordinating node, in the cluster. That data nodes each have 48gb ram, and 24gb is dedicated to the jvm.

I've noticed with large queries in Kibana, that the hard faults (read from virtual memory, aka disk), spike massively on my data nodes, but the used heap size in the JVM doesn't get much past 60%.
Here's a snip of one of my nodes hard faults vs. heap usage:

hard faults:

Heap Usage:

Thoughts on why elasticsearch isn't using more of the JVM heap? The typical proposed solution is "throw more ram at it", but since it's not using what it has, I'm hesitant

warkolm · October 31, 2022, 2:19am

This is no longer supported and has reached EOL, please upgrade.

if Elasticsearch is constantly accessing files from disk, the OS will transparently cache this for you which will also reduce these "hard faults". So the question would be, how often are you querying the data?

BradVido · November 1, 2022, 3:00pm

Thanks for the info. I'm wondering if the OS transparent disk caching any different on windows/NTFS than *nix? Is there any more documentation on this?

warkolm · November 2, 2022, 10:03pm

There probably is, but TBH I don't have any real knowledge of the differences and my focus is primarily linux sorry.

system · November 30, 2022, 10:03pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Heap used always > 85% Elasticsearch	14	3911	July 5, 2017
Heap sizing Elasticsearch	5	354	July 6, 2017
Understanding HEAP usage Elasticsearch	3	737	July 6, 2017
Node uses too much memory, I think Elasticsearch	4	660	July 6, 2017
Elasticsearch data node JVM Running out of memory Elasticsearch	2	510	May 8, 2020

Elasticsearch data nodes - pagefile/hard faults vs. heap usage

Related topics