Decreasing Heap Size Results in Better TPS, How can this happen?

vsrikanth_chinna · February 18, 2015, 6:45pm

We have 5 shards(give each size) totaling to 76.6GB in a single host
c3.8xlarge http://aws.amazon.com/ec2/instance-types/ system (60gb ram, 32
core, 2*320 SSD )

We have multiple fields in our record, but single field is ngram analyzed
on which we search results for. This search need to performed on all 5
shards of the host to get results, as there is no routing in our case.

We observed huge variations in search TPS with decrease in elastic search
heap memory size. Attached bigdesk images for both of the below cases!

CASE1)
When ess_heap size = 16gb
Search tps observed is 50

CASE2)
When ess_heap_size=30gb
Search tps observed is 18

Surprising thing is as we decrease ess_heap_size the search tps got
increased. All the resources(cpu,memory etc) are not fully utilized, OS
heap memory is not changed much, observed lot of zig zag ess_heap
usage(increase and decrease of ess heap usage, may be because of high index
size that need to be brought to RAM) and reads I/Os followed same zig zag
manner in both the cases.

Please note that we have run this experiment multiple times and observed
the same pattern. Can you please guide on what is going wrong? what dec
ess_heap increasing the tps, should we further decrease to achieve better
tps or we doing something wrong?

-Thanks
Srikanth V.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/f1d1bf51-ab4d-4242-94e5-4b5c3adf466c%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

jprante · February 18, 2015, 7:24pm

So you believe in "the more heap the better"? This assumption you have just
proved wrong. Note, in your test, at 16GB heap, GC took half a second,
while at 30GB heap, GC took around a second (and maybe more overhead in
stop-the-world pauses). This is a hint about current JVM GC scalability of
Java 7.

Jörg

On Wed, Feb 18, 2015 at 7:45 PM, Srikanth Valiveti <
vsrikanth.chinna@gmail.com> wrote:

We have 5 shards(give each size) totaling to 76.6GB in a single host
c3.8xlarge http://aws.amazon.com/ec2/instance-types/ system (60gb ram,
32 core, 2*320 SSD )

We have multiple fields in our record, but single field is ngram analyzed
on which we search results for. This search need to performed on all 5
shards of the host to get results, as there is no routing in our case.

We observed huge variations in search TPS with decrease in elastic
search heap memory size. Attached bigdesk images for both of the below
cases!

CASE1)
When ess_heap size = 16gb
Search tps observed is 50

CASE2)
When ess_heap_size=30gb
Search tps observed is 18

Surprising thing is as we decrease ess_heap_size the search tps got
increased. All the resources(cpu,memory etc) are not fully utilized, OS
heap memory is not changed much, observed lot of zig zag ess_heap
usage(increase and decrease of ess heap usage, may be because of high index
size that need to be brought to RAM) and reads I/Os followed same zig zag
manner in both the cases.

Please note that we have run this experiment multiple times and
observed the same pattern. Can you please guide on what is going wrong?
what dec ess_heap increasing the tps, should we further decrease to achieve
better tps or we doing something wrong?

-Thanks
Srikanth V.

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/f1d1bf51-ab4d-4242-94e5-4b5c3adf466c%40googlegroups.com
https://groups.google.com/d/msgid/elasticsearch/f1d1bf51-ab4d-4242-94e5-4b5c3adf466c%40googlegroups.com?utm_medium=email&utm_source=footer
.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAKdsXoEpT8rNM5%2BosX%3D2%2B70RF-A12ujT2bw3ak4LcH5aQhCA%3DQ%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

krishna · February 18, 2015, 8:50pm

They are couple of questions pertaining to above graphs

Yes the GC time taken is doubled, but the (frequency)rate of GC cycle is
high(almost double/triple) in 16gb compared to 30gb.
Why complete memory is not utilized before cleaning up the memory, for
eg for 30gb, maximum usage is around 20gb, for 16gb it is 11gb.
This looks like thrashing!, is it because of large index size(76.6GB) on
single host ?,

PS: GC used here is G1GC

On Thursday, 19 February 2015 00:54:25 UTC+5:30, Jörg Prante wrote:

So you believe in "the more heap the better"? This assumption you have
just proved wrong. Note, in your test, at 16GB heap, GC took half a second,
while at 30GB heap, GC took around a second (and maybe more overhead in
stop-the-world pauses). This is a hint about current JVM GC scalability of
Java 7.

Jörg

On Wed, Feb 18, 2015 at 7:45 PM, Srikanth Valiveti <vsrikant...@gmail.com
<javascript:>> wrote:

We have 5 shards(give each size) totaling to 76.6GB in a single host
c3.8xlarge http://aws.amazon.com/ec2/instance-types/ system (60gb ram,
32 core, 2*320 SSD )

We have multiple fields in our record, but single field is ngram analyzed
on which we search results for. This search need to performed on all 5
shards of the host to get results, as there is no routing in our case.

We observed huge variations in search TPS with decrease in elastic
search heap memory size. Attached bigdesk images for both of the below
cases!

CASE1)
When ess_heap size = 16gb
Search tps observed is 50

CASE2)
When ess_heap_size=30gb
Search tps observed is 18

Surprising thing is as we decrease ess_heap_size the search tps got
increased. All the resources(cpu,memory etc) are not fully utilized, OS
heap memory is not changed much, observed lot of zig zag ess_heap
usage(increase and decrease of ess heap usage, may be because of high index
size that need to be brought to RAM) and reads I/Os followed same zig zag
manner in both the cases.

Please note that we have run this experiment multiple times and
observed the same pattern. Can you please guide on what is going wrong?
what dec ess_heap increasing the tps, should we further decrease to achieve
better tps or we doing something wrong?

-Thanks
Srikanth V.

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearc...@googlegroups.com <javascript:>.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/f1d1bf51-ab4d-4242-94e5-4b5c3adf466c%40googlegroups.com
https://groups.google.com/d/msgid/elasticsearch/f1d1bf51-ab4d-4242-94e5-4b5c3adf466c%40googlegroups.com?utm_medium=email&utm_source=footer
.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/8aea6643-0fe8-43e0-a698-8379e306d21e%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

mikemccand · February 18, 2015, 9:36pm

Smaller JVM heap means more free RAM for the OS to cache hot pages from
your index ... in general you should only give the JVM as much as it needs
(will ever need) and a bit more for safety, and give the rest to the OS so
it can put hot parts of your index in RAM.

Mike McCandless

http://blog.mikemccandless.com

On Wed, Feb 18, 2015 at 3:50 PM, sri krishna krishnainet@gmail.com wrote:

They are couple of questions pertaining to above graphs

Yes the GC time taken is doubled, but the (frequency)rate of GC cycle
is high(almost double/triple) in 16gb compared to 30gb.

Why complete memory is not utilized before cleaning up the memory, for
eg for 30gb, maximum usage is around 20gb, for 16gb it is 11gb.

This looks like thrashing!, is it because of large index size(76.6GB)
on single host ?,

PS: GC used here is G1GC

On Thursday, 19 February 2015 00:54:25 UTC+5:30, Jörg Prante wrote:

So you believe in "the more heap the better"? This assumption you have
just proved wrong. Note, in your test, at 16GB heap, GC took half a second,
while at 30GB heap, GC took around a second (and maybe more overhead in
stop-the-world pauses). This is a hint about current JVM GC scalability of
Java 7.

Jörg

On Wed, Feb 18, 2015 at 7:45 PM, Srikanth Valiveti <vsrikant...@gmail.com

wrote:

We have 5 shards(give each size) totaling to 76.6GB in a single host
c3.8xlarge http://aws.amazon.com/ec2/instance-types/ system (60gb
ram, 32 core, 2*320 SSD )

We have multiple fields in our record, but single field is ngram
analyzed on which we search results for. This search need to performed on
all 5 shards of the host to get results, as there is no routing in our case.

We observed huge variations in search TPS with decrease in elastic
search heap memory size. Attached bigdesk images for both of the below
cases!

CASE1)
When ess_heap size = 16gb
Search tps observed is 50

CASE2)
When ess_heap_size=30gb
Search tps observed is 18

Surprising thing is as we decrease ess_heap_size the search tps got
increased. All the resources(cpu,memory etc) are not fully utilized, OS
heap memory is not changed much, observed lot of zig zag ess_heap
usage(increase and decrease of ess heap usage, may be because of high index
size that need to be brought to RAM) and reads I/Os followed same zig zag
manner in both the cases.

Please note that we have run this experiment multiple times and
observed the same pattern. Can you please guide on what is going wrong?
what dec ess_heap increasing the tps, should we further decrease to achieve
better tps or we doing something wrong?

-Thanks
Srikanth V.

--
You received this message because you are subscribed to the Google
Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send
an email to elasticsearc...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/
msgid/elasticsearch/f1d1bf51-ab4d-4242-94e5-4b5c3adf466c%
40googlegroups.com
https://groups.google.com/d/msgid/elasticsearch/f1d1bf51-ab4d-4242-94e5-4b5c3adf466c%40googlegroups.com?utm_medium=email&utm_source=footer
.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/8aea6643-0fe8-43e0-a698-8379e306d21e%40googlegroups.com
https://groups.google.com/d/msgid/elasticsearch/8aea6643-0fe8-43e0-a698-8379e306d21e%40googlegroups.com?utm_medium=email&utm_source=footer
.

For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAD7smReTN-1oScpc%2BU4OLGGd_hAQPLjR0YED%2BLbwp%2Bnbxp80Gw%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

vsrikanth_chinna · February 19, 2015, 3:40am

We thought the same about the OS cache, but it was proved to be wrong as
the graphs for the os cache usage seems to be almost same in both the
cases.

On Thu, Feb 19, 2015 at 3:06 AM, Michael McCandless mike@elasticsearch.com
wrote:

Smaller JVM heap means more free RAM for the OS to cache hot pages from
your index ... in general you should only give the JVM as much as it needs
(will ever need) and a bit more for safety, and give the rest to the OS so
it can put hot parts of your index in RAM.

Mike McCandless

http://blog.mikemccandless.com

On Wed, Feb 18, 2015 at 3:50 PM, sri krishna krishnainet@gmail.com
wrote:

They are couple of questions pertaining to above graphs

Yes the GC time taken is doubled, but the (frequency)rate of GC cycle
is high(almost double/triple) in 16gb compared to 30gb.

Why complete memory is not utilized before cleaning up the memory, for
eg for 30gb, maximum usage is around 20gb, for 16gb it is 11gb.

This looks like thrashing!, is it because of large index size(76.6GB)
on single host ?,

PS: GC used here is G1GC

On Thursday, 19 February 2015 00:54:25 UTC+5:30, Jörg Prante wrote:

So you believe in "the more heap the better"? This assumption you have
just proved wrong. Note, in your test, at 16GB heap, GC took half a second,
while at 30GB heap, GC took around a second (and maybe more overhead in
stop-the-world pauses). This is a hint about current JVM GC scalability of
Java 7.

Jörg

On Wed, Feb 18, 2015 at 7:45 PM, Srikanth Valiveti <
vsrikant...@gmail.com> wrote:

We have 5 shards(give each size) totaling to 76.6GB in a single host
c3.8xlarge http://aws.amazon.com/ec2/instance-types/ system (60gb
ram, 32 core, 2*320 SSD )

We have multiple fields in our record, but single field is ngram
analyzed on which we search results for. This search need to performed on
all 5 shards of the host to get results, as there is no routing in our case.

We observed huge variations in search TPS with decrease in elastic
search heap memory size. Attached bigdesk images for both of the below
cases!

CASE1)
When ess_heap size = 16gb
Search tps observed is 50

CASE2)
When ess_heap_size=30gb
Search tps observed is 18

Surprising thing is as we decrease ess_heap_size the search tps got
increased. All the resources(cpu,memory etc) are not fully utilized, OS
heap memory is not changed much, observed lot of zig zag ess_heap
usage(increase and decrease of ess heap usage, may be because of high index
size that need to be brought to RAM) and reads I/Os followed same zig zag
manner in both the cases.

Please note that we have run this experiment multiple times and
observed the same pattern. Can you please guide on what is going wrong?
what dec ess_heap increasing the tps, should we further decrease to achieve
better tps or we doing something wrong?

-Thanks
Srikanth V.

--
You received this message because you are subscribed to the Google
Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send
an email to elasticsearc...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/
msgid/elasticsearch/f1d1bf51-ab4d-4242-94e5-4b5c3adf466c%
40googlegroups.com
https://groups.google.com/d/msgid/elasticsearch/f1d1bf51-ab4d-4242-94e5-4b5c3adf466c%40googlegroups.com?utm_medium=email&utm_source=footer
.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/8aea6643-0fe8-43e0-a698-8379e306d21e%40googlegroups.com
https://groups.google.com/d/msgid/elasticsearch/8aea6643-0fe8-43e0-a698-8379e306d21e%40googlegroups.com?utm_medium=email&utm_source=footer
.

For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to a topic in the
Google Groups "elasticsearch" group.
To unsubscribe from this topic, visit
https://groups.google.com/d/topic/elasticsearch/sCzgycU8QgE/unsubscribe.
To unsubscribe from this group and all its topics, send an email to
elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/CAD7smReTN-1oScpc%2BU4OLGGd_hAQPLjR0YED%2BLbwp%2Bnbxp80Gw%40mail.gmail.com
https://groups.google.com/d/msgid/elasticsearch/CAD7smReTN-1oScpc%2BU4OLGGd_hAQPLjR0YED%2BLbwp%2Bnbxp80Gw%40mail.gmail.com?utm_medium=email&utm_source=footer
.

For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CANzRR0VywPB2MYg41dQoj%2Bo49MtTP_1hudnGNv4TjOfimT7-7Q%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Topic		Replies	Views
ES used heap % grows slowly until system becomes unresponsive Elasticsearch	21	5614	July 5, 2017
Abrupt performance drop above a certain index size Elasticsearch	16	1479	July 6, 2017
ES&Lucene 32GB heap myth or fact? Elasticsearch	10	4946	July 6, 2017
Heap Allocation for Machines with 100GB+ Ram Elasticsearch	5	977	July 6, 2017
Service heap size configuration Elasticsearch	5	634	July 6, 2017

Decreasing Heap Size Results in Better TPS, How can this happen?

Related topics