Long gc pause happened on es1.7.0 plus jdk8u40

makeyang · October 10, 2015, 8:42am

22042 2015-10-10T10:36:44.751+0800: 1954567.631: [GC (Allocation Failure) 1954567.631: [ParNew: 1606667K->41270K(1763584 K), 1221.6222148 secs] 5895406K->4332232K(16581312K), 1221.6230128 secs] [Times: user=11780.20 sys=0.00, real=1221 .44 secs]

is it a jdk bug or how to avoid it?

warkolm · October 11, 2015, 2:10am

How much data in your cluster, how much heap?

makeyang · October 12, 2015, 2:27am

doc count: 4,695,567
index siez:4.7GB(primary shard + replicas)
index setting:
"number_of_shards": "10"
"number_of_replicas": "1"
node count: 4
heap commited: 30GB

makeyang · October 12, 2015, 2:28am

please help to give suggestions

makeyang · October 12, 2015, 7:08am

can anyone give any insight about this? I really bothers us a lot

Christian_Dahlqvist · October 12, 2015, 7:21am

A bit more information around the issue would be useful:

Did the long GC affect one or all nodes?
What does the workload look like?
What type of hardware are the nodes deployed on?
Is there anything else running on the host(s)?
Has swap been disabled?

makeyang · October 12, 2015, 7:53am

Which version of Elasticsearch and Java are you using?
[answer]as mentioned in the title: es1.7.0 and jdk8u40
Did the long GC affect one or all nodes?
[answer]only one node
What does the workload look like?
[answer]write with 500tps and search with 300tps
What type of hardware are the nodes deployed on?
[answer]256GB ram, 32 core Xeon 2.6GHZ, 1TB spin disk
Is there anything else running on the host(s)?
[answer]Yes, there is another ES instance with same memory config running on that machine
Has swap been disabled?
[answer]yes.

warkolm · October 12, 2015, 8:18am

Are you using parent/child at all?

makeyang · October 12, 2015, 8:43am

no, not at all

makeyang · October 12, 2015, 9:08am

no, not at all

makeyang · October 13, 2015, 1:05am

can anyone give any insight about this?

makeyang · October 14, 2015, 3:57am

can you give any insights about this issue? I know it is tough issue, but u guys are exports

tinle · October 14, 2015, 4:22am

How many shards total for your cluster (primary and replica)?

Tin

makeyang · October 14, 2015, 9:41am

26 primary and 26 replica. total 52 shards.

tinle · October 14, 2015, 6:10pm

You mentioned 300tps for search. What does your fielddata metrics look like?

You can get that from curl -s localhost:9200/_cluster/stats

Could be your queries are using up the heap.

Topic		Replies	Views
Very long GC Elasticsearch	11	6804	July 6, 2017
Very long gc pause caused by ES Elasticsearch	6	1036	July 5, 2017
Finding why long GCs occur and fixing efficiency issues in Elastic Cluster Elasticsearch	4	1321	January 25, 2019
Long GC pauses but only one 1 host in the cluster Elasticsearch	3	347	July 6, 2017
Long GC pauses with ES 1.3.4 Elasticsearch	12	1503	July 5, 2017

Long gc pause happened on es1.7.0 plus jdk8u40

Related topics