Hello, every ones in a while we get into a state where one of our
servers reports high USER and System CPU, as indicated on this graph:
As you can tell, the rest of the cluster is pretty much idle, while
img699 is continuously hot with CPU
top - 14:28:46 up 737 days, 18:21, 1 user, load average: 12.03,
10.37, 10.25
Tasks: 125 total, 1 running, 124 sleeping, 0 stopped, 0 zombie
Cpu(s): 34.9%us, 33.9%sy, 0.0%ni, 18.8%id, 11.7%wa, 0.2%hi,
0.5%si, 0.0%st
Mem: 16472372k total, 16387968k used, 84404k free, 5952k
buffers
Swap: 9775544k total, 5632k used, 9769912k free, 6111504k
cached
PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+
COMMAND
8625 root 20 0 9113m 8.4g 10m S 513.6 53.6 9021:31 java
Please see attached Jstack:
I am not really sure what its doing, all of the health status
indicators are idle, there is no merge or flush in progress, and left
alone, this server will be hot for days. The only way to resolve this
is to restart the process.
Shay, please let me know what you think.
-Jack