Hi all,
We've got a cluster of 8 nodes. Recently we started experiencing
intermittent hangs of our application (we've since added timeouts to
prevent this)… at the time, all nodes in the cluster were reporting
green, but on further examination, one node was throwing the below
errors.
A bounce of the node resolved our issues. Any ideas on what
happened/how to catch this in the future? Thanks!
Errors from the node that was reporting green:
[2011-12-16 15:46:48,435][WARN ][transport.netty ]
[prod-elasticsearch-r04] Exception caught on netty layer [[id:
0x7361b0bc, /10.180.35.110:60042 => /10.180.46.203:9300]]
java.io.StreamCorruptedException: invalid data length: 0
at org.elasticsearch.transport.netty.MessageChannelHandler.callDecode(MessageChannelHandler.java:137)
at org.elasticsearch.transport.netty.MessageChannelHandler.messageReceived(MessageChannelHandler.java:101)
at org.elasticsearch.common.netty.channel.SimpleChannelUpstreamHandler.handleUpstream(SimpleChannelUpstreamHandler.java:80)
at org.elasticsearch.common.netty.channel.DefaultChannelPipeline.sendUpstream(DefaultChannelPipeline.java:564)
at org.elasticsearch.common.netty.channel.DefaultChannelPipeline.sendUpstream(DefaultChannelPipeline.java:559)
at org.elasticsearch.common.netty.channel.Channels.fireMessageReceived(Channels.java:274)
at org.elasticsearch.common.netty.channel.Channels.fireMessageReceived(Channels.java:261)
at org.elasticsearch.common.netty.channel.socket.nio.NioWorker.read(NioWorker.java:351)
at org.elasticsearch.common.netty.channel.socket.nio.NioWorker.processSelectedKeys(NioWorker.java:282)
at org.elasticsearch.common.netty.channel.socket.nio.NioWorker.run(NioWorker.java:202)
at org.elasticsearch.common.netty.util.ThreadRenamingRunnable.run(ThreadRenamingRunnable.java:108)
at org.elasticsearch.common.netty.util.internal.DeadLockProofWorker$1.run(DeadLockProofWorker.java:44)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1110)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:603)
at java.lang.Thread.run(Thread.java:636)
[2011-12-16 15:48:50,171][INFO ][node ]
[prod-elasticsearch-r04] {0.18.5}[6520]: stopping ...
[2011-12-16 15:48:50,601][WARN ][transport.netty ]
[prod-elasticsearch-r04] Exception caught on netty layer [[id:
0x7361b0bc, /10.180.35.110:60042 :> /10.180.46.203:9300]]
java.io.StreamCorruptedException: invalid data length: 0
at org.elasticsearch.transport.netty.MessageChannelHandler.callDecode(MessageChannelHandler.java:137)
at org.elasticsearch.transport.netty.MessageChannelHandler.cleanup(MessageChannelHandler.java:170)
at org.elasticsearch.transport.netty.MessageChannelHandler.channelDisconnected(MessageChannelHandler.java:119)
at org.elasticsearch.common.netty.channel.SimpleChannelUpstreamHandler.handleUpstream(SimpleChannelUpstreamHandler.java:112)
at org.elasticsearch.common.netty.channel.DefaultChannelPipeline.sendUpstream(DefaultChannelPipeline.java:564)
at org.elasticsearch.common.netty.channel.DefaultChannelPipeline.sendUpstream(DefaultChannelPipeline.java:559)
at org.elasticsearch.common.netty.channel.Channels.fireChannelDisconnected(Channels.java:360)
at org.elasticsearch.common.netty.channel.socket.nio.NioWorker.close(NioWorker.java:595)
at org.elasticsearch.common.netty.channel.socket.nio.NioClientSocketPipelineSink.eventSunk(NioClientSocketPipelineSink.java:101)
at org.elasticsearch.common.netty.channel.DefaultChannelPipeline.sendDownstream(DefaultChannelPipeline.java:574)
at org.elasticsearch.common.netty.channel.Channels.close(Channels.java:720)
at org.elasticsearch.common.netty.channel.AbstractChannel.close(AbstractChannel.java:200)
at org.elasticsearch.transport.netty.NettyTransport$NodeChannels.closeChannelsAndWait(NettyTransport.java:706)
at org.elasticsearch.transport.netty.NettyTransport$NodeChannels.close(NettyTransport.java:695)
at org.elasticsearch.transport.netty.NettyTransport$5.run(NettyTransport.java:332)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1110)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:603)
at java.lang.Thread.run(Thread.java:636)