After increasing the RAM of all allocator nodes (this caused a sequential restart) they are no longer able to connect to ZooKeeper.
Here are a couple of logs I found:
services-forwarder.log
[2017-11-21 09:02:20,780][INFO ][org.apache.zookeeper.ClientCnxn] Opening socket connection to server 0.0.0.0/0.0.0.0:2181. Will not attempt to authenticate using SASL (unknown error) {}
[2017-11-21 09:02:20,780][WARN ][org.apache.zookeeper.ClientCnxn] Session 0x0 for server null, unexpected error, closing socket connection and attempting reconnect {}
java.net.ConnectException: Connection refused
at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
at sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:717)
at org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:357)
at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1214)
Hi,
Can you please check if container frc-client-forwarders-client-forwarder is running and if there are any errors? Also, I would be great if you can send logs from that host.
I was able to resolve the problem thanks to a lot of help of @Yuri.
Thank you very much.
The issue was that my automation tried to perform the version upgrade multiple times and during a rollback of such an update some configuration got deleted.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.