I have upgraded from 16.x up to 17.0 in the past and had no problems
connecting a client only node to a master node using unicast settings.
Has there been a change in ver 17.1 behavior regarding networking
between multicast clusters and unicast clients?
After upgrading to 17.1, the unicast client only node is no longer
"seeing" the master node, and not connecting to the cluster. Cluster
names have not changed. No other changes were made in configuration,
except for updating the elastic jar to 17.1. I have also deleted the
data folders to start the cluster from fresh. The master node has
default network settings (ie uses multicast).
I have tested the unicast discovery and it works fine when running the
client and master node on the same computer.
The above posted problem involves a master node and client running on
different subnets (hence, using unicast discovery from the client
node). The master node is bound to 192.168.110.xx subnet, and the
client data only node is on 192.168.109.xx subnet.
Again, prior to 0.17.1, the client node had no problem connecting to
the master. Nothing else that I can think of has changed, other than
updating the elastic jar to 0.17.1.
I just tested that unicast discovery works on the same subnet using a
client only node and a master node running on the same machine. The
client only node is configured for unicast ping discovery, and the
master node is using default network configuration.
The client node (May Parker) is trying to connect to 192.168.109.138:9300 (the master node is Shuma-Gorath) yet it looks like a different node (Longshot) is responding instead.
Of course, Longshot is on a different cluster name so it May Parker should not be joining it.
I can't decipher the logs properly, but it seems like the presence of Long Shot (on the same subnet as May Parker but assigned a different cluster name) is interfering with May Parker joining Shuma-Gorath.
I don't have the time now, but I'm guessing that if I shutdown Longshot everything will work as expected.
In this sample, the client only node is named [Bes]. The target master that Bes is trying to connect to should be [Shuma-Gorath]. Again, [Longshot] seems to be getting in the way of success.
Heya, first, the logic did not change between 0.17.1 and 0.17.0, but, after
analyzing the output you provided, I think I found the problem. It does rely
on ordering of events, so its not really something that is consistent and
you might just started to see it.
In this sample, the client only node is named [Bes]. The target master that
Bes is trying to connect to should be [Shuma-Gorath]. Again, [Longshot]
seems to be getting in the way of success.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.