Hi,
I am trying to index data on a 20 node cluster using the Bulk API through
the Java client in my application. After running my application for some
time, BigDesk shows that the load is taken mostly by a single node on the
cluster. I tried indexing my data in the two following ways:
- Added all the hosts and ports to the client using the addTransportAddresses
on the TransportClient instance (this should distribute the requests in a
RoundRobin fashion) - Setting the master's node.data configuration setting to false and
sending all bulk requests to the master node.
In both cases, i could see that the number of IndexRequests reported by
BigDesk is extremely high on one node, and 0 (or close to 0) on the
remainder of the cluster. Note that my client, does not use sniffing that
the cluster is unable to use Multicast (all hosts and ports are configured
in Unicast configuration).
Has anyone ever experienced this issue? And is there a way to ensure a
uniform distribution of load on the cluster?
Thanks in advance!
--