Hi guys, I was wondering that the maximum scroll context is a node level property across all the indices right? If that’s true, then does it mean that any elastic cluster wherein let’s assume number of primary shards = number of data nodes, with each primary shard on 1 data node, the cluster cannot handle more than 1024 concurrent / parallel scroll requests? (Considering we have set the limit as 1024).
Also, if the data only exists on the 5 nodes, will the scroll instantly close on the remaining nodes, or will it continue to stay open till the time the entire polling is completed from these 5 nodes as well?
That’s what I expected to happen, but apparently I created 2 scenarios.
Routing with routing partition = 5, and no routing.
In case of routing, each scroll request (containing the routing id) was only opening 5 scrolls, whereas in case of no routing, the scroll request was opening scrolls on all the primary shards.
Also just to mention, this is not associated with the other thread…this is a completely different .
What does that mean? Maybe share/show what you actually did to "apparently create 2 scenarios" ?
It's an entirely different cluster? Just coincidence that the scroll request with routing id targeted only 5 nodes, 5 being the magic number in the other thread?
Basically creating 2 indices and populating them with the same data, however one had routing enabled and the other didn’t have any explicit custom routing enabled.
5 is our routing_partition which we have defined in the configuration, and hence it’s the magic number in both the threads
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.