Hi guys,
We make use of transforms extensively in a production cluster (6 nodes of 20 CPU 32GB RAM, all nodes have all roles) with approximatively 250 transforms running in continuous mode. Some transforms have a frequency of 15m and some others have 1h (these high intervals are enough for our use case).
We experience recurring load spikes at the same moment every hour. When checking the transform stats in Kibana during the spikes I can see lots of (up to 60) transforms in state "indexing" at the same time.
While I totally understand the way the periodic scheduler works, I think it would be nice to support a "nice" mode cluster setting for transforms, which would allow Elasticsearch to automatically re-arrange transforms so that they don't run all at the same time. The workaround for now would be to stop/start transforms manually to restart the scheduler at another moment. This is cumbersome and also totally lost in the case of node restarts, because transforms will be restarted in this case, all at the same time !
What do you think of this idea? Is it worth an improvement issue on Github?
Do you have any insight about the issue we're experiencing?
Thanks,
David