Should I include all Elasticsearch nodes (there are no dedicated masters, and there's one coordinating node) when I specify the list of ES nodes in the output block? I mean in case, I'm horizontally scaling, do I need to update all pipelines on every Logstash node?
I mean I see that this surely isn't something that'd block anything since I'm not doing it, but maybe there's overhead due to that?
It depends. If you are talking about hundreds of thousands of events per second, sure. If you're talking about scaling to store data for longer periods, no.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.