Architecture for Logstash Setup in Azure

brettbim · August 10, 2016, 11:13pm

I’m setting up a POC of ELK running on Azure and I’m trying to figure out an architecture that works for the POC but also may work for future uses. Right now I have ELK set up on a single Azure VM and I’m shipping a few application logs via Filebeats. I chose Filebeat over Logstash agents because there seemed to be some guarantee of at least once delivery for Filebeat while I couldn’t find a similar guarantee for Logstash agents. Is that correct?

Ideally, this Azure implementation will work for both our non-azure applications (that’s currently working though I’m not sure about future performance since I’m not fronting Logstash with a broker) and our future Azure applications.

Does this seem like a good approach? The Logstash book recommends fronting Logstash with Redis but because my ELK stack is in Azure and would need to be accessed over the Internet by on premise applications, Redis doesn't seem like a good option.

Any guidance would be appreciated.

Thanks,
Brett Bim

warkolm · August 11, 2016, 5:33am

It doesn't matter if you use Azure, AWS, GCE or a bunch of RPis in your bedroom, the architectures really remain the same.

Redis or kafka or whatever, it's still a good idea.

magnusbaeck · August 11, 2016, 5:38am

I chose Filebeat over Logstash agents because there seemed to be some guarantee of at least once delivery for Filebeat while I couldn’t find a similar guarantee for Logstash agents. Is that correct?

There's no such difference if you use Logstash to ship logs via a lumberjack output/input.

Does this seem like a good approach? The Logstash book recommends fronting Logstash with Redis but because my ELK stack is in Azure and would need to be accessed over the Internet by on premise applications, Redis doesn't seem like a good option.

Redis being a bad option because of the limited authentication options available, or what do you mean?

brettbim · August 11, 2016, 2:26pm

Thanks for your reply.

There's no such difference if you use Logstash to ship logs via a lumberjack output/input.

That is good to know since at some point, I see value in doing some or all of the filtering on client machines instead of having it all done on a central Logstash instance.

Redis being a bad option because of the limited authentication options available, or what do you mean?

Essentially, Redis is not recommended for public internet consumption. So I'm trying to figure out the best approach for performance and reliability. Right now, it doesn't matter, this is all just a proof of concept. But given our current solution for logging, I'm comfortable that it will be a successful proof so I don't want to paint myself into a corner.

If this was all in Azure (or AWS or local or whatever), I could easily stand up Redis or RabbitMQ as a broker. I'm just wondering with the need to ship logs from both Azure and on premise data centers if there is something I should do between the shippers and the central servers to mitigate not having a broker.

magnusbaeck · August 11, 2016, 2:29pm

If the source of the logs are on-disk files you have a natural buffer and the need for a broker for buffering is not so great. Brokers are also useful for distributing load, but you're not there yet. I'd start without a broker and add one if the need arises.

Topic		Replies	Views
Beginners ELK design doubts Logstash	4	626	May 16, 2017
ELK architecture advice with S3 Elasticsearch	3	2106	January 16, 2017
Planning capacity and ELK using filebeat/logstash or logstash/redis? Logstash	2	1377	July 6, 2017
Redis in elk Logstash	8	952	February 26, 2021
Combining ELK with Azure Elasticsearch	2	714	February 14, 2018

Architecture for Logstash Setup in Azure

Related topics