How logstash parse log files internally?

clash_of_clanss · May 16, 2018, 4:06pm

I am new here for ELK(ElasticSearch,Logstash,kibana). I would like to know how logstash parse log files and upload into an ElasticSearch. Lets take an example, If I want to load around 250GB of log files data into an ElasticSearch from S3 storage by using logstash. In logstash how that log files are parsed and send into an elasticsearch, is it stored the whole S3 log files data in local memory and then parsed that log files and then put into an elasticsearch? or it will parsed by reading one by one line from S3 Storage without downloading all the log files from S3 storage. And also how to increase the read and write throughput in logstash. is it support multithread process?

Thanks,

magnusbaeck · May 16, 2018, 5:46pm

or it will parsed by reading one by one line from S3 Storage without downloading all the log files from S3 storage.

The files are downloaded locally, but I don't remember if they're downloaded one by one (i.e. you'll never have more than one file locally at any points in time) or if all matching files are continuously downloaded independent of the processing.

And also how to increase the read and write throughput in logstash. is it support multithread process?

Yes. See e.g. Tuning and Profiling Logstash Performance | Logstash Reference [8.11] | Elastic

clash_of_clanss · May 16, 2018, 5:54pm

Thanks for responding... but if the files are downloaded locally then how it will be efficient? because S3 file size is very large then how to parse that log files efficiently?

magnusbaeck · May 16, 2018, 7:43pm

but if the files are downloaded locally then how it will be efficient?

Why would it not be efficient? What do you suggest instead?

clash_of_clanss · May 17, 2018, 4:59am

No I am not suggested... I like to know clear idea about like if all files are downloaded around 250GB of data in local memory how it could be parsing all the data from that local memory.

magnusbaeck · May 17, 2018, 5:58am

They're downloaded to disk, not to memory.

clash_of_clanss · May 17, 2018, 6:18am

yeah you are right but is it downloaded all the log files into disk or one by one from s3 storage? because if it download all the files from s3, the disk storage will not stored all the files.

magnusbaeck · May 17, 2018, 7:29am

As I said, I don't know if they're downloaded all at once or one by one as the parsing progresses.

clash_of_clanss · May 17, 2018, 7:31am

ohh it's okay thanks man.

system · June 14, 2018, 7:31am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
S3 logstah question Logstash	1	401	July 6, 2017
Parse Amazon S3 access log with multiple files Logstash	8	3074	July 6, 2017
Read slow for large files in S3? Logstash	5	980	July 6, 2017
Slow processing in Logstash with S3 input Logstash	1	1884	July 6, 2017
Speed up processing of logs Logstash	7	6517	April 26, 2017

How logstash parse log files internally?

Related topics