Analysing arbitrary log files (newbie)

Nathan_Sowatskey · March 26, 2019, 10:36am

Hi

I would like to ingest and correlate messages from a number of log files that I have on disk. I have installed Elasticsearch, Kibana and Filebeat. I have configured Filebeat to read the log files from a directory, and I can see some indications of that in the Kibana Discover page.

So, I have signs of life.

There are some basic points that I need guidance on please. I am hoping that there is a suitable tutorial for my use case, i.e. how to ingest and analyse arbitrary log files, that I could be referred to. So, the points below could be answered directly here (thank you), or via a pointer to a tutorial (thanks even more). I know there are lots of tutorials, but none that quite seem to fit my needs.

How do I remove the messages that are showing as a result of previous iterations of changing Filebeat config so that I only see the latest messages. As matters stand now, for example, I am seeing messages that appear to represent a directory listing, which is probably an artefact of an earlier configuration attempt.

How can I see what kind of data is being read from a given log file so that I can see whether Filebeat is making sense of the log file format?

Since Filebeat will likely get confused by some of these log files, what do I about that? For example, should I add a specific content format filter, and, if so, how?

Many thanks

Nathan

pup_seba · March 26, 2019, 8:47pm

Hi mate!!!

First thing that calls my attention, is that you don't mention logstash. I don't know if that's an omition or in fact you did not installed it. In any case, logstash is a key piece when it comes to the part for processing your data.

I'm quite new to elastic my self, so I'll share with you what I've been up to, as I'm quite happy to where it is leading me in terms of knowledge...although I can see that there is a huge way ahead.

I bought and read this book Search | Packt Subscription
I took these udemy courses. It is a 3 courses series, all of them costing between 10-15€ each.
https://www.udemy.com/elasticsearch-logstash-kibana-learn-elasticsearch-search-server/learn/v4/overview
https://www.udemy.com/elk-stack-2-learn-all-about-logstash-data-ingestion-tool/learn/v4/overview
https://www.udemy.com/elasticsearch-logstash-kibana-elk-3-learn-kibana/learn/v4/overview
Later on, I took this other course hoping it would explain better things related to pipelines, workers and stuff...but I got a little bit dissapointed.
https://www.udemy.com/elasticsearch-6-and-elastic-stack-in-depth-and-hands-on/learn/v4/overview
I started participating in this community. To my surprise, it is an outstanding community with a lot of effort from elastic team itself. Trying to help others, was and always be a great way for me to learn things.
I started working on a project to get my hands dirty https://github.com/Zimbra-Community/zimbra-elasticstack
I usually try to attend or at least see the recordings of elastic on-line presentations. They are great btw.

Now, to try to answer your questions:

--> What i usually do while testing things, is I just delete the "old" index all together and reindex the thing. I think there are other methods like using the _reindex API so you don't need to remove your original index...but in my case, deleting the old index and indexing again is just fine.

--> I don't quite understand this one, but I think that's because in my case, the one making sense of the file format is logstash. At this time, I'm using filebeat basically to just be the one that ships the "raw" information, with the only exception of the multiline configuration that I do at the filebeat level and adding one or two tags. When using logstash, usually any filter that fails to apply, will give as a result, a tag for that doc (line of the log), similar to _grokparsefailure or _dateparsefailure, meaning that that specific doc (line of the log) did not match any of respective the filters it went through (grok, date, etc). You can also add tags on failure with your own specific names.

--> Most of the filters I use with logstash, are "grok" filters. Grok filter plugin | Logstash Reference [8.11] | Elastic
Kibana comes with a "grok debbuger" that will let you test your grok filters and it works really great!

Good luck!!!

Nathan_Sowatskey · March 27, 2019, 5:12am

Thank you for following up.

I also thought it was odd not to have Logstash. I followed this guide to have Beats integrated with Elasticsearch, and that seemed to work (for a given definition):

That article notes that "Elastic has several methods for getting data in to Elasticsearch", of which Beats is one.

I started off here with the hope that I could just simply ingest some log file data and start to make sense of, but it is clearly going to much more complicated than that. I shall persevere, but I can't help wondering what trick I am missing, as there ought to be a simpler starting point ...

Many thanks also for your other pointers.

Regards

Nathan

system · April 24, 2019, 5:13am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
How do you see data sent from syslog > logstash > elasticsearch (no filebeats)? Kibana	2	621	April 8, 2019
Analyzing log files in a static directory Logs	1	122	May 23, 2024
Analyse the log files Logstash	6	673	December 20, 2016
Filebeat, Kibana and ElasticSearch are configured, but not reading log files Beats filebeat	6	1512	August 14, 2020
Developers would like to see some log files from they applications Elasticsearch	9	1045	July 6, 2020

Analysing arbitrary log files (newbie)

Related topics