Monitor logstash missing data

vikramdayma · May 18, 2021, 11:59am

Hi,

I setup a pipeline in logstash for getting data from sql db by jdbc connector and insert into elastic search index on daily basis. I setup cron time and sync pipeline with sql db on a date field.

My problem is:
How to get missing data which is not inserting into index. Like 20 new rows inserted or updated in sql db but when pipeline run on its time, pipeline insert 15 rows data. 5 rows data not inserted may be incorrect field value or other reason.
So how to get these missing data.

Thanks in advance

ErSumit · May 18, 2021, 12:27pm

Hey Vikram,

I guess you should try to write additional output in logstash conf file to see if data is being dropped while being pulled by jdbc connector or its dropped while trying to ingest to elasticsearch.

If its the case of data being dropped while ingesting to elasticsearch, you can try sending such data to failover index (with help of if tags present condition)
or try to fix the error while ingesting data to Elasticsearch.

Cheers!

vikramdayma · May 18, 2021, 12:52pm

Hi Sumit,

Thanks for reply.

Yes it is data dropping case while ingesting to elasticsearch.
Problem is how to implement if condition in output part of conf file.
If some data(sql row) is not insert than go to failover index.
If you have any reference or link please share me.
Here is my conf file:

input {
    jdbc {
           jdbc_connection_string => "jdbc:mysql://localhost:3306/sqldatabase"
           jdbc_user => "sqluser"
           jdbc_password => "sqlpassword"
           jdbc_driver_library => "/usr/share/logstash/logstash-core/lib/jars/mysql-connector-java-5.1.36.jar"
           jdbc_driver_class => "com.mysql.jdbc.Driver"
           statement => "SELECT * FROM table1"
           
           
         }
      }


output {

         stdout { codec => "rubydebug" }

         elasticsearch {
                         hosts => "http://localhost:9200"
                         index => "sqldata"
                         user => "XXX"
                         password => "XXX"
                       }
       }

ErSumit · May 18, 2021, 1:20pm

Hey Vikram,

Cool, for now I would suggest to see and find the reason why elasticsearch rejected ingesting specific record. You may try that with journalctl -fu logstash.service post starting the service or look at your rubydebug output.

The most general reason for this is the data type mismatch, you can fix it by updating index mapping in elasticsearch

or while you know the reason, you can write condition for the scenario to route data to failover index.

I experienced similar exception while I was trying to ingest string data to number format field. I just updated index mapping and reloaded data to fix it.

Cheers!

vikramdayma · May 19, 2021, 4:40am

Hi Sumit,

You are right it is data type mismatch. I checked with journalctl command.
Multiple fields are coming with data type mismatch.
I am trying to handle data type mismatch to a different index.
One way to implement multiple filter condition or user dead letter queue for missing data. link [Dead Letter Queues (DLQ) | Logstash Reference [7.12] | Elastic](Dead Letter Queues (DLQ) | Logstash Reference [7.12] | Elastic
Which approach is better meanwhile data is missing due to data type mismatch or some other reason.

ErSumit · May 19, 2021, 7:50am

Hi Vikram,

The ideal way would be to update data types on kibana. You can do so with help of index mappings.

In my opinion managing rejected data with DLQ will be like healing after an injury. You can avoid injury by updating index mapping

vikramdayma · May 19, 2021, 9:32am

Hi Sumit,

Thanks Your suggestion. It helps me lot.

system · June 16, 2021, 9:32am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Records missing while stashing data from Logstash Logstash	1	690	February 13, 2018
Logstash jdbc schedule missing values Logstash elastic-stack-monitoring	3	832	January 17, 2020
Missing Data - SQL Server 2017 to Elasticsearch Logstash	1	353	August 16, 2019
Logstash missing records in elastic search Logstash	1	660	March 28, 2018
Logstash has fetched the data but not shipped into Elasticsearch Logstash	6	1047	June 29, 2018

Monitor logstash missing data

Related topics