Comparing data from a RDBMS to Elasticsearch

newbie_here · August 21, 2017, 8:04am

Hi All,

We have a RDBMS source, from where we are indexing the data into indices in elasticsearch.
For reconcillation, we have run few sum and count aggregations on both the source and destination data, but found few discrepancies.

Since the data is in 100 millions, is there a simple way to compare which data-point is missing or where the mistakes are.

P.S : The composite key of the RDBMS is the _id field of the elasticsearch index

abdon · August 25, 2017, 11:41am

There is no simple way, but this blog post may give you some pointers in the right direction: https://www.elastic.co/blog/elasticsearch-verifying-data-integrity-with-external-data-stores

system · September 22, 2017, 11:42am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
How to re-verify data consistency with external RDBMS source Elasticsearch	5	682	August 10, 2017
Detect difference between elastic search and SQL databse Elastic Community and Ecosystem	2	2751	July 6, 2017
Querying differences between indexes Elasticsearch	2	386	April 17, 2018
RDBMS vs ES for list data Elasticsearch	2	336	January 8, 2019
Verifying data consistency between Oracle and ES Elasticsearch	2	443	July 30, 2018

Comparing data from a RDBMS to Elasticsearch

Related topics