I have a system where I grab lots of XML files throughout the day. I keep a
"sycned.xml" and a load of datestamped files too. Synced.xml is the latest
file which has been sycned with elasticsearch.
I run a program which works out the difference between the latest
timestamped file and "synced.xml" and depending on what has been added or
removed in the latest timestamped file, I update elasticsearch.
This means out of a large XML file, say 500,000 items, I only have to add
the 10,000 or so that have changed.
Can anyone recommend a way that I can check that elasticsearch and my
script is syncronised? In my mind I was thinking of generating a hash key
which I add to elasticsearch and log on my script. And if they don't match
then I know they are out of sync. That said, how do I achieve this?
Thank you for any help.
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to email@example.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/6ea631b5-f5f4-4ff2-9182-4251cd55e387%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.