Duplicate in Dataset while reading from elasticsearch index with SPARK


I am trying to read an unstructured index from es and loading it to RDD of SPARK, sometimes in RDD the complete data is getting duplicated, did somebody faced the similar issue, any recommendation.

Prashant Verma

