Let just say I have two different indexes named index1 and index2,
they both have a column/ field in them named C1 and C2, what I basically want to do is I want to match where C1 and C2 are the same and create a new index out of it named as matched and if C1 and C2 are not same then-new index named as mismatched.
One more thing I wanna add before the matching process, I also want to remove duplicate enteries with the same C1 and C2 fields to thier respective to thier index.
Are the field names C1 and C2 different or same?
You may have to do this outside of the stack. Maybe use Python or Java Elasticsearch libraries to compare and re-index them? This would probably be a lot easier, but could become bulky in the long run if the index is huge.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.