Find similar records through MLT from millions records

How can we get all similar/duplicate record/document depending upon certain fields. There are 20 millions of documents. Can we have weightage on these fields collectively. What is the best way in elasticsearch.

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.