Find similar records through MLT from millions records

How can we get all similar/duplicate record/document depending upon certain fields. There are 20 millions of documents. Can we have weightage on these fields collectively. What is the best way in elasticsearch.

