Hi, I want to index around 10 million of web pages HTML code in elasticsearch,
now, I want to give the HTML content as a search query and get the most similar documents related to the search query ( which is an HTML )
so If I put the HTML page that already indexed, it should return the exact same documents, otherwise, it should send nearest one.
how can I do this with elasticsearch? is there any sample code for this?
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.