Guys,
let us say i have ElasticSearch index and it contains about 10,000 documents, i did random manual check on the field "building name" for few documents and found there are a lot of documents matching each others based on this field "building name" is there any way i can find all documents that are similar to each others using Elasticsearch capabilities
Look at "more like this".
thanks guys for your feedback, actually our challenge is that we do not have a specific text that we want to compare it ith all documents, each set of documents could be matching each others based on specific text, if we will use the sample below queries we need to identify the text for the search which is not the case.
or if there is no solution we will do a development API work to go through all documents and apply the below queries and then group the documents as per the scoring results.
this is the sample MLT query
{
"query": {
"more_like_this": {
"fields": [
"building"
],
"like_text": "my text will be here",
"min_term_freq": 1,
"max_query_terms": 25
}
}
}
This is sample phonetic-matching query
{
"query": {
"match": {
"building.phonetic": {
"query": "my text will be here",
"operator": "or"
}
}
}