How to prefer match on a longer percentage of the doc over a longer match?

Lea_Shallev · October 11, 2019, 7:35am

Hi,

I’m using elastic to search terms that were detected in a sentence of free text against an inner DB that was indexed.

I have a problem when my query appears in the DB with some tokens repetition.
For example: I search the term “computer” while the DB includes 2 different documents:

“computer, dell computer”
“computer”
In this case the first doc (“computer, dell computer”) will be returned because it includes more matches, but the second doc (“computer”) is a more appropriate response.

To clarify, there are cases when the repetition appears also in the query so I can’t just remove duplicates.

I found this solution, but since I’m searching for a free text I want to combine analyzers and fuzziness too.

Is there an option to make sure that the text in the relevant field is contained in my query (taking into account fuzziness) or to change scoring so elastic prefer match on a longer percentage of the doc over a longer match?

Here is my query:
<
GET technology/_search/
{
"explain":"true",
"query": {
"match": {
"term": {
"query": "computer",
"fuzziness":"AUTO",
"max_expansions": 2,
"prefix_length":1,
"fuzzy_transpositions":true
}
}
}
}

Thanks!

system · November 8, 2019, 7:35am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Search Issue with Long Text keyword Elasticsearch	7	582	April 12, 2021
The fuzzier matching, the higher score? Elasticsearch	6	458	January 23, 2019
Find documents containing not more terms than in the query with fuzzines Elasticsearch	7	402	August 30, 2021
Phrase search with fuzziness Elasticsearch	1	123	November 26, 2022
Prefer matching search text in beginning of result using elasticsearch in ElasticSearch Match Query Elasticsearch	1	177	March 9, 2023

How to prefer match on a longer percentage of the doc over a longer match?

Related topics