How to softly exclude ambiguous words?

S-Dragon0302 · July 18, 2025, 4:13am

Because "apple" is hit, not just "apple care".How to implement this query_string?

Tortoise · July 18, 2025, 6:31am

As per the table :

POST /fruits/_doc
{
  "message": ["usa", "apple"]
}

POST /fruits/_doc
{
  "message": ["usa", "apple", "banana"]
}

POST /fruits/_doc
{
  "message": ["usa", "apple", "apple care"]
}

POST /fruits/_doc
{
  "message": ["usa", "apple care"]
}

POST /fruits/_doc
{
  "message": ["usa", "apple", "banana", "apple care"]
}

#ambigous word = None
GET /fruits/_search
{
  "query": {
    "bool": {
      "must": [
        { "match": { "message": "usa" }},
        { "match": { "message": "apple" }}
      ],
      "must_not": [
        { "match": { "message": "banana" }}
      ]
    }
  }
}

#ambigous word = apple care
GET /fruits/_search
{
  "query": {
    "bool": {
      "must": [
        { "match": { "message": "usa" }},
        { "match": { "message": "apple" }}
      ],
      "must_not": [
        { "match": { "message": "banana" }}
      ],
      "filter": {
        "script": {
          "script": {
            "source": """
              def kws = doc['message.keyword'].size() == 0 ? [] : doc['message.keyword'];
              return !kws.contains('apple care') || kws.contains('apple');
            """,
            "lang": "painless"
          }
        }
      }
    }
  }
}

Thanks!!

S-Dragon0302 · July 18, 2025, 6:45am

Using scripts can achieve this, but the performance is too poor. I have hundreds of billions of data entries, and the storage is at the PB level.

Mark_Harwood1 · July 20, 2025, 8:47am

Maybe of interest: in this demo users can choose which interpretation of the ambiguous search term “ice” they want and search for only that. This uses clustering and binary vectors so may require changes to both indexing and user behaviours but does solve this sort of problem.

Topic		Replies	Views
Помогите составить запрос с исключениями Вопросы на русском языке	10	3749	April 5, 2017
None of these words - syntax Elasticsearch	3	579	July 5, 2017
How to force excludes to include plural or possessives Elasticsearch	1	700	March 30, 2019
Whole word search with elasticsearch Elasticsearch	2	2088	July 5, 2017
Strange behavior: some words breaks search Elasticsearch	3	516	January 4, 2017

How to softly exclude ambiguous words?

Related topics