Highlighting with fvh and fuzziness taking long time in Elasticsearch

Nikesh · October 15, 2018, 8:16am

Hi all,
I have indexed a single document with more than 150 metadata, each with mapping:

           "ACTIVE": {
                                "type": "text",
                                "term_vector": "with_positions_offsets",
                                "fields": {
                                    "autocomplete_analyzed": {
                                        "type": "text",
                                        "analyzer": "autocomplete"
                                    },
                                    "keyword": {
                                        "type": "keyword",
                                        "ignore_above": 256
                                    }
                                }
                            }

and with setting,:

    "analysis": {
                    "analyzer": {
                        "autocomplete": {
                            "filter": [
                                "lowercase"
                            ],
                            "tokenizer": "autocomplete"
                        }
                    },
                    "tokenizer": {
                        "autocomplete": {
                            "min_gram": "3",
                            "tokenize_on_chars": [
                                "whitespace",
                                "letter",
                                "digit"
                            ],
                            "type": "edge_ngram",
                            "max_gram": "7"
                        }
                    }
                }

I have used terms _vector to be able to use fast vector highlighting in my query.
My query:

{
  "from": 0,
  "size": 24,
  "query": {
    "bool": {
      
      "should": [
        {
          "multi_match": {
            "query": "current",
            "type": "best_fields",
            "fields": []
          }
        },
        {
          "query_string": {
            "query": "*current*",
            "fields": []
          }
        },
        {
          "multi_match": {
            "query": "current",
            "fuzziness": "1",
            "fields": []
          }
        }
      ],
      "minimum_should_match": 1
    }
  },
  "highlight": {
    "type": "fvh",
    "fields": {
      "*": {}
    }
  }
}

My query demands fuzziness, wildcard and phrase matching.
Fuzziness and wildcard is disabled or enabled depending on my requirement on back end. But on Free text search, I have to enable both of it including highlighting.

With Highlighting, my query takes more than 15000 ms but without highlighting it takes 800ms
Without fuzziness and with highlighting it takes around 1200ms and without fuzziness and highlighting it takes around 500ms.

The slowness in the query is due to fuzziness and highlighting working together.
How Highlighting with terms vector index the document? why is the query running so slowly? Is it due to my query or is it due to indexing data? Because, I will be working on millions of documents.
What's the best way to go about this time problem?

Nikesh · October 16, 2018, 12:25pm

@Mark_Harwood Hi, This is the query I was referring to in the other post

Nikesh · October 18, 2018, 6:49am

@elastic

Topic		Replies	Views
Huge performance hit when using fuzzy and highlights Elasticsearch	4	1118	May 27, 2012
Fast vector highlighter (fvh) making searches slower Elasticsearch	6	1362	December 1, 2021
Highlighting performance issues with stored field and fvh highlighter Elasticsearch	2	518	February 14, 2024
Elasticsearch Highlighting is very slow Elasticsearch	0	1028	December 13, 2018
Have we a way to use highlight and fuzzy together? Elasticsearch	3	1792	July 7, 2014

Highlighting with fvh and fuzziness taking long time in Elasticsearch

Related topics