Multiple tokenizers inside one Custom Analyser in Elasticsearch

pranav24 · September 28, 2018, 6:37am

I am using Custom NGRAM Analyzer which has a ngram tokenizer. I have also used lowercase filter. The query is working fine for searches without characters. But when I am searching for certain symbols, it fails. Since I have used lower case tokenizers, Elasticsearch doesn't analyse symbols. I know whitespace tokenizer can help me solve the issue. How can I use two tokenizers in a single analyzer?Below is the mapping:

    {
      "settings": {
        "analysis": {
          "analyzer": {
            "my_analyzer": {
              "tokenizer": "my_tokenizer",
              "filter": "lowercase"
            }
          },
          "tokenizer": {
            "my_tokenizer": {
              "type": "ngram",
              "min_gram": 3,
              "max_gram": 3,
              "token_chars": [
                "letter",
                "digit"
              ]
            }
          }
        }
      },
      "mappings": {
        "_doc": {
          "properties": {
            "title": {
              "type": "text",
              "analyzer": "my_analyzer"
            }
          }
        }
      }
    }

Is there a way I could solve this issue?

system · October 26, 2018, 6:37am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Issues creating custom_analyzer Elasticsearch	4	411	September 13, 2019
Using uax_url_email tokenizer and ngram together Elasticsearch	2	631	November 23, 2017
Issues trying to search with ngram tokenizer Elasticsearch	2	504	May 12, 2021
Combining ngram tokenizer with stopwords Elasticsearch	1	117	April 12, 2024
Design custom analyzer with custom tokenizers Elasticsearch	3	984	July 5, 2017

Multiple tokenizers inside one Custom Analyser in Elasticsearch

Related topics