Wildcard search with _ (underscore) is giving no result

RohanKumbhar · January 4, 2018, 5:28am

Hi Team,

I facing the issue while using wild card search with query containing understore (_) in it with elasticsearch version 6.0.1
And i'm using word_delimiter_filter for the field my_field as mentioned in below query

I have document with my_field = document_02.txt and i want to search for this document using _02* as mentioned in below query but this will give me zero result

GET my_index/_search
{
 
 "query": {
   "bool": { 
     "must": [
       
       {
         "query_string": {
           
           "query":"my_field:_02*"
         }
       }
     ]
   }
 }
}

Below query gives me the valid search result if wildcard is not used in the query

  GET my_index/_search
    {
     
     "query": {
       "bool": { 
         "must": [
           
           {
             "query_string": {
               
               "query":"my_field:_02"
             }
           }
         ]
       }
     }
    }

kindly help!

val · January 4, 2018, 5:40am

Can you show the mapping of your field and the definition of the analyzer you're using for that field?

RohanKumbhar · January 4, 2018, 5:55am

Here are the steps to replicate the issue

PUT my_index
{
    "settings": {
      "index": {
        "analysis": {
          "analyzer": {
            "custom_analyzer": {
              "filter": [
                "word_delimiter",
                "lowercase"
              ],
              "type": "custom",
              "tokenizer": "standard"
            }
          }
        }
      }
    },
    "mappings": {
      "doc": {
        "properties": {
          "my_field": {
            "type": "text",
            "analyzer": "custom_analyzer",
            "fields": {
              "keyword": {
                "ignore_above": 256,
                "type": "keyword"
              }
            }
          }
        }
      }
    }
  }
  
  

  
PUT my_index/doc/1 
{
  "my_field":"document_02.txt"
}

GET my_index/_search
{
  "query": {
    "bool": {
      "must": [
        {
          "query_string": {
            "default_field": "my_field",
            "query": "_02*"
          }
        }
      ]
    }
  }
}

val · January 4, 2018, 6:01am

I'm not sure why you're using a word_delimiter token filter. When analyzing document_02.txt, it's going to produce the following tokens (obtained from the _analyze endpoint):

{
  "tokens": [
    {
      "token": "document",
      "start_offset": 0,
      "end_offset": 8,
      "type": "<ALPHANUM>",
      "position": 0
    },
    {
      "token": "02",
      "start_offset": 9,
      "end_offset": 11,
      "type": "<ALPHANUM>",
      "position": 1
    },
    {
      "token": "txt",
      "start_offset": 12,
      "end_offset": 15,
      "type": "<ALPHANUM>",
      "position": 2
    }
  ]
}

If you want to search inside words, what you need to to leverage the ngram token filter

RohanKumbhar · January 4, 2018, 6:35am

I'm using word_delimiter filter as i want to create token based on only words and numbers in it and not on any special character.

Just wanted to know any fix for this issue without changing the filter to ngram
or any workaround to fix the same?

Kindly help

val · January 4, 2018, 6:37am

Ok I understand. Since document_02.txt gets tokenized and indexed as the three tokens document, 02 and txt, you can search for 02* instead of _02* since the underscore is discarded during the analysis process.

RohanKumbhar · January 4, 2018, 6:41am

Sure , thanks @val !!

system · February 1, 2018, 6:42am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Field names starting with `_`(underscore) are not matched with * wildcard Elasticsearch	7	5706	March 30, 2020
Wildcard search result problems Elasticsearch	6	353	May 22, 2018
Wildcard query no results, but wildcard in query_string works OK Elasticsearch	2	571	November 12, 2018
Query_string with wildcard not working as expected (or wrong understanging of analyze_wildcard) Elasticsearch	0	9	December 12, 2024
How do I query using Wildcard Elasticsearch	6	1976	July 6, 2017

Wildcard search with _ (underscore) is giving no result

Related topics