Document not matched

Lukas_Tilch · November 13, 2017, 12:57pm

Hello i have a Document indexed with a custom analyzer.

the string is "MKS-Integrity11.1_Software-FAQ.docx"

it gets tokenized into

{
  "tokens": [
    {
      "token": "mks",
      "start_offset": 0,
      "end_offset": 3,
      "type": "<ALPHANUM>",
      "position": 0
    },
    {
      "token": "integrity",
      "start_offset": 4,
      "end_offset": 13,
      "type": "<ALPHANUM>",
      "position": 1
    },
    {
      "token": "11",
      "start_offset": 13,
      "end_offset": 15,
      "type": "<ALPHANUM>",
      "position": 2
    },
    {
      "token": "1",
      "start_offset": 16,
      "end_offset": 17,
      "type": "<ALPHANUM>",
      "position": 3
    },
    {
      "token": "software",
      "start_offset": 18,
      "end_offset": 26,
      "type": "<ALPHANUM>",
      "position": 4
    },
    {
      "token": "faq",
      "start_offset": 27,
      "end_offset": 30,
      "type": "<ALPHANUM>",
      "position": 5
    },
    {
      "token": "docx",
      "start_offset": 31,
      "end_offset": 35,
      "type": "<ALPHANUM>",
      "position": 6
    }
  ]
}

for the Query i use a phrase_prefix with a slop of 50

If i query for anything that contains software i.e. "software faq" i get zero matches.
if i query for "MKS faq" i get the match, "mks software faq" no match.
I wonder why that happens, the term software itself is tokenized correctly.

thanks in advance
Lukas

system · December 11, 2017, 12:57pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Matching every documents tokens Elasticsearch	1	589	July 5, 2017
ElasticSearch Match Phrase Query Not returning expected results Elasticsearch	6	153	April 17, 2024
Multi_match with phrase_prefix is not working although a token has the prefix in it Elastic Search	6	65	August 22, 2024
Match_phrase not matching all terms Elasticsearch	6	3887	January 25, 2019
Obtaining matching tokens with _search Elasticsearch	4	878	April 30, 2019

Document not matched

Related topics