Synonym ÅÄÖ exception


(Benjamin) #1

Hi !
I got some exception when I use swedish ÅÄÖ on my synonym words.

{
  "error" : {
    "root_cause" : [
      {
        "type" : "illegal_argument_exception",
        "reason" : "failed to build synonyms"
      }
    ],
    "type" : "illegal_argument_exception",
    "reason" : "failed to build synonyms",
    "caused_by" : {
      "type" : "malformed_input_exception",
      "reason" : "Input length = 1"
    }
  },
  "status" : 400
}

The problem occur when I use a file to read synonyms.
This works fine.

{
	"settings": {
		"index": {
			"analysis": {
				"filter": {
					"synonym": {
						"type": "synonym",
						"synonyms": [
							"Äpple, App"
						]
					}
				}
			}
		}
	}
}

But not when reading from a file:

{
    "settings": {
        "index" : {
            "analysis" : {
                "filter" : {
                    "synonym" : {
                        "type" : "synonym", 
                        "synonyms_path" : "synonym.txt",
                        "tokenizer" : "standard" 
                    }
                }
            }
        }
    }
}

I already read this:

I have already applied an asciifolding token filter before the synonym filter in the filter chain but still not working.

   "analysis": {
      "analyzer": {
        "custom_swedish_analyzer": {
          "type": "custom",
          "filter": [
            "asciifolding",
            "lowercase",
            "synonym",
            "swedish_stemmer_filter",
            "swedish_stopwords_filter"
          ],
          "tokenizer": "standard"
        }

(Benjamin) #2

The solution was using UTF8 encoded file .


(system) #3

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.