I made a custom analyzer based on Persian analyzer to use my custom stopwords. The problem is ES doesn't remove Persian words from the text.
The analyzer:
persian-without-stopwords-analyzer: {
type: "persian",
stopwords: [
"something",
"دبیرستان",
"another"
]
}
And I tested my analyzer using the following code:
GET driq/_analyze
{
"analyzer": "persian-without-stopwords-analyzer",
"text" : "something دبیرستان another"
}
The result is:
{
"tokens": [
{
"token": "دبيرستان",
"start_offset": 10,
"end_offset": 18,
"type": "",
"position": 1
}
]
}
Why it removes 'something' and 'another' but not 'دبیرستان'?