How to use other language stop filter in _analyze?

Hi,

States that it has a
[Stop Token Filter] (disabled by default)
So to test the Standard Analyzer we type:

POST _analyze
{
  "analyzer": "standard",
  "text": "The 2 QUICK Brown-Foxes jumped over the lazy dog's bone."
}

Now I want to add a german stopword filter.

States:

GET /_analyze
{
  "tokenizer": "standard",
  "filter": [ "stop" ],
  "text": "a quick fox jumps over the lazy dog"
}

Sadly the site does not explain how to activate a stopfilter for another language.

How do you do so?

Here is what I have tried:

POST /_analyze
{
  "tokenizer": "standard",
  "filter": [ "german_stop" ],
  "text": "Elasticsearch ist leider sehr schlecht dokumentiert, weshalb ich die Lösung nicht finde."
}

Thanks - Enomine

And why is the first line sometimes

"tokenizer": "standard",

and in other cases

"analyzer": "standard",

?

Is that no difference?

Thanks - Enomine

Hi Michael,
Regarding your second question, you should read Anatomy of an analyzer | Elasticsearch Guide [8.5] | Elastic before.
Regarding testing the stop token filter, it is stated clearly in the configurable parameters, in Stop token filter | Elasticsearch Guide [8.5] | Elastic
So you can use something like

POST /_analyze
{
  "tokenizer": "standard",
  "filter": [ {
          "type": "stop",
          "ignore_case": true,
          "stopwords" : "_german_"
        } ],
  "text":  "Elasticsearch ist ziemlich gut dokumentiert, aber die Konzepte sind manchmal etwas kompliziert. Zum Glück gibt es die Community"
}

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.