Filter order

eprst · July 31, 2015, 9:02am

i found this in documentation "More-specific filters should be placed before less-specific filters in order to exclude as many documents as possible, as early as possible." But in custom language analyzer example we have " "analyzer": {
"english": {
"tokenizer": "standard",
"filter": [
"english_possessive_stemmer",
"lowercase",
"english_stop",
"english_keywords",
"english_stemmer"
]
}
}", where the stemmer is placed before stopwords and keywords. what is the right way?

warkolm · July 31, 2015, 9:03am

A filter is totally different from an analyser though.

eprst · July 31, 2015, 9:05am

what is the order of filters in this analyzer?

eprst · July 31, 2015, 9:06am

Am i right that keywords filtering should be done before stemming?

dantuff · July 31, 2015, 10:10am

This is referring to query filters that you send in a search request, not index token filters.

The english_keywords filter is used if you want to specify a list of keywords that shouldn't be stemmed (it is empty by default), so the order is correct.

eprst · July 31, 2015, 10:53am

What about english_possessive_stemmer? it will be done before keywords. Is it normal?

eprst · July 31, 2015, 11:00am

BTW what is main difference between indexing filters and query filters, in a nutshell?

dantuff · July 31, 2015, 11:03am

Yes, it will remove the possession ('s) from any nouns, it is very unlikely that keywords would include possessive nouns.

eprst · July 31, 2015, 12:50pm

thx, and Should I set "term_vector":"yes" for the fields which use this analyzer?

Topic		Replies	Views
Filter order influences results Elasticsearch	1	306	March 24, 2020
When to apply Stemmer, Stop Word and Synonym Filters Elasticsearch	1	535	April 27, 2022
Exclude specific words from Custom Analyzer Elasticsearch	1	580	April 12, 2019
Elasticsearch Analyzer:Stemmer giving different results Elasticsearch	1	376	February 6, 2019
Constructing custom analyser for full-text queries Elasticsearch	3	336	September 12, 2019

Filter order

Related topics