I didn't know about that feature. is it a new thing?
anyway, I think other considerations regarding how we build the query
before hitting the search server may prevent us for taking that road. but I
gotta take a deeper look into it.
as how we use stopwords, it's pretty standard:
"index_name": {
"index": {
...
...
"analysis": {
"analyzer": {
"default_index": {
"tokenizer": "standard",
"filter": ["standard", "lowercase", "my_stop", ...]
},
"default_search": {
"tokenizer": "standard",
"filter": ["standard", "lowercase", "my_stop", ...]
},
},
"filter": {
...
"my_stop":{
"type": "stop",
"stopwords": ["es"]
},
...
}
}
}
},
do you happen to know where the default stopwords are defined?
On Sunday, May 19, 2013 5:38:13 PM UTC-4, simonw wrote:
you might want to look at the cutoff_frequency
that was added to the
match_query (
Elasticsearch Platform — Find real-time answers at scale | Elastic) It
might enable you to not filter stopwords at all. Another option is to
create a stopword list per customer and add it to the mapping yourself, how
are you creating your stopword filters right now?
simon
On Sunday, May 19, 2013 10:55:06 PM UTC+2, JoeZ99 wrote:
Is there any way of "marking" a word not to be filtered out by the stop
filter, even if it's on the stop words??
I'm looking for something like "keywordmarker" filter, which "marks" a
keyword not to be stemmed afterwards.
Use case. Some customers have a special meaning from just one or two
words belonging to the standard stop words list of their language. Say, for
a spanish customer, he want the word "tenga" to be searchable even if it
belongs to the "es" expanded stopwords.
I know I could just override the whole stopwords set, and provide a
custom stopwords list with "tenga" not in it, and I just did that, but it
would be "cheaper" instead of reespecifying the whole set, to say
"exception"s: ["tenga"]
On a sidenote: where can I find the default list that elasticsearch uses
when specifying ["es"] as stopwords???
--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.