Short description: searching by the stem word works ("meet") but using
the word in the source document ("meeting") doesn't. I'm missing
something here that's probably fundamental, but it's not explained
anywhere.
I thought a query of "meeting" would be stemmed to "meet" and then it
would find a document that was analyzed the same way at index time.
elasticsearch.yml:
index:
analysis:
analyzer:
my_analyzer: # duplicate the snowball analyzer
type: custom
tokenizer: standard
filter: [standard, lowercase, stop, snowball]
Short description: searching by the stem word works ("meet") but using
the word in the source document ("meeting") doesn't. I'm missing
something here that's probably fundamental, but it's not explained
anywhere.
I thought a query of "meeting" would be stemmed to "meet" and then it
would find a document that was analyzed the same way at index time.
elasticsearch.yml:
index:
analysis:
analyzer:
my_analyzer: # duplicate the snowball analyzer
type: custom
tokenizer: standard
filter: [standard, lowercase, stop, snowball]
When searching "against a type", matching docs automatically gets
filtered only for that type, but, search_analyzer associated with that type
is not applied. It should be, btw, opened an issue: https://github.com/elasticsearch/elasticsearch/issues/1391. For now, you can
explicitly specify the analyzer, or, specify the analyzer as the default
analyzer for that index (Across types), by simply renaming my_analyzer to
default.
Short description: searching by the stem word works ("meet") but using
the word in the source document ("meeting") doesn't. I'm missing
something here that's probably fundamental, but it's not explained
anywhere.
I thought a query of "meeting" would be stemmed to "meet" and then it
would find a document that was analyzed the same way at index time.
elasticsearch.yml:
index:
analysis:
analyzer:
my_analyzer: # duplicate the snowball analyzer
type: custom
tokenizer: standard
filter: [standard, lowercase, stop, snowball]
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.