Unfortunately, when I set up this index, I didn't realize that I wanted to
turn off tokenization ("index": "not_analyzed"). Now when I try and do
terms aggregation, I get back the tokenized values:
foo
bar (2)
one
Is there any way to do an aggregation on the full value of the field,
ignoring tokenization?
As far as I know, you can not do this, because ES do not store field value by default(It uses ‘_source’). If ES read the value from ‘_source’, it will cost a long time.
Unfortunately, when I set up this index, I didn't realize that I wanted to turn off tokenization ("index": "not_analyzed"). Now when I try and do terms aggregation, I get back the tokenized values:
foo
bar (2)
one
Is there any way to do an aggregation on the full value of the field, ignoring tokenization?
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.