have you looked at the token_count datatype? It looks like it might be doing what you are trying to do. In order to retrieve the values calculated for the count field you might need to set it to "store" : true. If you follow the example in the reference, in order to retrieve the values you can use
Hi Christoph,
thank you for your respond. Yeah, I tried the token_count and it works. but it counts the original words count not the one after filtering out the stopwords.
Any idea how to get the count of words after removing stopwords?
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.