How to get result of text analysis per each word

Leonid · August 26, 2014, 9:46am

Hi All,

I have number of analyzers, per language. English analyzer has simple tokenizer, stop and stemmer.
When I store document I need to know result of analyzer: stem of each word or "nothing" if the word is stopword.
German analyzer has simple tokenizer, stop, decompounder and stemmer. For German, I need stem per word , "nothing" if the word is stop and stems for compound word.

Unfortunately, when I store document by

IndexRequestBuilder request = client.prepareIndex(indexName, typeName).setId(docId) .setSource(paramsMap);

it seems me like a blackbox where I don't have access to the result of analyzer.

Regards,