Extract stored field in elasticsearch document using start and end offset

Hi all, I am seeking some pointers. Given the start and end offsets of text within a document field (e.g. as provided by the termvectors API), I would like to extract the stored text between those offsets. This text was obtained using the ingest attachment plugin, so the text is now present in the index.

My idea is to first fetch the document in Elasticsearch and store that in a variable in a programming language. Then I can get the text from the field, and find the text of interest based on the start and end offsets - doing all of that outside Elasticsearch. Is there a way to do this effectively within Elasticsearch?

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.