Auto summarization of elasticsearch text content


(Narmatha) #1

Need help in summarizing the media content in elasticsearch and store in different field during data ingestion.

Is there an option in elasticsearch directly to perform summarizing larger contents into readable text and store it ?

If not available in elastic then what is the alternate option available ?


(Mark Harwood) #2

That's a big field

Solutions will depend on various things including if you have any markup as part of the content e.g. <h1> that might help understand the structural importance of various sections of the text. Otherwise you might be looking at employing more statistical methods of the words used in the text. Maybe querying using "MoreLikeThisQuery" to match a document to itself and using a Highlighter to highlight the content would provide a form of summary.


(system) #3

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.