Can i do a full-text search where each word has location metadata?

sinofis · February 12, 2018, 12:04am

I'm trying to build a text search from voice to text, each word should have a metadata position something like start: 54s; end: 57s.

Metadata shouldn't be indexed but only returned with results. Is that possible? I'd appreciate some advice on how to do it.

warkolm · February 12, 2018, 12:17am

You'd need to figure this out before sending to Elasticsearch, there's nothing native that could calculate this.

sinofis · February 12, 2018, 12:41am

Thank you for the replay, currently I can get text and position from audio.

My current idea is to make a regular text search then use a custom highlighter to get the text position. With the position query an SQL database for a location in audio.

I'm wondering if I could do the whole thing natively with Elastic

warkolm · February 12, 2018, 3:20am

Elasticsearch cannot do the sort of merge/join with the words and the timings, you may need to do that in your client.

But you could put both sets of data into Elasticsearch then you've got a quick data store to leverage at least

system · March 12, 2018, 3:21am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Specify metadata per word/term in a string Elasticsearch	4	512	July 6, 2017
Find word position in large text Elasticsearch	3	943	August 13, 2019
Document level metadata in elastic Elasticsearch	1	373	January 8, 2020
Storing custom position information in fields Elasticsearch	1	438	March 31, 2017
Search based on FullText Meta Information Elasticsearch	5	686	November 20, 2017

Can i do a full-text search where each word has location metadata?

Related topics