Match phrase queries to highlighted values

Alexandra · May 26, 2014, 8:35am

Hi,

My use case is the following : I have a text segment and a list o terms and I want to find only the ones that match exactly the text.
For e.g. if my segment text is : "This is a simple text." And my terms are : "texts", "this", "text", I will find and highlight only the terms "this" and "text".

I'm building the query with the Java Api like this ( the segment is indexed ):

BoolQueryBuilder query = QueryBuilders.boolQuery();
for(TermDocument termCandidate : termCandidates) {
query.should(QueryBuilders.matchPhraseQuery(ElasticsearchDocumentField.TEXT_CONTENT.getName(), termCandidate.getTermText()).slop(0).queryName(termCandidate.getId()).analyzer(EN_ANALYZER));
}

If I also highlight the terms ( because in the end I need the offsets ), the will all be highlighted and I don't know which one is which. (e.g. This is a simple text.)

So now, my questions :
1.Is there a way to highlight the terms from the query separately ? And to associate some id to each of them in order to be able to match them back ?
2. Is there a way to receive the token numbers for an indexed text without using the analyze api ? (this is unrelated to the first question).

Topic		Replies	Views
Is it possible to highlight only match_phrase query? Elasticsearch	2	581	July 5, 2017
Highlighting issue with proximity phrase match Elasticsearch	1	577	July 6, 2017
Highlight exact phrases/keywords Elasticsearch	1	503	March 1, 2022
Text Phrase Query Elasticsearch	9	423	July 6, 2017
Strategy for matching unstructured text to phrases in index Elasticsearch	3	200	August 1, 2023

Match phrase queries to highlighted values

Related topics