Alternative approaches to query - Part II

James_Cook · September 12, 2010, 8:40pm

Suppose I am indexing a collection of movie subtitle translations, where
each document is a translation with properties to show the language and the
movie it represents. Some subtitles exist where the locale might be "fr_CA"
or "fr_FR" to indicate the difference between French Canada and French
European. A search term of 'fr' needs to match both 'fr_CA' and 'fr_FR'.

My use case is I want to query for a top five list by locale. For example,
the five most recently added films with french subtitles.

A not-so-nice approach to the problem is performing a query to fetch a
larger number of subtitles than needed and aggregate the result set based on
the film id property. This isn't very good on several fronts.

curl -XPOST 'http://localhost:9200/movies/subtitles/_search' -d '
{
"from" : 0,
"size" : 10,
"query" : {
"term" : {
"locale": "fr"
}
}
}'

Creating a movie's index with a child collection of subtitles is another
approach, but assume the indexing is by subtitle document alone.

Is there some other techniques or projection capability to assist in these
types of aggregated queries?

Topic		Replies	Views
Best practices with localized indices Elasticsearch	3	4079	July 6, 2017
Best practices for indexing documents with alternate names in many languages Elasticsearch	1	305	July 6, 2017
Bets practice for indexing documents of various languages Elasticsearch	3	537	July 19, 2017
I18n searching Elasticsearch	2	996	May 26, 2017
How can i search data from two indexes in Elastic search Elastic Search elastic-app-search , elastic-site-search	7	663	March 15, 2024

Alternative approaches to query - Part II

Related topics