in your case I'd likely use a shingle filter that builds token n-grams for
you. ie. the document "my super media player" would be filtered into "my",
"mysuper", "super", "supermedia", "media", "mediaplayer"
here is a
set max_shingle_size = 2 & output_unigrams = true (that is actually the
this would match for "mediaplayer" as well as "media player" & your dl380
problem woudl be solved as well.
This might create a ton more tokens but should work just fine!
On Tuesday, July 24, 2012 9:42:52 AM UTC+2, maik wrote:
I'm wondering if its possible for elasticsearch to solve the following
- Productname contains "Media Player", Customer searches for
"mediaplayer" (0 hits) or "media player" (lots of hits)
- Productname contains "dl380", Customer searches for "dl 380" (0 hits)
or "dl380" (lots of hits)
As today the name is analyzed with the standard analyzer and its queried
by a ftl query.
How to analyze the string or query the index to get similar results for
both searches, with blank and without it?
I know synonyms, but I hope there is a better, more general solution.
Thanks in advance