there's also a Google "Compact Language Detector",
seemed very nice to me, with easy wrapping from other languages (eg.
Detecting the language won't be the hard part here, I think. The hard
part is defining proper mapping for those languages (multifields?,
different properties?, etc). I think multifields should work really
well here, but I don't know of any strategy which would allow
magically analyzing queries with german analyzers and searching german
fields, and vice versa for english/japanese, etc., without the user
explicitely setting the language, or "trying out" the search against
all the multi fields and depending on the _score to sort them right...
This does not sound so "simple" as Otis' response suggests :), so I
may be missing something.
On Dec 17, 5:42 pm, James Cook jc...@pykl.com wrote:
I have a use case where several different languages can be used in a forum.
I don't have any indication which language is used by which users, and in
some cases, multiple languages might be used in the same posts.
My document includes the typical properties one might expect for a forum
post along with the body of the message.
What strategies might I employ so users can search for the german posts
using german keywords, japanese posts using japanese keywords, etc?