Hi James,
Sounds like you should simply identify the language(s) of documents
before indexing them and then analyze them appropriately.
There is a language identifier that's included in Solr 3.5.0 you could
rip out.
Otis
On Dec 17, 11:42 am, James Cook jc...@pykl.com wrote:
I have a use case where several different languages can be used in a forum.
I don't have any indication which language is used by which users, and in
some cases, multiple languages might be used in the same posts.My document includes the typical properties one might expect for a forum
post along with the body of the message.What strategies might I employ so users can search for the german posts
using german keywords, japanese posts using japanese keywords, etc?