We are currently trying to support search on multiple languages, but we are not indexing in all of these language fields (hence we are not using multifields).
We perform language detection to figure out which language fields should be used (can be more than one).
The problem now is, when we are specifying the text in multiple fields, the http payload size increases very quickly (complete duplicate per language field).
What might be the better way to handle this situation (especially to avoid explosion of http request size)?
Is it possible to achieve this via painless script?
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.