How should I deal with 10TB data with diferent languages!

We have almost 10TB data of E-draft, E-draft have different languages, about 10 languages(such as zh-CN. en. Arabic Language etc.).

  • How should we index and query such amounts of data with different languages.

  • Should we create multiple index?

  • how to set up shards?

These Examples are parial of our data :

1. zh_CN:

2. en

3. ar

Thanks for answers.


I'd start here:

Thanks,I'll read it carefully

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.