How should I deal with 10TB data with diferent languages!


(luxile) #1

We have almost 10TB data of E-draft, E-draft have different languages, about 10 languages(such as zh-CN. en. Arabic Language etc.).

  • How should we index and query such amounts of data with different languages.

  • Should we create multiple index?

  • how to set up shards?

These Examples are parial of our data :

1. zh_CN:

2. en

3. ar

Thanks for answers.

12/04/2016


(David Pilato) #2

I'd start here: https://www.elastic.co/guide/en/elasticsearch/guide/current/languages.html


(luxile) #4

Thanks,I'll read it carefully


(system) #5

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.