My index contains 300 millions documents. Every document has the same structure. Each document contains 40 to 50 fields, such as first_name, last_name, nick_name, weight, height, preferred_language, preferred_currency, etc. Averagely, 25 Million documents will be updated every day, some days it might go high. For a specific usecase i have to add one value to existing list in Elasticsearch index. Apart from this reindexing of entire document will happen in parallelly for few 100k every day.
Should i prefer update by query api or reindex entire document by index api?
If it's only 100k on 20 million docs, I'd use update by query.
@dadoonet Can I update 100k on each update by query in synchronously upto 20 million docs every day ?
If I understand correctly your question, yes.
1 Like