Update by query performance question

Hi, I'm quite new to elastic search. My problem is the following:

There are two indexes:

  • index1 - around 18m docs (~40gb of data)
  • index2 - around 22k docs (10mb of data)

I need to update all documents from index1 with values from index2
(they are related by id's and one document from index2 belongs to multiple documents in index1)

I have tried to search scroll each document from index2 and use updateByQuery to match all documents from index1. It takes ages to complete.

What is a better solution for this?

The elasticsearch is running on a i9 / 6 core / 32gb ram machine

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.