Elasticsearch upgrade and migration from 2.3.4 to 6.2

angshuman.ray · September 20, 2018, 1:45pm

Hi,
We are working on upgrading our elasticsearch 2.3.4 environment to 6.2. In our system we have 6 node elasticsearch (2.3.4) cluster with full of logs in different monthly indices and high flow of incoming logs.
As direct migration from 2.3.4 to 6.2 is not possible so as intermediate we are using one elasticsearch 5.6 node.
During our test we are taking snaps of monthly indices from elasticsearch 2.3.4 one by one then from elasticsearch 5.6 snaps are reloaded in 5.6 node, after reload reindexing also done. Then again snaps are taken on elasticsearch 5.6 and reloaded on elasticsearch 6.2. This upgrade procedure is working but taking huge time, number of documents inside indices are like 100 million. We observes that most of the time is taken by reindixing process on elasticsearch 5.6 node.
We need help fro two points,
(1) Can anyone please help us for optimization of reindixing timing on elasticsearch 5.6.

(2) We are also observing that size of the migrated indices are larger than original ones. Statistics with few small indices are given below. Is this phenomena normal or things can be improved?

Regards,
Angshuman

n0othing · September 28, 2018, 2:01pm

Hi Angshuman,

(1) Can anyone please help us for optimization of reindixing timing on elasticsearch 5.6.

It'd be more efficient to reindex directly from your 2.3.4 cluster to your new 6.2.x cluster. You can the reindex from remote feature to accomplish this.

Since you're creating entirely new indices on 6.x, there's no concern about Lucene level compatibility.

(2) We are also observing that size of the migrated indices are larger than original ones. Statistics with few small indices are given below. Is this phenomena normal or things can be improved?

In 5.0, we introduced a mapping change where we create a .keyword multi-field for text fields. If you're using dynamic mapping, this could certainly account for some of the additional footprint. You'll want to use an index template to pre-define your mappings before reindexing to avoid creating fields you don't want/need.

I'd also recommend following the tips in Tune for indexing speed to make the reindexing process faster.

Hope this helps!

angshuman.ray · October 4, 2018, 12:19pm

Thank you, Robbie.
Let me apply your suggestion, will update you with the results.

Regards,
Angshuman

system · November 1, 2018, 12:19pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Fastest way to reindex from 2.3 to 6.4 Elasticsearch	3	479	October 23, 2018
Elasticsearch Upgrade from 2.4 to 6.8 Elasticsearch	7	496	February 26, 2021
Improve reindex speed into new cluster Elasticsearch	4	1090	January 5, 2019
Elasticsearch Upgrade to 6.2.3 from 2.3.1 Elasticsearch	3	523	May 3, 2018
While migrating form ES2 to ES6 index is taking more space Elasticsearch	8	1846	May 21, 2018

Elasticsearch upgrade and migration from 2.3.4 to 6.2

Related topics