Hello,
I am in the process of migrating our opensearch cluster running ES 7.9 as the backend version to an Elasticsearch cluster I built in EC2 that is also running 7.9.3 for the time being. I have the sync running just fine and I am indexing using the doc ID to prevent duplication but the numbers seem to keep growing on the destination index.
For example here are the source OS cluster index stats for contacts-0001:
green open contacts-00001 USm-qjRfTECW1-QKg-LYXQ 6 2 444875573 51669599 450.1gb 148.8gb
Here is the same index on the destination ES side with the sync still running:
green open contacts-00001 stjryAniSvCffEqFjSJXSg 1 1 445715309 62469925 334.3gb 168.4gb
Running some cursory searches in Kibana on specific data points I also have noticed not all data entries have been synced yet.
So my question is, is this normal? Is the destination going to be larger than the source no matter what due to Logstash's handling of the data? How do I tell when the sync is done? This is to be a production cutover so I do not want to point my search to the new instance until I am 100% sure it's accurate and has all entries.
Thanks in advance!