Hi all ,
During _reindex
call, if I specify remote source from multiple indexes, In which order are those indexes going to be processed?
I saw in documentation that there was sort
option in reindex but it is deprecated since 7.6
, and that sort was never guaranteed.
In case above was not clear enough, I'm trying to do something like this:
POST _reindex/?wait_for_completion=false&wait_for_active_shards=all
{
"source": {
"remote": {
"host": "https://example.com:443"
},
"index": "my-index-*",
"size": 5000,
"query": {
"match_all": {}
}
},
"dest": {
"index": "my-index-new"
}
}
And reason why I am asking is because I have quite some duplicate document ids spread-ed over my-index-yyyy-MM-dd
on remote, not all have desired states recorded, but the latest inserted version is the one I want to keep on my my-index-new
, and it "seems" that documents are kinda updated in that order.
So maybe coincidence but from what I am seeing based on results, it looks like execution is done based on timestamp/insertion order of remote indexes? Or there is no guarantee for this?
remote es version: 6.6
destination es version: 7.8