ES 6.0 snapshot index UUIDs

arunachala · November 9, 2018, 12:20pm

Hi All,
In ES 2.x, on creation of a snapshot, the directories created in the repository under the 'indices' directory contained the indices names as it is. Given a snapshot name, it was very easy to get all the files (meta-, snap- and indices) for that snapshot and create a single bundled tar file which can be transferred to an offsite location to copy data. Thus we were able to copy data from one cluster to another unconnected cluster.

But, on upgrading to ES 6.1.2, the directory structure created per snapshot under the repository has changed. Although the meta- and snap- files use the UUID of the snapshot, the files under the indices directory seem to be using a temporary UUID which is NOT exposed in any existing APIs.

Is there anyway to map the files under indices directory to its corresponding snapshot? or any API which exposes the UUIDs/IDs used for these indices in the snapshot?

Regards,
Arun

DavidTurner · November 9, 2018, 1:56pm

Technically the answer is yes, because Elasticsearch does exactly this mapping when restoring from a snapshot, but it varies from version to version and can't be relied upon.

The only reliable way to do this is to tar up the whole repository. You can create a separate repository for this operation if you do not want to transfer all the indices. You can also probably perform an incremental copy of an active repository using something like rsync, as long as you are careful not to attempt to access the destination repository while the copy is in progress.

arunachala · November 9, 2018, 2:03pm

Thanks for reply David.

We are interested in tarballing indices files and the meta&snap files for a particular snapshot and hence the query. This is helpful in the field to move around only the particular data we are interested in.

Assuming ES version 6.1.2, is there an API (or any other method) from which I can get this mapping of indices UUIDs to archived indices for a given snapshot?

-Arun

DavidTurner · November 9, 2018, 2:21pm

There is no API for this. Perhaps the source helps?

github.com

elastic/elasticsearch/blob/v6.1.2/core/src/main/java/org/elasticsearch/repositories/blobstore/BlobStoreRepository.java#L133-L164


      
          *   STORE_ROOT
          *   |- index-N           - list of all snapshot ids and the indices belonging to each snapshot, N is the generation of the file
          *   |- index.latest      - contains the numeric value of the latest generation of the index file (i.e. N from above)
          *   |- incompatible-snapshots - list of all snapshot ids that are no longer compatible with the current version of the cluster
          *   |- snap-20131010 - JSON serialized Snapshot for snapshot "20131010"
          *   |- meta-20131010.dat - JSON serialized MetaData for snapshot "20131010" (includes only global metadata)
          *   |- snap-20131011 - JSON serialized Snapshot for snapshot "20131011"
          *   |- meta-20131011.dat - JSON serialized MetaData for snapshot "20131011"
          *   .....
          *   |- indices/ - data for all indices
          *      |- Ac1342-B_x/ - data for index "foo" which was assigned the unique id of Ac1342-B_x in the repository
          *      |  |- meta-20131010.dat - JSON Serialized IndexMetaData for index "foo"
          *      |  |- 0/ - data for shard "0" of index "foo"
          *      |  |  |- __1 \
          *      |  |  |- __2 |
          *      |  |  |- __3 |- files from different segments see snapshot-* for their mappings to real segment files
          *      |  |  |- __4 |
          *      |  |  |- __5 /
          *      |  |  .....
          *      |  |  |- snap-20131010.dat - JSON serialized BlobStoreIndexShardSnapshot for snapshot "20131010"

This file has been truncated. show original

arunachala · November 10, 2018, 6:51am

Hi David,
Thanks for the pointer.

I could find that index-N files under the repository have data in json format which has the mapping of indices id to their names

Thanks for your help.

-Arun

system · December 8, 2018, 6:51am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Index UUIDs in snapshots Elasticsearch	5	2868	March 31, 2017
What snapshot data should I copy from one cluster to another for a migration? Elasticsearch	3	601	April 15, 2021
Copying the repository directory to a new cluster Elasticsearch	1	438	September 21, 2018
Proper way to dump indices from Elasticsearch and import to another Elasticsearch instance Elasticsearch docker	8	12374	August 30, 2022
Can I take backup of indices from one cluster and restore it to another cluster Elasticsearch	15	1268	June 23, 2023

ES 6.0 snapshot index UUIDs

Related topics