Is this the correct approach of taking a snapshop of an indice which is in production?

Jax_dev · March 17, 2021, 1:55pm

I have 4 different elastic nodes in a cluster.

Want to take a snapshot from one of the server which is having primary shard.

Step 1: sudo systemctl stop elasticsearch

Step 2: add path.repo: ["/data/elasticbackup"] in elasticsearch.yaml file.

Step 3: Give permission to the folder path sudo chmod 777 -R /data/elasticbackup

Step 4: sudo systemctl start elasticsearch

Will it join in the cluster automatically after starting the node ? What is the immediate activity we have to perform if node join fails?

Step 5: PUT request from postman to register snapshot

http://xx.xx.xx.xx:4200/_snapshot/elasticbackup

{
	"type":"fs",
	"settings": {
		"compress" : true,
		"location" : "/data/elasticbackup"
	}
	
}

Step 6: Validate weather snapshot has been registered or not.
GET request from postman:
http://xx.xx.xx:4200/_snapshot/_all

output: 

{
    "elasticbackup": {
        "type": "fs",
        "settings": {
            "compress": "true",
            "location": "/data/elasticbackup"
        }
    }
}

Step 7: Take snapshop ( around 300 GB productioncustomerdata indices )

http://xx.xx.xx.xx:4200/_snapshot/elasticbackup/snapshot_1?wait_for_completion=true

input :
{
	
	 "indices": "productioncustomerdata"
}

Step 8: Delete existing primary indices - productioncustomerdata

DELETE request - indices - http://xx.xx.xx.xx:4200/productioncustomerdata

Are the above steps are correct? Or do we need to perform any other activity ?

Christian_Dahlqvist · March 17, 2021, 2:33pm

Snapshots are cluster wide so the repository need to be made available on all nodes at the same path.

Jax_dev · March 17, 2021, 2:58pm

Can we not take only particular indices eg: productioncustomerdata ? In my case indices ( productioncustomerdata ) primary shard is available in server4 and replica shard is available in server3.

I thought of updating path.repo only for server2 as primary is available there.

Christian_Dahlqvist · March 17, 2021, 3:16pm

You can take only individual indices but the repository still need to be configured on all master and data nodes.

Jax_dev · March 17, 2021, 6:03pm

ok understood. We have traffic 24/7, is there any possibility that we can do the above steps without downtime ?

Christian_Dahlqvist · March 17, 2021, 6:34pm

No, that change requires a restart but you can perform a rolling one. Note that shared storage is required and need to be mounted the same across all nodes.

Jax_dev · March 18, 2021, 6:56am

currently shared storage is not mounted. Every node had their own storage. Is it recommended to have a shared storage for all the nodes in ES ? Any specific reason for this ?

Christian_Dahlqvist · March 18, 2021, 7:21am

Shared storage is a requirement for snapshot repositories. Nodes should however store their own data on local storage.

Jax_dev · March 18, 2021, 3:20pm

Can we provide azure blob ? Our servers are hosted in azure vm’s.

Any documentation available if we want to provide azure blob storage ?

Christian_Dahlqvist · March 18, 2021, 3:25pm

There is an Azure repository plugin that you can install to take snapshots to Azure blob storage.

Jax_dev · March 18, 2021, 10:06pm

I have tested it with azure blob storage as repo and working fine in the development env. I will perform the same in production. But before that,

We are making below api call to take the snapshot,
http://xx.xx.xx.x:4200/_snapshot/elasticbackupazure/e?wait_for_completion=true

Provided query string wait_for_completion=true in the postman, We are taking the snapshot of indice which is having 350GB, will it really wait for the response in postman or will it request timeout? If request is timedout will the background job of taking snapshot continues ?

system · April 15, 2021, 10:06pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Snapshot best practices for staging/production scenario Elasticsearch	4	2030	July 5, 2017
Elasticsearch upgrade Snapshot in cluster nodes Elasticsearch	5	352	September 21, 2020
Proper way to dump indices from Elasticsearch and import to another Elasticsearch instance Elasticsearch docker	8	14377	August 30, 2022
How to take snapshots in cluster Elasticsearch	14	1974	January 17, 2017
Creating/Restoring snapshot from one cluster to another Elasticsearch	3	4422	December 20, 2017

Is this the correct approach of taking a snapshop of an indice which is in production?

Related topics