Replication with filtering

Lavalyte · February 17, 2017, 5:04am

Hi,

I have a use case I am not sure how to cater for.
We have an elasticsearch db that takes a high volume of logged information per second.
We have a requirement to archive some of that volume off.
I am hoping we can archive to a second elasticsearch db.

I know elasticsearch does replication. The question I have is, can it replicate with filtering, to only replicate what we want it to?

dadoonet · February 17, 2017, 6:47am

Replication is not designed for this need.

But, you have other choices:

let say you want to archive by date. Use time based indices and just change index allocation to allocate old indices to archive nodes
use reindex from remote: so reindex with a query from one cluster to another. Then use delete by query to remove old data.

May be there are other ways but it depends on the kind of data you want to archive (what is the type of query you want to run basically)

Lavalyte · February 19, 2017, 10:14pm

We want to backup the content of specific indices. When writing a query on es I believe there is no longer a general creation timestamp on each document we could use to only grab the incremental?

I am looking at
https://www.npmjs.com/package/elasticdump
as a solution atm.

system · March 19, 2017, 10:14pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Replicating one cluster to another cluster Elasticsearch	4	906	July 6, 2017
Cluster-to-Cluster replication using polling Elasticsearch	1	815	July 5, 2017
Replication Strategies Elasticsearch	6	1943	July 6, 2017
Copying Incremental Data Elasticsearch	7	747	June 18, 2020
Indexing by time and deleting indexes by time Elasticsearch	4	372	July 6, 2017

Replication with filtering

Related topics