Is it possible to update snapshot with new lucene segment and restore

Chetana · May 30, 2014, 11:59am

I have a requirment where for some data setting the '_source' in
indexrequest is strightforward but for some huge amount of data I need to
run long running activity to generate data which needs to be indexed. So
frequently I plan to take a snapshot from ES to Hadoop and want to add new
lucene documents (using lucene 4.7x library not elasticsearch library) in
hadoop by running a batch job and finally restore this modified index
repo/snapshot to ES.

Is it possible to update snapshot data and restore? If so, how to get
handle of Lucene (org.apache.lucene.store.Directory) which stored in hdfs
and addDocument using indexwriter
Is there any other better alternative to achieve the above requirement?

Thanks,

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/e374f948-32c9-4c50-b6f1-c62a37c7002a%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

ElasticSearch_Users_ · May 30, 2014, 8:15pm

No I don't believe so. The snapshot data is not really a "valid" Lucene
index, per se. It does contain segment files, but they are named and
packaged in a specific manner that it would be best not to mess with them.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/23f40904-3630-4a92-91f3-b0493fad36ee%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Chetana · June 2, 2014, 6:16am

Could please suggest a best option for merging index data stored in HDFS
with the index data stored in ES node

On Saturday, May 31, 2014 1:45:13 AM UTC+5:30, Binh Ly wrote:

No I don't believe so. The snapshot data is not really a "valid" Lucene
index, per se. It does contain segment files, but they are named and
packaged in a specific manner that it would be best not to mess with them.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/21b1f594-a469-4fdd-a141-c12c8730159d%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Topic		Replies	Views
Read segments directly from the snapshot Elasticsearch snapshot-and-restore	11	424	July 7, 2023
Restore api vs custom copy script Elasticsearch	1	351	July 6, 2017
Read ES index snapshot in spark without restore Elasticsearch es-hadoop	2	1508	September 1, 2017
Snapshot/restore between clusters not working after upgrade to 1.3.4 Elasticsearch	1	328	July 6, 2017
Snapshot HDFS files encrypted? Elasticsearch es-hadoop	5	1025	July 6, 2017

Is it possible to update snapshot with new lucene segment and restore

Related topics