Is it possible to update snapshot with new lucene segment and restore


(Chetana) #1

I have a requirment where for some data setting the '_source' in
indexrequest is strightforward but for some huge amount of data I need to
run long running activity to generate data which needs to be indexed. So
frequently I plan to take a snapshot from ES to Hadoop and want to add new
lucene documents (using lucene 4.7x library not elasticsearch library) in
hadoop by running a batch job and finally restore this modified index
repo/snapshot to ES.

Is it possible to update snapshot data and restore? If so, how to get
handle of Lucene (org.apache.lucene.store.Directory) which stored in hdfs
and addDocument using indexwriter
Is there any other better alternative to achieve the above requirement?

Thanks,

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/e374f948-32c9-4c50-b6f1-c62a37c7002a%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


(ElasticSearch Users mailing list) #2

No I don't believe so. The snapshot data is not really a "valid" Lucene
index, per se. It does contain segment files, but they are named and
packaged in a specific manner that it would be best not to mess with them.
:wink:

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/23f40904-3630-4a92-91f3-b0493fad36ee%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


(Chetana) #3

Could please suggest a best option for merging index data stored in HDFS
with the index data stored in ES node

On Saturday, May 31, 2014 1:45:13 AM UTC+5:30, Binh Ly wrote:

No I don't believe so. The snapshot data is not really a "valid" Lucene
index, per se. It does contain segment files, but they are named and
packaged in a specific manner that it would be best not to mess with them.
:wink:

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/21b1f594-a469-4fdd-a141-c12c8730159d%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


(system) #4