Hi,
i am currently writing a script which makes use of update by query.
I am trying to remove a field in all indices and documents older than 7 days.
I can see in Kibana that it is working. But i think the information are still stored in elasticserach because i heard that update by query creates a new version of the updated document.
My questions are:
Is this correct?
If yes, how can i remove older versions of document or how can i update without creating new versions of documents.
Internally, updates do indeed create a new version of the document and then overwrites it. The old document is marked for deletion and eventually purged.
Practically, you don't need to worry about this... it's all an internal implementation detail. From the application's point of view, once updated the old version of the document is not accessible anymore. Internally it will be cleaned up at some point and removed.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.