[Performance] Which is better update via script or storing the complete object?

Abhishek_Gupta · July 20, 2015, 6:10pm

There are two options to update an object in ES:

using update APIs (via script)
storing the complete object again with the id set

There are many scenarios when choosing among these two options is easy like:

When you want to do conditional updates script is the winner.
In case you in the application logic only have the update part not the complete object than again script is the clear winner.

But I have a situation where I have the complete object in my hand. Out of the complete object I only want to update a very small part. So what should I use, storing the complete object or updating it via script. Which will perform better?

nik9000 · July 20, 2015, 6:31pm

They'll perform mostly the same.The real overhead of an update is indexing the new document and cleaning up the tombstone of the old document on the next time its merged. The tombstones also cost IO during queries and effect the scoring so if you get too many that is trouble eventually.

The way in which the update hits the system isn't a huge part of it. My biggest bit of advice is to avoid noop updates. If you send the whole object there is a flag you can turn on to detect noops but if you send a script then the responsibility is on you.

Oh! If you go the script route make sure its parameterized or else compiling it will be overhead. If its parameterized then it is only compiled once.

Topic		Replies	Views
Script vs Document for partial update Elasticsearch	2	1491	July 5, 2017
How exactly elasticsearch update an document when use update? Elasticsearch painless	4	413	December 2, 2022
Performance Benefits to using Stored Scripts? Elasticsearch	2	870	August 21, 2020
Question about Update API Elasticsearch	5	528	July 5, 2017
Partial update using script and updating other properties Elasticsearch	1	627	February 16, 2018

[Performance] Which is better update via script or storing the complete object?

Related topics