How about the upgrade performance?


(Meng Li) #1

Hi, guys:

We are a heavy user of es for log searching, one log job is now indexing
more than 10,000 lines per minute (just name it as 'A')
We now have another log 'B' which we want to join it into A with the _id ,
and we decide to use update operation.
B is relatively small, 2 million a day and not real-time,( but will go up
in the future)
Anybody here know the performance of update? Or will the cluster's
performance go down facing the frequent update?
Both production environment experience and theoretical explanation are
appreciated.
Thank you.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/4cbfb523-9d57-44e6-a9e5-0f577521dab7%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


(Alexander Reelsen) #2

Hey,

an update operation is nothing else than a reindex operation - which in
turn marks the old document as deleted and creates a new document. Also an
update operations fetches the document from the index first, and then
applies the specified updates from the request. So basically you are just
doing two million more get and two million more index operations per day -
something that should work, depending of your current cluster is already at
its capacity or not. As usual the easiest way is to try it out on your
staging/testing systems...

--Alex

On Wed, Apr 2, 2014 at 5:20 AM, Meng Li remindme2009@gmail.com wrote:

Hi, guys:

We are a heavy user of es for log searching, one log job is now indexing
more than 10,000 lines per minute (just name it as 'A')
We now have another log 'B' which we want to join it into A with the _id
, and we decide to use update operation.
B is relatively small, 2 million a day and not real-time,( but will go up
in the future)
Anybody here know the performance of update? Or will the cluster's
performance go down facing the frequent update?
Both production environment experience and theoretical explanation are
appreciated.
Thank you.

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/4cbfb523-9d57-44e6-a9e5-0f577521dab7%40googlegroups.comhttps://groups.google.com/d/msgid/elasticsearch/4cbfb523-9d57-44e6-a9e5-0f577521dab7%40googlegroups.com?utm_medium=email&utm_source=footer
.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAGCwEM-7E11_%3DfNsGcwwSYwh1%2BbGBWXYO%3DsA4ggNUf0LoSp7UQ%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.


(system) #3