Index update performance

Gaurav_gupta · May 16, 2014, 2:14pm

Hi,

We are exploring the Lucene/ES as an embedded search engine inside our
product offering and while analysis following concerns come up. Could you
share your thoughts on how serious/genuine these are :-

Read Dominant - Data in Lucene Index is intended to be read
predominantly and not to be frequently updated. Although, it supports NEAR
Real-time Searches. We might have scenarios where one or multiple users are
updating/deleting few documents (concurrently) in a large index repository
(250 million documents).
While Lucene Index facilitates insert, delete and update of
documents, single field update capabilities aren't good especially if that
field is text indexed. Whenever a new field is added or the contents of an
existing field is changed, every single document in Lucene Index must be
reprocessed in its entirety before the data is populated for the new field
in all documents.

Thanks
Gaurav

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/93d7c25f-e8c3-4078-9f8c-f2158e26752c%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Gaurav_gupta · May 17, 2014, 12:50pm

Do anybody has any thoughts on below email points.

Also, can anybody help me to locate any performance comparison between
Databases (RDBMS) vs Lucene/ES index, for insert/update/delete feature.

Thanks

On Friday, May 16, 2014 7:44:56 PM UTC+5:30, Gaurav gupta wrote:

Hi,

We are exploring the Lucene/ES as an embedded search engine inside our
product offering and while analysis following concerns come up. Could you
share your thoughts on how serious/genuine these are :-

Read Dominant - Data in Lucene Index is intended to be read
predominantly and not to be frequently updated. Although, it supports NEAR
Real-time Searches. We might have scenarios where one or multiple users are
updating/deleting few documents (concurrently) in a large index repository
(250 million documents).

While Lucene Index facilitates insert, delete and update of
documents, single field update capabilities aren't good especially if that
field is text indexed. Whenever a new field is added or the contents of an
existing field is changed, every single document in Lucene Index must be
reprocessed in its entirety before the data is populated for the new field
in all documents.

Thanks
Gaurav

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/66602e0a-a86f-4c55-adda-23a8dae36d3e%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Topic		Replies	Views
Lucene Vs Elasticsearch Indexing performance Elasticsearch	3	3252	July 31, 2017
Batch doc updates and real-time search Elasticsearch	4	286	July 6, 2017
Need help with performance insights Elasticsearch	1	324	July 6, 2017
Greetings! Elasticsearch	8	921	July 6, 2017
Updating only a few fields out of many Elasticsearch	4	370	November 21, 2023

Index update performance

Related topics