Near Realtime Question

James_Cook · November 4, 2010, 7:12pm

We are using ElasticSearch as a datastore in place of a database. Kimchy
will be quick to add that this is not advisable at the moment due to its
beta status. (We also dupe data to MySQL for the present time.)

For most of our use cases, we do fine with ES NRT, but there are a few spots
when we have to issue a refresh before continuing. I was wondering if it
would be possible to add a parameter (similar to op_type) that can be passed
along with a POST or PUT to indicate that the call should not reply until
the document is indexed.

We have a multi-tier system, and this would help us by not requiring two
REST calls from the middle tier to our web services tier where ES lives.

-- jim

kimchy · November 4, 2010, 7:20pm

Hi,

First, the document is always indexed, and added to a transaction log
before the operations returned. Its the visibility of it for search that is
near real time. And yes, there is a flag for that, index and delete APIs
accept a refresh parameter that will cause the relevant shard they hit to
refresh before returning a result (note, found a small problem with it in
0.12, fixed in master).

-shay.banon

On Thu, Nov 4, 2010 at 9:12 PM, James Cook jcook@tracermedia.com wrote:

We are using Elasticsearch as a datastore in place of a database. Kimchy
will be quick to add that this is not advisable at the moment due to its
beta status. (We also dupe data to MySQL for the present time.)

For most of our use cases, we do fine with ES NRT, but there are a few
spots when we have to issue a refresh before continuing. I was wondering if
it would be possible to add a parameter (similar to op_type) that can be
passed along with a POST or PUT to indicate that the call should not reply
until the document is indexed.

We have a multi-tier system, and this would help us by not requiring two
REST calls from the middle tier to our web services tier where ES lives.

-- jim

James_Cook · November 4, 2010, 7:51pm

Awesome, didn't know that parameter existed.

On Thu, Nov 4, 2010 at 3:20 PM, Shay Banon shay.banon@elasticsearch.comwrote:

Hi,

First, the document is always indexed, and added to a transaction log
before the operations returned. Its the visibility of it for search that is
near real time. And yes, there is a flag for that, index and delete APIs
accept a refresh parameter that will cause the relevant shard they hit to
refresh before returning a result (note, found a small problem with it in
0.12, fixed in master).

-shay.banon

On Thu, Nov 4, 2010 at 9:12 PM, James Cook jcook@tracermedia.com wrote:

We are using Elasticsearch as a datastore in place of a database. Kimchy
will be quick to add that this is not advisable at the moment due to its
beta status. (We also dupe data to MySQL for the present time.)

For most of our use cases, we do fine with ES NRT, but there are a few
spots when we have to issue a refresh before continuing. I was wondering if
it would be possible to add a parameter (similar to op_type) that can be
passed along with a POST or PUT to indicate that the call should not reply
until the document is indexed.

We have a multi-tier system, and this would help us by not requiring two
REST calls from the middle tier to our web services tier where ES lives.

-- jim

Diptamay · November 4, 2010, 8:05pm

You could alternately perform with version 0.12.1
_client.admin().indices().prepareRefresh("index").execute(); since the
refresh boolean parameter ain't being honored in 0.12.1.

On Nov 4, 3:51 pm, James Cook jc...@tracermedia.com wrote:

Awesome, didn't know that parameter existed.

On Thu, Nov 4, 2010 at 3:20 PM, Shay Banon shay.ba...@elasticsearch.comwrote:

Hi,

First, the document is always indexed, and added to a transaction log
before the operations returned. Its the visibility of it for search that is
near real time. And yes, there is a flag for that, index and delete APIs
accept a refresh parameter that will cause the relevant shard they hit to
refresh before returning a result (note, found a small problem with it in
0.12, fixed in master).

-shay.banon

On Thu, Nov 4, 2010 at 9:12 PM, James Cook jc...@tracermedia.com wrote:

We are using Elasticsearch as a datastore in place of a database. Kimchy
will be quick to add that this is not advisable at the moment due to its
beta status. (We also dupe data to MySQL for the present time.)

For most of our use cases, we do fine with ES NRT, but there are a few
spots when we have to issue a refresh before continuing. I was wondering if
it would be possible to add a parameter (similar to op_type) that can be
passed along with a POST or PUT to indicate that the call should not reply
until the document is indexed.

We have a multi-tier system, and this would help us by not requiring two
REST calls from the middle tier to our web services tier where ES lives.

-- jim

Topic		Replies	Views
Real time and sync Elasticsearch	1	691	July 17, 2020
Make sure document is indexed before search for it Elasticsearch	7	2586	July 5, 2017
Guaranteed upper bound for near real time search Elasticsearch	7	1663	July 5, 2017
Non realtime aspects of ES Elasticsearch	5	664	December 2, 2017
Potential for realtime search? Elasticsearch	14	382	July 6, 2017

Near Realtime Question

Related topics