Review request for "Elasticsearch Survival Guide for Developers" blog post

DavidTurner · May 29, 2019, 7:52pm

A few comments from a quick scan:

There is, as far as I can see, no mention of oversharding, and this is pretty much the #1 problem we see with users. I think it's worth a mention and maybe a link to a blog post like this one.

Start first by setting index.translog.durability to async .

Please don't recommend this. It will cause less experienced users to experience data loss. The default durability setting is much safer, and normally performs just fine on good hardware.

In fact I think that whole paragraph on translog tuning is a little misleading for a "survival guide". There are lots of other things I'd look at for performance gains before turning to these settings.

Adapt index.refresh_interval to your needs.

It might be best to leave this setting unset too. In recent versions, if you're indexing but not searching then there will be no refreshes taking place. From the docs:

If this setting is not explicitly set, shards that haven’t seen search traffic for at least index.search.idle.after seconds will not receive background refreshes until they receive a search request.

Compare-and-swap over _version field is poor man’s transactions

The preferred CAS operation uses _primary_term and _seq_no, since _version has known issues.

Topic		Replies	Views
Slow performance compare to v1.7 when using 5.x Elasticsearch	3	620	October 20, 2017
Performance degrading after a couple of weeks Elasticsearch	7	526	October 30, 2018
INDEX Performance Elasticsearch	15	703	July 19, 2018
Indexing best practice Elasticsearch	4	465	December 23, 2020
Alternatives to oversharding to handle index / cluster growth? Elasticsearch	10	1111	July 6, 2017

Review request for "Elasticsearch Survival Guide for Developers" blog post

Related topics