GlobalCheckpoint syncer only syncs translog when the translog durability is REQUEST

asce0705 · May 8, 2021, 10:06am

Elasticsearch version: 7.10
Currently, IndexShard.maybeSyncGlobalCheckpoint is called in two places:

In the AsyncGlobalCheckpointTask of IndexService;
In the TransportReplicationAction after a write action.
In IndexShard.maybeSyncGlobalCheckpoint, it runs the globalCheckpointSyncer according to the following conditions:

    // only sync if there are no operations in flight, or when using async durability
    final SeqNoStats stats = getEngine().getSeqNoStats(replicationTracker.getGlobalCheckpoint());
    final boolean asyncDurability = indexSettings().getTranslogDurability() == Translog.Durability.ASYNC;
        if (stats.getMaxSeqNo() == stats.getGlobalCheckpoint() || asyncDurability) {
            final ObjectLongMap<String> globalCheckpoints = getInSyncGlobalCheckpoints();
            final long globalCheckpoint = replicationTracker.getGlobalCheckpoint();
            // async durability means that the local checkpoint might lag (as it is only advanced on fsync)
            // periodically ask for the newest local checkpoint by syncing the global checkpoint, so that ultimately the global
            // checkpoint can be synced. Also take into account that a shard might be pending sync, which means that it isn't
            // in the in-sync set just yet but might be blocked on waiting for its persisted local checkpoint to catch up to
            // the global checkpoint.
            final boolean syncNeeded =
                (asyncDurability && (stats.getGlobalCheckpoint() < stats.getMaxSeqNo() || replicationTracker.pendingInSync()))
                    // check if the persisted global checkpoint
                    || StreamSupport
                            .stream(globalCheckpoints.values().spliterator(), false)
                            .anyMatch(v -> v.value < globalCheckpoint);
            // only sync if index is not closed and there is a shard lagging the primary
            if (syncNeeded && indexSettings.getIndexMetadata().getState() == IndexMetadata.State.OPEN) {
                logger.trace("syncing global checkpoint for [{}]", reason);
                globalCheckpointSyncer.run();
            }
        }

One of the condition checks the translog durability, which should be ASYNC, and if the local checkpoint lags, it runs the GlobalCheckpointSyncer, which will then execute GlobalCheckpointSyncAction. This action syncs the translog of the given indexShard when the translog durability is REQUEST.

    private void maybeSyncTranslog(final IndexShard indexShard) throws IOException {
        if (indexShard.getTranslogDurability() == Translog.Durability.REQUEST &&
            indexShard.getLastSyncedGlobalCheckpoint() < indexShard.getLastKnownGlobalCheckpoint()) {
            indexShard.sync();
        }
    }

This condition and the action behavior conflicts. Should we remove the translog durability check int GlobalCheckpointSyncAction?

DavidTurner · May 8, 2021, 11:41am

I don't think so, we don't want to actually sync the translog here if the durability is ASYNC. In that case we just want to run a no-op GlobalCheckpointSyncAction to ensure that the primary's ReplicationTracker is kept up to date.

asce0705 · May 14, 2021, 6:44am

Indeed as you say. By running a no-op GlobalCheckpointSyncAction , the replicas learn about current globalCheckpoint from primary and the primary collects the localCheckpoints from replicas, which may result in globalCheckpoint advance and checkpoints update in ReplicationTracker .

system · June 11, 2021, 6:45am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Confusions about how globlal checkpoint advances in es 7.6 Elasticsearch	9	1036	March 31, 2020
Some questions with regards to index.translog.durability's interaction with replica shards Elasticsearch	5	2257	March 6, 2018
Which will cause the LocalCheckpoint less than GlobalCheckpoint? Elasticsearch	3	364	April 7, 2022
If translog durability is set to request, then flush is required? Elasticsearch	13	254	May 5, 2024
Metrics for replica sync - ES 7.0.1 Elasticsearch	4	453	May 14, 2020

GlobalCheckpoint syncer only syncs translog when the translog durability is REQUEST

Related topics