How Translog Work on elastic

yuswanul · February 18, 2023, 5:23am

Hi everyone,

I have a question about translog. So here is the situation:

I have one index with 1 primary and 1 replica shard and continuously ingesting data. If i read documentation, it says that primary and replica shards are write their operations to the translog. Cmiiw

So my question is, if there is a issue for example the cluster become yellow or red, and the primary and replica shards become unassigned, how do they recover from the translog until they have the latest data? while the translog itself is written by the shard as written in the documentation

"so each shard copy also writes operations into its transaction log known as the translog"

if the shards becomes unassigned, then it can't receive the latest data right? therefore it recovers through translog. however, the translog itself is written by the shard. so even recovering via translog can't make the shard have the latest data. if so, how will the shard catch up with its data?

Thanks

Christian_Dahlqvist · February 18, 2023, 6:04am

If a replica shard becomes unavailable, the primary shard will continue to accept writes and write these to the transaction log.

If the primary becomes unavailable and there is a replica shard in place, this will get promoted to primary and continue accepting writes as in the first scenario.

If the primary as well as all replicas of a specific shard becomes unavailable the index will go red and no writes will be accepeted to any shards in that index. When one of the unavailable shards become available this will become the primary and the index will again accept writes.

yuswanul · March 11, 2023, 9:01am

Ok, pretty clear. But i want to know one more thing. If the primary shard becomes unavailable, the replica shard become primary and accept writes. So my question is, the old primary shard, recover from which translog? because in the documentation on my first post, it said each shard, (which is primary and replica), has it's own translog. so will it recover using its own translog or using the replica's translog which obviously has the latest data?

Thanks

Christian_Dahlqvist · March 11, 2023, 9:18am

If the old primary shard comes back it will become a replica shard (assuming a replica shard still need to be assigned) and recover from the current primary shard.

yuswanul · March 11, 2023, 9:40am

that means the translog is only used for reassigning the shards right? After that, the new replica will replicate the data from the current primary shard?

Christian_Dahlqvist · March 11, 2023, 9:48am

The transaction log is a local buffer holding indexd data until a new segment can be created during a flush. Writes are sent to both primary and secondary shards and are written to each shards transaction log.

In more recent versions writes use sequence numbers, which can be used when recovery takes place and to identify what has and has not been written to segments or local transaction log.

When the failed primary shard comes back in the example we discussed, the cluster can identify it is behind the current primary shard based on the sequence numbers. It will then recover based on the current primary shard. This may include procesing the local transaction log and then receive missed writes based on sequence numbers or simply copy the whole primary shard if it is very far behind.

yuswanul · March 11, 2023, 10:35am

i see, thanks by the way

Pada tanggal Sab, 11 Mar 2023 16.58, Christian Dahlqvist via Discuss the Elastic Stack <notifications@elastic.discoursemail.com> menulis:

system · April 8, 2023, 10:35am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Recovery translog after server crash not working Elasticsearch	2	568	July 5, 2017
How does elasticsearch move a primary shard? Elasticsearch	10	4983	January 18, 2019
How does a replica catch up with primary Elasticsearch	2	809	July 6, 2017
How does replication works in detail? Elasticsearch	4	3019	July 6, 2017
When I add a new node, the shard will rebalance. Will there be a delay in inserting and updating data at this time? What is the approximate time? How does this rebalance work and what is the principle? Elasticsearch	12	585	April 18, 2020

How Translog Work on elastic

Related topics