Rollup API Autogenerated Index Strategy

Domenico_Raimondi · July 19, 2018, 1:31pm

Hello,

In my team, we've been enthusiastically using the new Rollup API. It provides the functionality that we were waiting for!

However, once we started using it, we noticed that the autogenerated _id for our new rollup index is a random unsigned 32bit integer. This is different than the autogenerated ID strategy that is indicated in this page.

We're worried that the new Rollup index that is generated would run into the Birthday Paradox problem, and thus possibly overwrite documents.

Could someone please explain the autogenerated index strategy of Rollup indexes?

Thank you and best regards,
Dom

Luca_Belluccini · July 26, 2018, 6:20am

Hello Domenico,
the _id is actually a 32bit CRC at the moment.
It was not possible to use the standard ES GUID as those ids must be deterministic, but it can improved.
An issue has been published for this reason.
Remember the Rollup API is still experimental.

Hope it helps!

Luca

Domenico_Raimondi · July 26, 2018, 8:48am

Thank you @Luca_Belluccini for pushing to get this issue tracked openly.

Indeed in my team we have indices that, when rolled up, can indeed pass over the 200k barrier. As such, Rollup is not usable in its current state until we know for sure that we won't generate any collisions.

We eagerly await a resolution!

Best Regards,
Dom

system · August 23, 2018, 8:48am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Potential Clash of Auto-Generated IDs Elasticsearch	3	1101	March 8, 2018
Updating documents when using auto-generated IDs Elasticsearch	7	2096	August 26, 2020
Autogenerate id of IndexRequest before sending it Elasticsearch	2	734	May 1, 2019
Updating documents with deterministic ID in older index after rollover Elasticsearch ilm-index-lifecycle-management	3	469	May 28, 2020
A rollover (ILM) on rollup indexes Elasticsearch ilm-index-lifecycle-management	8	470	January 9, 2023

Rollup API Autogenerated Index Strategy

Related topics