Does document ID length affect performance/heap?

gskema · February 4, 2019, 1:32pm

Let's say we have 300 million documents. An average document ID looks like this:
aaaaaaaaaaaaaaaa_000000000_0_0_0_0

There are thousands of MGET operations every second for these documents.
We wanted to change the document ID to make it more readable, but our colleague says that the IDs are already too long and should be shorter because length affects performance and heap usage.

Is that true? Is there a resource where we could learn more about it. Thank You

DavidTurner · February 4, 2019, 4:36pm

The structure of document IDs do indeed affect heap usage. We started to investigate some alternatives to the auto-generated IDs that Elasticsearch creates if you don't give an explicit ID here:

It's not purely a function of length, because the changes discussed in that thread are not affecting the lengths of the IDs. As with all performance questions there is no substitute for careful benchmarking to determine the true effects of any change.

system · March 4, 2019, 4:36pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
_id length Elasticsearch	7	3171	June 20, 2019
_id Considerations Elasticsearch	4	335	July 6, 2017
Choice of unique document id and performance Elasticsearch	3	1125	December 23, 2016
Long _id (so _uid) field performance issues? Elasticsearch	1	353	July 6, 2017
Understanding Heap Usage for Indexing and Updating Elasticsearch	10	1450	July 5, 2017

Does document ID length affect performance/heap?

Related topics