Elasticsearch 6.0 _id and size_in_bytes

bCast · December 3, 2017, 7:48pm

I upgraded to 6.0 because of sequental ids. I expected that after switching to sequental _id it will saves memory.
In my case I have per day index with lots of events with small amount of fields (like ip_src, ip_dst, port_dst). I noticed that _id cosumes lots of memory. Is it possible to optimese _id field or somehow disable it ? If id is sequental I expect that it some sort of memoty offset could be calculated per search request and it is not requred to store it in memory

This is part of statistics:
{
"description" : "field '_id' [BlockTreeTerms(seg=_32b terms=531218720,postings=531218720,positions=-1,docs=531218720)]",
"size_in_bytes" : 78714997,
"children" : [
{
"description" : "term index [FST(input=BYTE1,output=ByteSequenceOutputs]",
"size_in_bytes" : 78714837
}
]
},
{
"description" : "field 'ip_dst' [BlockTreeTerms(seg=_32b terms=255005,postings=519813549,positions=-1,docs=519813549)]",
"size_in_bytes" : 68845,
"children" : [
{
"description" : "term index [FST(input=BYTE1,output=ByteSequenceOutputs]",
"size_in_bytes" : 68685
}
]
},

Christian_Dahlqvist · December 4, 2017, 9:13am

In Elasticsearch 6.0 all operations get a sequence id, which can help speed up recovery. The logic for generating document ids is not affected by this (they are not sequential).

bCast · December 4, 2017, 4:15pm

How do you think if _id were sequental were memory consumption lower? What is a reason why _id are not numeric?

Christian_Dahlqvist · December 4, 2017, 4:19pm

Trying to assign sequential numeric ids automatically, does generally not scale or perform in a distributed, highly concurrent system. If you want to, you can however assign your own id at the application layer.

system · January 1, 2018, 4:19pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
_uid size_in_bytes optimization Elasticsearch	2	591	July 18, 2017
_id Considerations Elasticsearch	4	356	July 6, 2017
Breaking Change: _id is not longer indexed Elasticsearch	6	440	July 6, 2017
Does this mean my "_id" field is taking up GB of RAM? Elasticsearch	4	391	April 6, 2023
Disable _id Field Elasticsearch	1	494	November 14, 2018

Elasticsearch 6.0 _id and size_in_bytes

Related topics