Breaking Change: _id is not longer indexed

kimchy · April 19, 2011, 11:54pm

Heya,

In order to improve memory usage per shard, the _id field is not longer indexed by default. Note, no capability is lost except for the ability to use search queries using the _id field. For that, a new ids query / filter was introduced which does not require the _id field to be indexed (and is actually faster). There is a setting to bring it back if needed.

Issue: https://github.com/elasticsearch/elasticsearch/issues/868.
Ids Query/Filter: https://github.com/elasticsearch/elasticsearch/issues/865.

As a side note, the _type mapping can also be set to index : no, with almost all functionality retained (with a bit slower type filtering) and a new type filter, though this does not improve memory consumption by any noticeable measure.

-shay.banon

Sebastian_Gavarini · April 20, 2011, 2:08am

Hi Shay,

I have seen there is also an indexed field in Lucene's indices called "uid",
concatenating "type#id", which is taking a lot of space because the ids are
already unique. Is this still going to be created with this change?

Thanks,
Sebastian.

On Tue, Apr 19, 2011 at 8:54 PM, Shay Banon shay.banon@elasticsearch.comwrote:

Heya,

In order to improve memory usage per shard, the _id field is not longer
indexed by default. Note, no capability is lost except for the ability to
use search queries using the _id field. For that, a new ids query / filter
was introduced which does not require the _id field to be indexed (and is
actually faster). There is a setting to bring it back if needed.

Issue: Mapping: By default, don't index _id · Issue #868 · elastic/elasticsearch · GitHub.
Ids Query/Filter:
Query DSL: Add ids query and filter to fetch docs by ids (do not require _id field to be indexed) · Issue #865 · elastic/elasticsearch · GitHub.

As a side note, the _type mapping can also be set to index : no, with
almost all functionality retained (with a bit slower type filtering) and a
new type filter, though this does not improve memory consumption by any
noticeable measure.

-shay.banon

kimchy · April 20, 2011, 9:50am

Ids are not required to be unique across types. The uid will always be created, the _id will not. You need to have a way to identify the doc.
On Wednesday, April 20, 2011 at 5:08 AM, Sebastian Gavarini wrote:

Hi Shay,

I have seen there is also an indexed field in Lucene's indices called "uid", concatenating "type#id", which is taking a lot of space because the ids are already unique. Is this still going to be created with this change?

Thanks,
Sebastian.

On Tue, Apr 19, 2011 at 8:54 PM, Shay Banon shay.banon@elasticsearch.com wrote:

Heya,

In order to improve memory usage per shard, the _id field is not longer indexed by default. Note, no capability is lost except for the ability to use search queries using the _id field. For that, a new ids query / filter was introduced which does not require the _id field to be indexed (and is actually faster). There is a setting to bring it back if needed.

Issue: Mapping: By default, don't index _id · Issue #868 · elastic/elasticsearch · GitHub.
Ids Query/Filter: Query DSL: Add ids query and filter to fetch docs by ids (do not require _id field to be indexed) · Issue #865 · elastic/elasticsearch · GitHub.

As a side note, the _type mapping can also be set to index : no, with almost all functionality retained (with a bit slower type filtering) and a new type filter, though this does not improve memory consumption by any noticeable measure.

-shay.banon

Grant_Rodgers · April 23, 2011, 9:58pm

Does this remove the ability to order by _id?

On Apr 19, 4:54 pm, Shay Banon shay.ba...@elasticsearch.com wrote:

Heya,

In order to improve memory usage per shard, the _id field is not longer indexed by default. Note, no capability is lost except for the ability to use search queries using the _id field. For that, a new ids query / filter was introduced which does not require the _id field to be indexed (and is actually faster). There is a setting to bring it back if needed.

Issue:Mapping: By default, don't index _id · Issue #868 · elastic/elasticsearch · GitHub.
Ids Query/Filter:Query DSL: Add ids query and filter to fetch docs by ids (do not require _id field to be indexed) · Issue #865 · elastic/elasticsearch · GitHub.

As a side note, the _type mapping can also be set to index : no, with almost all functionality retained (with a bit slower type filtering) and a new type filter, though this does not improve memory consumption by any noticeable measure.

-shay.banon

kimchy · April 24, 2011, 6:35am

Yes.
On Sunday, April 24, 2011 at 12:58 AM, Grant Rodgers wrote:

Does this remove the ability to order by _id?

On Apr 19, 4:54 pm, Shay Banon shay.ba...@elasticsearch.com wrote:

Heya,

In order to improve memory usage per shard, the _id field is not longer indexed by default. Note, no capability is lost except for the ability to use search queries using the _id field. For that, a new ids query / filter was introduced which does not require the _id field to be indexed (and is actually faster). There is a setting to bring it back if needed.

Issue:Mapping: By default, don't index _id · Issue #868 · elastic/elasticsearch · GitHub.
Ids Query/Filter:Query DSL: Add ids query and filter to fetch docs by ids (do not require _id field to be indexed) · Issue #865 · elastic/elasticsearch · GitHub.

As a side note, the _type mapping can also be set to index : no, with almost all functionality retained (with a bit slower type filtering) and a new type filter, though this does not improve memory consumption by any noticeable measure.

-shay.banon

kimchy · April 24, 2011, 6:48am

Though, to add on that, there is a setting to set to make sure it is still indexed when upgrading. And, if starting fresh, you can mark it as indexed. I plan to also add a special sort option for id, that will be less memory heavy.
On Sunday, April 24, 2011 at 9:35 AM, Shay Banon wrote:

Yes.
On Sunday, April 24, 2011 at 12:58 AM, Grant Rodgers wrote:

Does this remove the ability to order by _id?

On Apr 19, 4:54 pm, Shay Banon shay.ba...@elasticsearch.com wrote:

Heya,

In order to improve memory usage per shard, the _id field is not longer indexed by default. Note, no capability is lost except for the ability to use search queries using the _id field. For that, a new ids query / filter was introduced which does not require the _id field to be indexed (and is actually faster). There is a setting to bring it back if needed.

Issue:Mapping: By default, don't index _id · Issue #868 · elastic/elasticsearch · GitHub.
Ids Query/Filter:Query DSL: Add ids query and filter to fetch docs by ids (do not require _id field to be indexed) · Issue #865 · elastic/elasticsearch · GitHub.

As a side note, the _type mapping can also be set to index : no, with almost all functionality retained (with a bit slower type filtering) and a new type filter, though this does not improve memory consumption by any noticeable measure.

-shay.banon

Topic		Replies	Views
I want to know why the indices.id_field_data.enabled configuration is turned off by default Elasticsearch	2	1678	July 26, 2023
Indexing the _id field Elasticsearch	2	310	July 6, 2017
Breaking Change: Change single operation shard hashing to only use id, and not id and type Elasticsearch	3	287	July 6, 2017
Another "indexing _ids in 0.16" question set Elasticsearch	4	330	July 6, 2017
_id Considerations Elasticsearch	4	340	July 6, 2017

Breaking Change: _id is not longer indexed

Related topics