Why ES create duplicated field for single indexed and doc value field?

howardhuang · July 6, 2018, 1:11pm

Hi All,

Recently, I am debugging ES 6.3 indexing flow through source code. I found that ES will new two objects for handling keyword type field. One is Field object for revert index store, and another one is SortedNumericDocValueField for handling doc value.

My question is that why ES create two fields? In my opinion, we could just create one Field object, and set FieldType with both indexed and doc value type options. In lucene level, it only just check field's type has these two options then handle them separately.

I also try to modify KeywordFieldMapper.java and this is my pseudocode:

Field field = new Field()

FieldType ft = new FieldType();
ft.setDocValueType(SORT_SET);
ft.setIndexOptions(IndexOptions.DOCS);
ft.setStored(true);
ft.setDimensions(2, 16);
...

field.setFieldType(ft);

And I saw Lucene could store this field correctly with doc values and revert index. Any one please help me？

Thanks,
Howard

system · August 3, 2018, 1:11pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
What happens when the field of a document changes from "long" to "double"? Elasticsearch	1	492	February 22, 2017
Multiple fields with different values in a same document Elasticsearch	6	593	July 5, 2017
Single index for different sources Elasticsearch	5	1077	July 5, 2017
Different document types with shared field names Elasticsearch	2	351	July 6, 2017
If I have multiple doctypes for an index, how does it impact searching and indexing performance? Elasticsearch	2	924	July 6, 2017

Why ES create duplicated field for single indexed and doc value field?

Related topics