I have resources that I need to index, they look like the below, I
index this into index "managedobject" and type as "host" using ../
managedobject/host
My question is, to get a best performance do I keep the document
structure the same or make some change to this representation?
Also, the above is not legal JSON, as you cannot repeat key names.
Elasticsearch will actually index the above just fine, but if you are
retrieving this _source docs from Elasticsearch, there is a good chance
that any JSON module that you use will accept only the last value for a
repeated key.
"base:Identifier":"tag:@domain.com,
2011:host:Host::Host-960a8b54ecb34475a3d9b508738cc1fa",
"base:Identifier":"Host-960a8b54ecb34475a3d9b508738cc1fa",
"base:Identifier":"Host-960a8b54ecb34475a3d9b508738cc1fa.lss.domain.com",
My question is, to get a best performance do I keep the document
structure the same or make some change to this representation?
Also, the above is not legal JSON, as you cannot repeat key names.
Elasticsearch will actually index the above just fine, but if you are
retrieving this _source docs from Elasticsearch, there is a good chance
that any JSON module that you use will accept only the last value for a
repeated key.
"base:Identifier":"tag:@domain.com,
2011:host:Host::Host-960a8b54ecb34475a3d9b508738cc1fa",
"base:Identifier":"Host-960a8b54ecb34475a3d9b508738cc1fa",
"base:Identifier":"Host-960a8b54ecb34475a3d9b508738cc1fa.lss.domain.com",
My question is, to get a best performance do I keep the document
structure the same or make some change to this representation?
Also, the above is not legal JSON, as you cannot repeat key names.
Elasticsearch will actually index the above just fine, but if you are
retrieving this _source docs from Elasticsearch, there is a good chance
that any JSON module that you use will accept only the last value for a
repeated key.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.