What's nested documents layout inside the lucene?

colings86 · September 7, 2016, 8:34am

As explained on the link, the nested documents are indexed as separate documents that reside in the segment next to the parent document. This way they can be identified as nested documents of that parent so it conceptually looks like the following in the index:

# First nested object
{ 
  "comments.name":    [ john, smith ],
  "comments.comment": [ article, great ],
  "comments.age":     [ 28 ],
  "comments.stars":   [ 4 ],
  "comments.date":    [ 2014-09-01 ]
}
# Second nested object
{ 
  "comments.name":    [ alice, white ],
  "comments.comment": [ like, more, please, this ],
  "comments.age":     [ 31 ],
  "comments.stars":   [ 5 ],
  "comments.date":    [ 2014-10-22 ]
}
# The root or parent document
{ 
  "title":            [ eggs, nest ],
  "body":             [ making, money, work, your ],
  "tags":             [ cash, shares ]
}

So in the index there are in fact 3 physical documents but only 1 logical document.

If you still have questions maybe you could reform you question to be more specific about what you are wanting to understand?

Topic		Replies	Views
Having same document id for different document types Elasticsearch	9	4621	July 6, 2017
Designing array of fields accessible by the Lucene expression language scripts Elasticsearch	1	444	May 31, 2017
Nested update Elasticsearch	3	364	July 6, 2017
How is Elasticsearch-data represented in Lucene Elasticsearch	5	1199	July 5, 2017
Nested objects queries Elasticsearch	3	516	September 21, 2019

What's nested documents layout inside the lucene?

Related topics