I would like to know if two documents can be linked to get the search benefits?
For example:
I have a child document which is formed from parent document. I do not want to copy the complete document metadata to child so that child is searchable by parent information. If there is a way to link documents, that helps to avoid having redundant data.
I am talking in the range of Million documents here. Any information would be helpful.
Because there would be 1000's of child documents for one parent. And replicating all this duplicate data across childs is lot of space. And I am talking about Millions of parent documents and Millions of child documents.
do a virtual join by yourself with a field.
Have the parent hold the data and the child doc contains a field say "parent_id" that holds the "_id" of the parent.
That way you search with the result of the child and get the exact doc (parent) via _id.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.