I have the following data structure: I have a hierarchical structure where every element can link to n-further elements. (Similar to a tree, but an object can have multiple roots. A link is only downwards possible). The data is distributed on 4-5 hierarchical levels.
I want to store this data in elasticsearch and query the dependencies in elasticsearch and create visualizations in Kibana. Whats the best way to store this data? If I would denormalize the data it would take x20 the disk space, as there are many relations between the objects. Joins in the query are not possible?
Are there application side ways to join the data in Kibana?
I would recommend trying to fully denormalize your data and index a good amount of it too see exactly how much extra space it takes up on disk as this is the type of data Kibana works best with. It could be that your estimate is accurate but it could also be less.
So my approach would be the following: (obj1: status, status2, status3) and (obj4: status, status5). Is this the right way? Hand how would you name the status fields of all connected elemnts?
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.