Datastructure relational data

michael_9 · May 27, 2020, 2:18pm

I have the following data structure: I have a hierarchical structure where every element can link to n-further elements. (Similar to a tree, but an object can have multiple roots. A link is only downwards possible). The data is distributed on 4-5 hierarchical levels.

I want to store this data in elasticsearch and query the dependencies in elasticsearch and create visualizations in Kibana. Whats the best way to store this data? If I would denormalize the data it would take x20 the disk space, as there are many relations between the objects. Joins in the query are not possible?

Are there application side ways to join the data in Kibana?

Christian_Dahlqvist · May 28, 2020, 5:21am

No, joins are not possible.

Not that I am aare of. Kibana works best with flattened data.

Is this an estimate or the result of a test?

michael_9 · May 28, 2020, 6:12am

Is this an estimate or the result of a test?

The disk space is an estimation.

Which data structure would you recommend?

Christian_Dahlqvist · May 28, 2020, 6:34am

I would recommend trying to fully denormalize your data and index a good amount of it too see exactly how much extra space it takes up on disk as this is the type of data Kibana works best with. It could be that your estimate is accurate but it could also be less.

michael_9 · May 28, 2020, 7:18am

Thanks a lot, I'll try that and tell you the results.
You think I should fully denormalize the data? Why not use nested objects?

Christian_Dahlqvist · May 28, 2020, 7:33am

Kibana does not support nested documents well and having very large deeply nested structures can be quite inefficient.

michael_9 · May 28, 2020, 2:42pm

Ok, so how should I denormalize this data structure. I for example want to query the status of all elements related to obj1:

( obj1: status)  -----
                       (obj3: status)
(obj2: status) -------


(obj4: status) -------- (obj5: status)

So my approach would be the following: (obj1: status, status2, status3) and (obj4: status, status5). Is this the right way? Hand how would you name the status fields of all connected elemnts?

michael_9 · May 30, 2020, 4:28am

I'm wondering what's the best way to name the fields. It isn't possible to have the same name, but if you have different ones, it's hard to search.

system · June 27, 2020, 4:28am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Porting a relational structure to Elasticsearch - Nested or Parent/Child Elasticsearch	10	3810	July 5, 2017
How to Correlate Data? Kibana	5	7449	May 25, 2018
Noob help with Kibana, Mappings & Nested Objects in Arrays Kibana	17	3650	December 1, 2017
Structuring data for hierarchy Elasticsearch	2	3437	July 5, 2017
Missing the link between the power of elasticsearch and using in kibana Kibana	2	238	May 3, 2019

Datastructure relational data

Related topics