Parent-Child Relationship performance

sanders_2345 · April 3, 2022, 8:17am

We've one usecase to implement where we want to maintain a relationship between two types of documents. We tried to put in the same document but the document size exceeding 200kb. So we're exploring parent-child relation model, where 1parent will map to 1 child. Read through the documentation regarding the performance issue when using the joins, but want to check any room for more improvement here.

Es version - 7.12
Cluster - 250+ mn document parent & 250+ mn child
200+ shards, writes are not frequent, mostly read-only cluster.
After indexing, field data is size 2gb, single query with has_child is taking almost 2.5 - 5 secs.

_stats/fielddata?human&fields=join_field#parent

primaries: {
 fielddata:{
   memory_size: "2.1gb",
   memory_size_in_bytes: 2309747432
 }
}

Is there a way to improve these metrics?

system · May 1, 2022, 8:18am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Parent/Child query performance in version 1.1.2 Elasticsearch	7	450	July 6, 2017
Implementing a parent-child relationship and changing parent of a child document Elasticsearch	3	406	July 29, 2021
Has_parent query performance Elasticsearch	1	259	June 29, 2022
Performance penalty for has_child queries Elasticsearch	15	1898	July 6, 2017
Parent-child v/s multiple indexes - Elasticsearch 6.0+ Elasticsearch	1	2753	June 11, 2018

Parent-Child Relationship performance

Related topics