How to implement join-fields with Spring Data Elasticsearch 4.0.0

matt4 · March 18, 2021, 1:02pm

Hi,

I am using Spring Data Elasticsearch 4.0.0 and I would like to establish a parent-child-relationship between my entities. I found out that @JoinTypeRelation comes with version 4.1.X, but unfortunately, I am stuck with 4.0.0. The official documentation does not have any information about how to implement join-fields, but I hope there is still a way to do it.

In order to give some more detailed information: Since types are no longer supported in ES 7.6.2, I merged my two entities, parent and child, into a single class which holds either parent or child information, but never both.

@Document(indexName = "my_index")
public class ParentOrChild {

    @Id
    private String _id;

    @Field(type = FieldType.Keyword)
    private String someParentProperty;

    @Field(type = FieldType.Keyword)
    private String someChildProperty;

    // getters and setters
}

Now I would like to create a join-field so that entities that represent a child can reference another entitiy that represents a parent. My goal is to later find parent entities by searching for properties of their children like this:

GET my_index/_search
{
    "query": {
        "has_child" : {
            "type" : "_doc",
            "query" : {
                "fuzzy" : {
                    "someChildProperty" : "value"
                }
            }
        }
    }
}

I appreciate any hints you can give me.

dadoonet · March 19, 2021, 3:17am

Welcome!

I'm not going to comment on the spring side as I don't know how this is implemented in the next version and if there's a workaround.
The only workaround I can imagine is by providing manually the mapping and writing manually the queries.

That being said, before going further, are you sure that you must use a relationship model in elasticsearch? Is there anything that prevents you of denormalizing your data and avoid doing joins?

IMO joins should be used only when nothing else is possible. At least if you want the application to be the fastest as possible.

matt4 · March 19, 2021, 7:50am

Thank you for your answer,

I guess it would not be a problem to just duplicate the parent data in the child, since it already has the fields anyway. The only problem I see is when a parent gets updated. Then I would have to find all of the children and update them aswell.

That seems like a costly operation. Do you think it is still better than having joins?

EDIT: I talked to my colleagues and in our scenario updates to parents are very rare whereas the search for childen by attributes of the parent are very common. So denormalization seems like a very good idea. Thank you very much, David.

system · April 16, 2021, 7:51am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Parent child implementation using join fields Elasticsearch	1	439	March 27, 2018
Combining two data sources into one or making a parent and child relation with two different sources Elasticsearch	3	232	August 18, 2021
Parent-Child relationship using join Elasticsearch	1	587	December 12, 2017
Could someone help me ? --How to implement join datatype by Java API Elasticsearch	1	469	October 15, 2018
Parent/child join approach? Elasticsearch	5	1375	July 5, 2017

How to implement join-fields with Spring Data Elasticsearch 4.0.0

Related topics