How to call _refresh

jspooner · July 14, 2016, 1:50pm

I'm using the parent child relationship in elastic search and I believe I need to call _refresh before bulk loading the child documents.

I currently have this code that works fine with a small test dataset, however when processing the real data set it's not loading the child documents. Is there a way to call the _refresh method via EsSpark?

EsSpark.saveToEsWithMeta( parentRDD, "myindex/parent")
EsSpark.saveToEsWithMeta( childRDD,  "myindex/child")

james.baiera · July 19, 2016, 4:19am

Hello there!

You can index child documents without having any parents present. You'll only see a change in search results in this case as they won't have parents yet, but the parents can be added at any time afterward.

There is a configuration property that controls how the _refresh endpoint is called during the job. It's on by default which means that the connector should be calling that end point for you already.

It's possible that something else is amiss?

Topic		Replies	Views
Refresh API and parent child routing Elasticsearch	1	323	July 6, 2017
Is refresh interval call is sync or async call in elasticsearch Elasticsearch	11	1646	August 24, 2017
Difference between `index.refresh_interval` vs `es.batch.write.refresh` Elasticsearch es-hadoop	5	3931	October 10, 2019
Read/write consistency Elasticsearch	3	63	February 7, 2025
GET API with refresh call Elasticsearch	1	406	October 15, 2018

How to call _refresh

Related topics