Can not query when inserting data

D_Tu · April 16, 2024, 4:52pm

I am a newbie to elastic and use spring boots to interact with elastic.
I have a scenario as below:

inserting a document like A
inserting a document like B ( and this action takes 5m to complete) and I also insert other documents like C and D after but quickly
then I performed searching but I got only A because the B was not completed
after 5m I queried again and got all A B C and D.
it seems that I can't query new data C and D when performing an action insert or update B
so, could you tell me why and how can I fix this case by config or something else?
Thanks.

dadoonet · April 16, 2024, 5:23pm

Welcome!

Most likely you need to call the _refresh API.

But please read the warning on that page:

Refreshes are resource-intensive. To ensure good cluster performance, we recommend waiting for Elasticsearch’s periodic refresh rather than performing an explicit refresh when possible.

If your application workflow indexes documents and then runs a search to retrieve the indexed document, we recommend using the index API's refresh=wait_for query parameter option. This option ensures the indexing operation waits for a periodic refresh before running the search.

D_Tu · April 16, 2024, 5:50pm

Thank you @dadoonet
But when I am inserting perhaps there is a transaction to lock the index, I called _refresh but it didn't work immediately and took a long time and it waited until the insert action finished then it worked

this is my insert code

  elasticsearchClient.index(i ->
                        i.index(elasticSearchConfig.fileIndex)
                                .id(finalInput.getId().toString())
                                .document(finalInput)

                );

query code


        SearchHits<FileDocumentContentDto> searchResponse = elasticsearchOperations.search(
                searchQuery,
                FileDocumentContentDto.class,
                IndexCoordinates.of(elasticSearchConfig.fileIndex));

Christian_Dahlqvist · April 16, 2024, 6:08pm

This sounds incredibly long. What is the size and configuration of your cluster? Where is it running?

What is the size of your document? What type of data does it contain?

D_Tu · April 16, 2024, 6:14pm

I parsed a PDF 180M to a text and then stored it in the index. is that ok?

I used a default configuration like that

#----------------------- BEGIN SECURITY AUTO CONFIGURATION -----------------------
#
# The following settings, TLS certificates, and keys have been automatically
# generated to configure Elasticsearch security features on 16-04-2024 14:12:48
#
# --------------------------------------------------------------------------------

# Enable security features
xpack.security.enabled: true

xpack.security.enrollment.enabled: true

# Enable encryption for HTTP API client connections, such as Kibana, Logstash, and Agents
xpack.security.http.ssl:
  enabled: true
  keystore.path: certs/http.p12

# Enable encryption and mutual authentication between cluster nodes
xpack.security.transport.ssl:
  enabled: true
  verification_mode: certificate
  keystore.path: certs/transport.p12
  truststore.path: certs/transport.p12
#----------------------- END SECURITY AUTO CONFIGURATION -------------------------

system · May 14, 2024, 6:15pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Elasticsearch[7.7] Does not update data or takes time to update the data Elasticsearch	8	523	March 13, 2024
Search api returns no documents Elasticsearch	4	113	May 10, 2024
Data is imported into ES in batches, and it can be queried after four minutes Elasticsearch	2	207	June 13, 2022
Newly inserted document not visible in searches Elasticsearch	3	743	March 28, 2021
When is data actually indexed in Elasticsearch? Elasticsearch	1	474	July 6, 2017

Can not query when inserting data

Related topics