Get all of the documents for user in elastic enterprise search

CTO_Servefast · June 19, 2024, 7:49am

I would like to get all of the documents for a user in elastic enterprise search using @elastic/enterprise-search npm package.

The only way i had some success was using client.app.search() using empty string for query but it seems unreliable and i am trying to look for another way to do it.

Elastic currently in use is 8.12.1 i can upgrade if needed

Thanks in advance

nfeekery · June 19, 2024, 9:28am

Hi @CTO_Servefast

The only way i had some success was using client.app.search() using empty string for query but it seems unreliable

Can you expand on what makes it seem unreliable? The App Search search query should return all docs ordered by ID by default.

If docs are being added, deleted, updated, etc. during the search operation and that search result is paginated, there could be some inconsistencies in the results. You can resolve that by using point in time API, but to do this you'd need to use the Elasticsearch JS client to send the requests to Elasticsearch instead of Enterprise Search.

CTO_Servefast · June 19, 2024, 10:07am

What i was thinking actually is there an easier way to find all documents by property i es has deleteByQuery can we use that in conjuction to elastic enterprise search

nfeekery · June 19, 2024, 10:40am

Anything that is officially supported in the HTTP Search API will work in the body field for client.app.search(). If there's nothing there that can be used to find the specific docs for a user based on your setup, then perhaps the Elasticsearch client will be more suited.

But first, going back to the original question:

I would like to get all of the documents for a user in elastic enterprise search

How are the documents differentiated per user? Is this who uploaded the doc, or is there a user id associated to the doc, or something else?

CTO_Servefast · June 19, 2024, 10:56am

@nfeekery there is userId defined in the document.

In regards to any Elasticsearch client i am having issues using deleteByQuery since i cant get the index of my engine that is used or maybe i am doing something wrong

nfeekery · June 19, 2024, 12:44pm

The index of the engine should be .ent-search-engine-documents-<engine_name>. Is that the index you're trying to use deleteByQuery against?

EDIT: if the engines are considerably older they may be .app-search-<engine_name>

nfeekery · June 19, 2024, 12:48pm

Also, you can use the search explain API which should show you the exact queries Enterprise Search will run against Elasticsearch. This will also show the index name for you.

CTO_Servefast · June 19, 2024, 3:47pm

This is perfect thank you i will try it also it would be great if we could get index name for the engine

CTO_Servefast · June 20, 2024, 10:34am

@nfeekery I have used it like this and it works really good. I have one follow up question it might be even for a separate thread.

If we are deleting thousands or even 10 of thousands of documents can it timeout and how can we know and what should we do in those cases should we split the query somehow into smaller batches.

Sample code:

    const index = `.ent-search-engine-documents-${engineName}`;
    const deleteResponse = await elasticSearchClient.deleteByQuery({
      index: index,
      query: {
        match: {
          user_id: userId,
        },
      },
    });

Best Regards

nfeekery · June 21, 2024, 7:53am

@CTO_Servefast glad to hear that it's working!

If you're concerned about timeouts for large payloads, you can run deleteByQuery asynchronously by using the flag wait_for_completion=false . The API will then return a task id that you can check the status on. Here's the documentation for that.

system · July 19, 2024, 7:53am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Deleting all documents inside an engine 8.2 Elastic Search elastic-app-search	2	577	July 19, 2022
Delete document from enterprise search Elastic Search elastic-app-search	2	304	August 1, 2022
How to retrieve all document form a content source of Workplace search Elastic Search elastic-workplace-search	12	2851	November 16, 2021
Get all ids with Python Elasticsearch	2	318	November 24, 2023
App-search document API? Elastic Search elastic-app-search	3	565	February 21, 2020

Get all of the documents for user in elastic enterprise search

Related topics