How to get results over 10K in App search

pradeepjanga · January 17, 2024, 9:42pm

HI,

We have 27K documents ingested into our engine. and we are trying to do a rest api call to gather the distinct field value across all the documents and that would be 27k id. how can gather those array of id's as a response. Currently i can get only 10k results and how to get the other 17K ids.

when i use current 11 and size 1000 it returns empty results. how can this achieved in App search.

{
    "query": "",
    "facets": {
        "u_number": [
            {
                "type": "value",
                "name": "u_number",
                "sort": {
                    "count": "desc"
                },
                "size": 1000
            }
        ]
    },
    "page": {
        "size": 1000,
        "current": 10
    },
    "result_fields": {
        "id": {
            "raw": {}
        }
    }
}

Thanks
Pradeep

Kathleen_DeRusso · January 17, 2024, 9:57pm

Hi @pradeepjanga thanks for your question.

Unfortunately 10,000 is a hard limit in App Search.

You may be able to get what you want using the Elasticsearch API. Note that by default you'll still run into a max result window of 10,000 but this can be configured in Elasticsearch using index.max_result_window. As noted in the documentation, scroll may be more efficient to get large document sets.

pradeepjanga · January 17, 2024, 10:54pm

@Kathleen_DeRusso But if i want to compare the data between 2 engines to see if a specific fields exist in one engine and not in other engine, i need to view the total documents right ?. Are you telling there is no way i can view the whole documents in the engine via Rest Api call. In my scenario, i just want to view for some analytics purpose. so performance doesn't matter to me.

pradeepjanga · January 17, 2024, 11:14pm

All i want to see array of values in the field u_number in whole engine ?. is that possible ?

 "facets": {
    "u_number": [
      {
        "type": "value",
        "name": "u_number",
        "sort": { "count": "desc" }
      }
    ]
  },

Kathleen_DeRusso · January 18, 2024, 10:08pm

You can't do that without performing multiple API calls and evaluating the responses.

pradeepjanga · January 18, 2024, 10:22pm

I tried multiple search but still how can we tell to not to search in already searched results.

Could you provide an example where i can get all the results to display ids in the whole engine ?

{"queries": [
    {
        "query": "",
        "page": {
            "size": 1000,
            "current": 10
        },
        "result_fields": {
            "id": {
                "raw": {}
            }
        }
    },
    {
        "query": "",
        "page": {
            "size": 1000,
            "current": 10
        },
        "result_fields": {
            "id": {
                "raw": {}
            }
        }
    }
]
}

pradeepjanga · January 18, 2024, 10:22pm

The above query still return in only first 10k. doesn;t return over 10k

pradeepjanga · January 19, 2024, 4:54pm

@Kathleen_DeRusso
is there any other way. please let me know. This is to compare something out of the code. we are trying to make some Rest API calls so there is no need for me to worry about performance.

Kathleen_DeRusso · January 22, 2024, 1:15pm

The only way you can do this is to:

Increase your index.max_result_window value to a value above the 27K documents you are looking for
Get the raw Elasticsearch query that app search generates via the explain API
Run the Elasticsearch query with an updated from/size using the Elasticsearch search API.

This is not a use case that App Search was optimized for, so expect very slow performance and I can't guarantee there won't be timeouts or errors doing this based on your data. If you run into errors the only other option is to use the scroll/ES queries I noted above.

Bikash_Hutait · January 30, 2024, 6:31pm

In Elasticsearch, including App Search, there are limitations on the number of results returned for a single query to prevent excessive resource usage. The default limit is often set to 10,000 results.

To work around this limitation and retrieve more than 10,000 results, you can consider the following options:

Pagination:
- Use the size and from parameters to paginate through the results.
- For example, if you want to retrieve results 10,001 to 20,000, you can set size to 10,000 and from to 10,000.
{
"size": 10000,
"from": 10000,
"query": {
// Your query here
}
}
- Keep in mind that deep pagination can be resource-intensive, and performance may degrade as you go deeper.
Scroll API:
- Use the Scroll API to retrieve large result sets.
- The Scroll API allows you to keep a "search context" open and continue retrieving results until all documents are processed.
- This is more efficient than using from for pagination.
POST /your_index/_search?scroll=5m
{
"size": 1000,
"query": {
// Your query here
}
}
- After the initial request, you'll receive a scroll ID. Use this ID to retrieve the next set of results.
POST /_search/scroll
{
"scroll": "5m",
"scroll_id": "your_scroll_id"
}
Increase index.max_result_window:
- Elasticsearch has a setting called index.max_result_window that controls the maximum number of results that can be retrieved in a single request.
- Be cautious with this approach, as setting it too high might lead to increased memory usage.
PUT /your_index/_settings
{
"index.max_result_window": 20000
}
- After adjusting this setting, you can use the regular query with a larger size parameter.

Remember to consider the performance implications of your chosen method, and choose the approach that best fits your use case and infrastructure.

oppocreaty · January 30, 2024, 8:28pm

The above query still return in only first 10k.

system · February 27, 2024, 8:28pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
How to increase query results limit of 10,000 results per search in AppSearch Elastic Search elastic-app-search	2	3381	September 6, 2021
Unable to retrieve more 10k records from elastic search using rest api Elasticsearch	4	555	October 22, 2021
How can I increase React App Search UI to show more than 10,000 records and 100 pages? Elastic Search elastic-app-search	4	169	March 14, 2024
Getting next 10k documents with AppSearch.list_documents() Elasticsearch	6	159	November 14, 2023
Fetching all docs in an app search index Elastic Search elastic-app-search	5	1891	January 7, 2019

How to get results over 10K in App search

Related topics