Cannot get scrolling to work when trying to extract more than 10,000 results

terry433iid · November 27, 2020, 9:56am

I have an index ('jenkins_job_logs') that stores console logs from jenkins jobs, the index fields include the actual console data plus a unique ID for the jenkins job ('task_id'), there are 30k plus documents in the index so I am unable to extract more than 10k at a time

(am not going to modify the value of index.max_result_window)

So will use scrolling instead - but am hitting an issue

Here is the initial query

    elk>cat testquery2.json
    {
        "_source": {
            "includes" : [ "task_id" ],
            "excludes" : [ "console_data", "testsuite", "os_name", "host_vm_version", "guest_vm_version" ]
        },
        "size": 5000,
        "query": {
           "match_all" : {}
        }
    }

When I run the query below I get 5000 matches returned, plus a scroll ID (have shortened the ID as it is 3200_ chars long)

    elk>curl -XGET 'http://localhost:9200/jenkins_job_logs-2020.*/_search?scroll=1m&pretty' -H "Content-Type: application/json" -d @testquery2.json
    {
      "_scroll_id" : "DnF1ZXJ5VGhlbk..<3622 characters long>..ZldGNoVwAAAAAA=="
      "took" : 17087,
      "timed_out" : false,
      "_shards" : {
        "total" : 87,
        "successful" : 87,
        "skipped" : 0,
        "failed" : 0
      },
      "hits" : {
        "total" : 32314,
        "max_score" : 1.0,
        "hits" : [
          {
            "_index" : "jenkins_job_logs-2020.07.28",
            "_type" : "doc",
            "_id" : "1419",
            "_score" : 1.0,
            "_source" : {
              "task_id" : 187873
            }
          },
          {
            "_index" : "jenkins_job_logs-2020.07.28",
            "_type" : "doc",
            "_id" : "273",
            "_score" : 1.0,
            "_source" : {
              "task_id" : 186542
            }

I then plug the scroll ID above into a new search command (as per the docs) and get the error below

    elk>curl -XPOST 'http://localhost:9200/_search/scroll?pretty' -H "Content-Type: application/json"  -d '{"scroll" : "1m", "scroll_id" : "DnF1ZXJ5VGhlbk..<3622 characters long>..ZldGNoVwAAAAAA=="
    {
      "error" : {
        "root_cause" : [
          {
            "type" : "search_context_missing_exception",
            "reason" : "No search context found for id [1910847]"
          },
          {
            "type" : "search_context_missing_exception",
            "reason" : "No search context found for id [1910856]"
          },

system · December 25, 2020, 9:56am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
ELK: How do I retrieve more than 10000 results using Elastic-search in node js Elasticsearch	7	6090	March 24, 2018
Display more than 10k records in sense plugin and elasticsearch-sql plugin Elasticsearch	6	1147	August 17, 2018
Scroll Search Bug? Elasticsearch	4	2608	July 6, 2017
How to script export of > 10,000 records - 5 mil? Elasticsearch	9	6439	July 5, 2017
Result window is too large, from + size must be less than or equal to: [10000] but was [11001] Elasticsearch	5	15605	July 5, 2017

Cannot get scrolling to work when trying to extract more than 10,000 results

Related topics