Python API for Elastic Search - Getting 10000 in response every time

jayanth5 · December 27, 2019, 3:54pm

client = Elasticsearch([host1, host2], http_auth=(user, password), scheme="http", port=port)
response = client.search(index="complats*", body={"from": 0, "size": 10000, "query": {
            "bool": {
                "must": [
                    {
                        "query_string": {
                            "query": "tags:\"prod\" AND severity:\"INFO\" AND service:\"abc-service\" AND msg:\"* is processed\"",
                            "fields": [],
                            "type": "best_fields",
                            "default_operator": "or",
                            "max_determinized_states": 10000,
                            "enable_position_increments": "true",
                            "fuzziness": "AUTO",
                            "fuzzy_prefix_length": 0,
                            "fuzzy_max_expansions": 50,
                            "phrase_slop": 0,
                            "escape": "false",
                            "auto_generate_synonyms_phrase_query": "true",
                            "fuzzy_transpositions": "true",
                            "boost": 1.0
                        }
                    },
                    {
                        "range": {
                            "@timestamp": {
                                "from": "now-{}s".format((now.minute + 1) * 60),
                                "to": "now",
                                "include_lower": "true",
                                "include_upper": "true",
                                "boost": 1.0
                            }
                        }
                    }
                ],
                "adjust_pure_negative": "true",
                "boost": 1.0
            }
        }})
value = response['hits']['total']['value']
print(value)

The above query is successfully connecting to elasticsearch but returning an incorrect value or 10000 every time. What could be wrong here? I've read somewhere that elasticsearch module in python has a bug where it maxes out at 10000. Changing "max_determinized_states": 10000 isn't helping and it is still returning the same value. Anyone else faced this problem? If yes how did u resolve it? Thanks in advance!

pjanzen · December 27, 2019, 5:12pm

If you have more results then the search size (you have set it to 10000) you need to scroll over the result set.

Here is an example.

system · January 24, 2020, 5:12pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Elasticsearch search query returning "empty" response using elasticsearch python API Elasticsearch	9	46	September 12, 2024
Need help with scan/scroll using elasticsearch-py client Elasticsearch	2	11291	April 11, 2017
Elasticsearch api returning empty response (python) Elasticsearch	4	722	November 9, 2023
Python3 Elastic Query returns large amount of results.. am i doing this right? Elasticsearch	1	344	December 29, 2020
HighLevelRest java client : Search query returning zero hits despite the equivalent HTTP curl call returning results (hits> 0) Elasticsearch	2	906	May 18, 2018

Python API for Elastic Search - Getting 10000 in response every time

Related topics