Connection pooling using Python Client 8.10

hs121 · October 10, 2023, 8:10pm

Hi all,

I am trying to understand the correct way of doing "Connection Pooling" using Python Client 8.10 API.

In the old 7.x docs, I found a little bit of information around this but that doesn't seem to exist in 8.x ( at least I couldn't find it ):

My current thinking is that if I simply pass a list of connection nodes to Elasticsearch function, then it manages the selection process and dead connections for me?

connection_nodes = [
  "https://elasticsearch-dev-1:500",
  "https://elasticsearch-dev-2:500",
  "https://elasticsearch-dev-3:500"
]

# does this mean that if "elasticsearch-dev-1" and "elasticsearch-dev-2" node are down for whatever reasons then "elasticsearch-dev-3" is selected from the pool?
es = Elasticsearch(hosts=connection_nodes)

Any help is appreciate.

-H

iulia · October 11, 2023, 3:47pm

Hi Hamed and welcome to the community!

You can read more about the different ways to connect multiple nodes and see examples in the docs here

You can use dictionaries when referencing the hosts if you have different parameters, turn on sniffing, and set up different authentication methods. The connections format must be a list of dictionaries or host[:port] which will be translated to a dictionary automatically. As in these examples:

es = Elasticsearch(
    ['localhost:443', 'other_host:443'])

es = Elasticsearch([
    {'host': 'localhost'},
    {'host': 'othernode', 'port': 443, 'url_prefix': 'es', 'use_ssl': True},
])

Connections are indeed managed by default as explained here - this should clear up the implementation details

The transport layer will create an instance of the selected connection class per node and keep track of the health of individual nodes - if a node becomes unresponsive (throwing exceptions while connecting to it) it’s put on a timeout by the ConnectionPool class and only returned to the circulation after the timeout is over (or when no live nodes are left). By default nodes are randomized before being passed into the pool and round-robin strategy is used for load balancing

Hope this helps!
Cheers,
Iulia

hs121 · October 11, 2023, 5:52pm

Hi @iulia Great, Thank you for the information. It was really helpful.

-Hamed

system · November 8, 2023, 5:52pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Connection pooling in Python Elasticsearch	8	6389	August 18, 2019
Elasticsearch Python client: how to resume connection after node is stopped Elasticsearch	1	1734	September 4, 2019
Python: Method to get connection pool details Elasticsearch elastic-stack-monitoring	1	372	February 13, 2021
Elasticsearch multiple connections using python Elasticsearch language-clients	6	1035	December 28, 2021
Elasticsearch python client Elasticsearch	2	609	October 24, 2017

Connection pooling using Python Client 8.10

Related topics