Rally dump existing cluster data while 429 occur

JohnPhilip · June 9, 2021, 7:29am

when I try to create a track from data in an existing cluster, I found that error occurred like following

Extracting documents for index [compress_ratio_ac...     1650000/50000001 docs [3.3% done][ERROR] Cannot create-track. TransportError(429, 'circuit_breaking_exception', '[parent] Data too large, data for [<http_request>] would be [1702314100/1.5gb], which is larger than the limit of [1690094796/1.5gb], real usage: [1702313936/1.5gb], new bytes reserved: [164/164b], usages [request=0/0b, fielddata=0/0b, in_flight_requests=154112090/146.9mb, accounting=0/0b]').

how can I solve this problem if I don't set ES config indices.fielddata.cache.size, rally can do something to avoid this situation?

danielmitterdorfer · June 9, 2021, 8:58am

Hi,

extracting all data from a cluster puts a heavier burden than usual operation on the cluster. Looking at the circuit breaker exception I guess that you limit heap size to ~ 2GB. I suggest you temporarily allocate more heap memory to Elasticsearch when extracting data.

Daniel

JohnPhilip · June 9, 2021, 9:29am

thanks Daniel,
maybe the cluster has other pressure of writing data when I extracting data from it, which cause the JVM memory to be insufficient, i'll try later, but there is no way to solve it by configuring rally right? perhaps only configure ES can avoid this if i have limited JVM memory

Dzp

danielmitterdorfer · June 9, 2021, 11:30am

Hi,

no, it's unfortunately not configurable. Internally, Rally uses the Python client's scan helper which fetches documents in batches of size 1000.

Daniel

JohnPhilip · June 9, 2021, 1:06pm

thanks again Daniel,

it's so helpful to me, glad to talk with you

system · July 7, 2021, 1:06pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
CircuitBreaker: [parent] Data too large, data for [<transport_request>] Elasticsearch	2	1612	September 5, 2019
Best practices: TransportError(429, 'circuit_breaking_exception', '[parent] Data too large, data for [<transport_request>] Elasticsearch	1	1769	April 15, 2020
Error code 429 - circuit_breaking_exception Elasticsearch	10	6794	November 8, 2019
Data too large, data for [<transport_request>] Elasticsearch es-hadoop	17	2196	January 25, 2021
Data too large for response [parent] Elasticsearch	2	281	November 28, 2023

Rally dump existing cluster data while 429 occur

Related topics