Large Drops in IOPS on AWS Elasticsearch Service & 504 Transport Error

blahblahdev · October 4, 2017, 2:49pm

We have an AWS Elasticsearch Service cluster that we (are trying to) continually push data to. To me, it isn't a crazy amount of data, maybe 12M documents a day. We use the bulk endpoint to push through ~500 documents at a time.

We have periods where we are able to push about 1 million documents through in an hour. This runs successfully for some period of time, sometimes days. Then, it stops with nothing changing on our end. IOPS go way down, throughput goes way down, and we no longer can make any bulk requests to the cluster. AWS Elasticsearch Service does not allow a timeout value larger than 60s, and essentially all of our bulk pushes begin to receive a 504 transport error once this drop-off occur.

All of our metrics that we can see seem fine.

Stats about the cluster/index:
ES 5.1
~350 GB free
~60 GB used
5 r4.xlarge nodes
1000 Provisioned IOPS
index has 5 shards
pushing to the cluster using elasticsearch-py in AWS Lambda functions

Any help/guidance would be greatly appreciated. I made a post in the AWS forums, but no one has replied.
This is the link to it:
https://forums.aws.amazon.com/message.jspa?messageID=806068

Thanks

warkolm · October 6, 2017, 5:30am

I am not sure we can help given the restrictions that AWS provide on the service.

Do you have any monitoring in place that might show resource usage at the time this happens?

blahblahdev · October 9, 2017, 6:10pm

Hi warkolm,

We have not set up any other monitoring besides the default CloudWatch monitoring.

If there are any stats in particular that you are interested in, please let me know.

I think that it may be due to the Elastic load balancer that (likely) sits between the cluster and the requests. Unfortunately, we can't access that ELB or any stats about it, so I have no real way of knowing.

blahblahdev · October 19, 2017, 5:05pm

We ended up switching to Elastic Cloud since we couldn't figure out why this was happening, nor get a response from AWS.

So far, we haven't run into any issues at all. We're definitely very happy we made the switch

system · November 16, 2017, 5:05pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Elasticsearch bulk insert is too slow on AWS Elasticsearch Service Elasticsearch	6	1672	January 22, 2018
Elasticsearch on aws ec2 index very slow Elasticsearch	3	1864	July 5, 2017
Performance issues when pushing data using the elastic API Elasticsearch	5	406	December 2, 2020
Elasticsearch bulk insertion issue: 403 request throttled due to too many requests Elasticsearch	17	3102	April 16, 2021
Aws ES Attempted to send a bulk request to elasticsearch' but Elasticsearch appears to be unreachable or down! Elasticsearch	3	3758	January 4, 2019

Large Drops in IOPS on AWS Elasticsearch Service & 504 Transport Error

Related topics