Elasticsearch scroll

lonzodc · November 19, 2018, 1:29pm

Hi All,

I would like to ask the best approach to extract thousands of records in Elasticsearch? Also is it possible to perform 1000 concurrent request using scroll api to extract the data from the elasticsearch index?

Thank you,

Christian_Dahlqvist · November 19, 2018, 1:31pm

How large is your cluster? How much data are you looking to extract? How many indices and shards is this spread across?

lonzodc · November 19, 2018, 1:43pm

Hi Christian,

We are using AWS ES m4.large having 8GB memory and 300GB EBS. We are planning to extract 200 thousands of records. We also have 1000 plus indices having 5 shards each indices.

Thank you,

Christian_Dahlqvist · November 19, 2018, 1:44pm

How many nodes in the cluster? Just 1?

lonzodc · November 19, 2018, 1:46pm

We have 7 nodes in total. The details are as follows 3 master nodes and 4 datanodes

Thank you,

Christian_Dahlqvist · November 19, 2018, 1:50pm

The first thing I would like to point out is that you have far too many indices and shards for a cluster that size. This can be very inefficient. I recommend you read this blog post about shards as it provides some practical guidelines.

Given the relatively low amount of heap available I would recommend running a few school queries at a time so you can determine how much the cluster can handle. I would not be surprised if you are suffering from heap pressure given the number of shards in the cluster. Performing 1000 requests in parallel would likely make it fall over.

system · December 17, 2018, 2:00pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
What's the quickest way to extract a LARGE amount of records out of ES? Best practices for scroll API are welcome Elasticsearch	2	3163	July 5, 2017
How to get data more than 10000 in elasticsearch Elasticsearch	27	21618	January 17, 2018
How to create parallel cursors on my dataset in Elastic search Elasticsearch	1	734	August 15, 2018
Query Millions of records in Elasticsearch Elasticsearch	2	947	July 6, 2017
I can't retrieve all data from index Elasticsearch	15	3211	September 18, 2017

Elasticsearch scroll

Related topics