ES Indexing take huge time

v.kumar · September 1, 2016, 8:18am

Hi All,

We are using ES 2.1. I am not sure its related to ES or not. I am getting huge no of list part inital like

for(Object object: objects) {
Result result = compute(objects);
list.add(result);
}

and list will iterate again and get the detail of each item and then index it. Right now for each time, getting detail for result , a service call went and gave response back and then indexing start. So indexing take overall alot time.

My question is, is that I do parallel execution of multiple thread , then will it be good solution?
Does that effect on Elastic search indexing time. I know this more of java question compare to ES. Looking forward to your inputs.

magnusbaeck · September 1, 2016, 8:30am

Start by switching to using the bulk API. Do you have full understanding of what's taking time? Is it the indexing requests or obtaining the information to index? Could the latter be done more efficiently, e.g. by not looking up items one by one?

v.kumar · September 1, 2016, 8:48am

I am already using bulk api.
I think more time taking by is obtaining information to index. My question, if i use multiple thread to index it, will it effect the elastic search?

magnusbaeck · September 1, 2016, 8:53am

I am already using bulk api.

That wasn't clear from your question.

I think more time taking by is obtaining information to index. My question, if i use multiple thread to index it, will it effect the Elasticsearch?

I'm not sure exactly what you're asking, but issuing concurrent bulk indexing requests should be fine as long as the concurrency doesn't become too great (at which point ES will start rejecting requests when its thread pools are exhausted).

v.kumar · September 1, 2016, 8:59am

Well yes. My question about about how ES handle multiple parallel thread for indexing. Is there any documentation. How can I control that? or debug that?

mainec · September 1, 2016, 9:09am

Not exactly about parallel indexing exclusively but the following blog post should provide you with more detail on indexing performance:

Topic		Replies	Views
ES takes too much time to index data Elasticsearch	8	542	July 6, 2017
Bulk Indexing Rate Elasticsearch	4	552	April 18, 2018
How Can I increase ES's indexing Data speed?Bulk can't achieve it! Elasticsearch	12	1275	July 5, 2017
ES performance when bulk create and GET in the same time Elasticsearch	2	349	July 9, 2018
Elasticsearch Indexing Issues Elasticsearch	2	242	March 29, 2023

ES Indexing take huge time

Related topics