Low performance on bulk insert with custom mapping

0011001011 · October 31, 2018, 5:36pm

Hi all,

I encountered a strange problem with ES 6.2.4. I try to bulk insert data into my ES nodes, which run fine, but which is also slow (~10s per 1000 insert).

I run it on 4go of RAM, the CPU is used at ~24%, with 3 shards, and refresh deactivated.

If I do not specify a mapping and let ES dynamically finds it, I get only 1s to index the bulk of 1000 docs. Alright, my mapping is complex and lots of preprocesing.

Now what is strange is that if i retrieve the mapping dynamically provided by ES (through GET /my-index/) and then use this exact same mapping on a new instance, I also have the bad perf (~10s per bulk) while it should be the same.

Any idea on that ? I feel like I am missing something here and it's kind of frustrating

Regards

Christian_Dahlqvist · October 31, 2018, 5:41pm

What type of hardware are you running on? What type of storage? What is the size of your documents?

0011001011 · October 31, 2018, 5:50pm

Running on classical HDD, Xeon E5-2680 (2.40Ghz)

One document is 50 fields long, each field is "short" (less than 50 characters)

dadoonet · October 31, 2018, 6:03pm

That should work the same way. Are you 100% sure that you don't have something else like an index template that changes the mapping ?

0011001011 · October 31, 2018, 6:07pm

Yep I am positive on that.

0011001011 · October 31, 2018, 6:08pm

Btw for what it is worth, I tried to create an empty mapping
{ myindexname : {} }
And then I lost the perf...

dadoonet · October 31, 2018, 6:23pm

Can you try with 6.4.2?

0011001011 · October 31, 2018, 6:56pm

I am currently out of work, and i dont know if it is available with the proxy/artifactory. I will check that asap. Is it a know bug ?

0011001011 · November 5, 2018, 9:28am

Any insight ? I can not test with 6.4.2 unfortunately

system · December 3, 2018, 9:28am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Initial Upload ElasticSearch 6.3 Bulk insert slows to a crawl Elasticsearch	3	789	September 9, 2018
Migration from ES1.5.4 TO ES 6.3 HDD performance issue Elasticsearch	5	426	August 17, 2018
Elasticsearch poor indexing performance Elasticsearch	6	853	December 1, 2017
Bulk insert vs Single insert Elasticsearch	2	1432	July 6, 2017
Slow Bulk Updates on 6.2.3 Elasticsearch	3	911	May 8, 2018

Low performance on bulk insert with custom mapping

Related topics