How to run multiple indexing threads on a single machine?

Niels_Basjes · July 31, 2022, 3:08pm

I have written an Elasticsearch plugin ( analyzes the UserAgent string ) that works the way I have in mind when I put it in a pipeline. (see Elastic Search | Yauaa - Yet Another UserAgent Analyzer )

Now on my laptop I want to put a rather large dataset (~100M records) in Elasticsearch via this plugin and check the results in Kibana.
I have put together some scripting ( yauaa/devtools/analysis at main · nielsbasjes/yauaa · GitHub ) that starts Elasticsearch with the plugin installed using Docker on my Ubuntu machine.
I then define the pipeline and load the data.

Functionally this works.

The problem I have is that when I do this I have been unable to make this pipeline run in multiple threads and thus reduce the time I have to wait. At the moment it is only using ~2-3 cpu cores where my laptop has 12 (6+hyperthreading).
I am doing 8 bulk updates at a time with ~ 100000 records in each batch.

What config setting can I change to get ES to actually use multiple threads for this pipeline so that it uses ~10 CPU cores?

warkolm · August 1, 2022, 10:33pm

The only way I can think to influence this would be to run multiple nodes on the host.

Maybe someone else can comment though?

Niels_Basjes · August 3, 2022, 8:44am

This clarifies a lot.
I'm going to try a different approach now.
Thanks!

system · August 31, 2022, 8:45am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
CPU affinity when running multiple nodes on a single host Elasticsearch	5	1215	June 5, 2020
Running multiple instances of Elasticsearch on the same host - Allocated processors setting Elasticsearch	0	71	July 2, 2024
Should I deploy elasticsearch in docker on one machine? Elasticsearch docker	14	415	January 16, 2024
Multiple ES run on same machine Elasticsearch	2	366	April 22, 2019
[ES2.2] how to run multiple elasticsearch nodes as processes on the same server Elasticsearch	8	4130	July 5, 2017

How to run multiple indexing threads on a single machine?

Related topics