Elasticsearch cluster / nodes / shards conf

DavidTurner · January 15, 2019, 4:58pm

It will take some experimentation and tuning to be sure that your setup has the performance characteristics you need. You might like to try importing increasingly large subsets of your log file first to get a handle on the performance characteristics and make sure that your mappings are set up suitably for the searches you want to perform.

If the index will eventually be 400GB then this article suggests you will want to split it into around 10-20 shards. However if you do not need all the fields to be indexed then you might find your index becomes much smaller than the source data, so you will be able to work with correspondingly fewer shards.

If your searches will be filtering on time ranges (common for searches of log data) then you might want to consider splitting the data into time-based indices rather than putting it into a single index with lots of shards. Using time-based indices will allow Elasticsearch to completely avoid searching any shards which it knows in advance do not contain any documents that match the time range specified in the search.

Topic		Replies	Views
Elasticsearch Logstash importing very large JSON files Logstash	4	4815	July 6, 2017
Bulk Import to Elasticsearch Elasticsearch	6	2076	December 5, 2017
To give input to ELasticsearch via logstash Logstash	5	1260	July 6, 2017
How should I configure ELK stack for saving logs everyday? Elasticsearch	2	395	August 21, 2018
Best way of importing very large JSON files into Elasticsearch Elasticsearch	1	1158	June 6, 2019

Elasticsearch cluster / nodes / shards conf

Related topics