Elastic and Kibana Load Balancing

adwaitjoshi · July 12, 2019, 1:32pm

I have 2 physical machines where I run Elastic and Kibana on

120 - Master Node and a Data Node
121 - Data Node

I have 8C/64GB Ram and 4TB of hard drive space on them.

the clusters are setup and the indices have replicated on both 120 and 121.

My issues is on Kibana. Should kibana run on 120 and 121? Or just 120 or 121? Also I get the 30000ms timeout errors because I have approximately 3 billion transactions in Elastic that people run searches/dashboards on. And quite a bit of times, especially if 2 or more users are doing something, the system times out. How do I solve this issue?

jonathan_rowe · July 12, 2019, 2:30pm

I'd run it on 121, as you have the master using memory on 120

What size is your heap?

adwaitjoshi · July 12, 2019, 3:32pm

Heap is set at 28GB. Also note that data ONLY gets added to Elastic once a day! About 10 million records inserted.

jonathan_rowe · July 13, 2019, 9:12am

So you're a mainly RO use case, merge your segments to a reasonable degree post load - merging to a single segment will probably take too long with 64gb RAM and a HDD

You could increase the search timeout (if acceptable)

If not, then you need to either optimise your mappings/indices or add more hardware

Christian_Dahlqvist · July 13, 2019, 11:15am

How large are your indices? How many indices and shards do you have in the cluster?

adwaitjoshi · July 13, 2019, 11:21am

What's an RO? and I do have SSDs sorry for not clarifying earlier.

adwaitjoshi · July 13, 2019, 11:23am

My seed index is about 800GB. It's the largest one. Has 1.1 billion transactions. The rest are around 10 million each per day and are about 10gb in size.

At the moment I only have 2 total indices. Shards setting is at default. So not entirely sure.

Christian_Dahlqvist · July 13, 2019, 11:44am

Can you provide the output of the _cat/indices API? It sounds like you have quite large shards, which can impact query performance.

What type of queries are you running? What does CPU usage, disk I/O and iowait look like while you are querying?

adwaitjoshi · July 13, 2019, 2:53pm

dadoonet · July 13, 2019, 3:18pm

Please don't post images of text as they are hardly readable and not searchable.

Instead paste the text and format it with </> icon. Check the preview window.

Christian_Dahlqvist · July 13, 2019, 3:26pm

I can see that you have one index with a single primary shard of 894.1GB. The replica is the same size. This is likely going to be very slow to query as each query is single-threaded against each shard. I would recommend you reindex this into a new index with e.g. 30 primary shards. When you do so I would recommend you also set the number_of_routing_shards to e.g. 120 so you later on can use the split index API if needed. This assume you are on a reasonablt recent version.

adwaitjoshi · July 13, 2019, 3:31pm

Will post in a minute, just to play with some settings I am trying to reindex and change the default to 5 shards and 2 replicas. I have 2 physical nodes. I want to see if the performance changes if the index is sharded. I understand that 1TB is a fairly large and has to be reindexed.

adwaitjoshi · July 13, 2019, 3:31pm

We are on the latest and greatest 7.2!

Christian_Dahlqvist · July 13, 2019, 3:32pm

Then you MAY be able to use the split index API without reindexing.

adwaitjoshi · July 13, 2019, 3:32pm

Wow 30 primaries?

Christian_Dahlqvist · July 13, 2019, 3:32pm

Yes, that will get you down to 30GB per shard. You could go a bit bigger, but that will allow you to grow for a while.

adwaitjoshi · July 13, 2019, 3:33pm

So you would just change "number_of_shards" : 50 and "number_of_replicas" : 2 (is 2 enough?)

adwaitjoshi · July 13, 2019, 3:34pm

we split daily indexes into smaller bits and they are generally 10GB per day so 30 or 50 shards would mean 500MB indexes, would that be an issue?

Christian_Dahlqvist · July 13, 2019, 3:34pm

First check what the index settings of the large index is.

Christian_Dahlqvist · July 13, 2019, 3:36pm

The split is for the really large index only. You want to avoid having lots of really small indices so try to keep the shard size above 10GB for time-based indices.

Topic		Replies	Views
Kibana Time out Kibana	3	877	July 6, 2017
Elastic/Kibana performances issues Elasticsearch	3	464	April 19, 2018
Kibana Timeouts/Shard failed errors Kibana	12	834	August 6, 2019
Kibana thrown a tantrum with Dashboard. ES timeout 30000ms Elasticsearch	6	1024	July 5, 2017
How many shards for 60GB daily index? Elasticsearch	13	2905	November 23, 2017

Elastic and Kibana Load Balancing

Related topics