I know there are many threads opened for this but I haven't been able to fix my issue with them.
I am new to this.
In our ES cluster, there are 4 data, 1 master and 1 ingest nodes.
The index rate is currently 35k per min. I tried to increase the indexing speed by adding additional data node but that didn't help.
I have configured my OS as per the official documentation and done some tuning for faster indexing (index.refresh_interval = 30s etc). Please could you identify the bottleneck (s), if any?
ES version: 7.8.1
Data is being sent to ES by multiple Filebeat agents (Default Filebeat index template is being used).
I have removed all unnecessary nested fields but there has hardly been any impact.
Even though it most likely is not related to your performance problems I would like to point out that having a single master eligible node is bad. You should always look to have at least 3 master eligible nodes in a cluster as that adds resiliency and reduces risk of catastrophic failure.
Some additional information about your cluster is required though. What is the hardware profile if the different nodes? What type of storage are you using? What does CPU and memory usage look like on the nodes? Is there any evidence of slow or long GC in the logs on any of the nodes?
@Anirbaan_Chowdhury Maybe it's not an indexing speed issue more an ingestion speed issue. It probably depends on the ingest pipeline complexity, but I know that in my production stack, 1 ingest node is not enough.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.