I've been working with ES 5.2.2 for the past few months and I can't seem to make searches run fast enough. They even timeout quite a bit. My set up:
I have a
1 dedicated Logstash node
3 master eligible ES nodes (8GB available RAM, I set half of it, 4GB, in the JVM file)
1 coordinating node with kibana on it
My sharding is 1 primary with 2 replicas.
So when I search past 12 hours, I get a timeout which seems ridiculous because I only have about 70k documents at 160mb each day/index so you'd think things would be going fast enough but they aren't.
I've done a ton of reading on how to make things faster but nothing seems to work. As a side question, is m_lockall automatically set in the YAML file in ES 5.X, or I need to explicitly set it?
What type of queries are you running? How much CPU does each node have? What kind of storage do you use? Do you have monitoring installed so you can get some clues from that?
A basic search all (*) query is what I'm referring to, but it's only a few
10k of documents that it's searching through... sadly i don't have
monitoring installed and can't install it because I've already used up the
trial. These are virtual machines using ssds and about half of the storage
is used up by past indexes. Can those past indices have an impact on
searches that don't even look at those shards? and I'll have to check on
the CPU.. what is a lower limit of the cpus? Does it have that strong of an
impact on performace?
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.