Es-diag: request for checks

Dotan_Nahum · June 17, 2012, 4:00pm

Hi guys,

Previously I've announced the es-diag tool -
https://github.com/jondot/es-diag, a readyness/healthcheck toolkit for
elastic search.

After reviewing parts of the mailing list and Github issues looking for
specific best practices, it is hard to find something that I can mark
off as an obvious pitfall or best-practice. It think, therefore that it is
best to start with the general opinion directly (welcome any other idea).

So, if anyone have best-practices, pitfalls, checklist or things they wrote
on a back of a napkin, such that they can share,
feel free to express it in words, and I'll take care of adding it into the
tool as proper 'checks'. You could then pick up the
latest version of the tool and run it for your own convenience.

The main things I'd need in a good description would be

source of data (can be anything really)
what are the conditions that are considered bad
how to amend it, in general

Just to get a feel for how such a thing looks coded, I've already done one
example based on Shay's recommendation for JVM heap settings, out in 0.0.3
which you can view here:

github.com

jondot/es-diag/blob/master/lib/checks/local_node_jvm_heap.rb

title "Recommended JVM Heap Size"


how_to """
    Passing JVM level configuration (such as -X parameters) should be set 
    within the elasticsearch.conf file.

    Use the ES_MIN_MEM and ES_MAX_MEM environment variables to set the minimum 
    and maximum memory allocation for the JVM (set in mega bytes). It defaults
    to 256 and 1024 respectively.
"""


#
# for more detail about es_local_node data bag, see your own node
# http://localhost:9200/_cluster/nodes/_local
#
# for technical detail, see how fetching is implemented in
# /contexts/es_local_node.rb
#

This file has been truncated. show original

The idea is to have a coded 'spec' of checks, that anyone can run, either
before setting up elastic search or while it is running
to make sure the server is always optimized to host it. Further, it may
bring troubleshooting and support to a no-brainer level.

Thanks!