Hi everyone,
I am considering moving one or several elasticsearch clusters to production.
Although Elasticsearch's documentation and community is great, I am
strongly startled not to find any complete use-case story stretching from
application(s) needs and data considerations to hardware ones.
Indeed, I understand why "what/how much hardware / configuration /
sharding" questions are systematically replied with both "it depends"
followed by "test".
But then, what about a few complete descriptions, out of so many
elasticsearch users, from data use case to cluster's internals, along with
a few performance and nodes stats?
So here are questions, before moving to production :
Are there any complete use cases around? Could you share some? By
complete I mean including at least some of the following :
- Application needs and scope
-
Indexing Data indications : data volume, documents mapping,
documents / indexes volume -
Searching Data indications : different applications, queries, use
of facets - filters - aggregations, concurrent indexing -
Cluster Hardware : machines' hardware (RAM, Disks/SSD -
DAS-JBOD/SAN/NAS), JVM heap / OS Cache, nb of machines, back office network -
Cluster Configuration : one or several indexes, sharding,
replication, master nodes, data nodes, use of over-sharding at start-up,
use of re-indexing - *Benchmaks *: queries response times, QPS, with or without concurrent
indexing, memory heap sweet spot, nodes stats
For those interested, here are the (not complete) best-among-very-few
exemples I've stumbled upon so far :
- The very best (perfs with hardware and query description) :
http://fr.slideshare.net/charliejuggler/lucene-solrlondonug-meetup28nov2014-solr-es-performance - Hardware and master nodes heap :
https://groups.google.com/forum/?fromgroups#!searchin/elasticsearch/sizing/elasticsearch/V5BtrCGOqoU/l7x6vqMEx5YJ -
6th slide - Hardware and storage with number of documents (well,
without indexes and documents storage volume nor RAM consumption) :
https://speakerdeck.com/bhaskarvk/scaling-elasticsearch-washington-dc-meetup
With JBOD / SAN storage discussion in "To Raid or not to Raid":
- Usual heap considerations in a real case :
https://codeascraft.com/2014/12/04/juggling-multiple-elasticsearch-instances-on-a-single-host/
Do not forget Elasticsearch awesome docs for moving to production
considerations :
http://www.elasticsearch.org/guide/en/elasticsearch/guide/current/administration.html
http://www.elasticsearch.org/guide/en/elasticsearch/guide/current/deploy.html
- http://www.elasticsearch.org/guide/en/elasticsearch/guide/current/hardware.html
http://www.elasticsearch.org/guide/en/elasticsearch/guide/current/hardware.html - http://www.elasticsearch.org/guide/en/elasticsearch/guide/current/heap-sizing.html
http://www.elasticsearch.org/guide/en/elasticsearch/guide/current/heap-sizing.html
--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/57c967ea-8bf0-4dce-a7ca-4a746ee21250%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.