Two weeks to start.
Otis
using 28 nodes with1 daily index what would be the best shard and replication scheme?
On Wed, Mar 6, 2013 at 3:39 AM, Otis Gospodnetic otis.gospodnetic@gmail.com wrote:
Hi,
0 replicas is risky.  1 node dies and you are missing ~1/28th of your data. So repl > 0 is definitely better.
Otis
ELASTICSEARCH Performance Monitoring - Sematext Monitoring | Infrastructure Monitoring Service
On Wednesday, March 6, 2013 1:56:53 AM UTC-5, B wrote:
Clinton, if i keep to the distributed model of a 28 node cluster with all using the same index. Would shards=14 , repl=1 be a wise choice to keep all nodes doing something?
Right now testing shards=28 and repl=0 which is working great and the cluster is balanced with data and load but thinking long term of node failures and adding nodes in the future.
On Feb 27, 2013 11:43 PM, "Brian Harris" beha...@gmail.com wrote:
ok, just playing around with 28 nodes I set shards=28 and replicas=0.
I now have 28 shards all of my nodes now which is balanced.
This is definitely an alternate setup that will work but I will still to try to find out how to keep logs local to the rack using the templates you listed below.
Perhaps I don't need to keep logs per rack as this seems to balance out the storage pretty well.
What mechanism does elasticsearch use to keep the data balanced across all nodes?
28 shards all reported on all nodes.
logstash-2013.02.28 27 p STARTED 2972868 2.5gb 2753226040 node4
..
logstash-2013.02.28 0 p STARTED 2972863 2.5gb 2764377828 node19
On Wed, Feb 27, 2013 at 2:58 AM, Clinton Gormley cl...@traveljury.com wrote:
On Wed, 2013-02-27 at 00:16 -0800, B wrote:
Great, I checked out the docs and can came up with following config to
enable rack awareness for each instance that would look something like
this.
node:
name: node1
rack_id: rack1
cluster:
name: elasticsearch
routing:
allocation:
awareness:
attributes: rack_id
Since logstash creates an index per day(logstash-2013.02.26) per node,
would I have to do anything special to make sure that an index and
it's shards are created for each node individually?
If you want 30 indices on 30 different nodes, then you need to create
each index with a different name, and set the allocation on each index
to tie it to a single node.
Elasticsearch Platform — Find real-time answers at scale | Elastic
For instance, you can use index templates to say: if the index name
matches "node_1_*" then set index.routing.allocation.include.rack_id to
"node_1"
Elasticsearch Platform — Find real-time answers at scale | Elastic
clint
Currently only 5 shards are allocated in my shard=5 rep=0 cluster
receiving logs for +1k servers under low load. The goal is to have all
30 nodes to have an index started and receiving logs for its rack to
handle high load.
es shards -v
index               shard pri/rep      state          docs    size
bytes node
logstash-2013.02.26     0 p       STARTED    28031927  22.9gb
24677604861 n7
logstash-2013.02.26     1 p       STARTED    26853297    22gb
23641399741 n18
logstash-2013.02.26     2 p       STARTED    28035826  22.9gb
24686606451 n21
logstash-2013.02.26     3 p       STARTED    28033599  22.9gb
24695469792 n24
logstash-2013.02.26     4 p       STARTED    28037600  22.9gb
24687686161 n5
Any recommendations on how to configure elasticsearch rack awareness
and routing to handle this.
On Tuesday, February 26, 2013 10:40:36 AM UTC-8, Clinton Gormley
wrote:
    > If I use the distributed system, can I force data to stay
    local to the
    > rack?
    Yes. Look for rack awareness in the docs
    >
    >
    >
    > On Tuesday, February 26, 2013 1:41:46 AM UTC-8, Clinton
    Gormley wrote:
    >         Hiya
    >
    >         OK, the picture is slowly evolving :)
    >
    >         On Mon, 2013-02-25 at 16:23 -0800, beha...@gmail.com
    wrote:
    >         > So I have 30 racks at a colo and have 1 es
    instance per
    >         rack.
    >         > The 1 es instance per rack is used to index all
    logs for
    >         that rack
    >         > only.
    >         >
    >         > Is there anyway for each instance to join a 30
    node cluster
    >         as a
    >         > (client only) and not replicate or shard data
    between them?
    >
    >         A "client" in Elasticsearch terminology doesn't hold
    any
    >         data.  Hence
    >         part of the confusion.  I think what you're asking
    is: Can I
    >         have an
    >         index on a single node in the cluster?
    >
    >         The answer is yes: you can create 30 indices, and
    specify rack
    >         awareness
    >         for each index, so that each index sits in a single
    rack.
    >         >
    >         > The purpose of the client only cluster would allow
    me to
    >         search one
    >         > instance and have es query all members for data.
    >
    >         Yes, you can connect to any node in the cluster and
    query one
    >         or more
    >         indices.  It will forward queries to all relevant
    nodes.
    >
    >         Note: I don't recommend this setup.  Especially with
    30 nodes,
    >         the
    >         chances of one of them going down is pretty high.
     Hardware
    >         fails.  With
    >         your current setup (esp if you don't have any
    replicas) then
    >         you run a
    >         good chance of losing data.
    >
    >         Why not just use Elasticsearch as the distributed
    system that
    >         it is
    >         intended to be?
    >
    >         clint
    >
    >
    >
    > --
    > You received this message because you are subscribed to the
    Google
    > Groups "elasticsearch" group.
    > To unsubscribe from this group and stop receiving emails
    from it, send
    > an email to elasticsearc...@googlegroups.com.
    > For more options, visit
    https://groups.google.com/groups/opt_out.
    >
    >
--
You received this message because you are subscribed to the Google
Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send
an email to elasticsearc...@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.
--
You received this message because you are subscribed to a topic in the Google Groups "elasticsearch" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/elasticsearch/1ZUNmPHD8wY/unsubscribe?hl=en-US.
To unsubscribe from this group and all its topics, send an email to elasticsearc...@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.
--
You received this message because you are subscribed to a topic in the Google Groups "elasticsearch" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/elasticsearch/1ZUNmPHD8wY/unsubscribe?hl=en-US.
To unsubscribe from this group and all its topics, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.
--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.