We have a write heavy cluster that is showing only spiky activity during
bulk loads. Incoming write speed is ~1500 documents per second, but the
indexing speed is significantly slower, ~1500 a MINUTE, and the cpu
utilization is incredibly spiky (i.e. high cpu and network io for a short
time, followed by no cpu or network for a short time, repeat). The data is
being written to ~10 object_types between 2 indices. Any ideas on how to
smooth this out and optimize for this volume of writes?
The loaders are on separate cluster (4 Amazon c1.xlarges) and are
configured to be transport nodes only (no data or http).
The indices themselves are stored on a 6 node elasticsearch cluster on
(Amazon) m1.xlarges (these are the machine showing spiky load behavior).
This is what the index settings look like:
"settings" : {
"index.number_of_shards" : "12",
"index.number_of_replicas" : "2",
"index.version.created" : "190899",
"index.gateway.snapshot_interval" : "1200s"
}
and the configuration file (with some private info removed):
ElasticSearch config file
File paths
path:
home: /usr/local/share/elasticsearch
conf: /etc/elasticsearch
logs: /var/log/elasticsearch
http://www.elasticsearch.com/docs/elasticsearch/modules/node/
node:
data: true
master: true
http://www.elasticsearch.com/docs/elasticsearch/modules/http/
http:
enabled: true
port: 9200-9300
max_content_length: 100mb
cluster:
routing:
allocation:
node_initial_primaries_recoveries: 4
concurrent_recoveries: 5
http://groups.google.com/a/elasticsearch.com/group/users/browse_thread/thread/1f3001f43266879a/06d62ea3ceb4db30?lnk=gst&q=translog#06d62ea3ceb4db30
indices:
cache:
filter:
size: 20%
memory:
index_buffer_size: 10%
index:
number_of_shards: 12
number_of_replicas: 2
translog:
flush_threshold_ops: 5000
flush_threshold_size: 200mb
flush_threshold_period: 60s
merge:
policy:
max_merge_at_once: 10
segments_per_tier: 10
use_compound_file: false
floor_segment: 2.7mb
refresh_interval: 1s
shard:
recovery:
concurrent_streams: 7
engine:
robin:
term_index_interval: 1024
gateway:
snapshot_interval: 10s
snapshot_on_close: true
http://www.elasticsearch.org/guide/reference/api/admin-cluster-nodes-shutdown.html
action:
disable_shutdown: false
http://www.elasticsearch.com/docs/elasticsearch/modules/transport/
transport:
tcp:
port: 9300-9400
connect_timeout: 2m
compress: true
http://www.elasticsearch.com/docs/elasticsearch/modules/jmx/
jmx:
create_connector: true
port: 9400-9500
domain: elasticsearch
monitor.jvm.gc.ParNew.warn: 1000ms
monitor.jvm.gc.ParNew.info: 700ms
monitor.jvm.gc.ParNew.debug: 400ms
monitor.jvm.gc.ConcurrentMarkSweep.warn: 10s
monitor.jvm.gc.ConcurrentMarkSweep.info: 5s
monitor.jvm.gc.ConcurrentMarkSweep.debug: 2s
--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.