Distribution of work

orenmazor · April 4, 2012, 7:49pm

hey guys,

I've been on/off dealing with some ES issues, and I think I've
resolved a LOT of them. but there's still one thing that bugs me.

My structure:

40gb index
two nodes, ten shards, one replica.
each node is a quad core 32gb of ram machine. min=max=16gb
our data is extremely uniform, and extremely small. but comes in at a
high rate of around 100 index actions per second.
ES is at 18.6 (I know, we'll be updating this week)

currently, all of the shards are marked as primary on node one.

node 1:

uptime is 9 hours.
big desk shows 11399 garbage collection actions, and 2000 merges.
cpu has been uniform at under 5% for pretty much ever.
memory is at used=30gb, actual=14gb

node 2:

uptime is 3 hours
bigdesk shows 104000 gc actions, and 800 merges.
cpu has been uniform at 80-90% for pretty much forever.
memory is at used=21gb, actual=13gb

note:

node 1 is also running a few python scripts (server density, mongo
sync tool, geckoboard agent)

my questions:

why is cpu so high on node 2?
I was thinking that the 'primary' shards are the ones that do the
merging, and the 'slaves' do the receives of the complete index, but
in both cases lucene indices have to do a merge. not so much a
question as making sure I'm understanding everything.
my data is routed, which in theory should mean that some shards
would be doing a lot more work than others. does ES account for 'work'
and move shards from machine to machine? or does it just account for
shard size?

thanks!
Oren

Lukas_Vlcek1 · April 4, 2012, 9:12pm

Hi,

I probably do not have answers to all your questions but let me try to
address some of them.

As for the high CPU, can this be caused by the fact that you connect ES
client just to he second node? Which means that this node needs to handle
query execution as well? In other words, you did not share with us how you
query the cluster. It could be one of the reasons.
ES replicates documents, not Lucene indices segments. For more about
rationale behind this approach you can check Shay's talk from BBUZZ 2011:
http://vimeo.com/26710663
Slides can be found here:
http://2011.berlinbuzzwords.de/sites/2011.berlinbuzzwords.de/files/elasticsearch-bbuzz2011.pdf
There is also come older draft of concept pages based on this talk sitting
in pull request. Specifically, the replication part is here:
https://github.com/lukas-vlcek/elasticsearch.github.com/blob/cd92cb1cfaa238d2de6844599d3c913de06de313/guide/concepts/scaling-lucene/replication/index.textile
To my knowledge, ES does not move shards around the cluster based on the
query or work load. But there are options how to setup and update shard
allocation even in real-time:
Elasticsearch Platform — Find real-time answers at scale | Elastic
Elasticsearch Platform — Find real-time answers at scale | Elastic
So it should be possible to implement external system, that does this based
on the load of nodes (and the load can be check using ES admin API, that is
the API which bigdesk uses to create all the charts).

Regards,
Lukas

On Wed, Apr 4, 2012 at 9:49 PM, Oren Mazor oren@wildbit.com wrote:

hey guys,

I've been on/off dealing with some ES issues, and I think I've
resolved a LOT of them. but there's still one thing that bugs me.

My structure:

40gb index
two nodes, ten shards, one replica.
each node is a quad core 32gb of ram machine. min=max=16gb
our data is extremely uniform, and extremely small. but comes in at a
high rate of around 100 index actions per second.
ES is at 18.6 (I know, we'll be updating this week)

currently, all of the shards are marked as primary on node one.

node 1:

uptime is 9 hours.
big desk shows 11399 garbage collection actions, and 2000 merges.
cpu has been uniform at under 5% for pretty much ever.
memory is at used=30gb, actual=14gb

node 2:

uptime is 3 hours
bigdesk shows 104000 gc actions, and 800 merges.
cpu has been uniform at 80-90% for pretty much forever.
memory is at used=21gb, actual=13gb

note:

node 1 is also running a few python scripts (server density, mongo
sync tool, geckoboard agent)

my questions:

why is cpu so high on node 2?

I was thinking that the 'primary' shards are the ones that do the
merging, and the 'slaves' do the receives of the complete index, but
in both cases lucene indices have to do a merge. not so much a
question as making sure I'm understanding everything.

my data is routed, which in theory should mean that some shards
would be doing a lot more work than others. does ES account for 'work'
and move shards from machine to machine? or does it just account for
shard size?

thanks!
Oren

kimchy · April 4, 2012, 9:35pm

Because the number of replicas is 1 and you have 2 nodes, then it is
strange that one node has high CPU load compared to the other... . Is the
OS the same on both? Are you running on a virtualized / AWS env? Is the JVM
the same on both? Can you gist a couple of jstack results from the node
with the high load? (
jstack - Stack Trace).

On Wed, Apr 4, 2012 at 10:49 PM, Oren Mazor oren@wildbit.com wrote:

hey guys,

I've been on/off dealing with some ES issues, and I think I've
resolved a LOT of them. but there's still one thing that bugs me.

My structure:

40gb index
two nodes, ten shards, one replica.
each node is a quad core 32gb of ram machine. min=max=16gb
our data is extremely uniform, and extremely small. but comes in at a
high rate of around 100 index actions per second.
ES is at 18.6 (I know, we'll be updating this week)

currently, all of the shards are marked as primary on node one.

node 1:

uptime is 9 hours.
big desk shows 11399 garbage collection actions, and 2000 merges.
cpu has been uniform at under 5% for pretty much ever.
memory is at used=30gb, actual=14gb

node 2:

uptime is 3 hours
bigdesk shows 104000 gc actions, and 800 merges.
cpu has been uniform at 80-90% for pretty much forever.
memory is at used=21gb, actual=13gb

note:

node 1 is also running a few python scripts (server density, mongo
sync tool, geckoboard agent)

my questions:

why is cpu so high on node 2?

I was thinking that the 'primary' shards are the ones that do the
merging, and the 'slaves' do the receives of the complete index, but
in both cases lucene indices have to do a merge. not so much a
question as making sure I'm understanding everything.

my data is routed, which in theory should mean that some shards
would be doing a lot more work than others. does ES account for 'work'
and move shards from machine to machine? or does it just account for
shard size?

thanks!
Oren

orenmazor · April 5, 2012, 5:05am

these are two RS machines, identical in every way other than the busy one
(node 2), is all non-primaries. all of the queries are going via node 1,
the one with the low cpu. node 2 (high cpu) is never actually queried,
which makes this even weirder.

if replication is basically just mirroring of requests, then the merge
activity should be identical on both, no?

jstack with no special flags:

gist.github.com

https://gist.github.com/orenmazor/483d2cbc73c29056ac44

gistfile1.txt

2012-04-05 01:00:47
Full thread dump Java HotSpot(TM) 64-Bit Server VM (20.6-b01 mixed mode):

"Attach Listener" daemon prio=10 tid=0x0000000040ad6000 nid=0x1051 waiting on condition [0x0000000000000000]
   java.lang.Thread.State: RUNNABLE

"elasticsearch[cached]-pool-1-thread-2249" daemon prio=10 tid=0x0000000040d74800 nid=0x1042 waiting on condition [0x00007f2e566b7000]
   java.lang.Thread.State: TIMED_WAITING (parking)
	at sun.misc.Unsafe.park(Native Method)
	- parking to wait for  <0x0000000400150848> (a java.util.concurrent.SynchronousQueue$TransferStack)

This file has been truncated. show original

On Wednesday, April 4, 2012 5:35:58 PM UTC-4, kimchy wrote:

Because the number of replicas is 1 and you have 2 nodes, then it is
strange that one node has high CPU load compared to the other... . Is the
OS the same on both? Are you running on a virtualized / AWS env? Is the JVM
the same on both? Can you gist a couple of jstack results from the node
with the high load? (
jstack - Stack Trace).

On Wed, Apr 4, 2012 at 10:49 PM, Oren Mazor oren@wildbit.com wrote:

hey guys,

I've been on/off dealing with some ES issues, and I think I've
resolved a LOT of them. but there's still one thing that bugs me.

My structure:

40gb index
two nodes, ten shards, one replica.
each node is a quad core 32gb of ram machine. min=max=16gb
our data is extremely uniform, and extremely small. but comes in at a
high rate of around 100 index actions per second.
ES is at 18.6 (I know, we'll be updating this week)

currently, all of the shards are marked as primary on node one.

node 1:

uptime is 9 hours.
big desk shows 11399 garbage collection actions, and 2000 merges.
cpu has been uniform at under 5% for pretty much ever.
memory is at used=30gb, actual=14gb

node 2:

uptime is 3 hours
bigdesk shows 104000 gc actions, and 800 merges.
cpu has been uniform at 80-90% for pretty much forever.
memory is at used=21gb, actual=13gb

note:

node 1 is also running a few python scripts (server density, mongo
sync tool, geckoboard agent)

my questions:

why is cpu so high on node 2?

I was thinking that the 'primary' shards are the ones that do the
merging, and the 'slaves' do the receives of the complete index, but
in both cases lucene indices have to do a merge. not so much a
question as making sure I'm understanding everything.

my data is routed, which in theory should mean that some shards
would be doing a lot more work than others. does ES account for 'work'
and move shards from machine to machine? or does it just account for
shard size?

thanks!
Oren

kimchy · April 5, 2012, 3:25pm

On Thu, Apr 5, 2012 at 8:05 AM, Oren Mazor oren@wildbit.com wrote:

these are two RS machines, identical in every way other than the busy one
(node 2), is all non-primaries. all of the queries are going via node 1,
the one with the low cpu. node 2 (high cpu) is never actually queried,
which makes this even weirder.

Even if you don't send explicit search request to them, they are still
being searched on because elasticsearch will distribute the data between
them.

if replication is basically just mirroring of requests, then the merge
activity should be identical on both, no?

First, note that the merge stats are only from the time the node was
started. And no, even if both are started on teh same time, they will
differ because they are not completely in sync (doc wise they are, time
wise of when things kick off, like merges, might differ).

jstack with no special flags:
gist:483d2cbc73c29056ac44 · GitHub

Can you gist another set of jstack? Can you also include one from node1?

On Wednesday, April 4, 2012 5:35:58 PM UTC-4, kimchy wrote:

Because the number of replicas is 1 and you have 2 nodes, then it is
strange that one node has high CPU load compared to the other... . Is the
OS the same on both? Are you running on a virtualized / AWS env? Is the JVM
the same on both? Can you gist a couple of jstack results from the node
with the high load? (Moved
**share/jstack.htmlhttp://docs.oracle.com/javase/1.5.0/docs/tooldocs/share/jstack.html
).

On Wed, Apr 4, 2012 at 10:49 PM, Oren Mazor oren@wildbit.com wrote:

hey guys,

I've been on/off dealing with some ES issues, and I think I've
resolved a LOT of them. but there's still one thing that bugs me.

My structure:

40gb index
two nodes, ten shards, one replica.
each node is a quad core 32gb of ram machine. min=max=16gb
our data is extremely uniform, and extremely small. but comes in at a
high rate of around 100 index actions per second.
ES is at 18.6 (I know, we'll be updating this week)

currently, all of the shards are marked as primary on node one.

node 1:

uptime is 9 hours.
big desk shows 11399 garbage collection actions, and 2000 merges.
cpu has been uniform at under 5% for pretty much ever.
memory is at used=30gb, actual=14gb

node 2:

uptime is 3 hours
bigdesk shows 104000 gc actions, and 800 merges.
cpu has been uniform at 80-90% for pretty much forever.
memory is at used=21gb, actual=13gb

note:

node 1 is also running a few python scripts (server density, mongo
sync tool, geckoboard agent)

my questions:

why is cpu so high on node 2?

I was thinking that the 'primary' shards are the ones that do the
merging, and the 'slaves' do the receives of the complete index, but
in both cases lucene indices have to do a merge. not so much a
question as making sure I'm understanding everything.

my data is routed, which in theory should mean that some shards
would be doing a lot more work than others. does ES account for 'work'
and move shards from machine to machine? or does it just account for
shard size?

thanks!
Oren

Oren_Mazor · April 17, 2012, 7:20pm

so, a follow up:

we migrated to 19.2 and I'm seeing the same thing. the node that takes
the "point" of the requests has the cpu pegged again.

but with the new version of ES comes a new version of big desk (I
can't love this tool enough), and in this one I noticed that that node
has over 100k http connections, while the second node only 20. going
to look into balancing this out first.

On Apr 5, 11:25 am, Shay Banon kim...@gmail.com wrote:

On Thu, Apr 5, 2012 at 8:05 AM, Oren Mazor o...@wildbit.com wrote:

these are two RS machines, identical in every way other than the busy one
(node 2), is all non-primaries. all of the queries are going via node 1,
the one with the low cpu. node 2 (high cpu) is never actually queried,
which makes this even weirder.

Even if you don't send explicit search request to them, they are still
being searched on because elasticsearch will distribute the data between
them.

if replication is basically just mirroring of requests, then the merge
activity should be identical on both, no?

First, note that the merge stats are only from the time the node was
started. And no, even if both are started on teh same time, they will
differ because they are not completely in sync (doc wise they are, time
wise of when things kick off, like merges, might differ).

jstack with no special flags:
gist:483d2cbc73c29056ac44 · GitHub

Can you gist another set of jstack? Can you also include one from node1?

On Wednesday, April 4, 2012 5:35:58 PM UTC-4, kimchy wrote:

Because the number of replicas is 1 and you have 2 nodes, then it is
strange that one node has high CPU load compared to the other... . Is the
OS the same on both? Are you running on a virtualized / AWS env? Is the JVM
the same on both? Can you gist a couple of jstack results from the node
with the high load? (Moved
**share/jstack.htmlhttp://docs.oracle.com/javase/1.5.0/docs/tooldocs/share/jstack.html
).

On Wed, Apr 4, 2012 at 10:49 PM, Oren Mazor o...@wildbit.com wrote:

hey guys,

I've been on/off dealing with some ES issues, and I think I've
resolved a LOT of them. but there's still one thing that bugs me.

My structure:

40gb index
two nodes, ten shards, one replica.
each node is a quad core 32gb of ram machine. min=max=16gb
our data is extremely uniform, and extremely small. but comes in at a
high rate of around 100 index actions per second.
ES is at 18.6 (I know, we'll be updating this week)

currently, all of the shards are marked as primary on node one.

node 1:

uptime is 9 hours.
big desk shows 11399 garbage collection actions, and 2000 merges.
cpu has been uniform at under 5% for pretty much ever.
memory is at used=30gb, actual=14gb

node 2:

uptime is 3 hours
bigdesk shows 104000 gc actions, and 800 merges.
cpu has been uniform at 80-90% for pretty much forever.
memory is at used=21gb, actual=13gb

note:

node 1 is also running a few python scripts (server density, mongo
sync tool, geckoboard agent)

my questions:

why is cpu so high on node 2?

I was thinking that the 'primary' shards are the ones that do the
merging, and the 'slaves' do the receives of the complete index, but
in both cases lucene indices have to do a merge. not so much a
question as making sure I'm understanding everything.

my data is routed, which in theory should mean that some shards
would be doing a lot more work than others. does ES account for 'work'
and move shards from machine to machine? or does it just account for
shard size?

thanks!
Oren

Oren_Mazor · April 21, 2012, 7:15pm

I've noticed another thing. the nodes have different data. is this a
function of re-play of index/delete actions? is there a way to modify
the frequency that happens? it looks like its a day or two behind…

On Apr 17, 3:20 pm, Oren Mazor oren.ma...@gmail.com wrote:

so, a follow up:

we migrated to 19.2 and I'm seeing the same thing. thenodethat takes
the "point" of the requests has the cpu pegged again.

but with the new version of ES comes a new version of big desk (I
can't love this tool enough), and in this one I noticed that thatnode
has over 100k http connections, while the secondnodeonly 20. going
to look into balancing this out first.

On Apr 5, 11:25 am, Shay Banon kim...@gmail.com wrote:

On Thu, Apr 5, 2012 at 8:05 AM, Oren Mazor o...@wildbit.com wrote:

these are two RS machines, identical in every way other than the busy one
(node2), is all non-primaries. all of the queries are going vianode1,
the one with the low cpu.node2 (high cpu) is never actually queried,
which makes this even weirder.

Even if you don't send explicit search request to them, they are still
being searched on because elasticsearch will distribute the data between
them.

if replication is basically just mirroring of requests, then the merge
activity should be identical on both, no?

First, note that the merge stats are only from the time thenodewas
started. And no, even if both are started on teh same time, they will
differ because they are not completely insync(doc wise they are, time
wise of when things kick off, like merges, might differ).

jstack with no special flags:
gist:483d2cbc73c29056ac44 · GitHub

Can you gist another set of jstack? Can you also include one from node1?

On Wednesday, April 4, 2012 5:35:58 PM UTC-4, kimchy wrote:

Because the number of replicas is 1 and you have 2 nodes, then it is
strange that onenodehas high CPU load compared to the other... . Is the
OS the same on both? Are you running on a virtualized / AWS env? Is the JVM
the same on both? Can you gist a couple of jstack results from thenode
with the high load? (Moved
**share/jstack.htmlhttp://docs.oracle.com/javase/1.5.0/docs/tooldocs/share/jstack.html
).

On Wed, Apr 4, 2012 at 10:49 PM, Oren Mazor o...@wildbit.com wrote:

hey guys,

I've been on/off dealing with some ES issues, and I think I've
resolved a LOT of them. but there's still one thing that bugs me.

My structure:

40gb index
two nodes, ten shards, one replica.
eachnodeis a quad core 32gb of ram machine. min=max=16gb
our data is extremely uniform, and extremely small. but comes in at a
high rate of around 100 index actions per second.
ES is at 18.6 (I know, we'll be updating this week)

currently, all of the shards are marked as primary onnodeone.

node1:

uptime is 9 hours.
big desk shows 11399 garbage collection actions, and 2000 merges.
cpu has been uniform at under 5% for pretty much ever.
memory is at used=30gb, actual=14gb

node2:

uptime is 3 hours
bigdesk shows 104000 gc actions, and 800 merges.
cpu has been uniform at 80-90% for pretty much forever.
memory is at used=21gb, actual=13gb

note:

node1 is also running a few python scripts (server density, mongo
synctool, geckoboard agent)

my questions:

why is cpu so high onnode2?

I was thinking that the 'primary' shards are the ones that do the
merging, and the 'slaves' do the receives of the complete index, but
in both cases lucene indices have to do a merge. not so much a
question as making sure I'm understanding everything.

my data is routed, which in theory should mean that some shards
would be doing a lot more work than others. does ES account for 'work'
and move shards from machine to machine? or does it just account for
shard size?

thanks!
Oren

kimchy · April 25, 2012, 3:31pm

What do you mean by a day or two behind?

On Sat, Apr 21, 2012 at 10:15 PM, Oren Mazor oren.mazor@gmail.com wrote:

I've noticed another thing. the nodes have different data. is this a
function of re-play of index/delete actions? is there a way to modify
the frequency that happens? it looks like its a day or two behind…

On Apr 17, 3:20 pm, Oren Mazor oren.ma...@gmail.com wrote:

so, a follow up:

we migrated to 19.2 and I'm seeing the same thing. thenodethat takes
the "point" of the requests has the cpu pegged again.

but with the new version of ES comes a new version of big desk (I
can't love this tool enough), and in this one I noticed that thatnode
has over 100k http connections, while the secondnodeonly 20. going
to look into balancing this out first.

On Apr 5, 11:25 am, Shay Banon kim...@gmail.com wrote:

On Thu, Apr 5, 2012 at 8:05 AM, Oren Mazor o...@wildbit.com wrote:

these are two RS machines, identical in every way other than the
busy one
(node2), is all non-primaries. all of the queries are going vianode1,
the one with the low cpu.node2 (high cpu) is never actually queried,
which makes this even weirder.

Even if you don't send explicit search request to them, they are still
being searched on because elasticsearch will distribute the data
between
them.

if replication is basically just mirroring of requests, then the
merge
activity should be identical on both, no?

First, note that the merge stats are only from the time thenodewas
started. And no, even if both are started on teh same time, they will
differ because they are not completely insync(doc wise they are, time
wise of when things kick off, like merges, might differ).

jstack with no special flags:
gist:483d2cbc73c29056ac44 · GitHub

Can you gist another set of jstack? Can you also include one from
node1?

On Wednesday, April 4, 2012 5:35:58 PM UTC-4, kimchy wrote:

Because the number of replicas is 1 and you have 2 nodes, then it is
strange that onenodehas high CPU load compared to the other... . Is
the
OS the same on both? Are you running on a virtualized / AWS env? Is
the JVM
the same on both? Can you gist a couple of jstack results from
thenode
with the high load? (
Moved
**share/jstack.html<
jstack - Stack Trace>
).

On Wed, Apr 4, 2012 at 10:49 PM, Oren Mazor o...@wildbit.com
wrote:

hey guys,

I've been on/off dealing with some ES issues, and I think I've
resolved a LOT of them. but there's still one thing that bugs me.

My structure:

40gb index
two nodes, ten shards, one replica.
eachnodeis a quad core 32gb of ram machine. min=max=16gb
our data is extremely uniform, and extremely small. but comes in
at a
high rate of around 100 index actions per second.
ES is at 18.6 (I know, we'll be updating this week)

currently, all of the shards are marked as primary onnodeone.

node1:

uptime is 9 hours.
big desk shows 11399 garbage collection actions, and 2000 merges.
cpu has been uniform at under 5% for pretty much ever.
memory is at used=30gb, actual=14gb

node2:

uptime is 3 hours
bigdesk shows 104000 gc actions, and 800 merges.
cpu has been uniform at 80-90% for pretty much forever.
memory is at used=21gb, actual=13gb

note:

node1 is also running a few python scripts (server density, mongo
synctool, geckoboard agent)

my questions:

why is cpu so high onnode2?

I was thinking that the 'primary' shards are the ones that do
the
merging, and the 'slaves' do the receives of the complete index,
but
in both cases lucene indices have to do a merge. not so much a
question as making sure I'm understanding everything.

my data is routed, which in theory should mean that some shards
would be doing a lot more work than others. does ES account for
'work'
and move shards from machine to machine? or does it just account
for
shard size?

thanks!
Oren

Oren_Mazor · April 27, 2012, 3:54pm

Lets say I index a new document every minute. with replica turned on,
it means that one of my nodes is going to be missing an entire day's
worth of indexing activity.

I'm currently running without replicas (but with multiple nodes) and
its running great. as soon as I turn on replica, I can get two
different results for a search query.

let me know what debug info I can get for you on this.

On Apr 25, 11:31 am, Shay Banon kim...@gmail.com wrote:

What do you mean by a day or two behind?

On Sat, Apr 21, 2012 at 10:15 PM, Oren Mazor oren.ma...@gmail.com wrote:

I've noticed another thing. the nodes have different data. is this a
function of re-play of index/delete actions? is there a way to modify
the frequency that happens? it looks like its a day or two behind…

On Apr 17, 3:20 pm, Oren Mazor oren.ma...@gmail.com wrote:

so, a follow up:

we migrated to 19.2 and I'm seeing the same thing. thenodethat takes
the "point" of the requests has the cpu pegged again.

but with the new version of ES comes a new version of big desk (I
can't love this tool enough), and in this one I noticed that thatnode
has over 100k http connections, while the secondnodeonly 20. going
to look into balancing this out first.

On Apr 5, 11:25 am, Shay Banon kim...@gmail.com wrote:

On Thu, Apr 5, 2012 at 8:05 AM, Oren Mazor o...@wildbit.com wrote:

these are two RS machines, identical in every way other than the
busy one
(node2), is all non-primaries. all of the queries are going vianode1,
the one with the low cpu.node2 (high cpu) is never actually queried,
which makes this even weirder.

Even if you don't send explicit search request to them, they are still
being searched on because elasticsearch will distribute the data
between
them.

if replication is basically just mirroring of requests, then the
merge
activity should be identical on both, no?

First, note that the merge stats are only from the time thenodewas
started. And no, even if both are started on teh same time, they will
differ because they are not completely insync(doc wise they are, time
wise of when things kick off, like merges, might differ).

jstack with no special flags:
gist:483d2cbc73c29056ac44 · GitHub

Can you gist another set of jstack? Can you also include one from
node1?

On Wednesday, April 4, 2012 5:35:58 PM UTC-4, kimchy wrote:

Because the number of replicas is 1 and you have 2 nodes, then it is
strange that onenodehas high CPU load compared to the other... . Is
the
OS the same on both? Are you running on a virtualized / AWS env? Is
the JVM
the same on both? Can you gist a couple of jstack results from
thenode
with the high load? (
Moved
**share/jstack.html<
jstack - Stack Trace>
).

On Wed, Apr 4, 2012 at 10:49 PM, Oren Mazor o...@wildbit.com
wrote:

hey guys,

I've been on/off dealing with some ES issues, and I think I've
resolved a LOT of them. but there's still one thing that bugs me.

My structure:

40gb index
two nodes, ten shards, one replica.
eachnodeis a quad core 32gb of ram machine. min=max=16gb
our data is extremely uniform, and extremely small. but comes in
at a
high rate of around 100 index actions per second.
ES is at 18.6 (I know, we'll be updating this week)

currently, all of the shards are marked as primary onnodeone.

node1:

uptime is 9 hours.
big desk shows 11399 garbage collection actions, and 2000 merges.
cpu has been uniform at under 5% for pretty much ever.
memory is at used=30gb, actual=14gb

node2:

uptime is 3 hours
bigdesk shows 104000 gc actions, and 800 merges.
cpu has been uniform at 80-90% for pretty much forever.
memory is at used=21gb, actual=13gb

note:

node1 is also running a few python scripts (server density, mongo
synctool, geckoboard agent)

my questions:

why is cpu so high onnode2?

I was thinking that the 'primary' shards are the ones that do
the
merging, and the 'slaves' do the receives of the complete index,
but
in both cases lucene indices have to do a merge. not so much a
question as making sure I'm understanding everything.

my data is routed, which in theory should mean that some shards
would be doing a lot more work than others. does ES account for
'work'
and move shards from machine to machine? or does it just account
for
shard size?

thanks!
Oren

kimchy · April 29, 2012, 4:51pm

Replication is synchronous, once you index a document, it will also be
indexed on the replica. Are you playing with the refresh interval by any
chance?

On Fri, Apr 27, 2012 at 6:54 PM, Oren Mazor oren.mazor@gmail.com wrote:

Lets say I index a new document every minute. with replica turned on,
it means that one of my nodes is going to be missing an entire day's
worth of indexing activity.

I'm currently running without replicas (but with multiple nodes) and
its running great. as soon as I turn on replica, I can get two
different results for a search query.

let me know what debug info I can get for you on this.

On Apr 25, 11:31 am, Shay Banon kim...@gmail.com wrote:

What do you mean by a day or two behind?

On Sat, Apr 21, 2012 at 10:15 PM, Oren Mazor oren.ma...@gmail.com
wrote:

I've noticed another thing. the nodes have different data. is this a
function of re-play of index/delete actions? is there a way to modify
the frequency that happens? it looks like its a day or two behind…

On Apr 17, 3:20 pm, Oren Mazor oren.ma...@gmail.com wrote:

so, a follow up:

we migrated to 19.2 and I'm seeing the same thing. thenodethat takes
the "point" of the requests has the cpu pegged again.

but with the new version of ES comes a new version of big desk (I
can't love this tool enough), and in this one I noticed that thatnode
has over 100k http connections, while the secondnodeonly 20. going
to look into balancing this out first.

On Apr 5, 11:25 am, Shay Banon kim...@gmail.com wrote:

On Thu, Apr 5, 2012 at 8:05 AM, Oren Mazor o...@wildbit.com
wrote:

these are two RS machines, identical in every way other than the
busy one
(node2), is all non-primaries. all of the queries are going
vianode1,
the one with the low cpu.node2 (high cpu) is never actually
queried,
which makes this even weirder.

Even if you don't send explicit search request to them, they are
still
being searched on because elasticsearch will distribute the data
between
them.

if replication is basically just mirroring of requests, then the
merge
activity should be identical on both, no?

First, note that the merge stats are only from the time thenodewas
started. And no, even if both are started on teh same time, they
will
differ because they are not completely insync(doc wise they are,
time
wise of when things kick off, like merges, might differ).

jstack with no special flags:
gist:483d2cbc73c29056ac44 · GitHub

Can you gist another set of jstack? Can you also include one from
node1?

On Wednesday, April 4, 2012 5:35:58 PM UTC-4, kimchy wrote:

Because the number of replicas is 1 and you have 2 nodes, then
it is
strange that onenodehas high CPU load compared to the other...
. Is
the
OS the same on both? Are you running on a virtualized / AWS
env? Is
the JVM
the same on both? Can you gist a couple of jstack results from
thenode
with the high load? (
Moved
**share/jstack.html<
jstack - Stack Trace>
).

On Wed, Apr 4, 2012 at 10:49 PM, Oren Mazor o...@wildbit.com
wrote:

hey guys,

I've been on/off dealing with some ES issues, and I think I've
resolved a LOT of them. but there's still one thing that bugs
me.

My structure:

40gb index
two nodes, ten shards, one replica.
eachnodeis a quad core 32gb of ram machine. min=max=16gb
our data is extremely uniform, and extremely small. but comes
in
at a
high rate of around 100 index actions per second.
ES is at 18.6 (I know, we'll be updating this week)

currently, all of the shards are marked as primary onnodeone.

node1:

uptime is 9 hours.
big desk shows 11399 garbage collection actions, and 2000
merges.
cpu has been uniform at under 5% for pretty much ever.
memory is at used=30gb, actual=14gb

node2:

uptime is 3 hours
bigdesk shows 104000 gc actions, and 800 merges.
cpu has been uniform at 80-90% for pretty much forever.
memory is at used=21gb, actual=13gb

note:

node1 is also running a few python scripts (server density,
mongo
synctool, geckoboard agent)

my questions:

why is cpu so high onnode2?

I was thinking that the 'primary' shards are the ones that
do
the
merging, and the 'slaves' do the receives of the complete
index,
but
in both cases lucene indices have to do a merge. not so much a
question as making sure I'm understanding everything.

my data is routed, which in theory should mean that some
shards
would be doing a lot more work than others. does ES account for
'work'
and move shards from machine to machine? or does it just
account
for
shard size?

thanks!
Oren

Topic		Replies	Views
ElasticSearch high CPU on merge threads Elasticsearch	8	2593	July 5, 2017
Data node high CPU Elasticsearch	19	3646	February 26, 2018
CPU utilization has been increased by 40% in ES 2.1.1 when compared with ES 1.6 Elasticsearch	7	1312	July 5, 2017
Scaling ES indexing CPU usage Elasticsearch	6	968	July 5, 2017
Performance Issues Elasticsearch	3	447	July 6, 2017

Distribution of work

Related topics