Limit the amount of data generated by Marvel with marvel.agent.interval?


(Logan) #1

I'm managing a pretty badass 11 node Elasticsearch cluster that is powering
a customer facing dashboard reporting platform. 20 cores per node, 64GB
RAM, SSDs, Dual 10 GbE of awesome. I evaluated Marvel while we were still
in development on the new platform and I found it to be a very valuable
tool. At first Marvel was indexing to the same cluster we were monitoring
and this was okay while we were in development as there were plenty of
extra cycles in the cluster to handle the load but now that we are in
production it doesn't make sense to burden the cluster with this. The
nature of our reporting system requires us to to have an index for each
customer so we're currently at 328 indexes and over 10,000 shards total.
The amount of data indexed by Marvel increases dramatically as the number
of indices increases so once we got over 300 indices in the system the
daily marvel index ended up at around 400 GB replicated and was indexing
around 2,000 documents a second by itself.

What I want to do is have Marvel index to a not as awesome 2 node
Elasticsearch monitoring cluster. 12 cores, 64 GB RAM and spinning disks.
But in practice these 2 nodes are unable to keep up with the load and get
completely bogged down. I'm thinking I can sacrifice redundancy and buy
myself some cycles by not using any replicas on the Marvel index. My other
idea is to set marvel.agent.interval from the default 10s to something like
30s on the assumption that this will cut the amount of data generated by a
third. Does this sound sane or do you have anyone have other ideas on what
I can try to limited the load?

marvel.agent.interval

Controls the interval between data samples. Defaults to 10s. Set to -1 to
temporarily disable exporting.

This setting is update-able via the Cluster Update Settings API.

Thanks -Logan-

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/5884045a-49f7-48d4-a3cb-93a5f70c53cf%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


(Mark Walkom) #2

That's pretty sane. I believe the newest version of marvel increased the
default from 5s to 10s.

But be aware, you are breaking the license for Marvel with that number of
nodes - http://www.elasticsearch.org/overview/marvel/

Regards,
Mark Walkom

Infrastructure Engineer
Campaign Monitor
email: markw@campaignmonitor.com
web: www.campaignmonitor.com

On 1 May 2014 06:52, Logan Hardy loganbhardy@gmail.com wrote:

I'm managing a pretty badass 11 node Elasticsearch cluster that is
powering a customer facing dashboard reporting platform. 20 cores per node,
64GB RAM, SSDs, Dual 10 GbE of awesome. I evaluated Marvel while we were
still in development on the new platform and I found it to be a very
valuable tool. At first Marvel was indexing to the same cluster we were
monitoring and this was okay while we were in development as there were
plenty of extra cycles in the cluster to handle the load but now that we
are in production it doesn't make sense to burden the cluster with this.
The nature of our reporting system requires us to to have an index for each
customer so we're currently at 328 indexes and over 10,000 shards total.
The amount of data indexed by Marvel increases dramatically as the number
of indices increases so once we got over 300 indices in the system the
daily marvel index ended up at around 400 GB replicated and was indexing
around 2,000 documents a second by itself.

What I want to do is have Marvel index to a not as awesome 2 node
Elasticsearch monitoring cluster. 12 cores, 64 GB RAM and spinning disks.
But in practice these 2 nodes are unable to keep up with the load and get
completely bogged down. I'm thinking I can sacrifice redundancy and buy
myself some cycles by not using any replicas on the Marvel index. My other
idea is to set marvel.agent.interval from the default 10s to something like
30s on the assumption that this will cut the amount of data generated by a
third. Does this sound sane or do you have anyone have other ideas on what
I can try to limited the load?

marvel.agent.interval

Controls the interval between data samples. Defaults to 10s. Set to -1 to
temporarily disable exporting.

This setting is update-able via the Cluster Update Settings API.

Thanks -Logan-

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/5884045a-49f7-48d4-a3cb-93a5f70c53cf%40googlegroups.comhttps://groups.google.com/d/msgid/elasticsearch/5884045a-49f7-48d4-a3cb-93a5f70c53cf%40googlegroups.com?utm_medium=email&utm_source=footer
.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAEM624b_tp-8afb-okJSkWQ76KKbzFf9gaa97RJheLCx8-Zg0Q%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.


(Logan) #3

Thanks Mark. I'm aware of the licensing and have been in contact with
Elasticsearch about this. I just need to make sure we can use Marvel with
our two monitoring nodes before we commit to buying licenses. I'm
disinclined to use Marvel if I have to buy licenses and buy a bunch
additional monitoring machines. I really like Marvel but if I can't make it
work for a reasonable price I think I go back to using SPM.

On Wednesday, April 30, 2014 5:39:47 PM UTC-6, Mark Walkom wrote:

That's pretty sane. I believe the newest version of marvel increased the
default from 5s to 10s.

But be aware, you are breaking the license for Marvel with that number of
nodes - http://www.elasticsearch.org/overview/marvel/

Regards,
Mark Walkom

Infrastructure Engineer
Campaign Monitor
email: ma...@campaignmonitor.com <javascript:>
web: www.campaignmonitor.com

On 1 May 2014 06:52, Logan Hardy <logan...@gmail.com <javascript:>> wrote:

I'm managing a pretty badass 11 node Elasticsearch cluster that is
powering a customer facing dashboard reporting platform. 20 cores per node,
64GB RAM, SSDs, Dual 10 GbE of awesome. I evaluated Marvel while we were
still in development on the new platform and I found it to be a very
valuable tool. At first Marvel was indexing to the same cluster we were
monitoring and this was okay while we were in development as there were
plenty of extra cycles in the cluster to handle the load but now that we
are in production it doesn't make sense to burden the cluster with this.
The nature of our reporting system requires us to to have an index for each
customer so we're currently at 328 indexes and over 10,000 shards total.
The amount of data indexed by Marvel increases dramatically as the number
of indices increases so once we got over 300 indices in the system the
daily marvel index ended up at around 400 GB replicated and was indexing
around 2,000 documents a second by itself.

What I want to do is have Marvel index to a not as awesome 2 node
Elasticsearch monitoring cluster. 12 cores, 64 GB RAM and spinning disks.
But in practice these 2 nodes are unable to keep up with the load and get
completely bogged down. I'm thinking I can sacrifice redundancy and buy
myself some cycles by not using any replicas on the Marvel index. My other
idea is to set marvel.agent.interval from the default 10s to something like
30s on the assumption that this will cut the amount of data generated by a
third. Does this sound sane or do you have anyone have other ideas on what
I can try to limited the load?

marvel.agent.interval

Controls the interval between data samples. Defaults to 10s. Set to -1 to
temporarily disable exporting.

This setting is update-able via the Cluster Update Settings API.

Thanks -Logan-

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearc...@googlegroups.com <javascript:>.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/5884045a-49f7-48d4-a3cb-93a5f70c53cf%40googlegroups.comhttps://groups.google.com/d/msgid/elasticsearch/5884045a-49f7-48d4-a3cb-93a5f70c53cf%40googlegroups.com?utm_medium=email&utm_source=footer
.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/77324f57-837d-4ba8-8b39-2c11d3103f98%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


(Logan) #4

Just a follow up on my results with this. I was able t get the daily marvel
index down from hundreds of GBs per day replicated to about 22GB per day by
setting replicas to 0 on the marvel index and marvel.agent.interval: to
30s.

I also noticed that after upgrading my production Elasticsearch cluster
from 0.90.12 to 1.1.1 (now the same version as the monitoring cluster) the
daily marvel index went down to around 2GB per day. Hooray!

-Logan-

On Saturday, May 3, 2014 6:00:51 PM UTC-6, Logan Hardy wrote:

Thanks Mark. I'm aware of the licensing and have been in contact with
Elasticsearch about this. I just need to make sure we can use Marvel with
our two monitoring nodes before we commit to buying licenses. I'm
disinclined to use Marvel if I have to buy licenses and buy a bunch
additional monitoring machines. I really like Marvel but if I can't make it
work for a reasonable price I think I go back to using SPM.

On Wednesday, April 30, 2014 5:39:47 PM UTC-6, Mark Walkom wrote:

That's pretty sane. I believe the newest version of marvel increased the
default from 5s to 10s.

But be aware, you are breaking the license for Marvel with that number of
nodes - http://www.elasticsearch.org/overview/marvel/

Regards,
Mark Walkom

Infrastructure Engineer
Campaign Monitor
email: ma...@campaignmonitor.com
web: www.campaignmonitor.com

On 1 May 2014 06:52, Logan Hardy logan...@gmail.com wrote:

I'm managing a pretty badass 11 node Elasticsearch cluster that is
powering a customer facing dashboard reporting platform. 20 cores per node,
64GB RAM, SSDs, Dual 10 GbE of awesome. I evaluated Marvel while we were
still in development on the new platform and I found it to be a very
valuable tool. At first Marvel was indexing to the same cluster we were
monitoring and this was okay while we were in development as there were
plenty of extra cycles in the cluster to handle the load but now that we
are in production it doesn't make sense to burden the cluster with this.
The nature of our reporting system requires us to to have an index for each
customer so we're currently at 328 indexes and over 10,000 shards total.
The amount of data indexed by Marvel increases dramatically as the number
of indices increases so once we got over 300 indices in the system the
daily marvel index ended up at around 400 GB replicated and was indexing
around 2,000 documents a second by itself.

What I want to do is have Marvel index to a not as awesome 2 node
Elasticsearch monitoring cluster. 12 cores, 64 GB RAM and spinning disks.
But in practice these 2 nodes are unable to keep up with the load and get
completely bogged down. I'm thinking I can sacrifice redundancy and buy
myself some cycles by not using any replicas on the Marvel index. My other
idea is to set marvel.agent.interval from the default 10s to something like
30s on the assumption that this will cut the amount of data generated by a
third. Does this sound sane or do you have anyone have other ideas on what
I can try to limited the load?

marvel.agent.interval

Controls the interval between data samples. Defaults to 10s. Set to -1 to
temporarily disable exporting.

This setting is update-able via the Cluster Update Settings API.

Thanks -Logan-

--
You received this message because you are subscribed to the Google
Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send
an email to elasticsearc...@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/5884045a-49f7-48d4-a3cb-93a5f70c53cf%40googlegroups.comhttps://groups.google.com/d/msgid/elasticsearch/5884045a-49f7-48d4-a3cb-93a5f70c53cf%40googlegroups.com?utm_medium=email&utm_source=footer
.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/d320e16b-e6ee-44ec-b084-96ec51e637d2%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


(Boaz Leskes) #5

Logan,

It sounds like you also made the move from version 1.0 of marvel to 1.1 -
we had great data reductions there. I would also recommend you upgrade your
monitoring cluster to ES 1.2.0 released yesterday. From my measurements it
saves around 30%

Cheers,
Boaz

On Thursday, May 22, 2014 7:36:06 PM UTC+2, Logan Hardy wrote:

Just a follow up on my results with this. I was able t get the daily
marvel index down from hundreds of GBs per day replicated to about 22GB per
day by setting replicas to 0 on the marvel index and marvel.agent.interval:
to 30s.

I also noticed that after upgrading my production Elasticsearch cluster
from 0.90.12 to 1.1.1 (now the same version as the monitoring cluster) the
daily marvel index went down to around 2GB per day. Hooray!

-Logan-

On Saturday, May 3, 2014 6:00:51 PM UTC-6, Logan Hardy wrote:

Thanks Mark. I'm aware of the licensing and have been in contact with
Elasticsearch about this. I just need to make sure we can use Marvel with
our two monitoring nodes before we commit to buying licenses. I'm
disinclined to use Marvel if I have to buy licenses and buy a bunch
additional monitoring machines. I really like Marvel but if I can't make it
work for a reasonable price I think I go back to using SPM.

On Wednesday, April 30, 2014 5:39:47 PM UTC-6, Mark Walkom wrote:

That's pretty sane. I believe the newest version of marvel increased the
default from 5s to 10s.

But be aware, you are breaking the license for Marvel with that number
of nodes - http://www.elasticsearch.org/overview/marvel/

Regards,
Mark Walkom

Infrastructure Engineer
Campaign Monitor
email: ma...@campaignmonitor.com
web: www.campaignmonitor.com

On 1 May 2014 06:52, Logan Hardy logan...@gmail.com wrote:

I'm managing a pretty badass 11 node Elasticsearch cluster that is
powering a customer facing dashboard reporting platform. 20 cores per node,
64GB RAM, SSDs, Dual 10 GbE of awesome. I evaluated Marvel while we were
still in development on the new platform and I found it to be a very
valuable tool. At first Marvel was indexing to the same cluster we were
monitoring and this was okay while we were in development as there were
plenty of extra cycles in the cluster to handle the load but now that we
are in production it doesn't make sense to burden the cluster with this.
The nature of our reporting system requires us to to have an index for each
customer so we're currently at 328 indexes and over 10,000 shards total.
The amount of data indexed by Marvel increases dramatically as the number
of indices increases so once we got over 300 indices in the system the
daily marvel index ended up at around 400 GB replicated and was indexing
around 2,000 documents a second by itself.

What I want to do is have Marvel index to a not as awesome 2 node
Elasticsearch monitoring cluster. 12 cores, 64 GB RAM and spinning disks.
But in practice these 2 nodes are unable to keep up with the load and get
completely bogged down. I'm thinking I can sacrifice redundancy and buy
myself some cycles by not using any replicas on the Marvel index. My other
idea is to set marvel.agent.interval from the default 10s to something like
30s on the assumption that this will cut the amount of data generated by a
third. Does this sound sane or do you have anyone have other ideas on what
I can try to limited the load?

marvel.agent.interval

Controls the interval between data samples. Defaults to 10s. Set to -1 to
temporarily disable exporting.

This setting is update-able via the Cluster Update Settings API.

Thanks -Logan-

--
You received this message because you are subscribed to the Google
Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send
an email to elasticsearc...@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/5884045a-49f7-48d4-a3cb-93a5f70c53cf%40googlegroups.comhttps://groups.google.com/d/msgid/elasticsearch/5884045a-49f7-48d4-a3cb-93a5f70c53cf%40googlegroups.com?utm_medium=email&utm_source=footer
.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/5c65cb1a-5d09-4e97-b6b4-5f8ada8b95e6%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


(system) #6