We are running on AWS and index our data on the fastest nodes we can get,
then go back and reduce the nodes to smaller/slower instances to save
money. We don't scale the number of nodes, just the speed/cost of the
nodes. If you're using your own hardware, that would be tougher to do.
The other shoe may be to look at how to index data more efficiently so that
you don't require so many nodes.
Thanks Craig... Are you not lowering the number of nodes because that
is difficult/impossible, or because it's just trivial on AWS?
and thanks for the other shoe idea.
I don't know ES just yet... and I am basically asking: If I use say 10
nodes, I assume my data would end up being distributed more or less evenly
of all 10... Is there a process in ES by which one can move "everything"
(data/index) off of one node, so as to remove it afterwards? even a manual
process?
Thanks a lot,
Mohamed.
On Monday, September 24, 2012 3:43:51 PM UTC-4, Craig Brown wrote:
We are running on AWS and index our data on the fastest nodes we can get,
then go back and reduce the nodes to smaller/slower instances to save
money. We don't scale the number of nodes, just the speed/cost of the
nodes. If you're using your own hardware, that would be tougher to do.
The other shoe may be to look at how to index data more efficiently so
that you don't require so many nodes.
Craig
On Mon, Sep 24, 2012 at 1:37 PM, Mohamed Lrhazi <ml...@georgetown.edu<javascript:>
wrote:
I was hoping my question would get a quick RTFM reply... Maybe it is not
possible to scale down? Or am I thinking about my problem in the wrong way?
Thank you so much,
Mohamed.
On Sunday, September 23, 2012 11:08:29 AM UTC-4, Mohamed Lrhazi wrote:
Hello,
I have not used elasticsearch before, but am considering it for a
project. One question I have is this:
Can you start off with a big number of nodes, in order to speed up the
indexing of legacy data, then scale down for normal operation afterwards?
I really haven't played around with reducing the number of nodes in the
cluster. I'm not sure of a way to really do that right now, though someone
else may have a better idea. I know there is an upcoming release where you
can manually move shards to particular nodes. The cluster will try to
rebalance the remaining shards across the cluster, but that can be turned
off.
I assume that with the new release you could turn off rebalancing, manually
specify the new shard placements, remove the nodes/nodes, then turn on
rebalancing and have the cluster get back to green status, but I don't know
for sure. Shay would probably be the best one to answer that. I'm not sure
if that release will be 0.19.10 or 0.20.1.
Thanks Craig... Are you not lowering the number of nodes because that
is difficult/impossible, or because it's just trivial on AWS?
and thanks for the other shoe idea.
I don't know ES just yet... and I am basically asking: If I use say 10
nodes, I assume my data would end up being distributed more or less evenly
of all 10... Is there a process in ES by which one can move "everything"
(data/index) off of one node, so as to remove it afterwards? even a manual
process?
Thanks a lot,
Mohamed.
On Monday, September 24, 2012 3:43:51 PM UTC-4, Craig Brown wrote:
We are running on AWS and index our data on the fastest nodes we can get,
then go back and reduce the nodes to smaller/slower instances to save
money. We don't scale the number of nodes, just the speed/cost of the
nodes. If you're using your own hardware, that would be tougher to do.
The other shoe may be to look at how to index data more efficiently so
that you don't require so many nodes.
Very useful info for me, I'll read about the newer releases too.
Thanks a lot,
Mohamed,
On Monday, September 24, 2012 4:13:11 PM UTC-4, Craig Brown wrote:
I really haven't played around with reducing the number of nodes in the
cluster. I'm not sure of a way to really do that right now, though someone
else may have a better idea. I know there is an upcoming release where you
can manually move shards to particular nodes. The cluster will try to
rebalance the remaining shards across the cluster, but that can be turned
off.
I assume that with the new release you could turn off rebalancing,
manually specify the new shard placements, remove the nodes/nodes, then
turn on rebalancing and have the cluster get back to green status, but I
don't know for sure. Shay would probably be the best one to answer that.
I'm not sure if that release will be 0.19.10 or 0.20.1.
Craig
On Mon, Sep 24, 2012 at 2:04 PM, Mohamed Lrhazi <ml...@georgetown.edu<javascript:>
wrote:
Thanks Craig... Are you not lowering the number of nodes because that
is difficult/impossible, or because it's just trivial on AWS?
and thanks for the other shoe idea.
I don't know ES just yet... and I am basically asking: If I use say 10
nodes, I assume my data would end up being distributed more or less evenly
of all 10... Is there a process in ES by which one can move "everything"
(data/index) off of one node, so as to remove it afterwards? even a manual
process?
Thanks a lot,
Mohamed.
On Monday, September 24, 2012 3:43:51 PM UTC-4, Craig Brown wrote:
We are running on AWS and index our data on the fastest nodes we can
get, then go back and reduce the nodes to smaller/slower instances to save
money. We don't scale the number of nodes, just the speed/cost of the
nodes. If you're using your own hardware, that would be tougher to do.
The other shoe may be to look at how to index data more efficiently so
that you don't require so many nodes.
Very useful info for me, I'll read about the newer releases too.
Thanks a lot,
Mohamed,
On Monday, September 24, 2012 4:13:11 PM UTC-4, Craig Brown wrote:
I really haven't played around with reducing the number of nodes in the
cluster. I'm not sure of a way to really do that right now, though someone
else may have a better idea. I know there is an upcoming release where you
can manually move shards to particular nodes. The cluster will try to
rebalance the remaining shards across the cluster, but that can be turned
off.
I assume that with the new release you could turn off rebalancing,
manually specify the new shard placements, remove the nodes/nodes, then
turn on rebalancing and have the cluster get back to green status, but I
don't know for sure. Shay would probably be the best one to answer that.
I'm not sure if that release will be 0.19.10 or 0.20.1.
Thanks Craig... Are you not lowering the number of nodes because that
is difficult/impossible, or because it's just trivial on AWS?
and thanks for the other shoe idea.
I don't know ES just yet... and I am basically asking: If I use say 10
nodes, I assume my data would end up being distributed more or less evenly
of all 10... Is there a process in ES by which one can move "everything"
(data/index) off of one node, so as to remove it afterwards? even a manual
process?
Thanks a lot,
Mohamed.
On Monday, September 24, 2012 3:43:51 PM UTC-4, Craig Brown wrote:
We are running on AWS and index our data on the fastest nodes we can
get, then go back and reduce the nodes to smaller/slower instances to save
money. We don't scale the number of nodes, just the speed/cost of the
nodes. If you're using your own hardware, that would be tougher to do.
The other shoe may be to look at how to index data more efficiently so
that you don't require so many nodes.
Very useful info for me, I'll read about the newer releases too.
Thanks a lot,
Mohamed,
On Monday, September 24, 2012 4:13:11 PM UTC-4, Craig Brown wrote:
I really haven't played around with reducing the number of nodes in the
cluster. I'm not sure of a way to really do that right now, though someone
else may have a better idea. I know there is an upcoming release where you
can manually move shards to particular nodes. The cluster will try to
rebalance the remaining shards across the cluster, but that can be turned
off.
I assume that with the new release you could turn off rebalancing,
manually specify the new shard placements, remove the nodes/nodes, then
turn on rebalancing and have the cluster get back to green status, but I
don't know for sure. Shay would probably be the best one to answer that.
I'm not sure if that release will be 0.19.10 or 0.20.1.
Thanks Craig... Are you not lowering the number of nodes because that
is difficult/impossible, or because it's just trivial on AWS?
and thanks for the other shoe idea.
I don't know ES just yet... and I am basically asking: If I use say 10
nodes, I assume my data would end up being distributed more or less evenly
of all 10... Is there a process in ES by which one can move "everything"
(data/index) off of one node, so as to remove it afterwards? even a manual
process?
Thanks a lot,
Mohamed.
On Monday, September 24, 2012 3:43:51 PM UTC-4, Craig Brown wrote:
We are running on AWS and index our data on the fastest nodes we can
get, then go back and reduce the nodes to smaller/slower instances to save
money. We don't scale the number of nodes, just the speed/cost of the
nodes. If you're using your own hardware, that would be tougher to do.
The other shoe may be to look at how to index data more efficiently so
that you don't require so many nodes.
On Mon, Sep 24, 2012 at 1:38 PM, Mohamed Lrhazi <ml...@georgetown.edu<javascript:>
wrote:
Very useful info for me, I'll read about the newer releases too.
Thanks a lot,
Mohamed,
On Monday, September 24, 2012 4:13:11 PM UTC-4, Craig Brown wrote:
I really haven't played around with reducing the number of nodes in the
cluster. I'm not sure of a way to really do that right now, though someone
else may have a better idea. I know there is an upcoming release where you
can manually move shards to particular nodes. The cluster will try to
rebalance the remaining shards across the cluster, but that can be turned
off.
I assume that with the new release you could turn off rebalancing,
manually specify the new shard placements, remove the nodes/nodes, then
turn on rebalancing and have the cluster get back to green status, but I
don't know for sure. Shay would probably be the best one to answer that.
I'm not sure if that release will be 0.19.10 or 0.20.1.
Thanks Craig... Are you not lowering the number of nodes because that
is difficult/impossible, or because it's just trivial on AWS?
and thanks for the other shoe idea.
I don't know ES just yet... and I am basically asking: If I use say 10
nodes, I assume my data would end up being distributed more or less evenly
of all 10... Is there a process in ES by which one can move "everything"
(data/index) off of one node, so as to remove it afterwards? even a manual
process?
Thanks a lot,
Mohamed.
On Monday, September 24, 2012 3:43:51 PM UTC-4, Craig Brown wrote:
We are running on AWS and index our data on the fastest nodes we can
get, then go back and reduce the nodes to smaller/slower instances to save
money. We don't scale the number of nodes, just the speed/cost of the
nodes. If you're using your own hardware, that would be tougher to do.
The other shoe may be to look at how to index data more efficiently
so that you don't require so many nodes.
Craig
On Mon, Sep 24, 2012 at 1:37 PM, Mohamed Lrhazi <ml...@georgetown.edu
wrote:
I was hoping my question would get a quick RTFM reply... Maybe it is
not possible to scale down? Or am I thinking about my problem in the wrong
way?
Thank you so much,
Mohamed.
On Sunday, September 23, 2012 11:08:29 AM UTC-4, Mohamed Lrhazi
wrote:
Hello,
I have not used elasticsearch before, but am considering it for a
project. One question I have is this:
Can you start off with a big number of nodes, in order to speed up
the indexing of legacy data, then scale down for normal operation
afterwards?
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.