Please provide suggestions on this

banu · September 30, 2016, 1:42pm

     how data will be distributed when we have single index(1 shard and 1 replica)  stored into 2 nodes server. Primary will be stored into one node and replica will be stored in to another node. What will happen when disk space reached its max capacity  and adding new node to avoid failure in this scenario? Will new data for the index be loaded into newly added node or it cannot be loaded? If it can be loaded in to new node , will it not impact relevance score?

 Can we move one index to other node if we have 2 index with above scenarios. ?
 During initial data load if we have 10 indexes, is there anywhere we can control to assign primary index in any specific node among multiple nodes. ?

warkolm · October 1, 2016, 11:32am

Please format your question a little better, you have a single massive line that people need to scroll through just to see what you are asking.

banu · October 3, 2016, 5:27am

how data will be distributed when we have single index(1 shard and 1 replica) stored into 2 nodes server.
Primary will be stored into one node and replica will be stored in to another node.
What will happen when disk space reached its max capacity and adding new node to avoid failure in this scenario? Will new data for the index be loaded into newly added node or it cannot be loaded? If it can be loaded in to new node , will it not impact relevance score?
[/quote]

warkolm · October 3, 2016, 5:55am

Yes.

Adding another node after a node gets to 100% disk use is no good. Do it before.

banu · October 3, 2016, 6:15am

will it not impact relevance score?

warkolm · October 3, 2016, 6:18am

Scoring is calculated against docs in the same shard. So no.

banu · October 3, 2016, 6:20am

Can we move one index to other node if we have 2 index with above scenarios. ?

warkolm · October 3, 2016, 6:23am

Maybe, you'd need to test really because ES doesn't generally like full disks.

banu · October 3, 2016, 6:46am

During initial data load if we have 10 indexes, is there anywhere we can control to assign primary index in any specific node among multiple nodes. ?

warkolm · October 3, 2016, 6:47am

Nope. It shouldn't matter.

banu · October 3, 2016, 7:02am

then how to handle when max disk space reached?

warkolm · October 3, 2016, 7:04am

Prevention is better than cure in this case.

banu · October 3, 2016, 7:50am

Hey mark i need more detailed description on this scenario i have 1shard 1 replica and 2nodes if i am indexing millions of documents when the disk reaches the max capacity ?whether we can increase node size or shards size which is recommended to increase ?

Christian_Dahlqvist · October 3, 2016, 8:03am

If you have a single index with just 1 shard and 1 replica, there is little you can do around redistributing data if the disk gets full as shards can not be split. You can however provision nodes with more disk space and move the shard over there in order to scale up, but you will not be able to scale horizontally. If you however had more than 1 shard, you could add nodes to the cluster and scale out horizontally as Elasticsearch would relocate a portion of the shards on the node and free up space.

banu · October 3, 2016, 9:16am

Thanks christian for your suggestion i do have another query During initial data load if we have 10 indexes, is there anywhere we can control to assign primary index in any specific node among multiple nodes.

Christian_Dahlqvist · October 3, 2016, 9:28am

I assume you are referring to primary shards as there are no primary indices, and if this is the case the answer is no, as this is not possible. Primaries and replicas however do the same amount of work, so you should not need to be concerned with this.

banu · October 3, 2016, 9:43am

if i have 5 nodes and 5th name is "testnode" and the index name is "testindex" is it possible to assign the "testindex" to "testnode" ?

Christian_Dahlqvist · October 3, 2016, 9:55am

Yes, that should be possible through shard allocation awareness.

banu · October 3, 2016, 10:07am

Thanks for your suggestion christian

banu · October 3, 2016, 11:09am

If we have 2 shard in one node then the data will be splited in each shard

Topic		Replies	Views
Shards of an index present only in one node in a multinode cluster Elasticsearch	5	697	October 20, 2021
Index distribution clarification Elasticsearch	2	139	January 19, 2024
Query on Elasticsearch storage Elasticsearch	4	509	March 29, 2018
Shard allocation Elasticsearch	7	27	September 30, 2024
3 node ES cluster...one node only holds replicas Elasticsearch	10	2097	July 5, 2017

Please provide suggestions on this

Related topics