Separate data and metadata

Oleg_Ruchovets · September 14, 2020, 1:20pm

Hello.
In Hadoop echo system and some other solution: data and metadata are stored separately. For Hadoop, there are master nodes to store metadata.

Question: does Elasticsearch have similar architecture? I mean is there metadata and data separation or ES hold data and metadata as part of the same JSON document?

Thanks.

DavidTurner · September 14, 2020, 1:52pm

In Elasticsearch the master-eligible nodes are responsible for the cluster metadata, whereas the data nodes are responsible for the data. A single node can do both roles, or you can separate them. See these docs for more information.

Oleg_Ruchovets · September 14, 2020, 2:16pm

Ok, Thank @DavidTurner for the quick answer.

Is it a best practice to separate the master node from data nodes?
In case of a master crashed elastic will elect a new master, right? what will be in this case with metadata? Should master machine configuration be more memory / CPU / disk intensive comparing to the data node?

Thanks

warkolm · September 14, 2020, 10:17pm

Yes.

Yes. All nodes in the cluster hold the cluster state.

No, you might even be able to go with less.

system · October 12, 2020, 10:17pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Three questions about master/data node failover Elasticsearch	9	3355	July 5, 2017
Where cluster metadata is stored? Elasticsearch	13	7253	July 5, 2017
Lost index metadata and overwriting pre-existing index files Elasticsearch	3	1551	July 6, 2017
Design question Elasticsearch	7	371	July 6, 2017
Is it possible to use same data storage for nodes? Elasticsearch	2	2607	December 20, 2018

Separate data and metadata

Related topics