Bad recovery after cluster restart

Ludovic · November 22, 2011, 12:16pm

Hi,

We are working on ES 0.17.6 with 2 servers and 4 ES nodes.
After a restart on cluster, we have lost all indices.
We can see them in work folders but the cluster state is green (cluster
health: green (4, 0)). but nothing happens concerning the indices.

the gist with configuration and log

gist.github.com

https://gist.github.com/anonymous/1385395

deltaA-Cluster.log

[2011-11-22 11:54:03,494][INFO ][node                     ] [Grasshopper I&II] {elasticsearch/0.17.6}[30927]: initializing ...
[2011-11-22 11:54:03,508][INFO ][plugins                  ] [Grasshopper I&II] loaded [], sites []
[2011-11-22 11:54:05,886][INFO ][node                     ] [Grasshopper I&II] {elasticsearch/0.17.6}[30927]: initialized
[2011-11-22 11:54:05,887][INFO ][node                     ] [Grasshopper I&II] {elasticsearch/0.17.6}[30927]: starting ...
[2011-11-22 11:54:05,973][INFO ][transport                ] [Grasshopper I&II] bound_address {inet[/0:0:0:0:0:0:0:0:9301]}, publish_address {inet[/10.117.202.47:9301]}
[2011-11-22 11:54:09,015][INFO ][cluster.service          ] [Grasshopper I&II] new_master [Grasshopper I&II][fKDL-rikT4q-ePDOVFsPsA][inet[/10.117.202.47:9301]], reason: zen-disco-join (elected_as_master)
[2011-11-22 11:54:09,022][INFO ][discovery                ] [Grasshopper I&II] deltaA/fKDL-rikT4q-ePDOVFsPsA
[2011-11-22 11:54:09,030][INFO ][http                     ] [Grasshopper I&II] bound_address {inet[/0:0:0:0:0:0:0:0:9201]}, publish_address {inet[/10.117.202.47:9201]}
[2011-11-22 11:54:09,030][INFO ][node                     ] [Grasshopper I&II] {elasticsearch/0.17.6}[30927]: started
[2011-11-22 11:54:09,033][INFO ][gateway                  ] [Grasshopper I&II] recovered [0] indices into cluster_state

This file has been truncated. show original

elasticsearch.yml

# Cluster Settings
cluster:
  name: deltaA 

# Gateway Settings
gateway:
 type: fs
 fs:
  location: /elasticsearch/index/work
#  recover_after_nodes: 1

This file has been truncated. show original

any ideas how to force recovery ?

Thanks you.

kimchy · November 22, 2011, 12:27pm

I am a bit confused about your setup. You are using the shared fs
gateway, is the path you configure it with (the work directory of ES???)
mounted on both servers and both servers see the same file system?
Why are you using the shared fs gateway and not the local gateway?
Why are you starting two nodes on a single server?

On Tue, Nov 22, 2011 at 2:16 PM, Ludovic superglu07@gmail.com wrote:

Hi,

We are working on ES 0.17.6 with 2 servers and 4 ES nodes.
After a restart on cluster, we have lost all indices.
We can see them in work folders but the cluster state is green (cluster
health: green (4, 0)). but nothing happens concerning the indices.

the gist with configuration and log

bad recovery after cluster restart · GitHub

any ideas how to force recovery ?

Thanks you.

Ludovic · November 22, 2011, 2:06pm

Hi Shay

I am a bit confused about your setup. You are using the shared fs
gateway, is the path you configure it with (the work directory of ES???)
mounted on both servers and both servers see the same file system?

each nodes has its own "work" directory.

Why are you using the shared fs gateway and not the local gateway?

We have only 2 servers with not enough disk memory, so we can't do anything
else then using a disk storage bay. that is why we used this shared fs
gateway.

Why are you starting two nodes on a single server?

The indices we want to use have 1 replica. We want to test load balancing
for 4 nodes and only 2 servers. We understand, but we might be wrong, that
4 nodes will be better than 2.

best regards

kimchy · November 22, 2011, 3:38pm

On Tue, Nov 22, 2011 at 4:06 PM, Ludovic superglu07@gmail.com wrote:

Hi Shay

I am a bit confused about your setup. You are using the shared fs
gateway, is the path you configure it with (the work directory of ES???)
mounted on both servers and both servers see the same file system?

each nodes has its own "work" directory.

then this does not work. The shared fs gateway requires the same mount to
be visible from all servers. But, you don't really need it, use the
default, recommended local gateway.

That explains, btw, why you "lost" your data. the master node stores in the
presumed shared gateway location the fact that indices were created. But,
because your location is not shared, when you started and a master node was
elected on another server, it will go to its own "shared" fs location, and
there won't be any meta data there.

Why are you using the shared fs gateway and not the local gateway?

We have only 2 servers with not enough disk memory, so we can't do
anything else then using a disk storage bay. that is why we used this
shared fs gateway.

The local gateway also works using the "file system":
Elasticsearch Platform — Find real-time answers at scale | Elastic. I
suggest you use it. You will need to reindex the data to move from shared
fs gateway to local gateway.

Why are you starting two nodes on a single server?

The indices we want to use have 1 replica. We want to test load balancing
for 4 nodes and only 2 servers. We understand, but we might be wrong, that
4 nodes will be better than 2.

99% change you are wrong, use one node per machine.

best regards

Topic		Replies	Views
Problem after rebooting cluster Elasticsearch	1	347	July 6, 2017
Recover the old indices after restarting Elasticsearch Elasticsearch	1	467	July 5, 2017
Indexes deleted (empty) on cluster restart Elasticsearch	14	1666	July 6, 2017
Normal for node to be RED after reboot; standalone environment Elasticsearch	10	1011	May 10, 2022
Lost indices after restart cluster Elasticsearch	3	425	June 27, 2018

Bad recovery after cluster restart

Related topics