On Mon, Mar 4, 2013 at 12:55 PM, Jörg Prante joergprante@gmail.com wrote:
Hej Ivan, good catch, I hope I can include the parsing of Elasticsearch
files into my skywalker plugin for 0.90 ...
I'm not sure that having two clusters in a red state is a good catch.
My rough understanding is, if the index is busted, folders on shard level
may be gone or deleted, e.g.
The issue was not that the index was busted (it was not), but the cluster
state was incorrect. This state was persisted to disk and I would love to
have more insight on where exactly is the state. The issue went away by
deleting an index, so it appears that the index (not global) state was
incorrect.
data//nodes/**/indices//<shard-**id>/
and if the files can not be read, the ES will ask the gateway to react
with a recovery of the shard.
I understand the organization of the Lucene shards, it is the gateway
files I am looking for info on. If I leave the Lucene files and translogs
intact, but delete the state, will the gateway recover only the state or
will it invalidate the shard. I am assuming the latter.
There are also lock files, only present while runtime, and checksum files.
In the _state files, there is redundant information of the cluster states,
so, if these files are gone, ES will look at other places in the hope there
will be the missing state information. I have to explore the code to find
out hese "other places", how an ES node recovers and matches cluster states
recovered from disk. Right now I would assume, the master node cluster
state simply overrides conflicting information...
The issue is there are two master nodes in a split-brain cluster. I figured
out the Java classes for most of the state files (IndexMetaData, MetaData),
but I failed to find where the master state is persisted. Only spent a few
minutes looking.
I guess what I am looking for are two things:
-
Where is the master/routing state persisted?
-
What classes are responsible for reading the various files? I figured
most of this part out, but I was hoping for a more authoritative answer.
This second part would be a great addition to the skywalker plugin, but
this is data at the file-system level and not discoverable via the API.
With one cluster meltdown, the node would not even start correctly. Not
sure if site plugins still worked.
Cheers,
Jörg
Cheers,
Ivan
Am 04.03.13 18:01, schrieb Ivan Brusic:
I am hoping to perhaps leverage the existing classes and write a simple
utility class to parse the state into a human-readable format (read-only
sanity check).
--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@**googlegroups.comelasticsearch%2Bunsubscribe@googlegroups.com
.
For more options, visit https://groups.google.com/**groups/opt_outhttps://groups.google.com/groups/opt_out
.
--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.