Im having issues with my cluster.The cluster has 3 master and 15 nodes 13 are data nodes and 2 are edge nodes. The cluster was working before, and I did a change in one of the templates that we use.
The original config looks like:
{
"template" : "servers-",
"settings" : {
"index.refresh_interval" : "5s",
"index.routing.allocation.require.node_type": "hot",
"analysis" : {
"analyzer" : {
"default" : {
"type" : "standard",
"stopwords" : "none"
}
}
}
},
"mappings" : {
"default" : {
"_all" : {"enabled" : true},
"dynamic_templates" : [
{
"string_fields" : {
"match" : "",
"match_mapping_type" : "string",
"mapping" : {
"type" : "multi_field",
"fields" : {
"{name}" : {"type": "string", "index" : "analyzed", "omit_norms" : true, "index_options" : "docs"}
}
}
}
} ],
"properties" : {
"@version": { "type": "string", "index": "not_analyzed" },
"geoip" : {
"type" : "object",
"dynamic": true,
"properties" : {
"ip" : { "type": "ip" },
"location" : { "type" : "geo_point" }
}
},
"tags": { "type": "string", "index": "not_analyzed" },
"pid": { "type": "long", "index": "not_analyzed" },
"priority": { "type": "integer", "index": "not_analyzed" },
"severity": { "type": "integer", "index": "not_analyzed" },
"facility": { "type": "integer", "index": "not_analyzed" },
"syslog_severity_code": { "type": "integer", "index": "not_analyzed" },
"syslog_facility_code": { "type": "integer", "index": "not_analyzed" }
}
}
}
}
My changes:
{
"template" : "servers-",
"settings" : {
"index.refresh_interval" : "5s",
"index.routing.allocation.require.node_type": "hot",
"analysis" : {
"analyzer" : {
"default" : {
"type" : "standard",
"stopwords" : "none"
}
}
}
},
"mappings" : {
"default" : {
"_all" : {"enabled" : true},
"dynamic_templates" : [
{
"beat" : {
"path_match" : "beat.",
"match_mapping_type" : "string",
"mapping" : {
"type" : "string",
"index" : "not_analyzed",
"omit_norms" : true,
"index_options" : "offsets"
}
}
},
{
"string_fields" : {
"match" : "*",
"match_mapping_type" : "string",
"mapping" : {
"type" : "multi_field",
"fields" : {
"{name}" : {"type": "string", "index" : "analyzed", "omit_norms" : true, "index_options" : "docs"}
}
}
}
} ],
"properties" : {
"@version": { "type": "string", "index": "not_analyzed" },
"geoip" : {
"type" : "object",
"dynamic": true,
"properties" : {
"ip" : { "type": "ip" },
"location" : { "type" : "geo_point" }
}
},
"tags": { "type": "string", "index": "not_analyzed" },
"pid": { "type": "long", "index": "not_analyzed" },
"priority": { "type": "integer", "index": "not_analyzed" },
"severity": { "type": "integer", "index": "not_analyzed" },
"facility": { "type": "integer", "index": "not_analyzed" },
"syslog_severity_code": { "type": "integer", "index": "not_analyzed" },
"syslog_facility_code": { "type": "integer", "index": "not_analyzed" }
}
}
}
}
Since then the cluster has being degraded to the point that I had to restarted a few times but never recovers completely.Is there a way I can restore my cluster, I cant delete the new indices or perform any kind of action due to I always get the following error:
"error": {
"root_cause": [
{
"type": "master_not_discovered_exception",
"reason": null
}
],
"type": "master_not_discovered_exception",
"reason": null
},
"status": 503
The cluster seem to be crazy busy trying to recover indices or something else. Any suggestion of what I can do we are talking of about 1000+ indices some of them are big as 160GB