Logstash is not creating new index in elastic cloud

sbade · October 12, 2017, 5:16pm

I' ve got a pretty simple configuration to setup. what I need to do is create a new index for each of my servers in my elastic cloud setup.

my output is
elasticsearch {
hosts => "https://XXXXXXXXX.us-east-1.aws.found.io:9243"
user => "elastic"
password => "XXXXXXXXXXXE"
index => "server1indes"
}

I don't see any new index being created in the cloud, if i remove the index line from the output, i get my data in the default logstash-YYYY.MM.DD index but this does not let me split my indicies on a perserver basis (not to mention that we don't want to have a new index for each day).

theuntergeek · October 12, 2017, 5:36pm

I cannot see any reason why the defined index would not be created. Are there any errors in your Logstash logs? Technically speaking, Logstash does not create indices. It sends a batch of documents to Elasticsearch and says, "these should go in an index named (whatever is defined in index =>)," and then Elasticsearch handles the bulk request and creates the index.

On a parallel note, though:

How are you planning on handling data retention? Doing a delete_by_query is very taxing to Elasticsearch for time-series data. You're actually much better off doing rollover indices of some kind, whether named, or with the Rollover API.

Also, what is the benefit of splitting indices by server name? Is the data completely different, such that the mappings will also differ significantly? It's simple to filter by server name in a query, so unless there is a mapping difference due to differing data, there's no other compelling reason to do this. More shards and indices is just more overhead for the cluster to manage. It will be more performant if you can reduce this to the least amount required.

sbade · October 12, 2017, 5:52pm

I have turned on full debugging log levels on my logstash and see no errors in logstash, i do know that the outputs are getting invoked because i added a file output to make sure that logstash was pulling from the files. Is there another way (you can probably guess i'm pretty new to elk) that i can create the indicies?.

I can't go into the reasons why we need to split the indices, there actually is a good reason (i'm using per server in an abstract sense), but yes there is a reason why we can't split by a simple query by "server name".

theuntergeek · October 12, 2017, 5:57pm

Are new entries continually being added to these files? If so, then the data is not being rejected by Elasticsearch. Logstash, at least in the current release, writes to all outputs in a given pipeline at the same time. If one of them puts up back-pressure (e.g. Elasticsearch is not accepting the output for any reason), then all of the outputs in that pipeline will also cease.

Fair enough. But that doesn't rule out data retention planning by dropping old indices based on the age of the contents (naming the indices server_name-YYYY.MM.dd, for example, or using the creation_date of the index). It's still not best practices to use delete_by_query to empty out indices for time-series data.

sbade · October 12, 2017, 6:06pm

yes i continually add data to the files. I've got the methodology for how we will handle retention, i'm simplifying the situation to focus on the one issue that I can;'t seem to get new indicies to be created based on what I believe should be correct in my configuration.

Christian_Dahlqvist · October 12, 2017, 6:09pm

Are you using the default elastic user, which is a superuser?

theuntergeek · October 12, 2017, 6:09pm

I mean, is Logstash continually outputting to the file outputs you defined? I wasn't able to infer the answer from your response. If the data is continually streaming into those output files, then the data must be in Elasticsearch somewhere, otherwise Logstash would have stopped sending data, and would have log entries about retries (429 code) and such.

sbade · October 12, 2017, 6:12pm

yes the file output that i defined continues to get the data in it as the files that i'm using as inputs get data added to them. yes i'm using the default elastic user

Christian_Dahlqvist · October 12, 2017, 6:19pm

This is odd. Could you provide the output of the cluster stats API? I just want to check if there are any issues with the cluster that could be causing this.

sbade · October 12, 2017, 6:24pm

I will have to do that tomorrow, as i'm not in the environment to reproduce this for the rest of the day. I'll have to figure out how to get to that api, so i'll work on that today

sbade · October 12, 2017, 8:43pm

I'm not sure how to use the cluster stats API I looked at the Paramedic console and it says my cluster is "yellow" but i have no idea what that means. Since I'm on the evaluation version of elastic.co cloud, maybe i should just delete/remove my cluster and start over

Christian_Dahlqvist · October 12, 2017, 8:47pm

Log in to Kibana and go to Dev Tools. This will open up Console, which allows you to run queries. Here you can enter and run GET _cluster/stats on the left side. Results will show up on the right.

sbade · October 12, 2017, 8:49pm

{
"_nodes": {
"total": 1,
"successful": 1,
"failed": 0
},
"cluster_name": "a9192f774f2b14812bc746b2e38ddb67",
"timestamp": 1507841365821,
"status": "yellow",
"indices": {
"count": 8,
"shards": {
"total": 8,
"primaries": 8,
"replication": 0,
"index": {
"shards": {
"min": 1,
"max": 1,
"avg": 1
},
"primaries": {
"min": 1,
"max": 1,
"avg": 1
},
"replication": {
"min": 0,
"max": 0,
"avg": 0
}
}
},
"docs": {
"count": 19438,
"deleted": 1
},
"store": {
"size_in_bytes": 14603099,
"throttle_time_in_millis": 0
},
"fielddata": {
"memory_size_in_bytes": 848,
"evictions": 0
},
"query_cache": {
"memory_size_in_bytes": 0,
"total_count": 0,
"hit_count": 0,
"miss_count": 0,
"cache_size": 0,
"cache_count": 0,
"evictions": 0
},
"completion": {
"size_in_bytes": 0
},
"segments": {
"count": 27,
"memory_in_bytes": 236094,
"terms_memory_in_bytes": 194787,
"stored_fields_memory_in_bytes": 14152,
"term_vectors_memory_in_bytes": 0,
"norms_memory_in_bytes": 10624,
"points_memory_in_bytes": 2175,
"doc_values_memory_in_bytes": 14356,
"index_writer_memory_in_bytes": 2375256,
"version_map_memory_in_bytes": 13175,
"fixed_bit_set_memory_in_bytes": 1440,
"max_unsafe_auto_id_timestamp": -1,
"file_sizes": {}
}
},
"nodes": {
"count": {
"total": 1,
"data": 1,
"coordinating_only": 0,
"master": 1,
"ingest": 1
},
"versions": [
"5.6.3"
],
"os": {
"available_processors": 32,
"allocated_processors": 2,
"names": [
{
"name": "Linux",
"count": 1
}
],
"mem": {
"total_in_bytes": 257942405120,
"free_in_bytes": 25348136960,
"used_in_bytes": 232594268160,
"free_percent": 10,
"used_percent": 90
}
},
"process": {
"cpu": {
"percent": 0
},
"open_file_descriptors": {
"min": 520,
"max": 520,
"avg": 520
}
},
"jvm": {
"max_uptime_in_millis": 116670947,
"versions": [
{
"version": "1.8.0_144",
"vm_name": "Java HotSpot(TM) 64-Bit Server VM",
"vm_version": "25.144-b01",
"vm_vendor": "Oracle Corporation",
"count": 1
}
],
"mem": {
"heap_used_in_bytes": 1555244888,
"heap_max_in_bytes": 2130051072
},
"threads": 64
},
"fs": {
"total_in_bytes": 111669149696,
"free_in_bytes": 111653335040,
"available_in_bytes": 111653335040
},
"plugins": [
{
"name": "ingest-user-agent",
"version": "5.6.3",
"description": "Ingest processor that extracts information from a user agent",
"classname": "org.elasticsearch.ingest.useragent.IngestUserAgentPlugin",
"has_native_controller": false
},
{
"name": "ingest-geoip",
"version": "5.6.3",
"description": "Ingest processor that uses looksup geo data based on ip adresses using the Maxmind geo database",
"classname": "org.elasticsearch.ingest.geoip.IngestGeoIpPlugin",
"has_native_controller": false
},
{
"name": "repository-s3",
"version": "5.6.3",
"description": "The S3 repository plugin adds S3 repositories",
"classname": "org.elasticsearch.repositories.s3.S3RepositoryPlugin",
"has_native_controller": false
},
{
"name": "x-pack",
"version": "5.6.3",
"description": "Elasticsearch Expanded Pack Plugin",
"classname": "org.elasticsearch.xpack.XPackPlugin",
"has_native_controller": true
},
{
"name": "found-elasticsearch",
"version": "5.6.3",
"description": "Elasticsearch plugin for Found",
"classname": "org.elasticsearch.plugin.found.FoundPlugin",
"has_native_controller": false
}
],
"network_types": {
"transport_types": {},
"http_types": {}
}
}
}

sbade · October 12, 2017, 8:56pm

so i went in and deleted the logstash-* indices and ran my generator that will put records out, and the data is getting put into the logstash-* index.. I wonder if this is related to templating or some other thing that I don't fully understand

Christian_Dahlqvist · October 12, 2017, 8:58pm

As far as I can see that all looks fine. Can you try indexing a record into this index manually, e.g. through the Console?

sbade · October 12, 2017, 9:15pm

sure I think i can do that.

sbade · October 12, 2017, 9:34pm

I took the "accounts.json" example in this https://www.elastic.co/guide/en/kibana/current/tutorial-load-dataset.html and ran the appropriate curl command as documented at the page, and it created the index no problem (i'm relying on logstash to convert my csv into json, so i need to tweak my config to get my actual data, which i'm going to try next

sbade · October 12, 2017, 10:15pm

Ok I'm not sure I'm doing this right. I did
I did a query on the logstash-2017.10.12 index and got all the records. I copied one record. and did
PUT /BluVector_Bro
{
"_index": "logstash-2017.10.12",
"_type": "BRO_httplog",
"_id": "AV8SXb1_UJ-u3V4Cn7CH",
"_score": 1,
"_source": {
"path": "/var/log/bro/current/http.log",
"@timestamp": "2017-10-12T20:54:10.178Z",
"@version": "1",
"host": "bvesx25.vm",
"message": """1507841650.178409 Cd4IHe2auw3y1WU2Vi 227.173.242.206 1056 159.154.119.78 80 1 GET diggstatistics.com /flash/dialog_header_red.jpg http://diggstatistics.com/ Mozilla/4.0 (compatible; MSIE 7.0; Windows NT 5.1) 0 7197 200 OK - - - (empty) - - - - - F7vXEf3FIgfyod7DU3 image/jpeg""",
"type": "BRO_httplog",
"ts": """1507841650.178409 Cd4IHe2auw3y1WU2Vi 227.173.242.206 1056 159.154.119.78 80 1 GET diggstatistics.com /flash/dialog_header_red.jpg http://diggstatistics.com/ Mozilla/4.0 (compatible; MSIE 7.0; Windows NT 5.1) 0 7197 200 OK - - - (empty) - - - - - F7vXEf3FIgfyod7DU3 image/jpeg""",
"tags": [
"_geoip_lookup_failure"
]
}
}

The data is lines from bro logs that are filtered in logstash to create a csv formatted file. I get
{
"error": {
"root_cause": [
{
"type": "illegal_argument_exception",
"reason": "unknown setting [index._id] please check that any required plugins are installed, or check the breaking changes documentation for removed settings"
}
],
"type": "illegal_argument_exception",
"reason": "unknown setting [index._id] please check that any required plugins are installed, or check the breaking changes documentation for removed settings"
},
"status": 400
}

I started removing the items that caused this exception, but pretty much everything ended up having to be removed. I'm wondering if this is because I don't have a template/mapping for this index (although I would expect to find something in the logstash logs that elastic threw an exception - but maybe i'm wrong)

So tomorrow, I'm going to figure out how to do a mapping and see if that improves anything.

theuntergeek · October 12, 2017, 10:28pm

That's a terrific idea. This blog post may be of some assistance.

system · November 9, 2017, 10:28pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Logstash doesn't create new indexes Logstash	5	6287	August 26, 2017
Logstash does not create index in es Logstash	5	1535	November 13, 2017
Logstash is not creating Indexes - SOLVED Logstash	21	46291	July 6, 2017
Logstash failed to push data (create indices) to elasticsearch Logstash	1	413	June 7, 2018
Logstash not pushing logs to ElasticSearch Logstash	27	5817	August 28, 2019

Logstash is not creating new index in elastic cloud

Related topics