Redis as a broken really help with loading the raw logs?

shilong · December 11, 2015, 11:37pm

Hi , is there anyone can help answer this question, it takes 2 hours even longer for the server to load the one day's apache access logs from the one file to by logstash conf, is it kind of normally ? Any suggestions

Thanks!!!

Christian_Dahlqvist · December 12, 2015, 7:23am

What does your Logstash config look like? How much data is loaded? What is the specification of your Elasticsearch cluster? Which version of Logstash and Elasticsearch are you using?

shilong · December 12, 2015, 4:50pm

Thanks for the answers.

First of all, all versions are up to date. Logstash 2.1.0, Elasticsearch 2.1.

The data size is around 1-3 GB per day. (Apache raw logs)

For the cluster I think i have no idea about this.

Most importantly the logstash config looks like:

logstash_indexer.conf
input {
redis {
host => "127.0.0.1"
port => 6379
type => "redis-input"
data_type => "list"
key => "logstash-2015.12.09"
codec => json
#threads => 5
}

}

filter {
grok {
match => { "message" => "%{COMBINEDAPACHELOG}" }
}
date {
match => [ "timestamp" , "dd/MMM/yyyy:HH:mm:ss Z" ]
}
geoip {
source => "clientip"
target => "geoip"
database => "D:/dev/elastic/GeoLite2-City.mmdb"
add_field => [ "[geoip][coordinates]", "%{[geoip][longitude]}" ]
add_field => [ "[geoip][coordinates]", "%{[geoip][latitude]}" ]
}
mutate {
convert => [ "[geoip][coordinates]", "float"]
}

}

output {
elasticsearch { hosts => ["127.0.0.1:9200"] }
stdout { codec => rubydebug }
}

logstash_shipper.conf
input {
file {
path => "D:/dev/elastic/logs//"
start_position => beginning
#sincedb_path => "/dev/null"
}

}

output {
redis { host => "127.0.0.1"
port => 6379
data_type => "list"
key => "logstash-%{+yyyy.MM.dd}"

stdout { codec => rubydebug }
}

magnusbaeck · December 12, 2015, 6:54pm

Are the CPUs saturated? If not, increase the number of filter workers (the -w startup option). You'll most likely also want to increase threads and/or batch_count for the redis input. Maybe a handful of threads that each fetch a few hundreds messages at a time?

shilong · December 13, 2015, 4:38pm

Thanks for the great inputs, is there a way you show me a more detailed example, I am confused about the filter workers especially for the -w startup option as well as threads batch_count, is that filter {
grok {
match => { "message" => "%{COMBINEDAPACHELOG}" }
}
date {
match => [ "timestamp" , "dd/MMM/yyyy:HH:mm:ss Z" ]
}
geoip {
source => "clientip"
target => "geoip"
database => "D:/dev/elastic/GeoLite2-City.mmdb"
add_field => [ "[geoip][coordinates]", "%{[geoip][longitude]}" ]
add_field => [ "[geoip][coordinates]", "%{[geoip][latitude]}" ]
}
mutate {
convert => [ "[geoip][coordinates]", "float"]
}
Thanks again!!!

magnusbaeck · December 13, 2015, 4:42pm

batch_count and threads are options to the redis input. See the documentation. If you're running Logstash as a daemon you can usually alter the startup options via /etc/default/logstash or /etc/sysconfig/logstash, or perhaps directly in the init script. It depends on how you run Logstash. If you're starting if from a shell you can, obviously, just add that option. But I think batch_count is going to have the biggest effect on the performance.

Christian_Dahlqvist · December 13, 2015, 4:46pm

As you are running Logstash 2.1 you may not need to worry about the filter workers. While this was 1 by default before version 2.0, it now adjusts depending on the number of cores on the host. Start with adjusting the reds input parameters as Magnus suggested and see what improvement that gives.

Topic		Replies	Views
Redis-logstash-elasticsearch Logstash	3	1089	July 6, 2017
Logstash 1.5.0 Performance Regression Logstash	11	1925	July 6, 2017
Logs not being indexed in Elasticsearch from Redis Logstash	3	348	August 6, 2019
Redis - Logstash - Elasticsearch is not as fast as I expected Logstash	7	568	July 2, 2019
Logstash at 100% CPU, slow to process Redis queue to Elasticsearch Logstash	3	1064	July 6, 2017

Redis as a broken really help with loading the raw logs?

Related topics