Here is an example:
Logs are being processed through two pathways.
ruby-app --> TCP --> OldLogstash --> OldKibana
ruby-app --> /var/log/json/file.log --> filebeats --> kafka --> NewLogstash --> NewKibana
Tracing one log instance (Tonya-Ca....) from two log files, a metrics log and a request log:
In this kibana screen you can see the double entries for each ruby activity:
My focus is on the Tonya-Ca... that appears mid screen.
Here is the same time period from the NewKibana:
Note the identical timestamp 16:36:30.619 on the Tonya-Cole......
Note that in the OldKibana, there are many lines missing between James-Tow... and Tonya-Cole.
We are seeing about 8 in 10 lines lost, which approximately coincides with the screenshots.
In Kafka, there is only one entry recorded:
@timestamp":"2016-10-14T23:36:31.084Z","beat":{"hostname":"ip-172-31-51-10","name":"ip-172-31-51-10","version":"6.0.0-alpha1"},"fields":{"App":"masscache","Availzone":"us-west-2a","Beat":"masscache","Env":"production","Imageid":"ami-e5875985","Insttype":"c4.xlarge","Ipaddr":"172.31.51.10","Role":"appserver"},"input_type":"log","message":"{\"name\":\"masscache\",\"environment\":\"production\",\"hostname\":\"ip-172-31-51-10\",\"pid\":2814,\"level\":30,\"message\":\"GET /Tonya-ColXXXXX 200 HIT\",\"req_id\":\"ec9b2204-8221-4c88-ba5e-a305b9b5f00c\",\"agent\":\"Mozilla/5.0 (compatible; bingbot/2.0; +http://www.bing.com/bingbot.htm)\",\"clientip\":\"207.46.13.163\",\"verb\":\"GET\",\"origin\":\"http://www.XXXX.com\",\"request\":\"/Tonya-ColXXXXX\",\"page_type\":\"name\",\"band_segment\":8,\"query\":{},\"response\":200,\"cache_status\":\"HIT\",\"cache_key\":\"cd014d74703d89f0d8ee06c170db1d7db94b0f5a\",\"totalms\":27.26,\"program\":\"masscache\",\"metrics\":{},\"timestamp\":\"2016-10-14 23:36:30.619\",\"msg\":\"\",\"time\":\"2016-10-14T23:36:30.619Z\",\"v\":0}","offset":2871070136,"source":"/var/log/json/request.log","type":"json"}