Hi, has anyone used the fingerprint plugin with MURMUR3? So far I find it has quite high collision rate. Even with just a few hundred thousands records managed to get 20 collisions.
Testing with sha256 over 2million records and no collisions so far. I'm ok with a some collisions. But not what MURMUR3 produced. Just wondering if this article should be updated: https://www.elastic.co/blog/logstash-lessons-handling-duplicates
In both scenarios I'm using the message and the kafka offset as the fields to hash.