Hi, i'm working on a ELK project to extract data from twitter to elasticsearch and build a sentimental analyses engin.
I need to know how can I filter spammed tweets with logstash ? Is that possible or not ?
for example, if i want to ignore all tweets that contain the word 'freebitcoin' or 'bitcoin giveaway', how can i do it?
Thank you.
Have a look at conditionals (https://www.elastic.co/guide/en/logstash/current/event-dependent-configuration.html) and the drop filter.
That's what i did, but when i visualized my data with kibana, i still found tweets that contain the keyword 'giveaway'
I think my Regular Expression is false
Perhaps \bgiveaway\b
would be enough?
Well, you obviously need the surrounding slashes. My example was only the expression itself.
Yes, excuse my mistake.
\bgiveaway\b doesnt work .
This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.