thank you for your welcoming.
i am capturing my network data through packetsbeats. after that i am preprocessing the domain name to detect wether it's malicious packet or not. i have already done all that.
but because i am reindexing the i am only getting previous data.
like if i am indexing at 3:01 pm i am only data from 3:01 pm and previous.
i want to index live data and continuous. how can i do that
I am taking the packetsbeats data (specifically domain name) and take n-gram of the name through painless script then i am putting the script through the pipeline so i can do inference the model.
i have made some work around i have created a python script that runs every 5 min to index the last 5 min.
but i know this isn't really a solution as in production it will have a big impact on the system.
Ok, this is outside the scope of what Packetbeat does at the moment.
However it's a great idea, and I would encourage you to raise a feature request in GitHub to see if there might be something that can be built in in future.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.