@Badger thanks for the response. This is exactly what I did. Here I am facing another issue, I put scheduler to pick from EL every 1 minute and write it to Kafka, now the problem is I need to manually update each document as processed(some flag) so that it won't be picked next batch. I need to add a new flag every time I push to Kafka ( two output writes)
I have another problem If I update the new document(duplicate) in the first pipeline the flag will be overridden and it will be again picked by the second pipeline.
This can be avoided by using action => create
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.