Hello,
We are trying to use a "Document_id" within ES output configuration (using multiple field columns from the Oracle Database table for example:
document_id => %{field1}%{field2}
Reason being to be able to have only one unique record in ES with the value field (field3) updated every minute with the latest near real-time value. (without creating duplicate records for each unique value)
However we noted that during a consequent run scheduled every minute using JDBC input, the record vanishes from the index if that document_id does not exists.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.