does ES jdbc river/feeder have a mechanism to feed data from database into
ES via jdbc incrementally in a stepwise function?
I have billions of records to be imported into ES from mysql and a select *
from my table would take a long time and be interrupted due to db timeout.
I need to do select * from my_table where my_id>start_id and my_id<end_id
and iterate through somehow. feeder/river doesn't seem to provides such
mechanism, or am I missing something?
JDBC plugin supports MySQL streaming mode out of the box:
If it does not work, I'm available for help to find alternatives.
I would love to know more about the observed DB timeout. The MySQL timeouts
can be changed in configuration, but I am unsure if streaming mode can
result in timeouts - that would be a defect on MySQL JDBC driver level.
does ES jdbc river/feeder have a mechanism to feed data from database into
ES via jdbc incrementally in a stepwise function?
I have billions of records to be imported into ES from mysql and a select
from my table would take a long time and be interrupted due to db
timeout. I need to do select * from my_table where my_id>start_id and
my_id<end_id and iterate through somehow. feeder/river doesn't seem to
provides such mechanism, or am I missing something?
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.