Does anyone knows a nice OCR java library that I may use to add the OCR
feature [1] to the attachment plugin ?
I searched for the same some time ago, but didnt come up with anything
useful (free, there seem to be some commercial ones which are ok/
good).
In Open Source world, there is Tesseract possibly (however this is a
binary, and you would have to spawn a process)
Keep me posted about progress in case you find something cool, I need
this for my home paperwork (I never find stuff when I search for in
real life
Does anyone knows a nice OCR java library that I may use to add the OCR
feature [1] to the attachment plugin ?
I searched for the same some time ago, but didnt come up with anything
useful (free, there seem to be some commercial ones which are ok/
good).
In Open Source world, there is Tesseract possibly (however this is a
binary, and you would have to spawn a process)
Keep me posted about progress in case you find something cool, I need
this for my home paperwork (I never find stuff when I search for in
real life
However, I dont think that OCR in ES side is a good approach, OCRs consume
a lot of resource maybe you will create a bottleneck in ES node. Some of
those OCRs get in deep in embed images, probably will be more expensive for
the machine.
However, I dont think that OCR in ES side is a good approach, OCRs consume a
lot of resource maybe you will create a bottleneck in ES node. Some of those
OCRs get in deep in embed images, probably will be more expensive for the
machine.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.