When i am trying to run FSCrawler job for Images, It is showing me below error:
07:05:40,428 DEBUG [f.p.e.c.f.t.TikaInstance] OCR is activated.
07:05:40,428 DEBUG [f.p.e.c.f.t.TikaInstance] But Tesseract is not installed so we won't run OCR.
Please don't use the citation icon but the code icon </> to format your code.
Most of time, on Windows, I found that having a space in the path can cause some troubles. Program Files here might be the problem.
Could you try with something like:
path: "/Progra~1/Tesseract-OCR/tesseract.exe"
If I recall correctly the syntax.
Otherwise, could you add C:/Program Files/Tesseract-OCR/ in your windows system path?
I cannot change the folder name of program Files to Program~1. Renaming of folder is disable on our server.
I have added windows system path to environment variable window.
How to mention that in OCR path?
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.