Elastic search and fscrawler


(Rahul Shendge) #1

I am working on elastic search and fscrawler. I am trying to index pdf, docx, jpeg file on elsatic search using fscrawlers. On my local jpeg file text content read successfully but on my server it not read text content. My local machine is ubuntu and server is centos. Please help me.


(David Pilato) #2

So you are indexing an image which contains text, that's right?
Did you install Tesseract for OCR?


(Rahul Shendge) #3

Yes I indexing an image which contains text and I installed Tesseract.


(David Pilato) #4

Could you make sure that tesseract is available on the default PATH of your machine?
Otherwise configure https://fscrawler.readthedocs.io/en/latest/user/tips.html#ocr-path


(Rahul Shendge) #5

Resolved issue. Below libs are missing:

libjpeg-dev  libpng-dev  libtiff4-dev

Thanks.


(system) #6

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.