Fscrawler ocr question

ocr:
    language: "eng"
    enabled: false
    pdf_strategy: "ocr_and_text"
  follow_symlinks: false

If I enable OCR in _settings.yaml, will all files be OCR'd - even those which already contain indexable text - or just those for which there is currently no indexable text ?

Also, does "follow_symlinks" mean that url's will be hyperlinked and made clickable ?

It will extract both text and images.
This test shows it:

It reads a PDF document which has an image plus some text. Both are extracted.

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.