FSCrawler Index Each Page as a Separate Document


Is it possible to changes settings for a job in FSCrawler to index each page of a PDF as a separate document? My understanding is that the entire content of the PDF is stored into one JSON field, can we break it up so that one query for a term will return multiple pages within the same PDF book?

If it's not possible, can we make changes in ElasticSearch to show the multiple occurrences of a term within the same document?


You are looking for this:

It's not implemented. :wink:

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.