I would like to use fs-crawler for page by page data extraction or extract data based on title hierarchy(title) instead of getting all the data of a document in a single row.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.