I am playing around with the crawler and it works great. However, I am unable to find a way to a crawl config to scraper only a specific set of URLs and nothing else. I have the list of URLs.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.