I am looking for documentation about how the Elasticsearch Crawler is handling the "lastmod" properties in website sitemaps. But so far, I am not able to find IF ES is actually taking the lastmod in consideration and actually crawl/update only the page is a lastmod newer than the previous crawl.
The web crawler does not process optional metadata defined by the standard.
The web crawler extracts a list of URLs from each sitemap and ignores all other information.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.