How to get index contents of a website

How can I index and perform search operations on the contents of a website of a given web link like discuss.elastic.co using Java 1.8 and Elasticsearch 7.12.0?

I think you already asked for a similar question at Is Elasticsearch webcrawler an open source feature or it is paid?.

I believe it would be better to keep the discussion in one place.