Hi,
I am testing the currently released project, Open Crawl, for a few days. I was just wondering if this project supports the stop words? In other word I don't want some words like "the", "and", "a", etc, effects on my search results. does the Open Crawler have some feature to handle this words for me while creating the index and storing the website content in elasticsearch index?
Thanks
Hey! Skipping stop words doesn't necessarily need to be done in the crawler logic. You can set up a destination Elasticsearch index with an analyzer to filter out stop words, which should achieve your desired result. You can refer to: Stop token filter | Elasticsearch Guide [8.14] | Elastic Let me know if it helps!
This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.