Crawler Regex help

ansamHox · January 26, 2022, 3:56pm

I need help creating Regex rules for Crawler, because it seems that regex is not working as it should.

I don't want to crawl anything in /wp-content/uploads folders, (images, media, etc.).

Without any luck, it's all being processed again. How can I achieve this?

Sean_Story · February 1, 2022, 9:46pm

Hi @ansamHox ,

Take a look over the crawl rules documentation. Specifically, these regexes must follow the Ruby Regex syntax. When I added your rules to https://rubular.com/, it immediately identified some syntax issues.

For your image regex specifically, also note this block from the docs:

The rule matches when the path pattern matches the beginning of the path (which always begins with / ).

Which means you probably need a preceding .*

Topic		Replies	Views
Regex search Elasticsearch	2	557	July 5, 2017
RegEx Query in Discover Kibana	3	5372	August 6, 2019
Regexp matches when it shouldn't, doesn't match when it should Elasticsearch	4	885	August 21, 2017
Search using complex regex is not working on Kibana 6 Kibana	4	5388	March 14, 2018
Issues with Regex in Kibana Kibana	3	1498	February 28, 2019

Crawler Regex help

Related topics