Tokenizing a Google Merchant Centre category field

I'm a new Elastic Search user.

I've been presented with a problem where I need to set up a tokenizer for a storefront category property that conforms to google merchant center category text strings

For example:
Apparel & Accessories > Clothing > Dresses

I need to tokenize this string such we can search by category and sub-category.

I initially thought to tokenize the above category text to the following keyword tokens for searching:
Apparel & Accessories
Apparel & Accessories > Clothing
Apparel & Accessories > Clothing > Dresses

I've been struggling to get this right. Is this possible?

Have you looked at the path hierarchy tokenizer?

1 Like

That is perfect! Thank you. I'd missed that.