We are planning to develop a site search type functionality to replace some
aging and painful nutch infrastructure. We are obviously not looking to
rebuild google, but assuming we have a decent spider, does anyone
have analyzer/mapping suggestions for a site search index?
Off the top of my head, the mapping would include fields like:
description (from html or tika)
I was wondering if anyone had any experiences or analyzer / mapping
definitions they'd be willing to share?