I signed up for the Elastic Cloud trial. I don't know much about Elastic, but I've worked with website and enterprise search platforms.
I'm attempting a proof of concept for an e-commerce client who needs help with their onsite search and filtering. They use a homegrown solution right now, and we'd benefit from getting a more robust solution in place.
I've set up an index, which I have set to crawl their product XML feed.
Their HTML is a bit messy, so I'd like to use their product schema to pull in product details.
I'd imagine I need to store full HTML since the schema is JSON in the HTML head.
Can anyone point me to more details on how to achieve this? Would it be through in Ingest Pipeline, content extraction?
I've done SEO consulting for many years. Before that, I was UX Designer and built out an enterprise search for a leading medical association website. We began with htDig in the very early days, then Google Search Appliance and Omniture Search&Promote. I love this stuff!