I was checking "Google" but I didn't find too much info about it, so any help would be greatly appreciated.
I was working with Google Search Appliance before but it was decommissioned, and Elastic seems to be the best option on the market right now.
I need to:
Crawl & index a number of websites (around 50)
Serve them as an XML to a web application
The current version of Elasticsearch does not have a crawler? I need to install something else?
Thanks Mark!
Swiftype is ok, but unfortunately we need to have more or less a real-time crawling (every 5 minutes) and their solution (the cheap one with <100$/month) offers only 1 crawling every 3 days or so.
I was checking now some other solutions like: 80legs.com and if I find something I will post here.
Can ElsticSearch be used for this type of live crawling, indexing & serving?
@warkolm Yes.. I was a bit shocked to see that there is no official crawler (at least on the Cloud version).
It's like selling only the engine and some other parts of a car , but you need to find the wheels by your own.
I guess I was too accustomed with the Google Search Appliance.
I will keep researching and post here whatever solution I find!
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.