Crawling web content

does anybody have any experience crawling web content and indexing it with
Elastic Search?

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

Perhaps this could help. I never used it myself.

http://manifoldcf.apache.org/release/release-1.1.1/en_US/included-connectors.html

--
David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet | @elasticsearchfr | @scrutmydocs

Le 20 févr. 2013 à 10:41, Ammar Yahia yahia.ammar.info@gmail.com a écrit :

does anybody have any experience crawling web content and indexing it with Elastic Search?

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

Hi,

Have a look at Nutch and ES:
http://search-lucene.com/?q=elasticsearch&fc_project=Nutch&fc_type=issue

Otis

ES Performance Monitoring:

On Wednesday, February 20, 2013 4:41:04 AM UTC-5, Ammar Yahia wrote:

does anybody have any experience crawling web content and indexing it with
Elastic Search?

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

There is an Elasticsearch Indexer for Nutch

https://github.com/ctjmorgan/nutch-elasticsearch-indexer

Jörg

On Thursday, February 21, 2013 7:48:40 AM UTC+1, Otis Gospodnetic wrote:

Hi,

Have a look at Nutch and ES:
http://search-lucene.com/?q=elasticsearch&fc_project=Nutch&fc_type=issue

Otis

ES Performance Monitoring:
Elasticsearch Monitoring

On Wednesday, February 20, 2013 4:41:04 AM UTC-5, Ammar Yahia wrote:

does anybody have any experience crawling web content and indexing it
with Elastic Search?

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.