How to index my website


(venkatnehatha) #1

Hi All,

I am a newbie to (Elastic)Search engines. I want to use ElasticSearch for my website. I am done with the installation and done given twitter example indexing, it's working fine.
Now to integrate with my website, what is the best way to store indexes. I mean, what is the best JSON format to use? And do I need to maintain entire my web page content as a JSON doc for indexing??

Thanks in advance.

-Nehatha


(venkatnehatha) #2

Can anybody plz reply.


(Karussell) #3

either use the underlying database of your website to feed ES

or if you don't have access to this DB use a crawler (like nutch) or
write your own simple parser ...

what is the best JSON format to use

What do you mean here? The mapping? url+title+text should be the
minimum I guess :slight_smile:

Regards,
Peter.

On Jun 6, 8:15 pm, venkatnehatha venkatneha...@gmail.com wrote:

Hi All,

I am a newbie to (Elastic)Search engines. I want to use ElasticSearch for my
website. I am done with the installation and done given twitter example
indexing, it's working fine.
Now to integrate with my website, what is the best way to store indexes. I
mean, what is the best JSON format to use? And do I need to maintain entire
my web page content as a JSON doc for indexing??

Thanks in advance.

-Nehatha

--
View this message in context:http://elasticsearch-users.115913.n3.nabble.com/How-to-index-my-websi...
Sent from the ElasticSearch Users mailing list archive at Nabble.com.


(fashionalwallet) #4
  • deleted -

(venkatnehatha) #5

Thank you for the reply Karussel.

can you suggest best php/java api to feed ES that cares about index names and all.

Thanks,
Nehatha


(Karussell) #6

Java API is already included in ES. For other clients:

http://www.elasticsearch.org/guide/appendix/clients.html


(venkatnehatha) #7

Thank you Karussell. I need the generic php/java scripts to crawl websites and FEED ES.

can you plz suggest one.


(system) #8