Regarding 2), here is the first thing that came to mind for me:
Some of these "nosql" datastores are MapReduce friendly, meaning there
are mechanisms and tools for process the stored data using MapReduce.
Elastic Search doesn't have that and I'm not sure if that's on the
road map.
You could also ask your question 2) the other way around. Why use ES
(or any other search solution for that matter) instead of just using
one of those data stores? I'm answering some interview questions
today and below is a snippet from the answer for a related question:
"Of course, just as people are learning this, a new breed of non-
relational databases has been gaining on popularity for the last few
years. These databases tend to have better support for full-text
search, though that support is commonly provided by simply integrating
the database with an existing and proven search library like Lucene at
the lower level, so that to the application using this database, the
search functionality appears to come from the database itself."
Concretely, ES and Terrastore seems to be great friends - exactly as
described above, I believe: Google Code Archive - Long-term storage for Google Code Project Hosting.
Otis
Sematext -- http://sematext.com/ -- Solr - Lucene - Nutch
On Jul 7, 9:26 am, Ted Karmel ted.kar...@gmail.com wrote:
First off, I just want to thank Shay from the bottom of my heart and
whomever else is behind the Elasticsearch project. It sounds like the
answer to my dreams! I get almost tearful just reading the general
overview! Really...
I am trying to learn as much as possible now... reading all the blog
posts etc. But two general conceptual questions persist in my mind
which I'm hoping the mailing list can provide some guidance for.
- Latency and the Searching and Indexing Process
I was just wondering generally how the data is indexed. Is it indexed
incrementally whenever a search is requested ( a la couchdb )? Is it
indexed whenever data is PUT in? In other words, how is relatively
low latency ensured when those beautifully simple http search requests
are made?
- Integration with other NoSQL Projects
I am currently leaning towards Riak. But, unfortunately, it does not
yet have a searching and indexing feature. Elasticsearch seems to act
as a datastore as well. So what is the impetus to use Riak or other
NoSQL projects if ES can take care of the datastore aspect as well
( especially now in light of the Amazon EBS and S3 modules )?