Hi
Would appreciate if any of you can share your experience / thoughts on below
questions:
REST Api vs Java api - Have read that Java api is much faster as it works
at a lower level protocol. Do you guys have any comparison?
What approach do you suggest for the below mentioned use case:
I plan to index a stream of short messages (like Tweets) into ES.
Now I don't want to keep say more than a month old data. How do I flush it?
If I create 3 index files say a, b, c. How do I tell ES to search on all
these indexes?
ES seems to have good shard support. Is there a way to control these
shards on capacity?
Hi
Would appreciate if any of you can share your experience / thoughts on
below questions:
REST Api vs Java api - Have read that Java api is much faster as it
works at a lower level protocol. Do you guys have any comparison?
It depends on what exactly you measure and also on your use case. First, as
of writing the REST API can be used via
HTTPhttp://www.elasticsearch.com/docs/elasticsearch/modules/http/or
Memcachedhttp://www.elasticsearch.com/docs/elasticsearch/modules/memcached/protocols.
Memcached protocol should be faster then HTTP (and has also some
minor downsides) but it depends on your client implementation (e.g. client
can be using slow implementation of HTTP client module under the hood). When
using Java API there different
optionshttp://www.elasticsearch.com/docs/elasticsearch/java_api/client/:
TransportClient or NodeClient. TransportClient is slower then NodeClient;
however, NodeClient joins directly the cluster while TransportCient does
not. Both (Java) clients use optimized binary protocol so they are faster
then HTTP and Memcached protocols.
What approach do you suggest for the below mentioned use case:
I plan to index a stream of short messages (like Tweets) into ES.
Now I don't want to keep say more than a month old data. How do I flush it?
Hi
Would appreciate if any of you can share your experience / thoughts on
below questions:
REST Api vs Java api - Have read that Java api is much faster as it
works at a lower level protocol. Do you guys have any comparison?
It depends on what exactly you measure and also on your use case. First, as
of writing the REST API can be used via HTTPhttp://www.elasticsearch.com/docs/elasticsearch/modules/http/or
Memcachedhttp://www.elasticsearch.com/docs/elasticsearch/modules/memcached/protocols. Memcached protocol should be faster then HTTP (and has also some
minor downsides) but it depends on your client implementation (e.g. client
can be using slow implementation of HTTP client module under the hood). When
using Java API there different optionshttp://www.elasticsearch.com/docs/elasticsearch/java_api/client/:
TransportClient or NodeClient. TransportClient is slower then NodeClient;
however, NodeClient joins directly the cluster while TransportCient does
not. Both (Java) clients use optimized binary protocol so they are faster
then HTTP and Memcached protocols.
What approach do you suggest for the below mentioned use case:
I plan to index a stream of short messages (like Tweets) into ES.
Now I don't want to keep say more than a month old data. How do I flush it?
Hi
Would appreciate if any of you can share your experience / thoughts on
below questions:
REST Api vs Java api - Have read that Java api is much faster as it
works at a lower level protocol. Do you guys have any comparison?
It depends on what exactly you measure and also on your use case. First,
as of writing the REST API can be used via HTTPhttp://www.elasticsearch.com/docs/elasticsearch/modules/http/or
Memcachedhttp://www.elasticsearch.com/docs/elasticsearch/modules/memcached/protocols. Memcached protocol should be faster then HTTP (and has also some
minor downsides) but it depends on your client implementation (e.g. client
can be using slow implementation of HTTP client module under the hood). When
using Java API there different optionshttp://www.elasticsearch.com/docs/elasticsearch/java_api/client/:
TransportClient or NodeClient. TransportClient is slower then NodeClient;
however, NodeClient joins directly the cluster while TransportCient does
not. Both (Java) clients use optimized binary protocol so they are faster
then HTTP and Memcached protocols.
What approach do you suggest for the below mentioned use case:
I plan to index a stream of short messages (like Tweets) into
ES. Now I don't want to keep say more than a month old data. How do I flush
it?
just a note regarding the HTTP vs. the native transport performance. I can
get really good performance with HTTP as well compared to the native one
using Java, it really depends on the http lib used by your programming
language of choice.
Hi
Would appreciate if any of you can share your experience / thoughts on
below questions:
REST Api vs Java api - Have read that Java api is much faster as it
works at a lower level protocol. Do you guys have any comparison?
It depends on what exactly you measure and also on your use case. First,
as of writing the REST API can be used via HTTPhttp://www.elasticsearch.com/docs/elasticsearch/modules/http/or
Memcachedhttp://www.elasticsearch.com/docs/elasticsearch/modules/memcached/protocols. Memcached protocol should be faster then HTTP (and has also some
minor downsides) but it depends on your client implementation (e.g. client
can be using slow implementation of HTTP client module under the hood). When
using Java API there different optionshttp://www.elasticsearch.com/docs/elasticsearch/java_api/client/:
TransportClient or NodeClient. TransportClient is slower then NodeClient;
however, NodeClient joins directly the cluster while TransportCient does
not. Both (Java) clients use optimized binary protocol so they are faster
then HTTP and Memcached protocols.
What approach do you suggest for the below mentioned use case:
I plan to index a stream of short messages (like Tweets) into
ES. Now I don't want to keep say more than a month old data. How do I flush
it?
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.