I am trying to build a ES based service. The idea is to have 16 ES
processes in the same machine (one per core) that do not communicate with
each other.
I will redirect the PUT inserts to the 16 servers using the first letter of
the md5 of {index/type/id} so I can have 16 independent ingestion
processes.
I will take care of the mapReduce operation in an upper layer.
The question is how must I configure each single ES service so they don't
speak with each other and they don't share/shard the documents.
I am trying to build a ES based service. The idea is to have 16 ES processes in the same machine (one per core) that do not communicate with each other.
I will redirect the PUT inserts to the 16 servers using the first letter of the md5 of {index/type/id} so I can have 16 independent ingestion processes.
I will take care of the mapReduce operation in an upper layer.
The question is how must I configure each single ES service so they don't speak with each other and they don't share/shard the documents.
Le 8 août 2012 à 18:35, Jesús Sanz Marcos <je...@estudiocotacero.es<javascript:>>
a écrit :
Hi,
I am trying to build a ES based service. The idea is to have 16 ES
processes in the same machine (one per core) that do not communicate with
each other.
I will redirect the PUT inserts to the 16 servers using the first letter
of the md5 of {index/type/id} so I can have 16 independent ingestion
processes.
I will take care of the mapReduce operation in an upper layer.
The question is how must I configure each single ES service so they
don't speak with each other and they don't share/shard the documents.
Sure he can. But if you have on the same LAN different versions of ES, let's say
0.17.5 and 0.19.8, you will have often exception because nodes try to connect to
each other but they can't.
I have already seen it in the past so I often disable multicast and define all
my nodes in the unicast config.
Le 8 août 2012 à 18:35, Jesús Sanz Marcos < je...@estudiocotacero.es> a
écrit :
Hi,
I am trying to build a ES based service. The idea is to have 16 ES
processes in the same machine (one per core) that do not communicate
with each other.
I will redirect the PUT inserts to the 16 servers using the first
letter of the md5 of {index/type/id} so I can have 16 independent
ingestion processes.
I will take care of the mapReduce operation in an upper layer.
The question is how must I configure each single ES service so they
don't speak with each other and they don't share/shard the documents.
Hi!
Thanks for your answers. I have configured the instances with separate folders and binaries, with configuration file like:
http.port: 90XX. /// from 9000 to 9015
index.number_of_shards: 1
discovery.zen.ping.multicast.enabled: false
But I still find documents from other instances. So if I PUT a doc in instance at port 9010, I can find it when I POST a search to instance at port 9005, for example. So servers are still speaking to each other...
Hi!
Thanks for your answers. I have configured the instances with separate folders and binaries, with configuration file like:
http.port: 90XX. /// from 9000 to 9015
index.number_of_shards: 1
discovery.zen.ping.multicast.enabled: false
But I still find documents from other instances. So if I PUT a doc in instance at port 9010, I can find it when I POST a search to instance at port 9005, for example. So servers are still speaking to each other...
Hi!
Thanks for your answers. I have configured the instances with separate folders
and binaries, with configuration file like:
http.port: 90XX. /// from 9000 to 9015
index.number_of_shards: 1
discovery.zen.ping.multicast.enabled: false
But I still find documents from other instances. So if I PUT a doc in instance
at port 9010, I can find it when I POST a search to instance at port 9005, for
example. So servers are still speaking to each other...
And now the documents are isolated in each instance. THANKS
The EC2 machine has 8 cores and I am using 16 ES instances. Each instance
is receiving PUT requests from a separate c++ application running in the
same machine. It hardly gets to 25% of CPU. Do you think that the I/O is
the bottleneck here or I could send more requests in parallel to each
instance. Have you ever seen ES at 100% during ingestion?
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.