"Routing ensures that all documents with the same routing value will locate to the same shard"
Now in my configuration I have 20 shards and 4 document type. It is possible to route a document type toward 5 shards, a second document type toward another 5 shards, etc ?
Because I don't want to route a document type toward just one shard and get others document in the 19 shards left or inside the first shard, according the blog:
"Routing ensures that documents of a particular routing value all go to the same shard…but that doesn’t mean that other documents aren’t routed to the shard too."
I have 20 shards because I have 6 nodes in my cluster, it will be possible to increase the node number to 10.
So the goal is to have 2 shards per node (2 processes running on each node).
Logs commonly have timestamp. A good practice is to create rolling indices. One per timeframe you wish:
day, month, year, 10d... Whatever.
It means that you can easily drop old data with just a DELETE oldindex call. You can create an alias on top of them so you will query all the indices as if there was only one.
Well. Then if you feel that you need more horse power, just start new nodes and let elasticsearch do its job.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.