During my deployment,
My system will be accessing ES using REST API. My typical document (for storing file information) will be as follows
thumbnails (10KB max, as attachment or simple base64 string)
Hardware I am planning to use are 2x Xeon 2.4GHz with 8 GB RAM machines with 250GB HDD (2 machine for fault tolerance) with ubuntu 10.04 64 bit server (Read one message from Shay that 10.10 proved better for one bug, will drill that more)
I will interested to run queries for files of grouped for specific types like images, videos etc. for specific users.
- 10k-100k users
- each with 20K-50K file information documents.
will this be hardware configuration mentioned be sufficient infrastructure? (any past experience will be good to know)
- Any changes to code/configuration to make sure that index is stored locally as well as periodic backup on S3?
- Any suggestions to optimal memory? (I can purchase 16GB memory in place of 8 GB with initial discount from service provider)
Any pointers will be BIG help.