Hi
new to ES, i have 12 node cluster and its purpose is to capture all of the logs from apps in our 14 env - lets call them dev1-14. each env has 6 apps server and 2 rp and 2 geodes and jmp box - so 11 servers. on the app servers there might be 6-8 apps running. on the others maybe 2-4 apps.
its the logs from those app that I want to keep
I have setup filebeat and logstash. But having run into index limits I have had to re think how I do thinks.
What I have is
data view per env, so I think i need atleast a datastream per env.
I have also created a datastream per app - why some apps are rather chatty - or developers turn on debug mode and produce 100G of debug text. I would like to delete just that apps documents - i figure having a datastream per app gives me that.
Originally I had I also split that done to the app server - but I found I ended up with too many indices and I don't think it helps me.
Next I created an ilm
hot 6 days - roll at 50G - roll at 5 days
warm mark as read only, I am hoping to save resource by doing this - do i ?
then after 15 days it gets deleted.
My questions
- do I really need all of those datastreams I'm thinking all i really is a datastream per env
- how do I delete very large number of document easily if I had to so if I have a field that the app name how can i / can I say delete from this date to this date all documents that have the field == 'app name' << this would give me a process to handle apps that spew out lots of shit stuff
- how to I manage ilm - in theory the data stream should only be written to nothing should be edited - I do have to handle if the server is down - so old data might need to be inserted because it was never read to begin with - so I think 5 days in warm is enough - maybe could push it to 1 week
- should I use warm / read only does it help
- what if any advance ilm stuff should i use on these - shrink/merge ??? not sure it would help
- after say 15 or 20 or ... day I want to delete the indices - i am not using ES for long term storage. if i set it to 15 days is it from the last document written to it or is it from the day the index was created
Basically I want to keep a sliding window of say 14 / 21 days of documents and purge anything older
I don't think I can get the ilm to do that so ?
Should I roll at 1 day - but will i end up with to many indices.
Thanks