I need to store historical data from my indexes, exactly the number of request per minute that my webservices has along time. Its possible to store only this information? If i create a rollup job using only the fileds i need to extract this information, the index size becomes so big depending on the days that the job process. I only want numerical statistics. Its possible to do that?
Rollup stores extra meta information in order to provide rollup search, if you do not need rollup search, transform might be an option for you, it only stores what you are asking for and in case you want to further compress, you can tweak the mappings to use smaller data types.
Whether rollup or transform, the reduction should mainly depend on the bucket size you choose, in your case the date histogram interval.
Transform is what i need! I created a transform to extract only the information i need, but when i go to Discover and use the new index i created, theres no option to filter by time. What can i do to filter by time?
that's a current limitation, see this issue. The workaround is to either create the index pattern yourself, not using the transform wizard or you delete the already existing index pattern and create a new one. The limitation in the management UI has its own issue (contains a 3rd option: manually update the index pattern).
One more question. If i want to filter by time in Discover, Its required in the transform to group index data by Timestamp? Is there another way to do it?
I think in the group_by it makes the most sense. You can of course have time fields in aggregations, too. E.g. a last_updated field. Still if you do not group_by time, it will not result in a time series.
I am not sure I am getting your question, can you explain what you want to do?
I´m sending all traffic from my F5 load balancer to Elastic. At the moment just for 5 services, wich are indentified by the virtual_ip field. For each service, there are to many diferent requests, wich i differentiate by the http_path field. All i want to do is a transform to store in an index, total request for an specific virtual_server, total requests for an specific http_path of that virtual_server, average response time and also if its possible, status of each request. Heres my transform code:
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.