Per default transform creates buckets after the bucket is complete, that's it waits 5 minutes, if your date_histogram is configured with 5 minutes interval. This improves performance, because transform does not need to update documents. You can change this using the setting align_checkpoints. It is default true and can be set to false. This will tell transform to process incomplete buckets for the price of more updates and therefore some performance penalty. You find this setting in the docs
Note that you will still have a waiting time of at least 1 minute if your sync delay is set to 60s, because transform will only query for data that is at least 1 minute old. This setting compensates ingest delays and data coming in in different order. If you know that your configured timestamp is guaranteed to reach elasticsearch earlier, you can decrease this setting to further optimize the time to trigger the alert. An even better approach which will compensate any problem on the data ingestion is the use of an ingest timestamp as explained here. By using an ingest timestamp you can decrease the setting for sync to e.g. 5s (You can't decrease it to 0s, because the refresh interval of a lucene index per default is 1s, so I think 2s should be the minimum).
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.