99% of the items are not enriched, nor tagged with _elasticsearch_lookup_failure and have the fields empty.
Is it possible due to the fact the @timestamp difference between Start and End items is < 20ms that the lookup made by End item will not find the Start item and thus not working ?
If you are enriching events and sending them to Elasticsearch, and then trying to look up the enrichments using an Elasticsearch filter, then you need to understand that events are processed in batches. A batch of 125 events (by default) is processed through each filter before being sent to the next filter and then the output.
If two events in one batch are related then the lookup of the first will fail, because it has not been sent to Elasticsearch.
You may be able to use an aggregate filter. See Example 1. Make sure you read the restrictions around pipeline.workers and pipeline.ordered.
I have switch to Aggregate and managed to reach ~70% of items to be enriched with the total duration.
In order to have more I tried combining the two but it doesnt seem it makes Lookup on the ones he cannot aggregate (I assume due to not being in the same 125 event batch):
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.