It would be very difficult to parse all the data I've been given and to be able to do that.
I was using wildcard query before but once I reached a certain number of emails the node timeout from the searches.
I was told earlier that a reverse token filter and a suffix query would increase the search significantly, but I'm a bit puzzled trying to get it to work.
I'm using Elasticsearch-PHP to handle the searches.
Why do you say that? Possibly tools like Logstash and Node ingest could help there.
Indeed a reverse token filter could help replace loading wildcards with suffix queries, but it will require reindexing too, like the extraction of the user and domain fields.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.