I actually looked into it. I was able to use the path tokenizer so that a string such a a.b.c would create [a,a.b, a.b.c] terms. But I am not sure if that would be able to address the kind of search query I was looking for.
I looked at terms aggregation which could help with this kinda data but with lots of data, terms aggregation is not able to support this kind of real time query. For example just to find out the root token, it has to do an aggregation at that level. Imagine having a millions of documents and doing aggregation at that level.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.