I have a facet to count shingles, now I want to filter the facet results to
exclude shingles given a simple rule I'm expressing as a regex using script:
There is a wildcard. Notice the (.) at the beginning of the regex. Since I
want to exclude 2-fold shingles, the regex is matching only the second
term. I had prefered using (\s) instead of (.) but it does not work.
On Thursday, June 27, 2013 2:21:58 PM UTC-4, AlexR wrote:
Why not use terms facet exclude parameter since you do Not seem to have
any wildcard in your regexp or facet_filter if you need more complex logic?
faceting on an analyzed (like a shingle) field will increase the chances of
running out of memory at some time, because your are basically un-inverting
the lucene inverted index and put it into memory. Maybe there is another
solution for your problem, if you can tell us a bit more about what you try
to achieve (in a broader context).
--Alex
On Thu, Jun 27, 2013 at 10:04 PM, Santiago Gaviria sgaviria@gmail.comwrote:
There is a wildcard. Notice the (.) at the beginning of the regex. Since I
want to exclude 2-fold shingles, the regex is matching only the second
term. I had prefered using (\s) instead of (.) but it does not work.
On Thursday, June 27, 2013 2:21:58 PM UTC-4, AlexR wrote:
Why not use terms facet exclude parameter since you do Not seem to have
any wildcard in your regexp or facet_filter if you need more complex logic?
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.