Percolation Regular Expressions and Hit Locations

I have a need to search extremely large documents using large numbers Regular Expressions. Someone has recommended that I look into Elastic Search percolator to accomplish this. I've searched the documentation here and Googled extensively, but I couldn't find anything that discusses these concerns:

  • Is there a limit to the size of a document / field that can be percolated (gigabytes?)?
  • Is there a limit to the number of simultaneous search queries that can be executed (ie: 1000s?)?
  • Can percolation search on Regular Expressions?
  • Can Percolation return the indices of every search match in the document instead of highlighting (ie: if searching this field of a document "blah hello yellow world" for "llo", are there flags to make percolation return something like [7, 10], [13, 16] (as long as the format is consistent, I can parse it out and I expect potentially hundreds of such intervals to be returned)] ?

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.