I have a need to search extremely large documents using large numbers Regular Expressions. Someone has recommended that I look into Elastic Search percolator to accomplish this. I've searched the documentation here and Googled extensively, but I couldn't find anything that discusses these concerns:
- Is there a limit to the size of a document / field that can be percolated (gigabytes?)?
- Is there a limit to the number of simultaneous search queries that can be executed (ie: 1000s?)?
- Can percolation search on Regular Expressions?
- Can Percolation return the indices of every search match in the document instead of highlighting (ie: if searching this field of a document "blah hello yellow world" for "llo", are there flags to make percolation return something like [7, 10], [13, 16] (as long as the format is consistent, I can parse it out and I expect potentially hundreds of such intervals to be returned)] ?