Hi,
Some of the documents that we need to index are very large (can possibly be up to 2 gigs). Is there a recommendation on what is the best way to index these type of documents so that they are still easily searchable as a unit? So far, what I've done is split each large document into smaller documents with an id field that can be used to tell if it came from same original document. However, this means that for queries with "AND" type of tokens, I have to split each into separate queries and then do some type of group by the id field to get the match to the actual entire document. Is there a better way to do this?
Thanks!