I'm interested in using ElasticSearch to compare large text documents. So,
essentially, I want to index documents that contain a large-ish 'text'
field (on average, 25,000+ characters or 3,500 words). I then want to
search this index using a query parameter on this same field with a value
of a similar size.
Is this an appropriate use-case for ElasticSearch? If so, what type of
query (e.g. match, query_string) would you recommend using? At first
glance, a match query seems best since it won't "parse" the query string.
Just playing around, I tried the match query (with the default "or"
operator) and received this error:
pyes.exceptions.ElasticSearchException: TooManyClauses[maxClauseCount is
set to 1024];
Is it simply a matter of increasing the maxClauseCount to support my
requirements?
On Thursday, September 26, 2013 4:08:45 PM UTC-4, Shawn O'Banion wrote:
Hello,
I'm interested in using Elasticsearch to compare large text documents. So,
essentially, I want to index documents that contain a large-ish 'text'
field (on average, 25,000+ characters or 3,500 words). I then want to
search this index using a query parameter on this same field with a value
of a similar size.
Is this an appropriate use-case for Elasticsearch? If so, what type of
query (e.g. match, query_string) would you recommend using? At first
glance, a match query seems best since it won't "parse" the query string.
Just playing around, I tried the match query (with the default "or"
operator) and received this error:
pyes.exceptions.ElasticSearchException: TooManyClauses[maxClauseCount is
set to 1024];
Is it simply a matter of increasing the maxClauseCount to support my
requirements?
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.