Hello,
I'm working on an index on which I want to do plain-text search in several strings :
for example, for 2 documents
- Doc 1 : strings "This is an example" and "another example"
- Doc 2 : strings "This is an test" and "another test"
if I search with the string "an", I found the two documents.
if I search with the string "amp" I found Doc 1
if I search with the string "tes" I found Doc 2
if I search with the string "anoo" no document is found
To do that I use ngram_tokenizer and for a document it can have a large set of string where to search. Is it a good solution ? Is there a better one ?
When I put documents in the index several errors occur :
-
In logstash :
[2020-05-18T00:16:59,550][INFO ][logstash.outputs.elasticsearch][main] retrying failed action with response code: 429 ({"type"=>"es_rejected_execution_exception", "reason"=>"rejected execution of processing of [27859947][indices:data/write/bulk[s][p]]: request: BulkShardRequest [[test_nouv_structure_es_on_demand_index-2020.05.15][0]] containing [34] requests, target allocation id: HuXyeaExRJOhKUGOVOE5Fg, primary term: 1 on EsThreadPoolExecutor[name = l4g-centoselk02/write, queue capacity = 200, org.elasticsearch.common.util.concurrent.EsThreadPoolExecutor@158523e4[Running, pool size = 4, active threads = 4, queued tasks = 200, completed tasks = 3246756]]"}) -
In Elasticsearch :
[2020-03-17T18:18:21,949][ERROR][o.e.b.ElasticsearchUncaughtExceptionHandler] [l4g-centoselk02] fatal error in thread [elasticsearch[l4g-centoselk02][write][T#1]], exiting
java.lang.OutOfMemoryError: Java heap space
at org.apache.lucene.util.BytesRefHash.rehash(BytesRefHash.java:398) ~[lucene-core-8.2.0.jar:8.2.0 31d7ec7bbfdcd2c4cc61d9d35e962165410b65fe - ivera - 2019-07-19 15:05:56]
at org.apache.lucene.util.BytesRefHash.add(BytesRefHash.java:309) ~[lucene-core-8.2.0.jar:8.2.0 31d7ec7bbfdcd2c4cc61d9d35e962165410b65fe - ivera - 2019-07-19 15:05:56]
at org.apache.lucene.index.TermsHashPerField.add(TermsHashPerField.java:151) ~[lucene-core-8.2.0.jar:8.2.0 31d7ec7bbfdcd2c4cc61d9d35e962165410b65fe - ivera - 2019-07-19 15:05:56]
...
Have you information on these errors ?
Thank you for your help.