Out of memory error and duplicate rows

Badger · December 23, 2022, 6:21pm

Is it possible that one of your documents is over 1 GB? A few years ago the maximum string size in Java was reduced from 2 GB (because each UTF-16 character uses 2 bytes and it uses a 32-bit string length) to 1 GB (a side-effect of compact strings which are byte arrays rather than char arrays).

Is the pipeline getting restarted over and over again due to this exception? If so, perhaps use the customer_id as the document_id in elasticsearch, so that it will just keep overwriting the document.

Topic		Replies	Views
When using logstash to index data from MySQL to ElasticSearch,only the first row is being displayed Logstash	5	1747	July 6, 2017
Logstash throws java.lang.OutOfMemoryError: Java heap space no matter the heap size Logstash	7	1056	May 15, 2023
Import data from DB causes OutOfMemory Logstash	5	813	September 3, 2018
Logstash's OutOfMemoryError Logstash	2	1508	July 6, 2017
EXCEPTION : java.lang.OutOfMemoryError: GC overhead limit exceeded Logstash	9	3916	July 6, 2017

Out of memory error and duplicate rows

Related topics