Large zip file 2gb content read by tika was not indexing. Indexing is hanging at bulk request indexing. Smaller zip has no issues. Ingest attachment is used for indexing. Is there any limit in ES per document?

Large zip content

dadoonet (David Pilato) December 25, 2017, 1:28pm 2

There I memory limit of the JVM, http network limit (100mb IIRC), ...

Indexing too big binary documents in elasticsearch is not a good idea IMHO.

You should do the extraction of metadata and text outside elasticsearch. FSCrawler project could help.

Topic		Replies	Views
Request Entity Too Large when index file json has size large 100mb Elasticsearch	5	1868	November 6, 2019
FSCrawler - Indexing mix of Big and small files - HTTP Entity too large error Elasticsearch	9	250	February 28, 2024
Elasticsearch considerations for ingesting large files Elasticsearch	7	2590	May 9, 2020
ElasticSearch 2.2.0 - File Too Large while bulk indexing Elasticsearch	3	1473	July 5, 2017
Unable to index a file (Word document) greater than 45 MB Elasticsearch	6	614	June 3, 2021