Fscrawler - update existing record

Andrej_K · December 13, 2021, 7:37am

Hello,
I need to search documents by its content and some custom customer data. So my idea is to use fscrawler to extract content from documents and keep it with custom customer data in one index for easy searching.

So the process is:

index document with custom customer data and ID which will use fscrawler for file processing
copy file to dedicated folder to be processed by fscrawler
fscralwer extracts content of the file and puts it to the index (pairing document by ID)

Problem here is, that fscrawler overwrites document I have created removing all my custom data. Is it possible to configure somehow fscrawler to keep these custom data? To just update the document, not completely overwrite it.

I can achieve complete data in one index by moving point 1 to the end and using 'UpdateDocument' instead of 'IndexDocument'. But then I need to wait for fscrawler data, so I would like to avoid this approach.

system · January 10, 2022, 7:37am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
FSCrawler Question Elasticsearch	7	3086	March 17, 2017
[FSCrawler] Add data in upload files Elasticsearch	4	662	July 24, 2018
Elastic Search FSCrawler Elasticsearch	4	376	December 18, 2018
Does FSCRAWLER update rate include all documents or only those modified? Elasticsearch	5	627	June 16, 2020
FSCrawler large document and indexing based on content Elasticsearch	4	2373	December 28, 2017

Fscrawler - update existing record

Related topics