FS crawler should be used in the same server where elasticsearch is installed

Edu · December 1, 2018, 1:07pm

Dear All,

I have started to work with elasticsearch last week. I apologize if I am asking something too obvious. I have to develop a fast solution for users to search inside MS-Office files.

I have two docker containers, one with the Django backend, which can access the user's files, and another container running elasticsearch-6, with host=elasticsearch:9200, where I have installed the "ingest-attachment" plugging.

I am using the elasticsearch_dsl and django_elasticsearch_dsl on the django server, however I could not find a clear explanation of how could I upload the file to the elasticsearch-6 (and I stress the SIX) server for indexing (only indexing the content... not to store it). The explanations that I found employed this 'Attachment" , that works only for elasticsearch 5.

Then, I came to find this nice project FS crawler with the promise that I could use a single api line like : curl -F "file=@test.txt" -F "id=my-test" "http://127.0.0.1:8080/fscrawler/_upload".

However it is still not clear how it works.
Is FS crawler a plugin for elasticsearch or an stand-alone program just to create indexes?
Should I install FS crawler in the elasticsearch docker container where I want the indexes to live?
Should I install FS crawler in the django docker container where the files are accessible?

I would like to simply use the endpoint in my Django server:
curl -F "file=@test.txt" -F "id=my-test" "http://elasticsearch:8080/fscrawler/_upload"?
and search using the "http://elasticsearch:9200/" by using the elasticsearch_dsl methods. Is it possible?

Any help would be appreciate, I am little lost here...
Best regards
Ed

dadoonet · December 1, 2018, 3:27pm

It's a stand-alone application which should run within its own container.

system · December 29, 2018, 3:33pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
How to install Fs Crawler in ubuntu? Elasticsearch	2	3190	March 4, 2019
Elasticsearch Transfer Physical Files Elasticsearch	14	1183	April 7, 2017
Ingesting documents (pdf, word, .txt) to elasticsearch Elasticsearch	31	39029	March 21, 2017
[ANN] Filesystem River for Elasticsearch 0.0.1 Elasticsearch	5	410	July 6, 2017
Pointing FSCrawler to a separate server for documents Elasticsearch	11	2240	November 24, 2017

FS crawler should be used in the same server where elasticsearch is installed

Related topics