Localhost path to folder to automatically ingest existent files

marius03 · July 15, 2022, 7:07am

Hi,

I have installed Elasticsearch with Kibana on Localhost on Linux Mint, everything works perfectly but I can't seem to set it to automatically ingest files from certain folders.

I want it to automatically ingest all files in the folders (path):

/media/marius/2-TB-Volume/Google Drive Linux/Bug Map
/media/marius/2-TB-Volume/Google Drive Linux/Log Map

Please specify step-by-step as for a beginner (this is my first day using Elasticsearch).

So:

Which file should I edit?
How do I add the path to my folders?
Can Elasticsearch import the files in those folders if the path is to the 2nd Hard Drive?

Thanks in advance!

Sean_Story · July 15, 2022, 3:46pm

Hi @marius03 ,

Elasticsearch does not ingest files from disk on its own, it it primarily a HTTP service capable of being passed JSON documents via REST APIs. However there are a large variety of tools that Elastic has available that pair with Elasticsearch in order to index your data.

Filebeat - if your files are "plain text" (extensions like .txt, .csv, .log, .xml, etc) you can use Filebeat to read from a filesystem and index into Elasticsearch
Network Drives Connector - if your files are binary documents (.pdf, .doc, .docx, .ppt, .xls, etc) and you are interested in using Elastic Workplace Search, the Network Drives Connector Package may be what you're looking for. Note that this feature is in Beta.
FsCrawler - this is a community-built, open-source project, that has become quite popular for indexing documents into Elasticsearch. It can also index into Workplace Search.
Google Drive Connector - I notice that your example paths say Google Drive. If what you're really wanting is to index documents from Google Drive, check out the Workplace Search Google Drive Connector.
Language clients - if none of these are quite what you're looking for, there are are a wide variety of language clients to help you index data into Elasticsearch. You can write your own code to traverse and transform your files the way you want, and then ship the resulting data to Elasticsearch for storage and search.

Serena_Chou · July 18, 2022, 4:18pm

@marius03 would love to know which strategy you are employing to ingest your documents here!

system · August 15, 2022, 4:18pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
How to index a file with elasticsearch 5.5.1 Elasticsearch	22	7945	September 1, 2017
Index PDF in ES Elasticsearch	14	9096	April 24, 2017
Search a PDF file using its content Elasticsearch	9	15680	February 11, 2019
Question on what Elasticsearch considers data vs logs Elasticsearch	2	359	March 6, 2021
Can we index .zip file using ingest attachment plugin? Elasticsearch	13	3611	April 25, 2019

Localhost path to folder to automatically ingest existent files

Related topics