How can Index a Filesystem?

pumacy112 · June 22, 2015, 12:21pm

In elasticsearch 1.5 are Rivers deprecated. The FSRiver Plugin is not working with ElasticSearch V. 1.6.
How is a good way to Index a File System (example: C.\temp) without the FSRiver?

eperry · June 22, 2015, 12:39pm

You will have to use Logstash and File input to grab your files

pumacy112 · June 22, 2015, 12:48pm

It gives a Sample for Index a File System with Logstash?

dadoonet · June 22, 2015, 12:52pm

It won't work until we publish a Tika codec plugin IMO.

eperry · June 22, 2015, 12:54pm

The simplest would be

input{
file {
   path=>["c:\temp\*"]
}
}
output{
stdout {
codec => "rubydebug"
}
}

https://www.elastic.co/guide/en/logstash/current/plugins-inputs-file.html

pumacy112 · June 22, 2015, 12:59pm

It is also possible to write the file datas to a own Mapping?

pumacy112 · June 22, 2015, 1:06pm

Is there a good way to do indexing a file system about the Java API from Elasticsearch?

eperry · June 22, 2015, 1:08pm

Sure you just need to issue a "PUT/POST" request

via any language or even do it with CURL
https://www.elastic.co/guide/en/elasticsearch/reference/current/docs-index_.html

dadoonet · June 22, 2015, 1:13pm

But what kind of files are you expecting?

pumacy112 · June 22, 2015, 1:16pm

All kind of files are possible. I will scan a File Url like c:\temp. And with Tika Parser parse all file datas and write it to my Index with own Mapping. And i will find out the best method how to do it.
Its no Documented how can be automatically

poonam · July 10, 2015, 5:48am

HI,
I have a requirement where I am reading file path from database and fetching files from file server.
Like KBid=1 KBfilepath="filepath" and KBstatus="ready".
Now I need to feed file and also other contents like KBID to elasticsearch.
How do I do this logtash or with river.am using ES1.5.2

dadoonet · July 10, 2015, 6:19am

You can't at this moment.
You need to write your own code IMO.

Or wait for LS tika codec / filter. But it's not there yet.

OpenSemanticSearch · February 19, 2016, 1:17pm

You can use the files connector of Open Semantic Search: http://www.opensemanticsearch.org/etl/elasticsearch

Topic		Replies	Views
How to Index file system Elasticsearch	11	10473	July 5, 2017
Index River Alternate Elasticsearch	6	1672	July 6, 2017
[ANN] Filesystem River for Elasticsearch 0.0.1 Elasticsearch	5	386	July 6, 2017
Index Db content and linked Filesystem content Elasticsearch	3	669	September 11, 2017
[ANN] Elasticsearch File System River Plugin 1.3.1 released Elasticsearch	3	383	July 6, 2017

How can Index a Filesystem?

Related topics