[ANN] FsRiver 0.4.0 (breaking change)


(David Pilato) #1

I am pleased to announce the fsriver-0.4.0 release!

FS River Plugin offers a simple way to index local files into Elasticsearch.
See full documentation on Github: https://github.com/dadoonet/fsriver

Changes in this version include:

New features:
o Option to not delete documents when files are removed. Issue: https://github.com/dadoonet/fsriver/issues/35 .
o Store origin URL in documents. Issue: https://github.com/dadoonet/fsriver/issues/37 .
o Add option to store original file as binary. Issue: https://github.com/dadoonet/fsriver/issues/40 .

Fixed Bugs:
o Check that server/login are defined when using ssh. Issue: https://github.com/dadoonet/fsriver/issues/29 .
o Index is not updated when files are deleted on windows platform. Issue: https://github.com/dadoonet/fsriver/issues/32 . Thanks to Fabien Gaujous.

Changes:
o Update to Elasticsearch 0.90.7. Issue: https://github.com/dadoonet/fsriver/issues/36 .
o Replace mapper-attachment plugin by Tika. Issue: https://github.com/dadoonet/fsriver/issues/38 .
o New json document mapping for docs. (Breaking) Issue: https://github.com/dadoonet/fsriver/issues/39 .

Note that generated documents by the river changed with a fancier structure like:
{
"file":{
"filename":"test.odt",
"last_modified":1386855978000,
"indexing_date":"2013-12-12T13:50:58.758Z",
"content_type":"application/vnd.oasis.opendocument.text",
"url":"file:///tmp/testfs_metadata/test.odt",
"indexed_chars":100000,
"filesize":8355
},
"path":{
"encoded":"bceb3913f6d793e915beb70a4735592",
"root":"bceb3913f6d793e915beb70a4735592",
"virtual":"",
"real":"/tmp/testfs_metadata/test.odt"
},
"meta":{
"author":"David Pilato",
"title":"Mon titre",
"date":"2013-04-04T15:21:35",
"keywords":[
"fs",
"elasticsearch",
"river"
]
},
"content":"Bonjour David\n\n\n"
}
It also means that you are not forced anymore to store the original source file as a binary BASE64 encoded field but it still available as an option

See documentation at: https://github.com/dadoonet/fsriver/

Comments, issues, PR are warmly welcomed! :slight_smile:

Have fun!

--
David

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/etPan.52b83f16.98a3148.111%40MacBook-Air-de-David.local.
For more options, visit https://groups.google.com/groups/opt_out.


(system) #2