Yes. It index your documents available on your local hard drive.
--
David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet | @elasticsearchfr
Le 8 janvier 2014 at 12:42:56, HongXuan Ji (hxuanji@gmail.com) a écrit:
OK, I will post the issue later.
About the river,
The first line: "This river plugin helps to index documents from your local file system and using SSH."
Does it means """ I store a bunch of pdf file in my local directory and by using the river plugin I can search the file in the directory. """ ?
In fact, I started to study ElasticSearch this week and I am not very familiar the filesystem means here.
Thanks a lot.
Ivan
David Pilato於 2014年1月8日星期三UTC+8下午7時32分17秒寫道:
Mapper attachment does not support extra field extraction. May be you could open an issue there? https://github.com/elasticsearch/elasticsearch-mapper-attachments
About FSRiver, I guess everything is described here: https://github.com/dadoonet/fsriver#filesystem-river-for-elasticsearch
Is there something you don't understand?
--
David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet | @elasticsearchfr
Le 8 janvier 2014 at 12:24:11, HongXuan Ji (hxu...@gmail.com) a écrit:
Hi David,
I only got the ALBUM field by using the endpoint of Solr, which is HOST/solr/update/extract?extractOnly=true.
So it seems the mapper attachment does not support the extra field extraction. right?
BTW, can you give me some tutorial about the fsriver? I am also curious what's the plugin for ? What's the purpose of the plugin?
Best,
Ivan
David Pilato於 2014年1月8日星期三UTC+8下午6時23分03秒寫道:
I would recommend not to use the mapper attachment but to manage that on your side.
I removed for example mapper attachment from fsriver project to have a finer control. (see https://github.com/dadoonet/fsriver/issues/38)
BTW, I'm not aware on how you can get ALBUM field using Tika. Any pointer? Could be nice to add it to fsriver as well.
--
David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet | @elasticsearchfr
Le 8 janvier 2014 at 10:49:47, HongXuan Ji (hxu...@gmail.com) a écrit:
Thanks for the reply.
Except for the six standard fields, I also want to know the extra field. For example, in Solr we can extract the "album" field in MP3 file.
Does this function also support in ElasticSearch? I just tested: I post a mp3 file into ES, but the fields of the mp3 file contains only the six fields.
Ideas?
Thanks a lot.
David Pilato於 2014年1月8日星期三UTC+8下午4時34分07秒寫道:
Have a look at https://github.com/elasticsearch/elasticsearch-mapper-attachments/blob/master/src/main/java/org/elasticsearch/index/mapper/attachment/AttachmentMapper.java#L376
You will see that mapper attachment reads:
Metadata.DATE
Metadata.TITLE
Metadata.AUTHOR
Metadata.KEYWORDS
Metadata.CONTENT_TYPE
Metadata.CONTENT_LENGTH
Does it help?
--
David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet | @elasticsearchfr
Le 8 janvier 2014 at 05:05:10, HongXuan Ji (hxu...@gmail.com) a écrit:
Hi all,
I am wondering how many metadata fields of MP3 files exist when I post the mp3 file into ElasticSearch using the mapper-attachment.
Because in Solr we can know the field information through the endpoint SOLR_HOST/update/extract?extractOnly=true,
but in ElasticSearch are there any ways to get such informations? Except for the MP3 files, how about the doc files?
I know the ElasticSearch use tika to support this operations, can you give me some example to fetch some special field of some special file format?
Regards,
Ivan
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/99916f15-9e40-406b-be92-a64dd3b4b55d%40googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.
--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/etPan.52cd3a29.71f32454.1449b%40MacBook-Air-de-David.local.
For more options, visit https://groups.google.com/groups/opt_out.