Attachment mapping without any meta data


(js) #1

Hi,
first of all, I really appreciate the work of elasticsearch. It is the
second application where we use elasticsearch for the search und
storage part. This time we wanted to use the attachment plugin as we
do have to store mainly text documents which shall also be searchable.
What works so far is the mapping and storing of documents, mainly .pdf
and .docs. But what is not working is the automatically adding of meta
data on the attachment. I checked it with Tika itself by simply
calling the java command line and Tika shows the meta data of a pdf
document. But when I map the pdf via elasticsearch only the content of
the file is stored. Is there any known bug which I have overseen? Or
is the mapping file wrong?
{
"myindextype" : {
"_all" : {"enabled" : true},
"properties" : {
"document" : {
"properties" : {
"content" : {
"type" : "attachment",
"store":"yes",
"fields" : {
"file" : { "store" : "yes",
"index" : "not_analyzed" },
"date" : { "store" : "yes",
"index" : "not_analyzed" },
"author" : { "store" : "yes",
"index" : "not_analyzed" },
"keywords" : { "store" : "yes",
"index" : "analyzed" }
}
}
}
}
}
}
}
Or do I have to do anything else in order to get this running?

Thank you.


(system) #2