Hi all, this is with ref. to this now closed thread.
in a nutshell, I am trying to build a filesearch solution using fscrawler but faced a problem of explosion of meta.* fields.
First the good news, simply putting raw_metadata to 'false' in _settings json, takes care of the metadata issue.
However, I am trying to restrict the indexing to specific filetypes, something like
-
-
"includes" : [ "*.txt" ],
"includes" : [ "*.doc" ],
"includes" : [ "*.docx" ],
"includes" : [ "* .pdf" ],
"includes" : [ "*.rtf" ],
-
-
I find that the index only has files of the last file.extension (in this case *.rtf). I tried this multiple times with same result. is this a bug or am I doing something wrong ?
edit : hmm, the forum SW has removed all the ["*.filetype"] except the last one, I wonder why.