Hi
I'm using elasticsearch ingest attachment plugin to ingest HTML files into the Elasticsearch 6.1.3. Whenever I am seeing contents of my attachment, some HTML files are getting parsed(removing html tag and all) but some HTML files I see are present, as it is.
I want to know why it must be doing that.?
I tried changing mappings adding different analyzers but results are same.
This attachment mapping I'm using right now.
"attachment": {
"properties": {
"content": {
"type": "text",
"analyzer": "some_anaylzer",
"term_vector": "with_positions_offsets",
"index_options": "offsets",
"index": true
}
}
},