Is there any way to ingest html file into elasticsearch? So far I have seen a command to ingest json files.
I have a file which contains html content different html tags etc. Requirement is to store some content as a String and some tags as it is in the elasticsearch.
Thanks @dadoonet
I saw the plugin document and installed it but I could not understand how the file should be provided in that API
In document it is shown as
PUT my_index/my_type/my_id?pipeline=attachment
{
"data": "e1xydGYxXGFuc2kNCkxvcmVtIGlwc3VtIGRvbG9yIHNpdCBhbWV0DQpccGFyIH0="
}
Ho can I specify my file? Do I need to specify file path somewhere?
Thanks @dadoonet : Encoding with BASE64 worked. It stores the actual text in the index. Is there any way if I want to store it with actual HTML tags. I wanted to store few tags like table tag as it is in the index whereas rest of the data can be stored by eliminating html tags.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.