I have found that Vietnamese has a parsing error in logStash
I want to know how to avoid errors.
test.xml
Tập 12 - Mười Tội Ác - The Ten Deadly Sins 2016
error message
Trouble parsing xml with XmlSimple {:source=>"message", :value=>"e>\n", :exception=>#<REXML::ParseException: #<REXML::ParseException: Missing end tag for '' (got "xml-fragment")
We parse more than 100 xml files at a time. Then an error occurs and Incorrect data is entered.
Note that we delete files stored in Elastic Search DB after 7 days.
It is difficult to find the cause of the error. There is nothing in common.
and LogStash has a bug that does not work if there are more than 5000 files to process.
so we delete the files stored in Elastic Search DB
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.