I'm trying to import some json data that is in a column in redshift. Some of the records have a field that includes invalid json escape sequences, such as this:
"work_title": "The Discrete and Semi-continuous Fr\'echet Distance with Shortcuts via Approximate Distance Counting and Selection Techniques"
"work_title": "On the \(\partial\overline{\partial}\)-Lemma and Bott-Chern cohomology"
I can remove the first one with logstash using this:
The logstash configuration compiler will always interpret a backslash before the end of a double quoted string as escaping the double quote.
The standard trick is to use a single occurrence of a character group with a single member, which is equivalent to the single character in the group: "[\\]"
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.