Since they are both the same author with different format in the document field.
Is there any way I can only extract the email from this field and update this field. I have 1.5 Million documents having this field.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.