It looks like you have periods at the beginning and ending of your regex: .BDT[0-9]{4}. -- is this intentional? If so, what is the role they're supposed to play? I believe the ending period is what's causing this regex to match BDT12345, because it indicates that any character is acceptable following the [0-9]{4} match. The 1234 section matches that part of the pattern and then the ending 5 is matched by the period. Does this help?
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.