Is there way to combine lists of tokenizers? Say that I'd like to generate "by product", "by-product" and "byproduct" from "by Product". I can configure tokenizers to generate every variant, but not all of them at once.
Is there way to combine lists of tokenizers? Say that I'd like to generate
"by product", "by-product" and "byproduct" from "by Product". I can
configure tokenizers to generate every variant, but not all of them at once.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.