An alternative approach might be to split this data at indexing time in separate fields, so that you can actually query it as well as easily access it.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.