That does not work. For an input like this we want either one event, containing the entire stations XML, or 2 events, each containing an entire station.
What we get is the first station ("<station>\n<id>1</id>\n</station>") but not the second, since there is no third station to trigger the end of the second event, and EOF does not trigger emission of an event.
Have you tried it? I'd suggest turning on debug logging and watching the output. May want to change your stat_interval to something less frequent, the default 1s will flood the log making it hard to pull relevant data.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.