I have a cluster of 20 nodes, 1 TB/day of data indexed, right now we only
keep the last 3 days opened but the customer wants us to open 6 months of
indexes.
We don't care about query execution times but only that the indexing
throughput wouldn't get hurt.
Is there anything we can do in the current installation without expanding
it with many more nodes?
I have a cluster of 20 nodes, 1 TB/day of data indexed, right now we only
keep the last 3 days opened but the customer wants us to open 6 months of
indexes.
We don't care about query execution times but only that the indexing
throughput wouldn't get hurt.
Is there anything we can do in the current installation without expanding
it with many more nodes?
I have a cluster of 20 nodes, 1 TB/day of data indexed, right now we only
keep the last 3 days opened but the customer wants us to open 6 months of
indexes.
We don't care about query execution times but only that the indexing
throughput wouldn't get hurt.
Is there anything we can do in the current installation without expanding
it with many more nodes?
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.