Yes, sharding is necessary (12 shards?).
Also keep mind you'll enough for every shard and your index needs to
be optimized after indexing.
After successfully making this fast you should try a NGramFilter for
the wildcard thing.
Regards,
Peter.
On 22 Feb., 02:05, Rich Kroll kroll.r...@gmail.com wrote:
Elasticsearch allows for configuring the number of shards and replicas,
which directly affects search performance. Considering the map/reduce
fashion that ES queries the index, I would think you could achieve good
performance against your dataset. ES breaks your documents down in the
index, so I don't think you will get any benefit from attempting to do it
yourself. Another option may be to write a custom analyizer, which creates
tokens on characters. Then you would not need the wildcards.
On Feb 21, 2011 6:47 PM, "JonathanD" prometheus...@hotmail.com wrote: