So filters are in the dataset retrieval, before the set is in memory?
If I've got machines all over the world, and each of them is geo centric (as well as language/locale centric), is there anyway to cause the sharding/clustering to be transparent from any machine in the world for the site, but the data and searches automatically only get directed to machines/clusters that contain the logical geographical location and locale where data would be stored?