Hello,
I need to index lots of documents in the following format:
source: {
geobox: {
type: envelope
coordinates: [[-78.538,-0.363],[-78.53,-0.371]]
}
country: EC
name: -0.363-78.538_-0.371_-78.53
population: 231
}
My type is like this:
"cell" : {
"properties" : {
u"name": {'index': 'not_analyzed','type': 'string',},
u"country": {'index': 'not_analyzed','type': 'string',},
u"population": {'type': 'integer',},
u"geobox": {'type': 'geo_shape', "tree": "geohash",
"precision": "1m"},
}
}
I tried many tuning suggestions I found, including increasing JVM RAM to
16GB, mlockall: true...
None of my tuning changes seem to cause any dramatic improvement in
indexing performance. I seem to always index 500 documents somewhere
between 30 and 60 seconds....
am indexing on one VM, part of a 4 node cluster. index has 4 shards and
zero replicas. Note that even going from one node to four did not change
the indexing speed that much!
am using Python and PyES (thrift) for indexing.
Can I hope for much better performance with somehow? what would you
suggest? or is this good enough for geo_shape indexing?
Thanks a lot,
Mohamed.
--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.