Problem with geo_points search queries

luc3k2202 · April 11, 2012, 1:31pm

Hello, we are bulk indexing some geo_points informations and thats is going
ok. We encountering problem with this query to elastic:

{ "query" : { "filtered" : { "filter" : { "geo_distance" : { "coords" : [
53.441,
14.545
],
"distance" : "2km",
"distance_type" : "arc"
} },
"query" : { "match_all" : { } }
} },
"size" : 1
}

We have that Error:

pastebin.com

http://pastebin.com/raw/FeNbQMcx

[2012-04-11 15:02:06,349][WARN ][index.cache.field.data.soft] [Morning Star] [budapest-20120411122913] loading field [coords] caused out of memory failure
java.lang.OutOfMemoryError: Java heap space
	at org.elasticsearch.index.field.data.support.FieldDataLoader.load(FieldDataLoader.java:61)
	at org.elasticsearch.index.mapper.geo.GeoPointFieldData.load(GeoPointFieldData.java:168)
	at org.elasticsearch.index.mapper.geo.GeoPointFieldDataType.load(GeoPointFieldDataType.java:55)
	at org.elasticsearch.index.mapper.geo.GeoPointFieldDataType.load(GeoPointFieldDataType.java:34)
	at org.elasticsearch.index.field.data.FieldData.load(FieldData.java:111)
	at org.elasticsearch.index.cache.field.data.support.AbstractConcurrentMapFieldDataCache.cache(AbstractConcurrentMapFieldDataCache.java:122)
	at org.elasticsearch.index.search.geo.GeoDistanceFilter.getDocIdSet(GeoDistanceFilter.java:115)
	at org.elasticsearch.common.lucene.search.NotDeletedFilter.getDocIdSet(NotDeletedFilter.java:43)

This paste has been truncated. show original

We are observing huge jump in heap memory (set to 1gb min/max). This is
our settings (we are using clean elastic configuration becouse we cant
change production configuration for testing):

transport.tcp.compress: true
http.max_content_length: 1000mb
jmx.create_connector: true
index.cache.field.type: soft

without index.cache.field.type we encounter the same problem.

Our _status:

pastebin.com

http://pastebin.com/raw/ewZdfdTN

{ "_shards" : { "failed" : 0,
      "successful" : 3,
      "total" : 6
    },
  "indices" : { "budapest-20120411122913" : { "docs" : { "deleted_docs" : 0,
              "max_doc" : 16482813,
              "num_docs" : 16482813
            },
          "flush" : { "total" : 0,
              "total_time" : "0s",

This paste has been truncated. show original

Sorry, for my English.

kimchy · April 13, 2012, 12:08pm

Hi,

In order to execute the geo related filters, the values for them need to
be loaded to memory, you simply need to increase the memory allocated to
elasticsearch or start more nodes in the cluster.

On Wed, Apr 11, 2012 at 4:31 PM, luc3k2202 luc3k2202@gmail.com wrote:

Hello, we are bulk indexing some geo_points informations and thats is
going ok. We encountering problem with this query to elastic:

{ "query" : { "filtered" : { "filter" : { "geo_distance" : { "coords" : [
53.441,
14.545
],
"distance" : "2km",
"distance_type" : "arc"
} },
"query" : { "match_all" : { } }
} },
"size" : 1
}

We have that Error:
http://pastebin.com/raw.php?i=FeNbQMcx
We are observing huge jump in heap memory (set to 1gb min/max). This is
our settings (we are using clean elastic configuration becouse we cant
change production configuration for testing):

transport.tcp.compress: true
http.max_content_length: 1000mb
jmx.create_connector: true
index.cache.field.type: soft

without index.cache.field.type we encounter the same problem.

Our _status:
http://pastebin.com/raw.php?i=ewZdfdTN

Sorry, for my English.

Jason_5 · January 7, 2013, 10:18pm

I'm having the same issue, but simply adding nodes is not really a viable
option for us.

We are hitting this error on a node with around 4 million records (of which
only around 40% have a location) and 2GB of RAM, but we expect to have
around 400 Million records. We can't plausibly run 100 nodes for 400
Million records. I had wondered if indexing the lat_lon on the field would
help but we also have more than one geo_point on some of the docs (around
20%) and according to the docs indexing lat_lon does not work if there's
more than one value for the field.

Is there any other approach that will get us there?

On Friday, April 13, 2012 5:08:55 AM UTC-7, kimchy wrote:

Hi,

In order to execute the geo related filters, the values for them need
to be loaded to memory, you simply need to increase the memory allocated to
elasticsearch or start more nodes in the cluster.

On Wed, Apr 11, 2012 at 4:31 PM, luc3k2202 <luc3...@gmail.com<javascript:>

wrote:

Hello, we are bulk indexing some geo_points informations and thats is
going ok. We encountering problem with this query to elastic:

{ "query" : { "filtered" : { "filter" : { "geo_distance" : { "coords" : [
53.441,
14.545
],
"distance" : "2km",
"distance_type" : "arc"
} },
"query" : { "match_all" : { } }
} },
"size" : 1
}

We have that Error:
http://pastebin.com/raw.php?i=FeNbQMcx
We are observing huge jump in heap memory (set to 1gb min/max). This is
our settings (we are using clean elastic configuration becouse we cant
change production configuration for testing):

transport.tcp.compress: true
http.max_content_length: 1000mb
jmx.create_connector: true
index.cache.field.type: soft

without index.cache.field.type we encounter the same problem.

Our _status:
http://pastebin.com/raw.php?i=ewZdfdTN

Sorry, for my English.

--

Topic		Replies	Views
OutOfMemoryError on geo fields (geo_distance query) Elasticsearch	7	482	July 6, 2017
Geo_distance filter performance issues Elasticsearch	1	422	July 6, 2017
Filtered Query With GeoDistanceFilter causing OutOfMemoryError, even when query returns 0 documents Elasticsearch	1	367	July 6, 2017
Geolocation mapping Elasticsearch	7	2879	July 6, 2017
Geo-distance exception failure and message Elasticsearch	2	376	July 6, 2017

Problem with geo_points search queries

Related topics