Few questions as I take over from an external Dev

M_Gibbs · November 9, 2020, 5:38am

Hey all,

I'm starting to transition our ES from our external dev to myself. I'm no developer, but can generally learn most things.

I have a few questions just to stat getting my head around things.

Our current mapping does not include a geo_point field, but we do have the coordinates. It was just mapped as text. Can I update this field type without needing to create an entirely new indice/index?
Is it possible to create a new field, and programmatically update this field with new data, for the index to then be able to return that result in near, but not quite real time? E.g. we have some data that is not mapped with avatars. We'd like to get these avatars (company logos) and update on the fly. Is this possible?
We have 300+ million records. Each does have a persistent id. Can we simply update the indice with a new dataset (from S3) or do we need to create an entirely new index each time with the S3 data?
Is logstash considered the best resource for indexing data from S3?

Thank you!

warkolm · November 9, 2020, 5:48am

You will need to reindex to cast the current text into the new format
You can create the field, then update it later, yes. Elasticsearch is near-realtime though, so not sure what that has to do with returning the field in this use
Depends what you want to achieve. If the actual number of updates is low, then consider just updating records as needed. If there is a lot of updates, then it'd be more efficient to create a new index
Depends what you need to do with the data, if it's just pulling it in then take a look at https://www.elastic.co/guide/en/beats/filebeat/current/filebeat-input-s3.html

It sounds like this is user profile-like information, is that right?

M_Gibbs · November 10, 2020, 9:46pm

Thank you @warkolm for your answers.

Yes, it's a combo of company/profile data. I'm going to look at filebeat now.

Quick question before I go down that rabbit hole - do I need to first 'map' the fields before it's indexed, or as part of the indexing process I am able to do this within filebeat?

warkolm · November 10, 2020, 10:46pm

Elasticsearch will figure it out as best it can - dynamic mapping.
When you are starting it's usually a good idea to test a bit of data, then grab the mappings, tweak as you need, then setup a template.

M_Gibbs · November 10, 2020, 10:47pm

Got it. Unfortunately the dynamic mapping we used previously, didn't pick up coordinates as a geo_point so we can't use the radius/bounding box - which is one of the reasons we need to remap and reindex.

Is setting up a template available in filebeat?

warkolm · November 10, 2020, 10:49pm

Filebeat has built in templates. But you can also define your own.

system · December 8, 2020, 10:49pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
New field or update field after data insertion? Elasticsearch	1	326	March 6, 2020
Question about adding geo_point and updating mappings in general Elasticsearch	5	406	September 19, 2019
How to Create a Field (Location) Elasticsearch	2	280	June 3, 2019
How to map new created fields into existing index? Logstash	1	473	August 17, 2017
Geo_point logging from python to elasticsearch Elasticsearch	4	3468	July 5, 2017

Few questions as I take over from an external Dev

Related topics