Hi,
to start my level of knowledge with this kind of thing is very entry level. The over all task is that there is data in Apache Hive being pushed to an elastic index (that bit is working already) and I need to enrich it as it contains an identifier field but I need to add an extra field to every document that contains a more human friendly name based on the id number passed. This data all lives in an index on the cluster.
Origionally I was told to use logstash to enrich the data but while setting about this task I foud there is an ingest node option where by you update the data before it even gets indexed which seems way more effecient.
What I'm struggling with is how to take the ID field of the incoming request, query out the human friendly name and then insert that into the document before it is indexed.
I know the query, if I pick a single record I can use the dev tools in kibana I can write a query which returns the single document I need to allow me to do the enrichment
get entity_data/_search
{
"query": {
"match": {
"entityId": 100,
}
}
}
I'm just struggling to find online how I actually use that in the pipeline to allow me to inject the entity_name into the document.
If anyone can give me any guidance or point me to the resource I need to read through it would be much appreceated.
Kind regards
Ant