Type Mismatch between Ingest Pipeline and Kibana

SIVA_S2018 · February 16, 2018, 8:22am

Hello Team,

We are facing an issue with Elastic Stack and Kabana type mismatch. Kabana is not picking up type as NUMBER rather shows them as string. Here is what we've tried,

Initially used ingest pipeline node grok processor to parse data from CSV file as type "DATA", the index type on Kabana is being shown as Type String.
Changed the values to Type NUMBER (for values which are like 1.0, 0.003 ), the data is not being parsed at all.
Now changed the value to BASE64 , ingest processor node, parses the data, however Kabana still shows them as type string. Tried with new ingest pipeline, new name for the tags, still the values in Kabana are shown as string.

Any help / suggestion please on why sometimes ingest pipeline grok processor are not picking up NUMBER and then Kabana not showing the type as NUMBER..

dadoonet · February 16, 2018, 8:39am

Run

GET yourindexname/_mapping

And check what is the mapping for that field. If text, then that explains and you need to adapt your mapping.

SIVA_S2018 · February 16, 2018, 9:15am

Thanks Dave..

Looking at the mapping , here is what I have,

"L75Pc": {
"type": "keyword",
"ignore_above": 1024
},
and so on for all the fields..

Grok processor has it defined as BASE16FLOAT. Is there way we can change the field type..?

dadoonet · February 16, 2018, 9:34am

You need to provide your own mapping and reindex.

SIVA_S2018 · February 16, 2018, 3:46pm

Thanks Dave..

At which level, the reindex needs to be done. Is it withing ingest grok processor ?
Is Kibana picking up the default data type (which is string), we tried deleting the index and re-creating, however still we get the data type as string.

Is there any pointers or example you can share. Any pointers to specific document will also be helpful.

dadoonet · February 16, 2018, 5:01pm

Basically do:

DELETE myindex
PUT myindex
{
  "mappings": {
    "doc": {
      "properties": {
        "foo": {
          "type": "integer"
        }
      }
    }
  }
}
PUT myindex/doc/1
{
  "foo": 1
}

That should work.
Once you understand what I wrote as an example, apply this to your documents and generate a correct mapping.
Then index your source (whatever it is) in this index myindex or use the reindex API to read from your bad index and index in myindex.

Note that if you have been using the correct type in JSON, that would be even better.
IE: instead of indexing:

{
  "foo": "1"
}

Index

{
  "foo": 1
}

SIVA_S2018 · February 19, 2018, 3:48pm

Thanks Dave.

We have now removed the indexes completely and added the following ingest grok pattern, which works fine as long as data is available...
{
"description": " test ingest pipeline data",
"processors": [
{
"grok": {
"field": "message",
"patterns": [
"^%{Data:abc.timestamp}\,?%{DATA:Count:int}?, \?%{DATA:Average:float}?"
]
}
}
]

However, whenever there is no data, it throws an ingest error as empty string. Do you know if there is better way to specify ignore empty strings (other than the '?'). Also is there way we can turn on debug for ingest pipelines.

Thanks,

dadoonet · February 19, 2018, 4:29pm

Please format your code.

Do you know if there is better way to specify ignore empty strings (other than the '?').

Read: Handling failures in pipelines | Elasticsearch Guide [master] | Elastic

Also is there way we can turn on debug for ingest pipelines.

This might help: Simulate pipeline API | Elasticsearch Guide [master] | Elastic

SIVA_S2018 · February 21, 2018, 12:24pm

Thanks Dave,

The latest issue seems more of a specific problem,

As it stands we now have a Grok ingest pipeline,

{
"description": "Pipeline for parsing log",
"processors": [
{
"grok": {
"field": "message",
"patterns": ["^%{DATA:abc.timestamp}\|%{DATA:abc.org}:%{DATA:abc.xycr}:%{INT:abc.counter}\|%{DATA:Try}\|%{DATA:ID}?$”
]
}
{
"convert": {
"field": "Try",
"type": "float",
"ignore_missing": true
}
},
{
"convert": {
"field": "ID",
"type": "integer",
"ignore_missing": true
}

The issue we are now facing is on the Kibana front where the Variable Try and ID are shown as Text rather than Numbers.

We did multiple testing after clearing up the indices, and noticed that whenever the values are present, then they are treated as Numbers in Kibana and if the values are not present, I think it is being treated as Null String and in Kibana it is represented as Text. Is there way these variable can always be treated as numbers.

In the GROK processor, we did try representing them as NUMBER rather than DATA, however it throws an exception, which I think NUMBER always expect some values in it.

We are running ELK Version 5.6.0. Do you know if there is way around it..?

system · March 21, 2018, 12:24pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Debugging "Failed Documents" from Data Visualizer CSV Ingestion Kibana	11	312	September 26, 2022
Convert in pipeline is not converting Elasticsearch	3	113	May 22, 2024
Data Type issue from Data Visualizer Import Data in Kibana Kibana	4	1827	June 10, 2020
A field Default Datatype is not changing from String to number Kibana	2	497	December 1, 2017
Kibana field type and Elasticsearch mapping don't match Kibana	4	824	October 2, 2020

Type Mismatch between Ingest Pipeline and Kibana

Related topics