Beginner questions on Supervised ML

kvtang · January 31, 2021, 8:13am

Hi all,

Sorry if this question has been asked somewhere previously but I couldn't find any relevant information.
For supervised ML, during training phase, do we have to provide label for each event that is already in Elasticsearch? Because if the event itself already has the label, why do we still need Machine Learning to predict it?

Thank you.

Tom_Veasey · January 31, 2021, 3:30pm

The idea would be that you have some labeled data, the training data, stored in an Elasticsearch index. You can use the data frame API to access the functionality to train a model from this as you observe. Note that not all the data provided to a classification job needs to be labelled and any unlabelled data will have a prediction added using the model. Furthermore, once the model has been trained we provide tools to run inference using that model on unlabelled data in the stack. For example, you can run it in an ingest processor or as part of a pipeline aggregation. There is also nothing to stop you training models elsewhere and importing them, provided we support inference for them. This github repo provides python converters for supported types and we are continuing to work on supporting additional model types for inference. Hope this helps!

kvtang · February 1, 2021, 2:21pm

Thanks a lot for the explanation.
Is it possible to have this feature whereby user can labelled their data manually on Kibana? For example the data in Elasticsearch only contains some features and user can label each event as dog or cat on Kibana?
Cause right now i was wondering if my data in Elasticsearch does not have the label, how can i add this extra field to it...

valeriy42 · February 3, 2021, 8:30am

I am afraid, you cannot add feature values manually for individual docs in Kibana. You may be able to use the update by query API or runtime fields to add additional fields to your docs if you can define a rule/query that discriminates cats and dogs in your training data.

kvtang · February 6, 2021, 11:37am

Understand, thanks a lot for your explanation

system · March 6, 2021, 11:37am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Machile Learning - supervised learning Elasticsearch elastic-stack-machine-learning	2	484	July 11, 2019
Classification of String Data Elasticsearch elastic-stack-machine-learning	3	615	February 2, 2021
Machine learning algorithm in Elastic? Elasticsearch elastic-stack-machine-learning	7	2483	September 27, 2017
Machine Learning on ES fields Elasticsearch	1	483	June 15, 2017
Supervised machine learning plugins or tools for elasticsearch? Elasticsearch	9	3400	July 5, 2017

Beginner questions on Supervised ML

Related topics