Slightly old video - now there is a dedicated job wizard that assists in the configuration of such a job.
What is shown in the video here is to find anomalies in patterns in log files, but of course, the side-effect is that the process of categorization also produces information about number of unique categorizes per data source, etc: Get categories API | Elasticsearch Guide [7.15] | Elastic