Knowledge Base loading indefinitely

I was using the AI assistant and after finding that I could feed it information about my business, I started creating items in the “Knowledge Base” ITEM.

Then I realized that I could give the assistant a role in this case as a cybersecurity expert, and everything worked perfectly because the assistant stopped giving predictable answers and suggesting queries to an assistant who consulted and gave answers based on the queries I made.

Now, when I want to add information to this knowledge base, I notice that it stays in a state where it is loading indefinitely and does not let me do anything.

I am currently using version 8.19.8.

The issue began when the Security AI Knowledge Base in Kibana remained stuck on the “Setup Knowledge Base” screen. After investigation, it was determined that the root cause was a failure in the Machine Learning native controller process on node my_node-2. The ELSER model deployment repeatedly failed with errors indicating that the PyTorch inference process could not start because the native controller process had stopped. This prevented Elastic’s semantic search model from running, which in turn caused the AI Knowledge Base setup to hang indefinitely.

The failure was not related to memory, disk space, or AppArmor restrictions. Instead, the ML native controller process had entered a broken state and could no longer spawn new inference processes. To resolve the issue safely in a production cluster with active shards and replicas, shard allocation was temporarily restricted to primaries only. Elasticsearch on the affected node was then restarted in a controlled manner. After the restart, the native controller process was properly recreated, the cluster recovered and reallocated shards successfully, and the overall cluster health returned to green. The ELSER deployment transitioned to a healthy “started” state, and the Knowledge Base interface began functioning normally again.

1 Like