# KubeAI (/docs/runtime/components/kubeai)



Component Category [#component-category]

Inference / model serving orchestration

Component Description [#component-description]

KubeAI is a Kubernetes-native inference operator for deploying and scaling AI models in production.

Why It Is Used [#why-it-is-used]

In BullSequana AI Runtime, KubeAI provides the operational layer that helps run model-serving workloads on Kubernetes with more predictable scaling, routing, and platform integration.

Learn More [#learn-more]

* [KubeAI documentation](https://www.kubeai.org/)
* [substratusai/kubeai on GitHub](https://github.com/substratusai/kubeai)

Interacts With [#interacts-with]

* `MinIO`, which provides object storage and dedicated credentials for KubeAI.
* `vLLM` and `FasterWhisper`, which are part of the model-serving runtime KubeAI orchestrates.
* `Model Installer`, which targets the KubeAI service endpoint to register and manage models.
