Hosted components
Cast AI components installed in your Kubernetes cluster and how they map to the umbrella chart operating modes.
When you connect a cluster, Cast AI installs several components into it. Which components are present depends on the operating mode you chose during onboarding. The umbrella Helm chart (castai-helm/castai) uses tag-based modes to control the component set:
| Mode | Tag | What it provides |
|---|---|---|
| Read-only | tags.readonly=true | Cost visibility and security telemetry |
| Workload Autoscaler | tags.workload-autoscaler=true | Right-sizing of workload resource requests |
| Node Autoscaler | tags.node-autoscaler=true | Automated node provisioning and bin-packing |
| Full | tags.full=true | Node Autoscaler + Workload Autoscaler combined |
Installation methodsThese components are installed automatically when you connect a cluster using castctl, the Cast AI console, or the umbrella Helm chart. You do not need to install them individually unless you have a specific reason to do so.
Components by mode
The table below shows which components each mode installs.
| Component | readonly | workload-autoscaler | node-autoscaler | full |
|---|---|---|---|---|
| castai-agent | Yes | Yes | Yes | Yes |
| castai-spot-handler | Yes | Yes | Yes | Yes |
| castai-kvisor | Yes | Yes | Yes | Yes |
| castai-cluster-controller | — | Yes | Yes | Yes |
| castai-evictor | — | Yes | Yes | Yes |
| castai-pod-mutator | — | Yes | Yes | Yes |
| castai-workload-autoscaler | — | Yes | — | Yes |
| castai-workload-autoscaler-exporter | — | Yes | — | Yes |
| castai-pod-pinner | — | — | Yes | Yes |
| castai-live (Container Live Migration) | — | — | Yes | Yes |
You can also enable or disable any component individually using autoscaler.<component>.enabled overrides in your Helm values. Explicit overrides always take precedence over the mode tag.
Shared base components
These components are installed in every mode, including read-only.
- castai-agent sends cluster state snapshots to the Cast AI platform.
- castai-spot-handler monitors Spot Instance interruption events from your cloud provider and reports them to Cast AI. This data improves Spot reliability and interruption prediction models. Spot Handler does not take any action on nodes or workloads. Learn more in the Spot Handler documentation.
- castai-kvisor is a security and telemetry agent. In read-only mode it collects network traffic flows, resource usage statistics, GPU metrics, and storage-related metrics. When security features are enabled, it additionally performs image vulnerability scanning and Kubernetes manifest linting. See the Kvisor documentation.
Operator-managed componentsWhen clusters are onboarded using the Cast AI Operator,
castai-agentandcastai-spot-handlerare installed and managed automatically. Updates can be applied through the Cast AI console without running additional scripts.
Autoscaling components
These components are added in the workload-autoscaler, node-autoscaler, and full modes.
- castai-cluster-controller executes actions received from the Cast AI platform, such as accepting newly created nodes into the cluster and managing Container Live Migration operations. See the Cluster Controller documentation.
- castai-evictor removes pods from underutilized nodes to reduce the overall number of cluster nodes. When Container Live Migration is enabled, Evictor automatically attempts to live-migrate eligible workloads before falling back to traditional eviction. See Evictor.
- castai-pod-mutator modifies pod specifications for improved efficiency, including GPU driver injection and resource adjustments. See Pod mutations.
Workload Autoscaler components
These components are added in the workload-autoscaler and full modes.
- castai-workload-autoscaler dynamically adjusts workload resource requests based on actual usage patterns. See Workload Autoscaler.
- castai-workload-autoscaler-exporter collects workload metrics from your cluster to support recommendation generation. It is installed automatically alongside the Workload Autoscaler.
Node Autoscaler components
These components are added in the node-autoscaler and full modes.
- castai-pod-pinner controls pod placement for optimal resource usage, ensuring workloads are placed on appropriate nodes. See Pod Pinner.
- castai-live (Container Live Migration controller) manages live migration operations, including workload eligibility assessment, migration orchestration, and specialized VPC CNI management. The umbrella chart installs this component in
node-autoscalerandfullmodes, but Container Live Migration must also be enabled in your node templates before it becomes active. Supported on EKS, GKE (partial support), and AKS (partial support). See Container Live Migration for requirements and limitations.
Karpenter integration
For clusters using Karpenter as their node provisioner, an additional component replaces the standard node autoscaling path.
- castai-kentroller coordinates Cast AI optimization and automation features with Karpenter. This component is installed automatically when a Karpenter-managed cluster is connected through the onboarding script or castctl. See Karpenter Enterprise suite.
Additional components
These components serve specialized use cases and are enabled independently of the mode tags.
- castai-db-optimizer monitors database performance and provides cost optimization recommendations. See Database Optimizer.
- castai-audit-logs-receiver captures cluster events for analysis and compliance reporting. See Audit Logs Receiver.
OMNI components
When OMNI is enabled for cluster extension to other regions and cloud providers, additional components are deployed in the castai-omni namespace:
- OMNI Agent manages edge location connections and node provisioning.
- Liqo components enable multi-cluster topology and virtual node functionality (controller-manager, crd-replicator, fabric, ipam, metric-agent, proxy, webhook).
Component upgrade methods
Cast AI components are upgraded using different methods. The table below outlines the upgrade approach for each component.
| Category | Component | Upgrade method | Notes |
|---|---|---|---|
| Core | castai-agent | Manual | Upgrade via helm upgrade or the upgrade script |
| castai-spot-handler | Manual | Upgrade via helm upgrade | |
| castai-kvisor | Manual | Upgrade via helm upgrade | |
| Autoscaling | castai-cluster-controller | Auto | Can self-update via actions from the Cast AI platform |
| castai-evictor | Auto | Automatically upgraded when new versions are available | |
| castai-pod-mutator | Manual | Upgrade via helm upgrade | |
| castai-pod-pinner | Auto | Automatically upgraded when new versions are available | |
| castai-live | Manual | Upgrade via helm upgrade | |
| Workload Autoscaling | castai-workload-autoscaler | Manual | Upgrade both charts together |
| castai-workload-autoscaler-exporter | Manual | Upgraded alongside the Workload Autoscaler | |
| Karpenter | castai-kentroller | Manual | Upgrade via helm upgrade |
| Other | castai-db-optimizer | Manual | Upgrade via helm upgrade |
| castai-audit-logs-receiver | Manual | Upgrade via helm upgrade | |
| castai-omni-agent | Manual | Liqo components update automatically with the agent |
Umbrella chart upgrades
When using the umbrella chart, upgrade all components at once:
helm repo update castai-helm
helm upgrade castai castai-helm/castai -n castai-agent --reset-then-reuse-values
NoteThe
--reset-then-reuse-valuesflag requires Helm v3.14.0 or higher.
Automatic upgrades
Components marked as "Auto" are upgraded by the Cast AI platform without manual intervention. The castai-cluster-controller can also update other components (castai-evictor, castai-spot-handler, castai-agent) when it has sufficient permissions. By default, it cannot apply updates that require permission changes. You can bind a role such as cluster-admin to the castai-cluster-controller service account to enable it to manage all Cast AI components automatically. See the Cluster Controller auto-update documentation.
Self-managed component options
For environments that require control over update schedules, several components offer self-managed installation options:
- Evictor: See Manually install Evictor
- Pod Pinner: See Self-managed Pod Pinner
Self-managed components can be updated using tools like Argo CD or Helm on your preferred schedule.
Next steps
Install or customize Cast AI using the umbrella chart or individual charts.
Manage component versions and automatic updates from the Cast AI console.
Automate component lifecycle management with the Cast AI Operator.
Resolve common issues with Cast AI components.
