Hosted components
CAST AI components hosted on customer clusters
CAST AI Components Hosted On Customers' Clusters
The CAST AI connection process installs several components into a customer's cluster in phases, providing different levels of functionality:
- Phase 1: Provides visibility into connected clusters without the ability to tune them. This phase operates in a read-only mode.
- Phase 2: Enables full functionality of the CAST AI platform, primarily for cluster optimization. In this phase, CAST AI can instruct clusters and Cloud Providers to reorganize resources for optimal performance.
Phase 1 Component - CAST AI Kubernetes Agent
The CAST AI Agent is the first component installed when connecting a new cluster. It runs as a Pod in a dedicated CAST AI namespace:
» kubectl get pods -n castai-agent
NAME READY STATUS RESTARTS AGE
castai-agent-7f9d7ff65b-8qm7p 1/1 Running 0 78m
castai-agent-cpvpa-56f749fb-n2wzp 1/1 Running 0 22d
- The CAST AI Kubernetes Agent is responsible for sending cluster state data (snapshots) to the main system.
- The Cluster Proportional Vertical Autoscaler adjusts allocated resources for
castai-agent
Pods based on a predefined formula.
Phase 2 Autoscaling Components
When a connected cluster is promoted to Phase 2, CAST AI installs additional components to enable cost savings through cluster management:
❯ kubectl get pods -n castai-agent
NAME READY STATUS RESTARTS AGE
castai-agent-7f9d7ff65b-8qm7p 1/1 Running 0 80m
castai-agent-7f9d7ff65b-kf2zp 1/1 Running 0 5h7m
castai-agent-cpvpa-56f749fb-n2wzp 1/1 Running 0 22d
castai-cluster-controller-757997ff6c-r6x25 1/1 Running 0 27d
castai-cluster-controller-757997ff6c-xw54g 1/1 Running 0 27d
castai-evictor-5684748495-kl2q4 1/1 Running 0 22d
castai-kvisor-787c5dd946-gmzs5 1/1 Running 0 6d18h
castai-spot-handler-44shj 1/1 Running 0 43m
- The Cluster Controller executes actions received from the central platform, such as accepting newly created nodes into the cluster.
- The Evictor removes pods from underutilized nodes to reduce the overall number of cluster nodes.
- The Spot Handler monitors scheduled events (provided by Instance Metadata Service) and relays them to the central platform. It is installed as a DaemonSet rather than a regular Deployment.
Phase 2 Security Component - Kvisor
- Kvisor performs image vulnerability scanning, Kubernetes YAML manifest linting, and provides CIS security recommendations.
Component upgrade methods
CAST AI components installed in your cluster are upgraded using different methods. Understanding which components upgrade automatically versus those requiring manual intervention helps maintain optimal cluster operation.
The table below outlines the upgrade method for each CAST AI component:
Product | Component | Upgrade Method | Description |
---|---|---|---|
Cluster Autoscaling | Agent | Manual* | Must be manually upgraded by running the upgrade script or helm command * - See "Automatic upgrades" section below |
Evictor | Auto* | Automatically upgraded by CAST AI * - See "Automatic upgrades" section below | |
Spot-handler | Manual* | Must be manually upgraded using the helm command * - See "Automatic upgrades" section below | |
Cluster Controller | Auto* | Automatically upgraded by CAST AI * - See "Automatic upgrades" section below | |
Pod Pinner | Auto | Automatically upgraded by CAST AI | |
Workload Autoscaling | Workload Autoscaler | Manual | Must be manually upgraded using the helm command |
Security | kvisor | Manual | Must be manually upgraded using the helm command |
Reporting | gpu-metrics-exporter | Manual | Must be manually upgraded using the helm command |
Egressd exporter | Manual | Must be manually upgraded using the helm command | |
Generic | audit-logs-receiver | Manual | Must be manually upgraded using the helm command |
Automatic upgrades
Components marked as "Auto" are automatically upgraded by CAST AI to ensure you always have the latest features and security updates. Cluster administrators do not need to take any action for these components.
By default, cluster-controller
can update itself by receiving an update action (scheduled by CAST AI). However, it cannot update other components, such as castai-evictor
, castai-spot-handler
or castai-agent
. You can explicitly bind a role, such as cluster-admin
to the castai-cluster-controller
service account. This will allow cluster-controller
to manage all other CAST AI components automatically. For more details, visit the Cluster controller auto-update documentation.
Manual upgrades
Components marked as "Manual" require cluster administrators to perform upgrades when new versions are released. These upgrades can typically be performed using helm commands or upgrade scripts provided in the component documentation.
Please refer to each component's dedicated documentation section for detailed instructions for manually upgraded components.
Note
Always check the release notes before upgrading manually updated components to understand potential impacts and required actions.
User Roles
CAST AI offers the following predefined roles:
- Owner: Full access to clusters, billing, and organization management.
- Member: Full access to clusters, view-only access to billing.
- Analyst: Full access to cost monitoring, view-only access to clusters, and billing.
- Viewer: View-only access to clusters and billing.
This guide outlines the key components and roles within the CAST AI platform, providing an overview of its functionality and integration with customer clusters.
Updated about 1 month ago