Component control dashboard
The Component control feature in Cast AI provides a centralized hub for managing all Cast AI components across your connected Kubernetes clusters. Instead of navigating through multiple dashboards or using various CLI commands, you can now monitor, update, and troubleshoot all components from a single interface.
Overview
When you access the Component control dashboard, you'll be greeted with a comprehensive view of your Cast AI ecosystem. The dashboard presents a health snapshot of your components, showing how many are healthy, which need attention, and which might be disconnected across your infrastructure.
This visibility is crucial when managing multiple clusters, as it helps identify potential issues before they affect your operations. The dashboard is your command center for maintaining optimal Cast AI functionality throughout your organization.
Accessing Component Control
To access the Component management:
-
In the Cast AI console, select Manage Organization in the top right
-
Navigate to Component control in the left menu. Once there, you'll have access to all component management capabilities across your organization.
Understanding the Dashboard
The Component management dashboard is organized into several key sections that provide high-level insights and detailed component information.
Component Statuses Summary
At the top of the dashboard, you'll find a visual summary of component health across your organization. This section gives you an immediate understanding of your overall component health, broken down into three categories:
- Healthy components are running the latest version and functioning properly
- Warning status indicates components that need attention, typically requiring an update
- Disconnected components are not functioning or cannot be reached

This summary helps you quickly gauge the health of your infrastructure and prioritize which areas need attention.
Mothership Status
The Mothership Status indicator shows the connection health between your clusters and the Cast AI control plane. This information includes:
- Connection status (Active/Inactive)
- Last 30 Days Uptime percentage
A healthy connection ensures that your clusters can communicate with Cast AI services, which is essential for most features.
Component List
The main section of the dashboard displays a comprehensive list of all available Cast AI components. For each component, you'll see:
- The category it belongs to (Autoscaler, Security, Reporting, etc.)
- A brief description of its purpose and function
- The type of feature tier it belongs to (free or paid)
- Installation status across your clusters
- Health status across all installations
This list becomes your primary tool for managing the Cast AI component lifecycle in your organization.
Managing Components
The Component control interface makes it easy to drill down into specific components and take action when needed.
Viewing Component Details
When you need more information about a specific component, click on its row in the list. This takes you to a dedicated page showing:
- A detailed description of the component's purpose
- Current and latest available versions
- Installation status across all your clusters
- Actions you can take for each installation
Finding What You Need
As your infrastructure grows, you may have dozens of components across many clusters. The Component control dashboard includes powerful filtering and search capabilities:
- Use the search bar to find specific components by name or function quickly
- Filter by feature to focus on specific aspects of Cast AI functionality in your infrastructure
- Filter by health status to prioritize components that need attention
These tools help you focus on what matters most, especially when managing large-scale deployments. The example above filters only those Autoscaler-related components that need attention.
Installing and Updating Components
When a component needs to be installed or updated, Cast AI provides the necessary commands right in the interface. Here's how it works:
-
Identify and select a component from the list that needs attention
-
Click the Fix or Update button next to that component
-
A drawer opens with detailed information about the required action
-
Copy the provided script or helm command
-
Run the command in your terminal with access to your cluster
Note for Terraform-managed components
For components that were installed using Terraform, the Component control interface cannot generate direct update commands. In these cases, the drawer will provide a link to the relevant documentation, and you'll need to update your Terraform configuration manually according to the documentation guidelines.
This workflow makes it easy to keep your components up to date without having to manually search for the correct commands or configurations.
Component Ecosystem
Cast AI's automation functionality is delivered through a collection of specialized components, each serving a specific purpose in your Kubernetes optimization strategy. This way, you can choose only to run what is required for your use case in your infrastructure.
Autoscaler Components
The Autoscaler group forms the core of Cast AI's cost optimization capability:
Component | Function |
---|---|
castai-agent | Collects and reports cluster data for optimization. See About the read-only agent. |
evictor | Implements node deletion policies to reduce waste. See Evictor. |
spot-handler | Manages Spot Instance interruptions gracefully. See Spot handler. |
cluster-controller | Performs various Kubernetes management actions. See Cluster controller. |
pod-mutator | Modifies pod specifications for improved efficiency. See Pod mutations. |
pod-pinner | Controls pod placement for optimal resource usage. See Pod Pinner. |
These components work together to ensure your clusters maintain optimal performance while minimizing costs.
Workload Optimization
Workload optimization components fine-tune how your applications run:
Component | Function |
---|---|
workload-autoscaler | Dynamically adjusts workload resource requests. See Workload autoscaling. |
By optimizing at the workload level, these components help you get the most out of your infrastructure.
Security Components
Security components help maintain the safety and compliance of your environment:
Component | Function |
---|---|
kvisor | Performs security scanning and monitoring. See Kvisor security agent. |
audit-logs-receiver | Captures security events for analysis. See Audit log exporter. |
These tools improve your security posture without requiring separate security platforms.
Reporting Components
Reporting components provide visibility into specialized metrics:
Component | Function |
---|---|
castai-gpu-exporter | Captures GPU usage metrics. See GPU metrics exporter. |
egressd-exporter | Collects network traffic information. See egressd. |
This data helps you understand specific aspects of your infrastructure that traditional monitoring might miss.
AI Enabler Components
AI Enabler components help you optimize and manage Large Language Model (LLM) usage across your infrastructure:
Component | Function |
---|---|
ai-optimizer-proxy | Routes LLM requests to the most appropriate provider based on cost and performance. See AI Enabler. |
This component helps reduce LLM costs and provides flexibility in model selection.
Database Optimization
Database optimization components help improve the performance and reduce the cost of your database infrastructure:
Component | Function |
---|---|
db-optimizer | Monitors database performance and provides cost optimization recommendations. See Database Optimizer. |
By optimizing at the database level, this component helps you reduce database costs while maintaining or improving performance.
Troubleshooting Components
When components show warning or disconnected statuses, Component control makes troubleshooting straightforward:
- Click on the component row to view detailed information
- Check the specific error reported for each affected cluster
- Use the Fix or Update button to get the appropriate remediation command
- Run the command in your environment
- Verify that the status returns to healthy after a few minutes
If you need more diagnostic information, you can check component logs directly in your cluster:
kubectl logs -l app.kubernetes.io/name=<component-name> -n castai-agent
This direct access to logs helps you identify and resolve issues quickly.
Infrastructure as Code Support
For organizations that manage infrastructure as code, most Cast AI components can be deployed and managed through Terraform. However, the Component control dashboard doesn't currently generate a Terraform update configuration. Instead, when a Terraform-managed component needs attention, the dashboard will:
- Identify the component as Terraform-managed
- Provide a link to the relevant documentation
- Guide you to update your Terraform configuration manually
This approach ensures that your infrastructure-as-code workflow remains consistent while still benefiting from the monitoring capabilities of the Component control dashboard.
See Terraform provider for more information on managing Cast AI components with Terraform.
Updated 19 days ago