Cluster CPU limits policy

The cluster CPU limits policy allows you to set boundaries on the total amount of vCPUs available across all worker nodes in your cluster. This policy helps manage cluster scaling and resource allocation.

How it works

When enabled, this policy keeps the cluster CPU resources within the defined minimum and maximum limits.
The cluster will not scale down below the minimum CPU limit or scale up above the maximum CPU limit, even if there are pending pods.

Configuring CPU limits policy

You can configure the CPU limits policy through the CAST AI Console or one of our API endpoints.

The CAST AI Console

Navigate to Cluster -> Autoscaler -> Settings.
Under Limits, toggle the CPU limit policy to On.
Set the minimum and maximum CPU cores as needed.

API

Use the CAST AI policies API endpoint: /v1/kubernetes/clusters/{clusterId}/policies.
Set CPU limit values accordingly:

"clusterLimits": {
    "enabled": true,
    "cpu": {
      "minCores": 1,
      "maxCores": 20
    }

The new settings will propagate immediately after being applied.