For AI agents: visit https://docs.cast.ai/llms.txt for an index of all pages formatted in Markdown and endpoints in OpenAPI.
Jump to Content
Cast AI
GuidesAPI ReferenceRelease Notes
Log InCast AI
Guides
Log In
GuidesAPI ReferenceRelease Notes
  • Getting started
    • About the read-only agent
  • Connecting your cluster
    • Connect using the castctl CLI
    • Connect using the Cast AI console
    • GCP Private Service Connect
    • AWS PrivateLink
  • Enable automation
    • Autoscaler preparation checklist
    • Troubleshooting cluster onboarding
  • Platform permissions & data privacy
    • Kubernetes permissions
    • Cloud permissions
      • GKE service account impersonation
    • Data collection and storage
    • Communication requirements
  • Cast AI Anywhere
    • Overview
    • Getting started
  • API access
  • Component management
    • Hosted components
    • Helm charts
    • Terraform provider
      • GKE via GitOps
      • Terraform troubleshooting
    • Component control
    • Cast AI Operator
    • Open source components
      • Cluster controller
      • Spot Handler
      • Audit log exporter
      • egressd (deprecated)
      • GPU metrics exporter (deprecated)
    • Troubleshooting Cast AI components
  • Cluster and node status overview
  • Disconnect your cluster
  • Overview
  • Getting started
  • Feature reference
  • Kentroller
  • Scheduled rebalancing for Karpenter clusters
  • Continuous rebalancing
  • Overview
  • Getting started
  • Runbooks
    • Fix container image vulnerabilities
    • Synchronize Workload Autoscaler recommendations
  • Autoscaling
    • Node templates
    • Node configuration
    • Spot Instances
      • Spot interruption prediction API
    • GPU Instances
    • Storage-optimized nodes
    • TPU Instances (GKE)
    • AWS Neuron Instances (EKS)
    • GPU sharing
      • Time-slicing
      • Multi-Instance GPU (MIG)
      • Multi-Process Service (MPS)
      • Fractional GPUs (AWS)
    • Dynamic Resource Allocation (DRA)
    • Pod placement
    • Pod Pinner
    • Subnets
    • Network bandwidth
    • Commitments
      • AWS capacity reservations
    • Autoscaler settings
    • Autoscaler Node Labels and Taints
    • Managing DaemonSets with Cast AI
    • Troubleshooting node autoscaling
  • Downscaling
    • Evictor
    • Evictor vs. Rebalancer
  • Rebalancing
    • Workload preparation
    • Scheduled rebalancing
    • Paused drain configuration
  • Cluster hibernation
    • Cluster hibernation (Legacy)
  • Migration from Karpenter
  • Upgrading Kubernetes version
  • Cluster certificate rotation
  • Container Live Migration
    • Concept
      • Overview
      • Probe and lifecycle behavior
    • Reference
      • Requirements and limitations
      • Labels, Annotations, and Events
    • Tutorials
      • Using Container Live Migration with Evictor and Rebalancer
  • Pod mutations
    • Quickstart
    • Overview
    • Tutorials
      • Enable Workload Autoscaler with pod mutations
    • Reference
  • Using ARM nodes with Cast AI
  • Business continuity
  • Watchdog
  • OMNI Edge Provisioning
    • Overview
    • Getting started
    • Custom edge locations
  • Overview
  • Workload Autoscaler configuration
    • Available settings
    • Annotations reference
      • Legacy annotations reference (deprecated)
  • Scaling policies
    • How-to: Create a scaling policy
    • How-to: Manage scaling policies
  • Custom metrics
  • JVM workload optimization
  • In-Place Pod Resizing
  • Node-aware DaemonSet sizing
  • Pod startup recommendations
  • Horizontal Pod Autoscaling
    • Tutorials
      • How-to: Configure HPA on a workload
      • How-to: HPA in scaling policies
      • How-to: Migrate from legacy horizontal scaling to HPA
    • Reference
      • KEDA compatibility
      • Vertical & horizontal workload autoscaling
      • Legacy horizontal scaling (v1) (deprecated)
  • Event log
  • Overview
  • Available savings
  • OpsPilot
  • Dashboard
  • Cluster score
  • Organization-level reports
    • Organizational cluster cost report
    • Organizational allocation groups
    • Idle resources report
  • Cluster-level reports
    • Efficiency
    • Workloads
    • Namespaces
    • Allocation groups
    • Cost comparison
  • GPU utilization
  • Network cost
  • Storage cost
  • CPU vs. memory cost calculation
  • Metrics
    • Tutorials
      • Integrating Prometheus Metrics with New Relic
  • Getting started
  • Serverless inference
    • Rate limits
    • CLI
    • AI-assisted coding
      • OpenCode
      • Cline
      • Cursor
      • Windsurf
      • Claude Code
      • Report Claude Code metrics to AI Enabler
      • Continue
    • Dashboard token usage
  • Supported LLM providers
  • Hosted model deployment
    • Model autoscaling and hibernation
    • Tutorials
      • Deploy custom model with AI Enabler
  • AI Enabler settings
  • Batch processing
    • Create and manage batch jobs
  • Tags
    • Tag Support
  • Introduction
  • Getting started
    • AWS RDS & Aurora quick start
    • Cloud SQL Proxy quick start
  • How does it work?
    • Access requirements and permissions
    • Security and compliance
    • Supported platforms
    • Performance estimation & cost savings
  • Connecting client applications
  • Application failover configuration
  • Connection Pooling
    • PostgreSQL
    • MySQL
  • Tutorials
    • Analyzing database performance
  • How-to
    • Pause DBO for troubleshooting
  • Performance Advisor
    • Installing the Performance Advisor agent for PostgreSQL
    • Installing the Performance Advisor agent for MySQL
  • Database Optimizer FAQ
  • Getting started
  • Kvisor security agent
    • Overview
    • Installation & upgrading
    • Configuring Kvisor features
    • Cloud network context
    • Private image scanning
    • Network intelligence
  • Security reports
    • Security dashboard
    • Compliance
    • Vulnerabilities
    • Attack paths
    • Node updates
  • Runtime security
    • Overview
    • Installation & upgrading
    • Anomaly rules engine
  • Single Sign-On (SSO)
    • IdP user group sync
      • Okta: Set up IdP user group sync
      • Okta: Managing IdP user group sync
  • Organization management
  • Role-Based Access Control (RBAC)
    • Users
      • How-to: Inviting users to organization
      • How-to: Changing user roles and access
      • How-to: Removing users
    • User Groups
      • How-to: Creating and configuring user groups
      • How-to: Managing existing user groups
    • Service Accounts
      • How-to: Creating service accounts
      • How-to: Managing service accounts
  • AWS Marketplace subscription setup
  • Price adjustments
    • Overview
    • Getting started
  • Notifications
    • Tutorials
      • Set up Slack notifications
      • Set up webhook notifications
    • Webhook integration examples
  • Audit log
  • Platform usage
  • General
  • API
  • Arm and Graviton
  • Autoscaler
  • Evictor
  • CUD, Savings Plans, and reservations
  • egressd, network and VPC
  • Helm
  • Kubernetes
  • Logs, alerts, and metrics
  • Node templates, node configuration, and labels
  • Permissions, users, Orgs, and SSO
  • Reports and UI/UX
  • Storage
  • Terraform
  • Updates and images
  1. Database Optimizer

Connection Pooling

DBO provides connection pooling for both PostgreSQL and MySQL databases. Select your database to get started.

PostgreSQL

Connection pooling via PgDog — includes application compatibility guidance.

MySQL

Connection pooling via ProxySQL — transparent to your application, no compatibility constraints.