November 17th, 2025

NVIDIA recommends that Kubernetes clusters enable Coherent Driver-Based Memory

Services

## Feature Feature NVIDIA recommends that Kubernetes clusters enable Coherent Driver-Based Memory Management (CDMM) to resolve memory over-reporting. CDMM is enabled by default on A4X nodes running the R580 GPU driver in GKE clusters with the following versions: * **1.33 or later**: 1.33.4-gke.1036000 or later * **1.32**: 1.32.8-gke.1108000 or later CDMM allows GPU memory to be managed through the driver instead of the operating system (OS), avoiding OS onlining of GPU memory, and exposing the GPU memory as a Non-Uniform Memory Access (NUMA) node to the OS. For more information about CDMM, see[Hardware and Software Support](https://docs.nvidia.com/datacenter/tesla/tesla-release-notes-580-65-06/index.html#hardware-software-support). To create GKE clusters with A4X, see the following documents: * [Create an AI-optimized GKE cluster with default configuration](https://cloud.google.com/ai-hypercomputer/docs/create/gke-ai-hypercompute) * [Create a custom AI-optimized GKE cluster which uses A4X](https://cloud.google.com/ai-hypercomputer/docs/create/gke-ai-hypercompute-custom-a4x)