Maintained with ☕️ by
IcePanel logo

You can now determine the status and health of a TPU slice and partition by monitoring these new beta system metrics

Share

Services

## Feature Feature You can now determine the status and health of a TPU slice and partition by monitoring these new beta system metrics: * `kubernetes.io/accelerator/slice/state`: Indicates the current status of the slice. * `kubernetes.io/accelerator/partition/state`: Indicates the health of the partition. For more information, see the [GKE system metrics](https://docs.cloud.google.com/monitoring/api/metrics%5Fkubernetes#kubernetes-kubernetes) documentation.