Saxml on GKE is de-prioritized beginning April 24, 2025. This means the project won't get further updates
Share
Services
## Deprecate
Saxml on GKE is de-prioritized beginning April 24, 2025\. This means the project won't get further updates. Existing Saxml deployments will continue to function as is without disruption. We _strongly suggest_ that you migrate to [JetStream](https://github.com/google/JetStream), Google's up to date open source inference framework for high-performance LLM serving on TPUs and GPUs. JetStream offers continuous batching and quantization for better throughput and memory efficiency. For a migration example, see [Serve Gemma using TPUs on GKE with JetStream](https://cloud.google.com/kubernetes-engine/docs/tutorials/serve-gemma-tpu-jetstream).
What else is happening at Google Cloud Platform?
Read update
Services
Share
Google Distributed Cloud (software only) for VMware 1.31.400-gke.110 is now available for download
about 5 hours ago
Services
Share
Google Distributed Cloud (software only) for VMware 1.31.400-gke.110 is now available for download
about 5 hours ago
Services
Share
8 AlloyDB recommenders are now generally available (GA). For more information, see the following pages
about 7 hours ago
Services
Share
New SAP certifications: Additional M4 memory-optimized machine types
about 10 hours ago
Services
Share