Maintained with ☕️ by

October 16th, 2025

Google Cloud Platform

VLLM TPU vLLM TPU, a highly-efficient serving framework for large language models (LLM) that's

Share

Services

## Feature **vLLM TPU** [vLLM TPU](https://cloud.google.com/vertex-ai/generative-ai/docs/open-models/vllm/use-vllm-tpu), a highly-efficient serving framework for large language models (LLM) that's optimized for [Cloud TPU](https://cloud.google.com/tpu) hardware, is available through Model Garden.

What else is happening at Google Cloud Platform?

Anthos GKE - October 29th, 2025 []

about 2 hours ago

Services

Share

BigQuery - October 29th, 2025 []

about 3 hours ago

Services

Share

Cloud SQL for MySQL - October 28th, 2025 []

about 22 hours ago

Services

Share

1.27.2-asm.1 is now available for in-cluster Cloud Service Mesh

1 day ago

Services

Share

1.27.2-asm.1 is now available for in-cluster Cloud Service Mesh

1 day ago

Services

Share

Cloud Identity-Aware Proxy - October 28th, 2025 []

1 day ago

Services

Share