The AI assistant in Vertex AI Studio can help you refine and generate prompts
Share
Services
## Feature
The AI assistant in Vertex AI Studio can help you refine and generate prompts. This feature is in [Preview](https://cloud.google.com/products#product-launch-stages). To learn more, see [Use AI-powered prompt writing tools](https://cloud.google.com/vertex-ai/generative-ai/docs/learn/prompts/ai-powered-prompt-writing).
## Feature
[Prompt Guard](https://console.cloud.google.com/vertex-ai/publishers/meta/model-garden/prompt-guard) and [Flux](https://console.cloud.google.com/vertex-ai/publishers/black-forest-labs/model-garden/flux1-schnell) were added to [Model Garden](https://cloud.google.com/vertex-ai/generative-ai/docs/model-garden/explore-models).
## Feature
You can deploy Hugging Face models on Google Cloud that have [text embedding inference](https://huggingface.github.io/text-embeddings-inference/) enabled or [pytorch inference](https://huggingface.co/docs/inference-endpoints/supported%5Ftasks) enabled. For more information, see the [Hugging Face model deployment](http://console.cloud.google.com/vertex-ai/model-garden;action=deploy;hfSource=true) in the console.
## Change
Added multiple deployment settings (with A100-80G and H100) and sample requests for some popular models, including [Llama 3.1](https://console.cloud.google.com/vertex-ai/publishers/meta/model-garden/llama3%5F1), [Gemma 2](https://console.cloud.google.com/vertex-ai/publishers/google/model-garden/gemma2), and [Mixtral](https://console.cloud.google.com/vertex-ai/publishers/mistral-ai/model-garden/mixtral).
## Change
Added dynamic LoRA serving for [Llama 3.1](https://github.com/GoogleCloudPlatform/vertex-ai-samples/blob/main/notebooks/community/model%5Fgarden/model%5Fgarden%5Fpytorch%5Fllama3%5F1%5Fdeployment.ipynb) and [Stable Diffusion XL](https://github.com/GoogleCloudPlatform/vertex-ai-samples/blob/main/notebooks/community/model%5Fgarden/model%5Fgarden%5Fpytorch%5Fstable%5Fdiffusion%5Fxl%5Flora.ipynb).
What else is happening at Google Cloud Platform?
The CPU allocation setting has been renamed to Billing in the Google Cloud console for Cloud Run services
December 13th, 2024
Services
Share
Google Kubernetes Engine (GKE) - December 13th, 2024 [Feature]
December 13th, 2024
Services
Share