Google Cloud - Community

A collection of technical articles and blogs published or curated by Google Cloud Developer Advocates. The views expressed are those of the authors and don't necessarily reflect those of Google.

Control your Generative AI costs with the Vertex API’s context caching

minherz
Google Cloud - Community
Nov 18, 2024

--

If you were looking to optimize costs when using generative models on Vertex AI please, the post that I wrote with my colleague Nim Jayawardena, can be helpful. The post explores how to use Vertex AI’s context caching to reduce the cost of using Gemini models with large, repeated contexts. It shows criteria for using context caching together with the code samples. Since the platform does not support posting content with multiple authors, you can find the post in leoy.blog or in the Nim’s Medium blog.

--

--

Google Cloud - Community
Google Cloud - Community

Published in Google Cloud - Community

A collection of technical articles and blogs published or curated by Google Cloud Developer Advocates. The views expressed are those of the authors and don't necessarily reflect those of Google.

minherz
minherz

Written by minherz

DevRel Engineer at Google Cloud. The opinions posted here are my own, and not those of my company.

No responses yet