Introduction
In my recent blog post, I’ve been exploring Vertex AI’s Gemini REST API and mainly talked
about the
streamGenerateContent
method which is a streaming API.
Recently, a new method appeared in Vertex AI docs:
generateContent
which is the non-streaming (unary) version of the API.
In this short blog post, I take a closer look at the new non-streaming
generateContent API and explain why it makes sense to use as a simpler API when
the latency is not super critical.




