It took me a while but I think the difference between Vertex and Gemini APIs is ...

happyopossum · 2025-06-25T23:14:57 1750893297

> I think the difference between Vertex and Gemini APIs is that Vertex is meant for existing GCP users and Gemini API for everyone else

Nahh, not really - Vertex has a HUGE feature surface, and can run a ton of models and frameworks. Gemini happens to be one of them, but you could also run non-google LLMs, non LLM stuff, run notebooks against your dataset, manage data flow and storage, and and and…

Gemini is “just” an LLM.

fooster · 2025-06-25T15:48:55 1750866535

The other difference is that reliability for the gemini api is garbage, whereas for vertex ai it is fantastic.

nikcub · 2025-06-25T20:09:07 1750882147

The key to running LLM services in prod is setting up Gemini in Vertex, Anthropic models on AWS Bedrock and OpenAI models on Azure. It's a completely different world in terms of uptime, latency and output performance.

shpat · 2025-06-26T00:32:51 1750897971

Have you had any luck getting your Claude quota bumped on Bedrock? I tried working through AWS support but got nowhere. Gave up and used Vertex + Gemini

com2kid · 2025-06-26T06:18:58 1750918738

Does OpenAI on azure still have that insane latency for content filtering? Last time I checked it added a huge # to time to first token, making azure hosting for real time scenarios impractical.

shakna · 2025-06-26T10:11:37 1750932697

Yes.

Unless you convince MS to let you at the "Provisioned Throughput" model. Which also requires being big enough for sales to listen to you.

throwaway1550 · 2025-06-26T16:35:58 1750955758

Ex-googler here. Google shipped their org hierarchy here.

Vertex API is managed by Vertex team in Google Cloud. This is a production ready infrastructure that is SRE managed but usually one or two steps from the bleeding edge.

Gemini API, Jules etc are built by Google Labs. This is close to the bleeding edge but not as production ready.

nprateem · 2025-06-25T19:13:44 1750878824

Which would all be fine except some models like Imagen 4 only work on vertex.