Getting a lot of "service unavailable" errors on gemini-2.0-flash

zlmk · April 30, 2025, 2:39pm

503 UNAVAILABLE. {‘error’: {‘code’: 503, ‘message’: ‘The model is overloaded. Please try again later.’, ‘status’: ‘UNAVAILABLE’}}

Anyone else having the same issue?

Vishal · May 1, 2025, 3:27pm

Hey @zlmk - this issue should be resolved now. Please let me know if you’re continuing to see issues

sillyuser · May 1, 2025, 5:19pm

At the moment I am getting persistent 503 when trying to use gemini 2.0 flash (able to use flash lite)

Muhammad_Zafar · May 1, 2025, 5:56pm

Facing same error tested just now multiple times.

Text input structured output with 2.0 flash

Vishal · May 1, 2025, 10:17pm

Sorry about that. I believe this should be resolved now - we just pushed another fix. Please continue to let me know if you’re still seeing issues!

Toby_Hain · May 2, 2025, 12:23pm

Still get 503 errors.

Vishal · May 2, 2025, 1:25pm

Thanks for flagging, Toby. I’ll take a look

Ahsham_Abdullah · May 9, 2025, 1:25pm

still getting the 503 issue

Krish_Varnakavi1 · June 13, 2025, 4:49am

Hi @Toby_Hain & @Ahsham_Abdullah,

Are you still facing 503 error issue?

Franzhong · June 20, 2025, 9:43am

As of early June, I was still encountering this issue. I am thinking of upgrading to Google AI Pro, wondered if that would improve the situation?

Sam_D · June 23, 2025, 4:34pm

As of June 23 I am getting 503 from gemini-2.0-flash on us-east1. This want’s occurring on us-central1, but we had to switch to us-east1 due to failure standing up instances through CloudRun in central. As of last night CloudRun failed to provision service, and it was failing all night, so this morning we made the call to switch to us-east1. Overall, the service recovered, but we are now seeing persistent 503s and some other Gemini errors which were not present in central. @Vishal for visibility

Vishal · June 24, 2025, 12:28am

Can you check if this is still an issue? 2.0 Flash was overloaded earlier today which may have coincided with you switching over to us-east1

zlmk · June 24, 2025, 12:40am

I don’t understand why you sell resources to OpenAI when your own services are frequently overloaded. Your APIs are unstable, unreliable and have way higher latency than the equivalent models of OpenAI.

Sam_D · June 24, 2025, 1:23am

It is no longer an issue as of right now. I’ll be watching it for the next few days, as we have experienced spikes in 503, and other timeouts in the past. That was the reason for deploying in us-central1 originally, and we would stick to it if it wasn’t for the issues provisioning CloudRun instances for the last 24hrs. Thank you for the follow up, Vishal. We are all clear now.

Sam_D · June 24, 2025, 2:21am

OpenAI is likely to have dedicated resources, which can be provisioned by reserving instances for the rest of us, as well. Its a tradeoff between costs and availability.

Muhammad_Zafar · July 3, 2025, 3:24pm

Still happening guys, this is a serious issue, how can people use it in production with this issue? Kindly fix it, thanks.

Azzeddine_Kabarousse · July 8, 2025, 3:53pm

I keep getting it non stop today… this is serious problem

ibgib · July 8, 2025, 8:48pm

Yes, it’s unusable for me today. 2.0 flash is failing with model is overloaded and my fallback to flash lite is failing as well, though I’m not sure if it is overloaded or just not calling the functions correctly. Usually the fallback to the lite model performs the function calling correctly.

Topic		Replies	Views
503 UNAVAILABLE Gemini 2.0 Flash API Gemini API models , gemini-flash	11	740	May 8, 2025
Model is overloaded - Gemini API model	53	3562	June 3, 2025
Error: The model is overloaded Gemini API model	44	18393	June 4, 2025
Cannot use gemini 1.5 flash model says overload Gemini API gemini-15 , api	5	309	June 19, 2025
503 Unavailable: The Model is Overloaded Google AI Studio model	2	1984	December 13, 2024

Getting a lot of "service unavailable" errors on gemini-2.0-flash

Related topics