Google resets Gemini quotas as Gemini 3.5 Flash returns

AIJune 3, 2026 at 12:00 PM

Read full story

Source: Android Authority

TLDR: SAN FRANCISCO—Google wiped Gemini quota counters back to zero for free and paid users, alongside a refreshed Gemini 3.5 Flash to prevent output drop offs.

Key Takeaways:

Google DeepMind is iterating Gemini inside Antigravity, chasing steadier coding performance without burning tokens on simple tasks.
Varun Mohan announced a refreshed Gemini 3.5 Flash deployment and said harder tasks now get higher endurance with much less output.
A full quota reset reduces immediate friction, but users will quickly notice whether the new model stops quality dips.

Quota counters going back to zero feels like a clean slate after a messy build cycle. If the refreshed Gemini 3.5 Flash holds its quality under pressure, users will notice fast.

No comments yet. Be the first to share your thoughts!

Quota counters going back to zero feels like a clean slate after a messy build cycle. If the refreshed Gemini 3.5 Flash holds its quality under pressure, users will notice fast.

Q&A

Why reset quota counters instead of just raising rate limits?

A reset zeroes out accumulated usage, giving immediate relief and making it easier to compare model behavior before and after the update.

What happens if users keep triggering the same quality drop offs?

Google will likely adjust prompting defaults, model routing, or guardrails again, because sudden degradation undermines user trust faster than occasional errors.

How could the new focus on producing less output change developer workflows?

Shorter answers can reduce token burn and speed iteration cycles, but developers may need more follow ups for edge cases.

What does higher endurance on hard tasks imply about the model internals?

It suggests improved stability over longer reasoning or multi step coding, likely through revised inference behavior and model tuning.

Could the Low effort Gemini variant’s issues lead to a permanent hybrid approach?

If the tradeoff between efficiency and correctness keeps shifting, Google may settle on dynamic routing, switching variants by task type.

Google resets Gemini quotas as Gemini 3.5 Flash returns

Key Takeaways:

Q&A

Why reset quota counters instead of just raising rate limits?

What happens if users keep triggering the same quality drop offs?

How could the new focus on producing less output change developer workflows?

What does higher endurance on hard tasks imply about the model internals?

Could the Low effort Gemini variant’s issues lead to a permanent hybrid approach?

Top in AI

Claude AI free tier in 2026: limits sharpen

GitHub Copilot usage billing sparks up to 100 fold shocks

Microsoft pushes Scout to “make people addicted”

Google gives Gemini Extended thinking to all users

Google tests buying Play Store code to power AI coding