Gemini 3.5 Flash added to the developer API; Google's migration doc warns it costs more than Gemini 3 Flash
New model SKU added to the Gemini developer API. Gemini 3 Flash Preview remains available at its prior pricing. Google's own migration guide at ai.google.dev/gemini-api/docs/interactions/whats-new-gemini-3.5 instructs developers to switch to the new model and explicitly notes it is more expensive. Default reasoning effort changed from high to medium. Computer Use feature not supported in 3.5 Flash.
Gemini did not respond inside the 30-day window. This receipt now reads as uncontested in the public record.
Additive change on the API surface — companion to the consumer-side model-swap receipt.
Additive change on the API surface — companion to the consumer-side model-swap receipt. Not a nerf in isolation because the old SKU is still available. WATCH LIST entry: if Google deprecates Gemini 3 Flash Preview within 60-90 days, this receipt converts to a forced migration at severity major. As of receipt time, the official Gemini API pricing page at ai.google.dev/gemini-api/docs/pricing does not list Gemini 3.5 Flash — Google's developer migration guide is the only first-party source confirming the more-expensive language. Third-party sources report the figure at $1.50/$9.00 per 1M input/output tokens but the receipt does not treat that as primary-source-confirmed until Google's pricing page is updated.
Easiest way to run a local LLM. Pair with Open WebUI for full ChatGPT replacement.
The default ChatGPT-replacement self-host. Massive ecosystem.
Permissive (MIT) open-weight model targeting ChatGPT/Claude-class chat and agentic coding without API lock-in - the strongest open option for a self-hosted stack.
Drop-in open-weight replacement for the OpenAI/Anthropic chat APIs. Flash is the locally-runnable variant; serve it behind Open WebUI.