[RETRACTED] Windsurf Flex-tier model swap
Retracted 2026-06-10: Windsurf has no "Flex tier" (Flex = promo credits) and NerfBench does not test Windsurf, so it cannot be the evidence. See /transparency.
Smaller variant · ~12% lower benchmark pass-rate
Windsurf did not respond inside the 30-day window. This receipt now reads as uncontested in the public record.
Discovered when independent benchmarks (including NerfBench) showed a sudden ~12% pass-rate drop on identical prompts.
Discovered when independent benchmarks (including NerfBench) showed a sudden ~12% pass-rate drop on identical prompts. Windsurf acknowledged the swap in a community thread but did not update marketing claims about model quality.
Open-source Claude Code / Cursor replacement that lives in the terminal. Pick when you want zero vendor lock-in and any LLM backend.
Permissive (MIT) open-weight model targeting ChatGPT/Claude-class chat and agentic coding without API lock-in - the strongest open option for a self-hosted stack.
ChatGPT's Code Interpreter, but local and against any model.
Near-frontier open-weight model to back a self-hosted ChatGPT/Claude replacement or an agentic coding stack.