Migrate from DeepSeek

DeepSeek Nerf Index19·30-day re-nerf forecast49%·critical receipts0

§ 02OPEN-SOURCE REPLACEMENTS6

Self-hosted, free, or freemium tools that cover the same workflow. Every entry is real software with a public repo and an active community.

OllamaFREE / OSS130k★MIT

Easiest way to run a local LLM. Pair with Open WebUI for full ChatGPT replacement.

Run Llama/Mistral/Gemma/DeepSeek locally
Ollama-compatible API for any client
Brew/winget/curl install

brew install ollama && ollama run llama3.3

! caveat: Needs 16GB+ RAM for usable models. GPU strongly recommended.

View repo ↗★ PAIDStep-by-step guide →

Open WebUIFREE / OSS95k★MIT

The default ChatGPT-replacement self-host. Massive ecosystem.

ChatGPT-style web UI for any backend (Ollama, OpenAI-compatible)
RAG with file uploads
Function calling, web search plugin

docker run -p 3000:8080 ghcr.io/open-webui/open-webui:main

! caveat: Best paired with Ollama running locally.

View repo ↗★ PAIDStep-by-step guide →

GLM-5.2 (open weights)FREE / OSSMIT

Permissive (MIT) open-weight model targeting ChatGPT/Claude-class chat and agentic coding without API lock-in - the strongest open option for a self-hosted stack.

Fully-open (MIT) flagship MoE from Z.ai / Zhipu AI (weights released Jun 2026)
~744B total / ~40B active parameters, 1M-token context (~5x GLM-5.1)
Strongest open-weight coding model at release - #2 on Code Arena, ~#3 on FrontierSWE behind only Fable 5 / Opus 4.8

ollama run glm-5.2 (or download weights from HuggingFace: zai-org/GLM-5.2, serve via vLLM/SGLang)

! caveat: MIT, fully open, no usage restrictions. At ~744B params it is multi-GPU / data-center scale - use a hosted provider if you lack the hardware. Note: the hosted Z.ai API routes through China (a data-residency consideration); self-hosting the open weights avoids that.

View repo ↗★ PAIDStep-by-step guide →

DeepSeek-V4 (open weights)FREE / OSSMIT

Drop-in open-weight replacement for the OpenAI/Anthropic chat APIs. Flash is the locally-runnable variant; serve it behind Open WebUI.

MIT-licensed open-weight MoE: V4-Flash (284B/13B active) + V4-Pro (1.6T/49B active)
1M-token context
Near-frontier quality at a fraction of API cost

Download from HuggingFace (deepseek-ai/DeepSeek-V4-Flash); serve via vLLM/SGLang

! caveat: Even Flash (284B) needs serious VRAM/multi-GPU; Pro (1.6T) is data-center scale. MIT license, weights fully open.

View repo ↗★ PAIDStep-by-step guide →

Mistral Large 3 (open weights)FREE / OSSApache-2.0

Unrestricted (Apache-2.0) open-weight model to replace the OpenAI/Anthropic/Gemini chat APIs in a self-hosted stack.

Apache-2.0 open-weight MoE from Mistral AI: 675B total / 41B active
256K context, multimodal (text + vision)
Production-grade general-purpose model

ollama run mistral-large-3  (or download weights from HuggingFace: mistralai/Mistral-Large-3-675B-Instruct-2512-BF16, serve via vLLM/SGLang)

! caveat: Apache-2.0, no usage restrictions. December 2025 release. At 675B params it is multi-GPU / data-center scale - use a hosted provider if you lack the hardware.

View repo ↗★ PAIDStep-by-step guide →

Qwen3.6 (open weights)FREE / OSSApache-2.0

Permissive (Apache-2.0) open-weight model to back a self-hosted ChatGPT/Claude replacement. The 35B-A3B MoE runs on a single high-RAM machine.

Apache-2.0 open-weight family from Alibaba's Qwen team
Qwen3.6-35B-A3B MoE (35B total, 3B active) + 27B dense
256K context, extensible to ~1M

ollama run qwen3.6 (or pull from HuggingFace: Qwen/Qwen3.6-35B-A3B)

! caveat: Larger variants need a GPU; the 35B MoE wants ~24GB+ to be comfortable.

View repo ↗★ PAIDStep-by-step guide →

§ 03TRACKED PAID COMPETITORS2

Other paid tools we track in the same category, ranked by current Nerf Index. Lowest = least likely to nerf you next.

#1OpenAI API17 · clean↓ 2 vs DeepSeek

1 major change on record.

critical · 0last 90d · 130d re-nerf · 57%

Full receipts →Vendor pricing ↗Side-by-side →★ PAIDStep-by-step migration →

#2Claude (Anthropic)49 · on watch↑ 30 vs DeepSeek

2 major changes on record · roughly one every 107 days.

critical · 0last 90d · 330d re-nerf · 33%

Full receipts →Vendor pricing ↗Side-by-side →★ PAIDStep-by-step migration →

★ PERSONAL Step-by-step migration playbook

DeepSeek → Ollama

6 steps· ~30 min· low risk· built for macOS · Linux · WSL

Migrate to→Ollama FREE Open WebUI FREE GLM-5.2 (open weights) FREE DeepSeek-V4 (open weights) FREE Mistral Large 3 (open weights) FREE OpenAI API PAID↗

Personal$9/mo

Unlock the full Ollama migration

Step one’s on us. The rest — config port, smoke test, the buried cancel path with a prefilled refund email, and the rollback — open the second you join.

All 6 steps, copy-paste ready
Every migration playbook — including ones we haven’t written yet
Refund email that’s clawed back $80+
gotnerfed ask — grounded help through every step

Unlock all playbooks →

Instant access · cancel anytime

FREEYou’re reading the free preview. Unlock once — every playbook opens, including future ones.See Personal →