Overview
DeepSeek-R1-0528 is the May 28, 2025 refresh of DeepSeek AI's R1 reasoning model. It is built on the DeepSeek-V3 mixture-of-experts architecture, with 671B total parameters and 37B activated per token, and ships under the MIT License. Rather than a new base model, this release is a post-training upgrade that uses more compute and tuned algorithms to push reasoning quality close to proprietary systems like OpenAI o3 and Gemini 2.5 Pro.
What it's good at
The model is tuned for hard reasoning. On AIME 2025 it scores 87.5% pass@1, up from 70% in the original R1, and its Codeforces rating climbs from 1530 to 1930. Coding benchmarks improve too: LiveCodeBench rises to 73.3 and SWE-Verified to 57.6. The 0528 update also reduces hallucination, supports system prompts, and adds stronger function calling, which makes it more usable for agentic and tool-driven workflows. The gains come from longer thinking, so it averages about 23K tokens per AIME question versus 12K before.
Running locally
This is a large model. The native FP8 weights are around 700 GB, so the full version expects a multi-GPU server or a high-RAM host. Community dynamic GGUF quants from teams like Unsloth bring it down to roughly 150-250 GB at 1.5-2 bit, which lets it run on CPU with enough system memory or a workstation with several cards. You can serve it with vLLM or SGLang, or run quantized builds through llama.cpp and Ollama. For single-GPU setups, DeepSeek also released a distilled DeepSeek-R1-0528-Qwen3-8B that runs like an 8B model.
License
DeepSeek-R1-0528 is released under the MIT License. That permits commercial use, modification, and redistribution, and DeepSeek explicitly allows distillation of the model's outputs to train other models. There is no separate acceptable-use addendum beyond standard MIT terms.

