Overview
SmolLM3-3B is a 3-billion-parameter language model from Hugging Face, released in 2025 as the third generation of the SmolLM family. It is a decoder-only transformer that uses grouped-query attention and NoPE in a 3:1 ratio, pretrained on roughly 11 trillion tokens across web, code, math, and reasoning data. The model was post-trained with 140B reasoning tokens, supervised fine-tuning, and Anchored Preference Optimization (APO). Hugging Face publishes the weights, the data mixture, and the training configs, so the whole pipeline is reproducible.
What it is good at
SmolLM3-3B ships with a hybrid reasoning design: you can toggle an extended thinking mode with /think or turn it off with /no_think. On instruction following it scores 76.7 on IFEval in no-think mode, ahead of Qwen2.5-3B and Llama3.1-3B. In thinking mode its math and reasoning scores climb sharply, for example AIME 2025 rising from 9.3 to 36.7 and GPQA Diamond reaching 41.7. It handles tool calling (92.3 BFCL) for agentic workflows and natively covers six languages: English, French, Spanish, German, Italian, and Portuguese, with additional Arabic, Chinese, and Russian data. Context reaches 128K tokens through YaRN extrapolation from a 64K training window.
Running locally
At 3B parameters the model is light. Full bf16 inference fits in about 6-8 GB of VRAM, and 4-bit quantization brings it under 8 GB so it runs on most consumer GPUs. The modeling code landed in transformers v4.53.0, and you can serve an OpenAI-compatible endpoint with vLLM or SGLang. For CPU, Apple Silicon, or edge use there are GGUF, ONNX, MLX, and ExecuTorch builds. Hugging Face recommends sampling at temperature 0.6 and top_p 0.95.
License
SmolLM3-3B is released under Apache 2.0. That allows commercial use, modification, and redistribution without a separate license fee, and there is no acceptable-use restriction beyond the standard Apache terms. As Hugging Face notes, outputs can still be inaccurate or biased, so the model should be treated as an assistive tool rather than an authoritative source.
