Overview

gpt-oss-20b is an open-weight language model from OpenAI, released in 2025 under the gpt-oss family. It uses a Mixture-of-Experts (MoE) design with about 21.5B total parameters, of which roughly 3.6B are active per token. That routing keeps inference light while the model still reaches across a large parameter pool, and it carries a 128K-token context window for long documents and extended chats.

The model runs fully on your own hardware through Atomic Chat. Nothing leaves your machine, so prompts, files, and outputs stay private, and it keeps working with the network off. For anyone who wants a capable reasoning model without sending data to a cloud API, gpt-oss-20b is built for that local-first setup.

What it is good at

gpt-oss-20b ships with tool calling, chain-of-thought reasoning, and code capabilities, which map to a few clear jobs:

Tool and function calling — the model can emit structured calls to external functions, so you can wire it into agents, scripts, or local apps that fetch data and run actions.
Step-by-step reasoning — it produces visible chain-of-thought and supports adjustable reasoning effort, useful for math, logic, and multi-step problems where you want to see the working.
Code generation and review — it writes functions, explains snippets, and helps debug across common languages, with the 128K context holding a sizable codebase in one session.

Running it locally

At 21.5B total parameters, gpt-oss-20b is sized for consumer hardware. With its 4-bit MXFP4 quantization it fits in about 16GB of memory, so a GPU like the RTX 5080 (16GB) handles it, and a 24GB card such as the RTX 4090 gives more headroom for longer context and the full 128K window. Around 24GB of system RAM works as a CPU fallback, though a GPU is much faster.

huggingface-cli download openai/gpt-oss-20b

You can load the weights with Hugging Face Transformers or serve them through vLLM, or skip the setup entirely and open gpt-oss-20b in Atomic Chat with a one-click download.

License

gpt-oss-20b is released under the Apache 2.0 license. That permits commercial use, modification, redistribution, and fine-tuning, with no fee and no restriction on building it into your own products, as long as you keep the license and attribution notices.

gpt-oss-20b

More models

At a glance

Overview

What it is good at

Running it locally

License

Frequently asked questions