WebWorld-8B

Updated

Thinking

Vision

Reasoning

Web

Run

An 8B open-weight multimodal model from Qwen built for web-agent and reasoning tasks, with vision input and a 40K context window. Runs fully on local hardware.

huggingface-cli download Qwen/WebWorld-8B

from transformers import AutoModel
model = AutoModel.from_pretrained("Qwen/WebWorld-8B")

More models

View all

Name	Size / Usage	Context	Input
MiniCPM-V 4.6	5213 GB	421K	Text, Image
anima	421 GB	31K	Text

At a glance

License: Apache 2.0 — free for commercial use
Context length: 40K tokens
Languages: English and Chinese
Minimum hardware: ~19 GB VRAM (16–24 GB range)
Strengths: web-agent reasoning, vision understanding and on-device inference

Overview

WebWorld-8B is a web world model from the Qwen team, fine-tuned from Qwen3-8B. It does not chat. Given the current state of a web page and an action, it predicts what the page becomes next — so it acts as a browser you can run in your head, in software, instead of clicking through the real site.

That solves a specific problem in agent training. Teaching a web agent on the live internet is slow and risky: pages rate-limit you, network latency stacks up, and a wrong click can submit a form or spend money. WebWorld replaces the live web with a simulator. The model was trained on more than a million real browser trajectories, and it keeps the page format you give it — A11y Tree, HTML, XML, Markdown, or plain language — across multi-turn rollouts of 30 or more steps.

What it is good at

WebWorld-8B is built for agent developers rather than end users. The common uses are:

Lookahead search — an agent proposes several candidate actions, WebWorld simulates the resulting page for each, a value model scores them, and only the best action runs on the real browser.
Training-data synthesis — it turns a handful of seed tasks into thousands of fine-tuning trajectories, keeping only the runs that succeed.
Offline evaluation — you can replay and test agent behavior without touching production sites.

Fine-tuning Qwen3-8B on WebWorld-generated trajectories has been reported to add +9.9% on MiniWob++ and +10.9% on WebArena. The 8B size is the entry point of the series; 14B and 32B variants trade more hardware for higher fidelity.

Running it locally

WebWorld-8B is an 8B model, so it fits on consumer hardware — roughly 16–24 GB of VRAM at full precision, and a 4-bit quantized build runs on a 16 GB laptop. Pull the weights from Hugging Face and load them through your preferred runtime:

huggingface-cli download Qwen/WebWorld-8B

From there you can serve it with Transformers or vLLM, or load it in a desktop app like Atomic Chat, where it downloads and runs in one step with nothing leaving your device.

License

WebWorld is released under Apache 2.0 — the weights, the dataset, and the demo code. That means free local use, fine-tuning, and commercial deployment. The research behind it is documented in the WebWorld paper.

Desktop

macOS

(M1 or better)

Download

Windows

(x64)

Download

Linux

(x86_64)

Download

Frequently asked questions

WebWorld-8B is an open-source web world model from Qwen, fine-tuned from Qwen3-8B. Instead of generating chat replies, it predicts the next state of a web page given the current page and an action, so developers can train and test web agents in a simulated browser rather than on the live web. It was trained on over a million real browser trajectories and supports A11y Tree, HTML, XML, Markdown and natural-language formats.

As an 8B model it needs roughly 16–24 GB of VRAM in full precision; a quantized 4-bit build runs comfortably on a 16 GB machine, including Apple Silicon Macs and recent Windows or Linux laptops. A discrete GPU speeds up generation but is not required — the model also runs CPU-only, just more slowly.

Yes. WebWorld-8B is released under the Apache 2.0 license, so the weights are free to download, run locally, fine-tune and use commercially. There are no API fees, per-token charges or subscriptions — once it is on your hardware, every run is free.

Yes. Once the weights are downloaded the model runs entirely on your own device — no internet connection, API key or account is required, and no data leaves the machine. That makes it suitable for private experiments and air-gapped environments.

Download the weights from Hugging Face with huggingface-cli download Qwen/WebWorld-8B, then load them through a local runtime such as Transformers or vLLM. In a desktop app like Atomic Chat you pick the model from the list and it downloads and loads in one step, with no command line needed.