WebWorld-8B

Updated
22.06.2026
Thinking
Vision
Reasoning
Web

An 8B open-weight multimodal model from Qwen built for web-agent and reasoning tasks, with vision input and a 40K context window. Runs fully on local hardware.

huggingface-cli download Qwen/WebWorld-8B
from transformers import AutoModel
model = AutoModel.from_pretrained("Qwen/WebWorld-8B")

More models

NameSize / UsageContextInput
MiniCPM-V 4.6
5213 GB421KText, Image
anima
421 GB31KText

At a glance

  • License: Apache 2.0 — free for commercial use
  • Context length: 40K tokens
  • Languages: English and Chinese
  • Minimum hardware: ~19 GB VRAM (16–24 GB range)
  • Strengths: web-agent reasoning, vision understanding and on-device inference

Overview

WebWorld-8B is a web world model from the Qwen team, fine-tuned from Qwen3-8B. It does not chat. Given the current state of a web page and an action, it predicts what the page becomes next — so it acts as a browser you can run in your head, in software, instead of clicking through the real site.

That solves a specific problem in agent training. Teaching a web agent on the live internet is slow and risky: pages rate-limit you, network latency stacks up, and a wrong click can submit a form or spend money. WebWorld replaces the live web with a simulator. The model was trained on more than a million real browser trajectories, and it keeps the page format you give it — A11y Tree, HTML, XML, Markdown, or plain language — across multi-turn rollouts of 30 or more steps.

What it is good at

WebWorld-8B is built for agent developers rather than end users. The common uses are:

  • Lookahead search — an agent proposes several candidate actions, WebWorld simulates the resulting page for each, a value model scores them, and only the best action runs on the real browser.
  • Training-data synthesis — it turns a handful of seed tasks into thousands of fine-tuning trajectories, keeping only the runs that succeed.
  • Offline evaluation — you can replay and test agent behavior without touching production sites.

Fine-tuning Qwen3-8B on WebWorld-generated trajectories has been reported to add +9.9% on MiniWob++ and +10.9% on WebArena. The 8B size is the entry point of the series; 14B and 32B variants trade more hardware for higher fidelity.

Running it locally

WebWorld-8B is an 8B model, so it fits on consumer hardware — roughly 16–24 GB of VRAM at full precision, and a 4-bit quantized build runs on a 16 GB laptop. Pull the weights from Hugging Face and load them through your preferred runtime:

huggingface-cli download Qwen/WebWorld-8B

From there you can serve it with Transformers or vLLM, or load it in a desktop app like Atomic Chat, where it downloads and runs in one step with nothing leaving your device.

License

WebWorld is released under Apache 2.0 — the weights, the dataset, and the demo code. That means free local use, fine-tuning, and commercial deployment. The research behind it is documented in the WebWorld paper.

Desktop
macOS
(M1 or better)
Download
Windows
(x64)
Download
Linux
(x86_64)
Download

Frequently asked questions

WebWorld-8B is an open-source web world model from Qwen, fine-tuned from Qwen3-8B. Instead of generating chat replies, it predicts the next state of a web page given the current page and an action, so developers can train and test web agents in a simulated browser rather than on the live web. It was trained on over a million real browser trajectories and supports A11y Tree, HTML, XML, Markdown and natural-language formats.

As an 8B model it needs roughly 16–24 GB of VRAM in full precision; a quantized 4-bit build runs comfortably on a 16 GB machine, including Apple Silicon Macs and recent Windows or Linux laptops. A discrete GPU speeds up generation but is not required — the model also runs CPU-only, just more slowly.

Yes. WebWorld-8B is released under the Apache 2.0 license, so the weights are free to download, run locally, fine-tune and use commercially. There are no API fees, per-token charges or subscriptions — once it is on your hardware, every run is free.

Yes. Once the weights are downloaded the model runs entirely on your own device — no internet connection, API key or account is required, and no data leaves the machine. That makes it suitable for private experiments and air-gapped environments.

Download the weights from Hugging Face with huggingface-cli download Qwen/WebWorld-8B, then load them through a local runtime such as Transformers or vLLM. In a desktop app like Atomic Chat you pick the model from the list and it downloads and loads in one step, with no command line needed.