Agent World Model
Collection
4 items
β’
Updated
β’
4
Zhaoyang Wang1,
Canwen Xu2,
Boyi Liu2,
Yite Wang2,
Siwei Han1,
Zhewei Yao2,
Huaxiu Yao1,
Yuxiong He2
1UNC-Chapel Hill 2Snowflake AI Research
Arctic-AWM-8B is a multi-turn tool-use agent model trained with agentic reinforcement learning on Qwen3-8B, using the fully synthetic environments from AgentWorldModel-1K.
The model is trained to interact with tool-use environments exposed via a unified MCP (Model Context Protocol) interface, enabling strong multi-turn agentic capabilities.
For detailed usage of the model, please visit https://github.com/Snowflake-Labs/agent-world-model.
Related resources are also available, please check:
| Resource | Link |
|---|---|
| π Paper | π arxiv.org/abs/2602.10090 |
| π» Code | π» Snowflake-Labs/agent-world-model |
| π¦ AgentWorldModel-1K | π€ Snowflake/AgentWorldModel-1K |
| π€ Arctic-AWM-4B | π€ Snowflake/Arctic-AWM-4B |
| π€ Arctic-AWM-8B | π€ Snowflake/Arctic-AWM-8B |
| π€ Arctic-AWM-14B | π€ Snowflake/Arctic-AWM-14B |
If you find this resource useful, please kindly cite:
@article{wang2026agentworldmodelinfinity,
title={Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning},
author={Zhaoyang Wang and Canwen Xu and Boyi Liu and Yite Wang and Siwei Han and Zhewei Yao and Huaxiu Yao and Yuxiong He},
year={2026},
eprint={2602.10090},
archivePrefix={arXiv},
primaryClass={cs.AI},
url={https://arxiv.org/abs/2602.10090},
}