Arctic-AWM-8B

Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning

Zhaoyang Wang1, Canwen Xu2, Boyi Liu2, Yite Wang2, Siwei Han1,
Zhewei Yao2, Huaxiu Yao1, Yuxiong He2

1UNC-Chapel Hill   2Snowflake AI Research  

Overview

Arctic-AWM-8B is a multi-turn tool-use agent model trained with agentic reinforcement learning on Qwen3-8B, using the fully synthetic environments from AgentWorldModel-1K.

The model is trained to interact with tool-use environments exposed via a unified MCP (Model Context Protocol) interface, enabling strong multi-turn agentic capabilities.

For detailed usage of the model, please visit https://github.com/Snowflake-Labs/agent-world-model.

Resources

Related resources are also available, please check:

Resource Link
πŸ“„ Paper πŸ“„ arxiv.org/abs/2602.10090
πŸ’» Code πŸ’» Snowflake-Labs/agent-world-model
πŸ“¦ AgentWorldModel-1K πŸ€— Snowflake/AgentWorldModel-1K
πŸ€– Arctic-AWM-4B πŸ€— Snowflake/Arctic-AWM-4B
πŸ€– Arctic-AWM-8B πŸ€— Snowflake/Arctic-AWM-8B
πŸ€– Arctic-AWM-14B πŸ€— Snowflake/Arctic-AWM-14B

Citation

If you find this resource useful, please kindly cite:

@article{wang2026agentworldmodelinfinity,
      title={Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning}, 
      author={Zhaoyang Wang and Canwen Xu and Boyi Liu and Yite Wang and Siwei Han and Zhewei Yao and Huaxiu Yao and Yuxiong He},
      year={2026},
      eprint={2602.10090},
      archivePrefix={arXiv},
      primaryClass={cs.AI},
      url={https://arxiv.org/abs/2602.10090}, 
}
Downloads last month
19
Safetensors
Model size
8B params
Tensor type
BF16
Β·
Video Preview
loading

Model tree for Snowflake/Arctic-AWM-8B

Base model

Qwen/Qwen3-4B-Base
Finetuned
Qwen/Qwen3-4B
Finetuned
(448)
this model
Quantizations
2 models

Collection including Snowflake/Arctic-AWM-8B

Paper for Snowflake/Arctic-AWM-8B