MARS: Reinforcing Multi-Agent Reasoning of LLMs through Self-Play in Strategic Games
Paper
ā¢
2510.15414
ā¢
Published
ā¢
1
MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs š Accepted by ICLR 2026
Note Note: This paper has been updated to v2 on arXiv. MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs