MARS: Modular Agent with Reflective Search for Automated AI Research Paper • 2602.02660 • Published 12 days ago • 61
Endless Terminals: Scaling RL Environments for Terminal Agents Paper • 2601.16443 • Published 22 days ago • 16
Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs Paper • 2601.08763 • Published Jan 13 • 147
Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning Paper • 2601.09667 • Published about 1 month ago • 90
Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge Paper • 2601.08808 • Published Jan 13 • 39
view article Article Introducing OptiMind, a research model designed for optimization 30 days ago • 34
view article Article Building the Open Agent Ecosystem Together: Introducing OpenEnv +8 Oct 23, 2025 • 148