Improving Data and Reward Design for Scientific Reasoning in Large Language Models Paper • 2602.08321 • Published 4 days ago • 38
Improving Data and Reward Design for Scientific Reasoning in Large Language Models Paper • 2602.08321 • Published 4 days ago • 38
MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration Paper • 2602.01734 • Published 11 days ago • 32
The Era of Agentic Organization: Learning to Organize with Language Models Paper • 2510.26658 • Published Oct 30, 2025 • 29
MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration Paper • 2602.01734 • Published 11 days ago • 32
MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration Paper • 2602.01734 • Published 11 days ago • 32
SIGMA: An AI-Empowered Training Stack on Early-Life Hardware Paper • 2512.13488 • Published Dec 15, 2025
DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle Paper • 2512.04324 • Published Dec 3, 2025 • 154
Beyond Length: Quantifying Long-Range Information for Long-Context LLM Pretraining Data Paper • 2510.25804 • Published Oct 29, 2025 • 1
Beyond Length: Quantifying Long-Range Information for Long-Context LLM Pretraining Data Paper • 2510.25804 • Published Oct 29, 2025 • 1 • 1
Beyond Length: Quantifying Long-Range Information for Long-Context LLM Pretraining Data Paper • 2510.25804 • Published Oct 29, 2025 • 1
Learning from the Best, Differently: A Diversity-Driven Rethinking on Data Selection Paper • 2510.18909 • Published Oct 21, 2025 • 5
Learning from the Best, Differently: A Diversity-Driven Rethinking on Data Selection Paper • 2510.18909 • Published Oct 21, 2025 • 5
Learning from the Best, Differently: A Diversity-Driven Rethinking on Data Selection Paper • 2510.18909 • Published Oct 21, 2025 • 5 • 3