Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections Paper • 2603.12180 • Published 4 days ago • 57
Neural Thickets: Diverse Task Experts Are Dense Around Pretrained Weights Paper • 2603.12228 • Published 4 days ago • 8
IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse Paper • 2603.12201 • Published 4 days ago • 45
Thinking to Recall: How Reasoning Unlocks Parametric Knowledge in LLMs Paper • 2603.09906 • Published 6 days ago • 64
EvoSkill: Automated Skill Discovery for Multi-Agent Systems Paper • 2603.02766 • Published 14 days ago • 1
Scalable Training of Mixture-of-Experts Models with Megatron Core Paper • 2603.07685 • Published 8 days ago • 1
OfficeQA Pro: An Enterprise Benchmark for End-to-End Grounded Reasoning Paper • 2603.08655 • Published 7 days ago • 3
Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders Paper • 2603.06569 • Published 10 days ago • 105
AgentIR: Reasoning-Aware Retrieval for Deep Research Agents Paper • 2603.04384 • Published 12 days ago • 3
NOBLE: Accelerating Transformers with Nonlinear Low-Rank Branches Paper • 2603.06492 • Published 10 days ago • 2
Progressive Residual Warmup for Language Model Pretraining Paper • 2603.05369 • Published 11 days ago • 33
T2S-Bench & Structure-of-Thought: Benchmarking and Prompting Comprehensive Text-to-Structure Reasoning Paper • 2603.03790 • Published 13 days ago • 114
Memex(RL): Scaling Long-Horizon LLM Agents via Indexed Experience Memory Paper • 2603.04257 • Published 12 days ago • 19
If You Want Coherence, Orchestrate a Team of Rivals: Multi-Agent Models of Organizational Intelligence Paper • 2601.14351 • Published Jan 20 • 1
F-GRPO: Don't Let Your Policy Learn the Obvious and Forget the Rare Paper • 2602.06717 • Published Feb 6 • 72