MemFly: On-the-Fly Memory Optimization via Information Bottleneck Paper • 2602.07885 • Published 8 days ago • 7 • 3
LoopFormer: Elastic-Depth Looped Transformers for Latent Reasoning via Shortcut Modulation Paper • 2602.11451 • Published 4 days ago • 14 • 2
The Pensieve Paradigm: Stateful Language Models Mastering Their Own Context Paper • 2602.12108 • Published 4 days ago • 13 • 4
RISE: Self-Improving Robot Policy with Compositional World Model Paper • 2602.11075 • Published 5 days ago • 26 • 2
ScalSelect: Scalable Training-Free Multimodal Data Selection for Efficient Visual Instruction Tuning Paper • 2602.11636 • Published 4 days ago • 2 • 2
Dreaming in Code for Curriculum Learning in Open-Ended Worlds Paper • 2602.08194 • Published 7 days ago • 6 • 2
Pretraining A Large Language Model using Distributed GPUs: A Memory-Efficient Decentralized Paradigm Paper • 2602.11543 • Published 4 days ago • 4 • 3
P-GenRM: Personalized Generative Reward Model with Test-time User-based Scaling Paper • 2602.12116 • Published 4 days ago • 4 • 3
The Devil Behind Moltbook: Anthropic Safety is Always Vanishing in Self-Evolving AI Societies Paper • 2602.09877 • Published 6 days ago • 185 • 9
Budget-Constrained Agentic Large Language Models: Intention-Based Planning for Costly Tool Use Paper • 2602.11541 • Published 4 days ago • 3 • 2