World Modeling with Probabilistic Structure Integration Paper • 2509.09737 • Published Sep 10, 2025 • 13
Representing Speech Through Autoregressive Prediction of Cochlear Tokens Paper • 2508.11598 • Published Aug 15, 2025 • 17
Taming generative video models for zero-shot optical flow extraction Paper • 2507.09082 • Published Jul 11, 2025 • 12
3D Scene Understanding Through Local Random Access Sequence Modeling Paper • 2504.03875 • Published Apr 4, 2025 • 5