GENIUS: Generative Fluid Intelligence Evaluation Suite Paper • 2602.11144 • Published 9 days ago • 53
Are Video Models Ready as Zero-Shot Reasoners? An Empirical Study with the MME-CoF Benchmark Paper • 2510.26802 • Published Oct 30, 2025 • 34