meituan-longcat/LongCat-Flash-Prover
561B • Updated • 10 • 8
None defined yet.
$V_{0.5}$: Generalist Value Model as a Prior for Sparse RL Rollouts
ScaleEnv: Scaling Environment Synthesis from Scratch for Generalist Interactive Tool-Use Agent Training