FUfu99/DeepSeek-R1-Distill-Qwen-7B-best_of_n-VLLM-Skywork-o1-Open-PRM-Qwen-2.5-7B-completions Updated Feb 25, 2025 • 7
FUfu99/Qwen-2.5-Math-7B-SimpleRL-Zero-best_of_n-VLLM-Skywork-o1-Open-PRM-Qwen-2.5-7B-completions Updated Feb 22, 2025 • 6
FUfu99/Qwen-2.5-Math-7B-SimpleRL-best_of_n-VLLM-Skywork-o1-Open-PRM-Qwen-2.5-7B-completions Updated Feb 22, 2025 • 5
FUfu99/deepseek-math-7b-instruct-best_of_n-VLLM-Skywork-o1-Open-PRM-Qwen-2.5-7B-completions Updated Feb 22, 2025 • 6