Skywork-R1V4: Toward Agentic Multimodal Intelligence through Interleaved Thinking with Images and DeepResearch Paper • 2512.02395 • Published Dec 2, 2025 • 47
QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs Paper • 2510.11696 • Published Oct 13, 2025 • 176
BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping Paper • 2510.18927 • Published Oct 21, 2025 • 83
A Survey of Reinforcement Learning for Large Reasoning Models Paper • 2509.08827 • Published Sep 10, 2025 • 190
Matrix-3D: Omnidirectional Explorable 3D World Generation Paper • 2508.08086 • Published Aug 11, 2025 • 75
Skywork-R1V3 Collection Advanced multimodal reasoning model • 7 items • Updated Aug 8, 2025 • 14
Skywork UniPic: Unified Autoregressive Modeling for Visual Understanding and Generation Paper • 2508.03320 • Published Aug 5, 2025 • 62
view article Article ScreenSuite - The most comprehensive evaluation suite for GUI Agents! +1 Jun 6, 2025 • 55
Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy Paper • 2507.01352 • Published Jul 2, 2025 • 56
Skywork-SWE: Unveiling Data Scaling Laws for Software Engineering in LLMs Paper • 2506.19290 • Published Jun 24, 2025 • 52
Matrix-Game: Interactive World Foundation Model Paper • 2506.18701 • Published Jun 23, 2025 • 72
Skywork-VL Reward: An Effective Reward Model for Multimodal Understanding and Reasoning Paper • 2505.07263 • Published May 12, 2025 • 30
view article Article Open Preference Dataset for Text-to-Image Generation by the 🤗 Community +5 Dec 9, 2024 • 69
Medical QA Datasets Collection A collection of medical question answering (QA) datasets • 23 items • Updated Feb 22, 2025 • 47
Infinity Instruct Collection Scaling Instruction Selection and Synthesis to Enhance Language Models • 17 items • Updated Dec 1, 2025 • 9