16 25 54

random

fakerbaby

fakerbaby

AI & ML interests

NLP, RL, VLM

Recent Activity

upvoted an article 29 days ago

We Got Claude to Fine-Tune an Open Source LLM

upvoted a paper about 1 month ago

Skywork-R1V4: Toward Agentic Multimodal Intelligence through Interleaved Thinking with Images and DeepResearch

upvoted a paper 2 months ago

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

View all activity

Organizations

upvoted an article 29 days ago

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

564

upvoted a paper about 1 month ago

Skywork-R1V4: Toward Agentic Multimodal Intelligence through Interleaved Thinking with Images and DeepResearch

Paper • 2512.02395 • Published Dec 2, 2025 • 47

upvoted a paper 2 months ago

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published Oct 13, 2025 • 176

upvoted a paper 3 months ago

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

Paper • 2510.18927 • Published Oct 21, 2025 • 83

upvoted a paper 4 months ago

A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10, 2025 • 190

upvoted a paper 5 months ago

Matrix-3D: Omnidirectional Explorable 3D World Generation

Paper • 2508.08086 • Published Aug 11, 2025 • 75

upvoted a collection 5 months ago

Skywork-R1V3

Collection

Advanced multimodal reasoning model • 7 items • Updated Aug 8, 2025 • 14

upvoted a paper 5 months ago

Skywork UniPic: Unified Autoregressive Modeling for Visual Understanding and Generation

Paper • 2508.03320 • Published Aug 5, 2025 • 62

upvoted a paper 6 months ago

Skywork-R1V3 Technical Report

Paper • 2507.06167 • Published Jul 8, 2025 • 72

upvoted an article 6 months ago

Article

ScreenSuite - The most comprehensive evaluation suite for GUI Agents!

Jun 6, 2025

•

upvoted a paper 6 months ago

Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy

Paper • 2507.01352 • Published Jul 2, 2025 • 56

upvoted 3 papers 7 months ago

upvoted an article 8 months ago

Article

Vision Language Models (Better, faster, stronger)

May 12, 2025

•

582

upvoted 2 papers 8 months ago

Seed1.5-VL Technical Report

Paper • 2505.07062 • Published May 11, 2025 • 154

Skywork-VL Reward: An Effective Reward Model for Multimodal Understanding and Reasoning

Paper • 2505.07263 • Published May 12, 2025 • 30

upvoted an article 9 months ago

Article

Open Preference Dataset for Text-to-Image Generation by the 🤗 Community

Dec 9, 2024

•

upvoted a collection about 1 year ago

Medical QA Datasets

Collection

A collection of medical question answering (QA) datasets • 23 items • Updated Feb 22, 2025 • 47

upvoted a collection over 1 year ago

Infinity Instruct

Collection

Scaling Instruction Selection and Synthesis to Enhance Language Models • 17 items • Updated Dec 1, 2025 • 9

random

AI & ML interests

Recent Activity

Organizations

fakerbaby's activity

We Got Claude to Fine-Tune an Open Source LLM

ScreenSuite - The most comprehensive evaluation suite for GUI Agents!

Vision Language Models (Better, faster, stronger)

Open Preference Dataset for Text-to-Image Generation by the 🤗 Community