Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Steffen Röcker's picture
12 234 749

Steffen Röcker PRO

sroecker
ltim's profile picture Neda7's profile picture Pclanglais's profile picture
·
https://x.com/sroecker
  • sroecker
  • sroecker

AI & ML interests

Local models

Recent Activity

liked a model about 1 hour ago
mistralai/Mistral-Small-4-119B-2603-eagle
liked a model about 1 hour ago
mistralai/Mistral-Small-4-119B-2603
upvoted an article 4 days ago
Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries
View all activity

Organizations

Hugging Face Discord Community's profile picture

sroecker 's collections 1

RLHF
  • The Importance of Online Data: Understanding Preference Fine-tuning via Coverage

    Paper • 2406.01462 • Published Jun 3, 2024 • 6
  • SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

    Paper • 2501.17161 • Published Jan 28, 2025 • 124
RLHF
  • The Importance of Online Data: Understanding Preference Fine-tuning via Coverage

    Paper • 2406.01462 • Published Jun 3, 2024 • 6
  • SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

    Paper • 2501.17161 • Published Jan 28, 2025 • 124
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs