Rui
Yalimu
·
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 2 hours ago
Diversity or Precision? A Deep Dive into Next Token Prediction
commented on
a paper
3 months ago
One-Token Rollout: Guiding Supervised Fine-Tuning of LLMs with Policy
Gradient
upvoted
a
paper
3 months ago
One-Token Rollout: Guiding Supervised Fine-Tuning of LLMs with Policy
Gradient