Minrui Xu
RolandXMR
ยท
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 8 hours ago
Secure Code Generation via Online Reinforcement Learning with Vulnerability Reward Model
upvoted
a
paper
3 days ago
Probability-Entropy Calibration: An Elastic Indicator for Adaptive Fine-tuning
upvoted
a
paper
4 days ago
Improving Data and Reward Design for Scientific Reasoning in Large Language Models