PeterPP's picture

8

PeterPP

ZhSh1230

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning

upvoted a paper 3 days ago

P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads

upvoted a paper 3 days ago

Improving Data and Reward Design for Scientific Reasoning in Large Language Models

View all activity

Organizations

None yet

upvoted a paper 1 day ago

Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning

Paper • 2602.11748 • Published 2 days ago • 25

upvoted 2 papers 3 days ago

P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads

Paper • 2602.09443 • Published 4 days ago • 56

Improving Data and Reward Design for Scientific Reasoning in Large Language Models

Paper • 2602.08321 • Published 5 days ago • 39

upvoted a paper 5 days ago

MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration

Paper • 2602.01734 • Published 12 days ago • 32

upvoted a paper 9 days ago

A2Eval: Agentic and Automated Evaluation for Embodied Brain

Paper • 2602.01640 • Published 12 days ago • 8

upvoted a paper 3 months ago

Every Token Counts: Generalizing 16M Ultra-Long Context in Large Language Models

Paper • 2511.23319 • Published Nov 28, 2025 • 24

upvoted a paper 4 months ago

Knocking-Heads Attention

Paper • 2510.23052 • Published Oct 27, 2025 • 30

upvoted a paper 5 months ago

DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

Paper • 2509.25454 • Published Sep 29, 2025 • 146