2 12

Jiaxin Huang

teapot123

AI & ML interests

None yet

Recent Activity

upvoted a paper about 9 hours ago

TTCS: Test-Time Curriculum Synthesis for Self-Evolving

upvoted a paper 4 days ago

Training Data Efficiency in Multimodal Process Reward Models

upvoted a paper 5 days ago

Parallel-Probe: Towards Efficient Parallel Thinking via 2D Probing

View all activity

Organizations

upvoted a paper about 9 hours ago

TTCS: Test-Time Curriculum Synthesis for Self-Evolving

Paper • 2601.22628 • Published 10 days ago • 33

upvoted a paper 4 days ago

Training Data Efficiency in Multimodal Process Reward Models

Paper • 2602.04145 • Published 5 days ago • 74

upvoted a paper 5 days ago

Parallel-Probe: Towards Efficient Parallel Thinking via 2D Probing

Paper • 2602.03845 • Published 6 days ago • 24

upvoted a paper about 1 month ago

RelayLLM: Efficient Reasoning via Collaborative Decoding

Paper • 2601.05167 • Published Jan 8 • 30

upvoted a paper 3 months ago

VisPlay: Self-Evolving Vision-Language Models from Images

Paper • 2511.15661 • Published Nov 19, 2025 • 43

upvoted 2 papers 6 months ago

Self-Rewarding Vision-Language Model via Reasoning Decomposition

Paper • 2508.19652 • Published Aug 27, 2025 • 84

R-Zero: Self-Evolving Reasoning LLM from Zero Data

Paper • 2508.05004 • Published Aug 7, 2025 • 130

upvoted a paper 8 months ago

POSS: Position Specialist Generates Better Draft for Speculative Decoding

Paper • 2506.03566 • Published Jun 4, 2025 • 6

upvoted a paper 10 months ago

CrossWordBench: Evaluating the Reasoning Capabilities of LLMs and LVLMs with Controllable Puzzle Generation

Paper • 2504.00043 • Published Mar 30, 2025 • 10

upvoted a paper 11 months ago

Efficient Test-Time Scaling via Self-Calibration

Paper • 2503.00031 • Published Feb 25, 2025 • 15

upvoted a paper about 1 year ago

Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models

Paper • 2501.09686 • Published Jan 16, 2025 • 41

upvoted a paper over 1 year ago

Taming Overconfidence in LLMs: Reward Calibration in RLHF

Paper • 2410.09724 • Published Oct 13, 2024 • 3

Jiaxin Huang

AI & ML interests

Recent Activity

Organizations

teapot123's activity