xuxin's picture

On Vacation 🏝️

3 9

xuxin

xx18

·

https://xinxu-ustc.github.io/

AI & ML interests

None yet

Recent Activity

authored a paper about 24 hours ago

EntroPIC: Towards Stable Long-Term Training of LLMs via Entropy Stabilization with Proportional-Integral Control

upvoted a paper 29 days ago

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

upvoted a paper about 1 month ago

EntroPIC: Towards Stable Long-Term Training of LLMs via Entropy Stabilization with Proportional-Integral Control

View all activity

Organizations

Collections 1

Papers 16

arxiv:2511.15248

arxiv:2510.00553

arxiv:2509.26226

arxiv:2509.14646

models 15

xx18/TFPI-Qwen3-4B-Thinking-2507-Stage3

Text Generation • 4B • Updated Nov 7, 2025 • 24

xx18/DirectRL_Qwen3-4B_baseline2

Text Generation • 4B • Updated Nov 7, 2025 • 9

xx18/DirectRL_Qwen3-4B_baseline1

Text Generation • 4B • Updated Nov 7, 2025 • 5

xx18/TFPI-Qwen3-4B-Stage3_then_RL

Text Generation • 4B • Updated Nov 7, 2025 • 6

xx18/TFPI-Qwen3-4B-Stage3

Text Generation • 4B • Updated Nov 7, 2025 • 9

xx18/TFPI-Qwen3-4B-Stage2

Text Generation • 4B • Updated Nov 7, 2025 • 12

xx18/TFPI-Qwen3-4B-Stage1

Text Generation • 4B • Updated Nov 7, 2025 • 5

xx18/DirectRL_DeepSeek-Qwen-1.5B_baseline2

Text Generation • 2B • Updated Nov 7, 2025 • 4

xx18/DirectRL_DeepSeek-Qwen-1.5B_baseline1

Text Generation • 2B • Updated Nov 7, 2025 • 4

xx18/TFPI-DeepSeek-Qwen-1.5B-Stage3_then_RL

Text Generation • 2B • Updated Nov 7, 2025 • 4

datasets 2

xx18/TFPI-EVA

Preview • Updated Sep 28, 2025 • 52

xx18/R2PE

Viewer • Updated Feb 21, 2024 • 38.7k • 1.12k • 2