hangyu guo's picture

hangyu guo

Rosiness

·

AI & ML interests

Natural Language Processing

Recent Activity

upvoted a paper 7 days ago

Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills

upvoted a paper 13 days ago

On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation

upvoted a paper 13 days ago

WorldCache: Content-Aware Caching for Accelerated Video World Models

View all activity

Organizations

upvoted a paper 7 days ago

Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills

Paper • 2603.25158 • Published 11 days ago • 48

upvoted 2 papers 13 days ago

On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation

Paper • 2603.22117 • Published 13 days ago • 28

WorldCache: Content-Aware Caching for Accelerated Video World Models

Paper • 2603.22286 • Published 13 days ago • 4

upvoted a paper 14 days ago

HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning

Paper • 2603.17024 • Published 19 days ago • 107

upvoted 2 papers 18 days ago

MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Paper • 2603.17187 • Published 19 days ago • 136

XSkill: Continual Learning from Experience and Skills in Multimodal Agents

Paper • 2603.12056 • Published 24 days ago • 32

authored a paper 20 days ago

WebVR: Benchmarking Multimodal LLMs for WebPage Recreation from Videos via Human-Aligned Visual Rubrics

Paper • 2603.13391 • Published 26 days ago • 19

upvoted a paper 26 days ago

\$OneMillion-Bench: How Far are Language Agents from Human Experts?

Paper • 2603.07980 • Published 28 days ago • 27

upvoted a paper 27 days ago

Proact-VL: A Proactive VideoLLM for Real-Time AI Companions

Paper • 2603.03447 • Published Mar 3 • 37

upvoted 5 papers about 1 month ago

AgentVista: Evaluating Multimodal Agents in Ultra-Challenging Realistic Visual Scenarios

Paper • 2602.23166 • Published Feb 26 • 44

How Controllable Are Large Language Models? A Unified Evaluation across Behavioral Granularities

Paper • 2603.02578 • Published Mar 3 • 25

RubricBench: Aligning Model-Generated Rubrics with Human Standards

Paper • 2603.01562 • Published Mar 2 • 64

Spectral Condition for μP under Width-Depth Scaling

Paper • 2603.00541 • Published Feb 28 • 15

K-Search: LLM Kernel Generation via Co-Evolving Intrinsic World Model

Paper • 2602.19128 • Published Feb 22 • 7

upvoted 6 papers about 2 months ago

DataChef: Cooking Up Optimal Data Recipes for LLM Adaptation via Reinforcement Learning

Paper • 2602.11089 • Published Feb 11 • 18

CLI-Gym: Scalable CLI Task Generation via Agentic Environment Inversion

Paper • 2602.10999 • Published Feb 11 • 10

Towards Autonomous Mathematics Research

Paper • 2602.10177 • Published Feb 10 • 36

When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning

Paper • 2602.10560 • Published Feb 11 • 31

Code2World: A GUI World Model via Renderable Code Generation

Paper • 2602.09856 • Published Feb 10 • 202

SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning

Paper • 2602.08234 • Published Feb 9 • 72