Daixuan Cheng's picture

Daixuan Cheng

daixuancheng

·

https://cdxeve.github.io

DaixuanC45443

AI & ML interests

I study LLMs, from Pre-Training to Agent.

Recent Activity

authored a paper 12 days ago

BeyondSWE: Can Current Code Agent Survive Beyond Single-Repo Bug Fixing?

upvoted a paper 13 days ago

BeyondSWE: Can Current Code Agent Survive Beyond Single-Repo Bug Fixing?

upvoted a collection 21 days ago

View all activity

Organizations

None yet

upvoted a paper 13 days ago

BeyondSWE: Can Current Code Agent Survive Beyond Single-Repo Bug Fixing?

Paper • 2603.03194 • Published 13 days ago • 54

upvoted a collection 21 days ago

LLM-in-Sandbox

Data and models for the paper: LLM-in-Sandbox Elicits General Agentic Intelligence. Feel free to open an issue if you have any questions or problems! • 3 items • Updated 21 days ago • 1

upvoted 3 papers about 1 month ago

SWE-Master: Unleashing the Potential of Software Engineering Agents via Post-Training

Paper • 2602.03411 • Published Feb 3 • 37

SWE-World: Building Software Engineering Agents in Docker-Free Environments

Paper • 2602.03419 • Published Feb 3 • 40

Adaptive Ability Decomposing for Unlocking Large Reasoning Model Effective Reinforcement Learning

Paper • 2602.00759 • Published Jan 31 • 5

upvoted 11 collections about 2 months ago

Agentic

14 items • Updated Jan 24 • 2

Agents

12 items • Updated Jan 25 • 1

AI-papers

5 items • Updated Jan 24 • 1

Ai-general

50 items • Updated Jan 29 • 3

Agent

100 items • Updated 11 days ago • 12

2026

174 items • Updated Feb 8 • 4

Coding

3 items • Updated Feb 4 • 1

Agents

13 items • Updated Feb 10 • 4

Training-Free

3 items • Updated Jan 30 • 1

LLM

12 items • Updated Feb 10 • 3

Agent

50 items • Updated 11 days ago • 3

upvoted a paper about 2 months ago

LLM-in-Sandbox Elicits General Agentic Intelligence

Paper • 2601.16206 • Published Jan 22 • 85

upvoted 2 papers 5 months ago

Agentic Entropy-Balanced Policy Optimization

Paper • 2510.14545 • Published Oct 16, 2025 • 106

BitNet Distillation

Paper • 2510.13998 • Published Oct 15, 2025 • 59

upvoted a paper 6 months ago

FlowRL: Matching Reward Distributions for LLM Reasoning

Paper • 2509.15207 • Published Sep 18, 2025 • 117