Fed-SE: Federated Self-Evolution for Privacy-Constrained Multi-Environment LLM Agents Paper • 2512.08870 • Published 26 days ago • 3
HyperAgent: Leveraging Hypergraphs for Topology Optimization in Multi-Agent Communication Paper • 2510.10611 • Published Oct 12, 2025 • 4
GraphTracer: Graph-Guided Failure Tracing in LLM Agents for Robust Multi-Turn Deep Search Paper • 2510.10581 • Published Oct 12, 2025 • 2
LongCodeZip: Compress Long Context for Code Language Models Paper • 2510.00446 • Published Oct 1, 2025 • 106
Attention as a Compass: Efficient Exploration for Process-Supervised RL in Reasoning Models Paper • 2509.26628 • Published Sep 30, 2025 • 16
SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks? Paper • 2509.16941 • Published Sep 21, 2025 • 21
SWE-QA: Can Language Models Answer Repository-level Code Questions? Paper • 2509.14635 • Published Sep 18, 2025 • 34
Pruning the Unsurprising: Efficient Code Reasoning via First-Token Surprisal Paper • 2508.05988 • Published Aug 8, 2025 • 19
EVOC2RUST: A Skeleton-guided Framework for Project-Level C-to-Rust Translation Paper • 2508.04295 • Published Aug 6, 2025 • 7
SWE-Exp: Experience-Driven Software Issue Resolution Paper • 2507.23361 • Published Jul 31, 2025 • 13
SWE-Debate: Competitive Multi-Agent Debate for Software Issue Resolution Paper • 2507.23348 • Published Jul 31, 2025 • 11
view article Article Detecting Machine-Generated Code: Unveiling Patterns in AI-Generated Programming Jul 2, 2025 • 2
Horizon-Length Prediction: Advancing Fill-in-the-Middle Capabilities for Code Generation with Lookahead Planning Paper • 2410.03103 • Published Oct 4, 2024 • 9
Internal Consistency and Self-Feedback in Large Language Models: A Survey Paper • 2407.14507 • Published Jul 19, 2024 • 46
From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical Debugging Paper • 2410.01215 • Published Oct 2, 2024 • 39