Tim Dingman's picture

1 63 1

Tim Dingman

tdingman

https://timdingman.com/

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

upvoted a paper 1 day ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

upvoted a paper 20 days ago

Solar Open Technical Report

View all activity

Organizations

None yet

upvoted 2 papers 1 day ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

Paper • 2601.08763 • Published 22 days ago • 145

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published 27 days ago • 218

upvoted a paper 20 days ago

Solar Open Technical Report

Paper • 2601.07022 • Published 24 days ago • 65

upvoted a paper 27 days ago

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

Paper • 2511.21689 • Published Nov 26, 2025 • 121

upvoted a paper 30 days ago

Memory in the Age of AI Agents

Paper • 2512.13564 • Published Dec 15, 2025 • 148

upvoted a paper 2 months ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published Dec 2, 2025 • 255

upvoted 7 papers 3 months ago

ResearchRubrics: A Benchmark of Prompts and Rubrics For Evaluating Deep Research Agents

Paper • 2511.07685 • Published Nov 10, 2025 • 10

The Path Not Taken: RLVR Provably Learns Off the Principals

Paper • 2511.08567 • Published Nov 11, 2025 • 34

Robot Learning: A Tutorial

Paper • 2510.12403 • Published Oct 14, 2025 • 122

Agentic Entropy-Balanced Policy Optimization

Paper • 2510.14545 • Published Oct 16, 2025 • 106

DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

Paper • 2509.25454 • Published Sep 29, 2025 • 146

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9, 2025 • 273

Tongyi DeepResearch Technical Report

Paper • 2510.24701 • Published Oct 28, 2025 • 101

upvoted 5 papers 4 months ago

Scaling Agents via Continual Pre-training

Paper • 2509.13310 • Published Sep 16, 2025 • 117

FlowRL: Matching Reward Distributions for LLM Reasoning

Paper • 2509.15207 • Published Sep 18, 2025 • 116

ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data

Paper • 2509.15221 • Published Sep 18, 2025 • 111

LIMI: Less is More for Agency

Paper • 2509.17567 • Published Sep 22, 2025 • 104

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

Paper • 2509.02544 • Published Sep 2, 2025 • 125

upvoted 2 papers 5 months ago

Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving

Paper • 2507.23726 • Published Jul 31, 2025 • 115

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

Paper • 2508.13167 • Published Aug 6, 2025 • 129