1 19 13

dma2077 PRO

dma2077

AI & ML interests

None yet

Recent Activity

upvoted a paper 22 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

upvoted a paper about 1 month ago

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

published a model about 1 month ago

dma2077/eva_vit

View all activity

Organizations

upvoted a paper 22 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 95

upvoted a paper about 1 month ago

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23, 2025 • 281

published a model about 1 month ago

dma2077/eva_vit

Updated Nov 25, 2025

liked a dataset about 2 months ago

HPLT/HPLT3.0

Updated Nov 14, 2025 • 78 • 10

upvoted a paper 2 months ago

Video-Thinker: Sparking "Thinking with Videos" via Reinforcement Learning

Paper • 2510.23473 • Published Oct 27, 2025 • 84

updated a dataset 2 months ago

dma2077/proof

Viewer • Updated Oct 28, 2025 • 10k • 5

published a dataset 2 months ago

dma2077/proof

Viewer • Updated Oct 28, 2025 • 10k • 5

liked a dataset 2 months ago

nick007x/github-code-2025

Viewer • Updated Oct 15, 2025 • 147M • 4.55k • 111

upvoted 2 papers 4 months ago

FinSearchComp: Towards a Realistic, Expert-Level Evaluation of Financial Search and Reasoning

Paper • 2509.13160 • Published Sep 16, 2025 • 29

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

Paper • 2509.02544 • Published Sep 2, 2025 • 124

liked 2 models 4 months ago

Goedel-LM/Goedel-Prover-V2-32B

Text Generation • 33B • Updated Aug 27, 2025 • 14.4k • 60

deepseek-ai/DeepSeek-Prover-V2-7B

7B • Updated Apr 30, 2025 • 45.9k • 134

updated a model 5 months ago

dma2077/qwen_classify_model

Updated Aug 3, 2025

published a model 5 months ago

dma2077/qwen_classify_model

Updated Aug 3, 2025

updated a dataset 6 months ago

dma2077/ana

Updated Jul 22, 2025 • 32

published a dataset 6 months ago

dma2077/ana

Updated Jul 22, 2025 • 32

upvoted 3 papers 6 months ago

First Return, Entropy-Eliciting Explore

Paper • 2507.07017 • Published Jul 9, 2025 • 23

OmniPart: Part-Aware 3D Generation with Semantic Decoupling and Structural Cohesion

Paper • 2507.06165 • Published Jul 8, 2025 • 58

Kwai Keye-VL Technical Report

Paper • 2507.01949 • Published Jul 2, 2025 • 130

upvoted a paper 7 months ago

Scaling Test-time Compute for LLM Agents

Paper • 2506.12928 • Published Jun 15, 2025 • 63

dma2077 PRO

AI & ML interests

Recent Activity

Organizations

dma2077's activity