Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
BadCat's picture
2 7 8

BadCat

Foresta
·
  • Aegis1863

AI & ML interests

LLMs Deep learning Reinforcement learning

Recent Activity

upvoted a paper 3 days ago
XSkill: Continual Learning from Experience and Skills in Multimodal Agents
upvoted an article 7 days ago
From GRPO to DAPO and GSPO: What, Why, and How
upvoted a paper 2 months ago
AT^2PO: Agentic Turn-based Policy Optimization via Tree Search
View all activity

Organizations

None yet

upvoted a paper 3 days ago

XSkill: Continual Learning from Experience and Skills in Multimodal Agents

Paper • 2603.12056 • Published 4 days ago • 24
upvoted an article 7 days ago
view article
Article

From GRPO to DAPO and GSPO: What, Why, and How

Aug 9, 2025
•
101
upvoted 2 papers 2 months ago

AT^2PO: Agentic Turn-based Policy Optimization via Tree Search

Paper • 2601.04767 • Published Jan 8 • 28

Evaluating Parameter Efficient Methods for RLVR

Paper • 2512.23165 • Published Dec 29, 2025 • 28
upvoted a paper 5 months ago

Refusal Falls off a Cliff: How Safety Alignment Fails in Reasoning?

Paper • 2510.06036 • Published Oct 7, 2025 • 7
upvoted a paper 7 months ago

OpenCUA: Open Foundations for Computer-Use Agents

Paper • 2508.09123 • Published Aug 12, 2025 • 33
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs