Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
dnotitia 's Collections
4B SFT Experiments
Aether
Private Datasets (SFT - 2511)
Private Datasets (DPO - 2511)
Qwen3-ChatTemplate
DNA 2.1
DNA 2.0
DNA 2.0 (RC2)
DNA 2.0 (RC1)
DNA-R1
DNA 1.0
HMC
Smoothie Qwen3
Smoothie Qwen2.5
Private Models
Private Datasets (DNA 2.0)
Private Datasets (DNA 2.0 Evaluation)
Private Datasets (Qwen3 Korean)
Private Datasets (SFT)
Private Datasets (CoT)
Private Datasets (Only Answer)
Private Datasets (MATH)
Private Datasets (Reasoning, ko)
Private Datasets (Reasoning, en)
Private Datasets (CPT)
Private Datasets (DPO)
Private Datasets (Coding)
Private Datasets (RL, GRPO)
Private Datasets (Smoothie Qwen)

DNA-R1

updated Jan 26

Reasoning model distilled from DeepSeek-R1, enhanced with GRPO using supplementary reasoning datasets.

Upvote
2

  • dnotitia/DNA-R1

    Text Generation • Updated Mar 11, 2025 • 123 • 41
Upvote
2
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs