Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
1
12
20
Xu Zhihao
naiweizi
Follow
mamasihan's profile picture
didiforhugface's profile picture
Jhonny999's profile picture
3 followers
·
0 following
AI & ML interests
Trustworthy AI
Recent Activity
upvoted
a
paper
6 days ago
LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning
upvoted
a
paper
12 days ago
TRUST-SQL: Tool-Integrated Multi-Turn Reinforcement Learning for Text-to-SQL over Unknown Schemas
upvoted
a
paper
27 days ago
LLaDA-o: An Effective and Length-Adaptive Omni Diffusion Model
View all activity
Organizations
None yet
Collections
1
Reward Consistency Model
naiweizi/dpo-harmless_saferlhf
Updated
Jun 18, 2025
•
3
Reward Consistency Model
naiweizi/dpo-harmless_saferlhf
Updated
Jun 18, 2025
•
3
Papers
5
arxiv:
2602.03786
arxiv:
2601.10355
arxiv:
2507.11316
arxiv:
2504.15585
Expand 5 papers
models
12
Sort: Recently updated
naiweizi/r1-qwen-7b-sft_meta
8B
•
Updated
Nov 21, 2025
•
3
naiweizi/R1-Qwen-7B-SFT-Meta
Updated
Nov 21, 2025
naiweizi/R1-Qwen-1_5B-Cold_Start-OpenR1_Math-priority
2B
•
Updated
Jul 18, 2025
•
1
naiweizi/dpo-harmless_saferlhf
Updated
Jun 18, 2025
•
3
naiweizi/mistral-dpo-helpful-vanilla-1e-4
Updated
May 6, 2025
naiweizi/mistral-dpo-harmless-vanilla-2e-4
Updated
May 6, 2025
•
1
naiweizi/test
Text Generation
•
8B
•
Updated
Apr 21, 2025
naiweizi/dpo-harmless_helpful-vanilla
Updated
Apr 14, 2025
naiweizi/dpo-harmless_helpful-rc_armo
Updated
Apr 14, 2025
naiweizi/dpo-harmless_helpful-mixed
Updated
Apr 14, 2025
View 12 models
datasets
2
Sort: Recently updated
naiweizi/RC_single_objective
Preview
•
Updated
Jun 4, 2025
•
13
naiweizi/pref_dataset
Preview
•
Updated
Apr 14, 2025
•
14