Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
2
3
Yurun Yuan
RyanYr
Follow
KenCao2007's profile picture
xuanfeiren's profile picture
21world's profile picture
6 followers
·
2 following
yurun-yuan
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
5 days ago
Breaking the Capability Ceiling of LLM Post-Training by Reintroducing Markov States
upvoted
a
paper
10 days ago
POLCA: Stochastic Generative Optimization with LLM
updated
a model
11 days ago
RyanYr/slf-dstl_regular_Q2.5-1.5B-It_tooluse_OPD
View all activity
Organizations
None yet
RyanYr
's models
12
Sort: Recently updated
RyanYr/slf-dstl_regular_Q2.5-1.5B-It_tooluse_OPD
Updated
11 days ago
RyanYr/slf-dstl_regular_Q2.5-1.5B-It_science_OPD
Updated
11 days ago
RyanYr/slf-dstl_Q2.5-1.5B-It_science_OPD
Updated
11 days ago
RyanYr/slf-dstl_Q2.5-1.5B-It_tooluse_SFT
2B
•
Updated
11 days ago
•
62
RyanYr/slf-dstl_Q2.5-1.5B-It_science_SFT
2B
•
Updated
11 days ago
•
79
RyanYr/pg-dapo-qwen2.5math-1.5B-base-mbs256-n8_actor
Updated
27 days ago
•
16
RyanYr/pg-dapo-qwen2.5math-1.5B-base-n8_actor
Updated
28 days ago
•
37
RyanYr/grpo-dapo_offline-qwen2.5math-1.5B-base-mbs256-n8_actor
Updated
Feb 25
RyanYr/grpo-dapo-01_offline-qwen2.5math-1.5B-base-mbs256-n8_actor
Updated
Feb 25
RyanYr/pg-dapo-01_offline-qwen2.5math-1.5B-base-mbs256-n8_actor
Updated
Feb 24
RyanYr/pg-dapo_offline-qwen2.5math-1.5B-base-mbs256-n8_actor
Updated
Feb 23
RyanYr/grpo-dapo-qwen2.5math-1.5B-base-mbs256-n8_actor
Updated
Feb 21