Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
In a Training Loop ๐
96.8
TFLOPS
10
20
aayush garg
PRO
garg-aayush
Follow
somusan's profile picture
leokmax's profile picture
sunilsv's profile picture
16 followers
ยท
16 following
https://aayushgarg.dev/
Aayush_ander
garg-aayush
aayush-garg-8b26a734
AI & ML interests
None yet
Recent Activity
published
an
article
7 days ago
Understanding GRPO: PPO without the critic
upvoted
an
article
8 days ago
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge
published
an
article
9 days ago
Deriving the DPO Loss from First Principles
View all activity
Organizations
garg-aayush
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
published
an
article
7 days ago
view article
Article
Understanding GRPO: PPO without the critic
7 days ago
โข
1
published
an
article
9 days ago
view article
Article
Deriving the DPO Loss from First Principles
9 days ago
โข
6
published
an
article
14 days ago
view article
Article
Deriving the PPO Loss from First Principles
14 days ago
โข
33
published
an
article
about 1 month ago
view article
Article
What I Learned Building SFT from the Ground Up
Dec 3, 2025
โข
1