Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
12
11
Lior Baruch
LBK95
Follow
Gargaz's profile picture
21world's profile picture
2 followers
·
2 following
Lior-Baruch
AI & ML interests
DL
Recent Activity
published
a model
about 1 hour ago
LBK95/grpo-OracleReward_Async_4responses_V1
updated
a model
2 days ago
LBK95/grpo-OracleReward_Async_2responses_V1
published
a model
3 days ago
LBK95/grpo-OracleReward_Async_2responses_V1
View all activity
Organizations
None yet
LBK95
's models
127
Sort: Recently updated
LBK95/grpo-OracleReward_Async_4responses_V1
Updated
about 1 hour ago
LBK95/grpo-OracleReward_Async_2responses_V1
Updated
2 days ago
LBK95/grpo-OracleReward_Async_V1
Updated
3 days ago
LBK95/grpo-OracleReward_V1
Updated
4 days ago
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.15
Updated
6 days ago
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.14
Updated
6 days ago
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.13
Updated
6 days ago
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.12
Updated
6 days ago
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.11
Updated
7 days ago
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.10
Updated
7 days ago
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.9
Updated
7 days ago
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.8
Updated
7 days ago
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.7
Updated
7 days ago
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.6
Updated
8 days ago
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.5
Updated
8 days ago
LBK95/Llama-3.2-1B-Instruct-Reward-Model-Finetuned_V1.4
Updated
8 days ago
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.4
Updated
8 days ago
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.3
Updated
8 days ago
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.2
Updated
8 days ago
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.1
Updated
8 days ago
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1
Updated
10 days ago
LBK95/grpo-smoketest-ultrachat-LengthReward
Updated
11 days ago
LBK95/grpo-smoketest-Instruct-Skywork-Reward
Updated
12 days ago
LBK95/grpo-smoketest-Skywork-Reward
Updated
12 days ago
LBK95/Llama-3.2-1B-hf_PPO-LookAhead-5_V1_Second_beta-0
Text Generation
•
Updated
15 days ago
•
10
LBK95/Llama-3.2-1B-hf_PPO-LookAhead-5_V1_Second
Updated
Dec 9, 2025
LBK95/Llama-3.2-1B-hf_RewardModel_LookAhead-5_V1_Second
Updated
Dec 8, 2025
LBK95/Llama-3.2-1B-hf_PPO-LookAhead-5_V1
Text Generation
•
Updated
Dec 5, 2025
LBK95/Llama-3.2-1B-hf_RewardModel_LookAhead-5_V1
Updated
Dec 4, 2025
LBK95/Llama-3.2-1B-hf_PPO-LookAhead-5_V1_NoStopString_V2
Text Generation
•
Updated
Dec 3, 2025
•
3
Previous
1
2
3
...
5
Next