Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
13
11
Lior Baruch
LBK95
Follow
Gargaz's profile picture
21world's profile picture
2 followers
·
2 following
Lior-Baruch
AI & ML interests
DL
Recent Activity
updated
a model
1 day ago
LBK95/GRPO_Iterative_Q1Q2_Llama32-1B_LA5_MCL12_G8_full
published
a model
1 day ago
LBK95/GRPO_Iterative_Q1Q2_Llama32-1B_LA5_MCL12_G8_full
updated
a model
2 days ago
LBK95/PTO_Iterative_Q1Q2_Llama32-1B_LA5_MCL12_M8_PTgreedy_full
View all activity
Organizations
None yet
LBK95
's models
137
Sort: Recently updated
LBK95/PTO_Iterative_Q1Q2_Llama32-1B_LA5_MCL12_M8_PTgreedy_full
Updated
1 day ago
LBK95/GRPO_Iterative_Q1Q2_Llama32-1B_LA0_MCL12_G8_full
Updated
1 day ago
LBK95/GRPO_Iterative_Q1Q2_Llama32-1B_LA5_MCL12_G8_full
Updated
1 day ago
LBK95/PTO_Iterative_Q1Q2_Llama32-1B_LA0_MCL12_M8_PTgreedy_full
Updated
2 days ago
LBK95/GRPO_Iterative_Q1Q2_Llama32-1B_LA5_MCL12_G8_full_Archive_V2
Updated
5 days ago
LBK95/GRPO_Oracle_Llama32-1B-Instruct_LA5_G4_V2
Updated
Feb 18
LBK95/GRPO_Oracle_Llama32-1B_LA5_G4_V2
Updated
Feb 18
LBK95/GRPO_Oracle_Llama32-1B_LA5_G4_V1
Updated
Feb 1
LBK95/GRPO-OracleRM_Q1Q2_V1-adapter-v1
Updated
Jan 28
LBK95/grpo-OracleRM_Async_4responses_V1-adapter-v1
Updated
Jan 25
LBK95/grpo-OracleRM_Async_4responses_V1
Updated
Jan 22
LBK95/grpo-OracleReward_Async_2responses_V1
Updated
Jan 10
LBK95/grpo-OracleReward_Async_V1
Updated
Jan 9
LBK95/grpo-OracleReward_V1
Updated
Jan 8
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.15
Updated
Jan 6
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.14
Updated
Jan 6
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.13
Updated
Jan 6
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.12
Updated
Jan 6
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.11
Updated
Jan 5
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.10
Updated
Jan 5
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.9
Updated
Jan 5
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.8
Updated
Jan 5
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.7
Updated
Jan 5
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.6
Updated
Jan 4
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.5
Updated
Jan 4
LBK95/Llama-3.2-1B-Instruct-Reward-Model-Finetuned_V1.4
Updated
Jan 4
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.4
Updated
Jan 4
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.3
Updated
Jan 4
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.2
Updated
Jan 4
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.1
Updated
Jan 4
Previous
1
2
3
...
5
Next