Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Pedro Ribeiro
BRlkl
Follow
0 followers
·
5 following
AI & ML interests
None yet
Recent Activity
updated
a dataset
3 days ago
BRlkl/sft-7
published
a dataset
3 days ago
BRlkl/sft-7
updated
a model
3 days ago
BRlkl/GRPO-6-harder_70
View all activity
Organizations
BRlkl
's models
147
Sort: Recently updated
BRlkl/TCC-state
Updated
3 days ago
BRlkl/GRPO-6-harder_70
Updated
3 days ago
BRlkl/GRPO-6-harder_60_verifiable_20
Updated
4 days ago
BRlkl/GRPO-6-harder_60_verifiable_10
Updated
4 days ago
BRlkl/GRPO-6-harder_60
Updated
5 days ago
BRlkl/GRPO-6-harder_50
Updated
6 days ago
BRlkl/GRPO-6-harder_40
Updated
6 days ago
BRlkl/fixed-14b
15B
•
Updated
7 days ago
•
18
BRlkl/GRPO-6-harder_30
Updated
7 days ago
BRlkl/GRPO-6-harder_20
Updated
8 days ago
BRlkl/GRPO-6-harder_10
Updated
8 days ago
BRlkl/GRPO-6_40
Updated
9 days ago
BRlkl/GRPO-6_30
Updated
10 days ago
BRlkl/GRPO-6_20
Updated
12 days ago
BRlkl/GRPO-6.1_10
Updated
13 days ago
BRlkl/GRPO-6_10
Updated
16 days ago
BRlkl/orchestrator-qwen3-4b-lora-new
Updated
17 days ago
BRlkl/GRPO-6_5
Updated
17 days ago
BRlkl/orchestrator-qwen3.5-2b-lora-sft-longer
Updated
20 days ago
BRlkl/orchestrator-qwen3.5-2b-lora-sft-larger
Updated
20 days ago
BRlkl/orchestrator-qwen3.5-2b-lora-sft
Updated
20 days ago
BRlkl/GRPO-5-sft-bootstrap-3
Updated
25 days ago
BRlkl/GRPO-5-sft-bootstrap-qwen3-4b-thinking-2507
Updated
25 days ago
BRlkl/distill-sft-grpo-4_70-full
Text Generation
•
4B
•
Updated
29 days ago
•
271
BRlkl/distill-sft-qwen3-4b-full
Text Generation
•
4B
•
Updated
29 days ago
•
209
BRlkl/distill-sft-qwen3-0.6b-full
Text Generation
•
0.6B
•
Updated
29 days ago
•
315
BRlkl/distill-sft-qwen3-8b-full
Text Generation
•
8B
•
Updated
30 days ago
•
296
BRlkl/distill-sft-qwen3-32b-full
Updated
30 days ago
BRlkl/GRPO-5-sft-bootstrap-2
Updated
Mar 24
BRlkl/GRPO-5-sft-bootstrap
Updated
Mar 24
Previous
1
2
3
...
5
Next