Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
khazarai
's Collections
Benchmarks
CoT
Az-Language
GRPO
Text-to-Speech Models
RLHF
SFT
RLHF
updated
Sep 12
Reinforcement Learning with Human Feedback
Upvote
1
khazarai/datascience-RLHF
Text Generation
•
Updated
Sep 9
•
32
•
1
khazarai/Social-RLHF
Text Generation
•
Updated
Sep 11
•
31
•
1
khazarai/Psychology-RLHF
Text Generation
•
Updated
Sep 11
•
32
•
1
Upvote
1
Share collection
View history
Collection guide
Browse collections