Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
6
Jeewoo Kim
jdubkim
Follow
0 followers
·
1 following
jdubkim
AI & ML interests
LLM (Reasoning, RLHF) Trust and Safety
Organizations
models
1
jdubkim/ppo-LunarLander-v2-TEST
Reinforcement Learning
•
Updated
Dec 20, 2022
•
1
datasets
None public yet