Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
jiaxin-wen
/
MisleadLM-code
like
0
PyTorch
llama
arxiv:
2409.12822
Model card
Files
Files and versions
Community
Train
jiaxin-wen
commited on
Oct 11
Commit
da7bc3d
•
1 Parent(s):
8e31fa3
Create README.md
Browse files
Files changed (1)
hide
show
README.md
+5
-0
README.md
ADDED
Viewed
@@ -0,0 +1,5 @@
1
+
## Language Model Learns to Mislead Humans via RLHF
2
+
3
+
This repository contains the RLHF'ed code generation model in our paper: https://arxiv.org/pdf/2409.12822.
4
+
5
+
It's initialized based on deepseek-coder-7B.