Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
ContextualAI
/
archangel_ppo_llama7b
like
0
Follow
ContextualAI
54
Text Generation
Transformers
Safetensors
stanfordnlp/SHP
Anthropic/hh-rlhf
OpenAssistant/oasst1
English
llama
human feedback
rlhf
preferences
alignment
HALO
halos
dpo
rl
text-generation-inference
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
20054e1
archangel_ppo_llama7b
Commit History
Upload README.md with huggingface_hub
20054e1
xwinxu
commited on
Jan 9
Upload README.md with huggingface_hub
8a79581
xwinxu
commited on
Jan 9
Upload README.md with huggingface_hub
591cc5b
xwinxu
commited on
Jan 8
Upload LlamaForCausalLM
334c254
xwinxu
commited on
Jan 8
Upload tokenizer
f2e1b03
xwinxu
commited on
Jan 8
Upload README.md with huggingface_hub
a17ddfd
xwinxu
commited on
Jan 8
Upload README.md with huggingface_hub
7134ae1
xwinxu
commited on
Dec 7, 2023
Upload README.md with huggingface_hub
9c04901
xwinxu
commited on
Dec 7, 2023
Upload README.md with huggingface_hub
ff5e385
xwinxu
commited on
Dec 7, 2023
Upload README.md with huggingface_hub
1ce3a64
xwinxu
commited on
Dec 6, 2023
Upload README.md with huggingface_hub
c5cbad6
xwinxu
commited on
Dec 6, 2023
Upload LlamaForCausalLM
37cf3a3
stas
commited on
Nov 26, 2023
Upload tokenizer
ce7f3ef
stas
commited on
Nov 26, 2023
initial commit
3782ae8
stas
commited on
Nov 26, 2023