Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
anakin87
/
gemma-2b-orpo
like
28
Text Generation
Transformers
Safetensors
alvarobartt/dpo-mix-7k-simplified
English
gemma
trl
orpo
Generated from Trainer
conversational
Eval Results
text-generation-inference
Inference Endpoints
arxiv:
2403.07691
License:
gemma-terms-of-use
Model card
Files
Files and versions
Community
2
Train
Deploy
Use this model
main
gemma-2b-orpo
Commit History
Update README.md
bf6bfe3
verified
anakin87
commited on
May 6
Update README.md
1f9a318
verified
anakin87
commited on
May 6
add evaluation on Open LLM Leaderboard
7569a46
verified
anakin87
commited on
May 6
link to GGUF version
76e5b9c
verified
anakin87
commited on
Apr 6
Upload tokenizer.model
b946408
verified
anakin87
commited on
Apr 6
improve readme
1a06bf8
anakin87
commited on
Mar 26
retry nb visualization
f18f009
anakin87
commited on
Mar 26
improve notebook visualization
c8b9386
anakin87
commited on
Mar 26
fix
cd3951b
anakin87
commited on
Mar 25
fixes
189413b
anakin87
commited on
Mar 25
improve readme
ce4ba3c
anakin87
commited on
Mar 25
Upload gemma-2b-orpo.png
159c797
verified
anakin87
commited on
Mar 25
material
4db7146
anakin87
commited on
Mar 25
Update README.md
5cbf999
verified
anakin87
commited on
Mar 25
little change
15a13e0
anakin87
commited on
Mar 25
End of training
7fbb0bb
verified
anakin87
commited on
Mar 24
Training in progress, epoch 2
b6e4162
verified
anakin87
commited on
Mar 24
Training in progress, epoch 2
d241042
verified
anakin87
commited on
Mar 24
Training in progress, epoch 0
3c43e7b
verified
anakin87
commited on
Mar 24
initial commit
dcc2f5c
verified
anakin87
commited on
Mar 24