Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
guoqiang-x
/
zephyr-7b-dpo-qlora
like
0
PEFT
TensorBoard
Safetensors
HuggingFaceH4/ultrafeedback_binarized
mistral
alignment-handbook
trl
dpo
Generated from Trainer
4-bit precision
bitsandbytes
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Use this model
main
zephyr-7b-dpo-qlora
Commit History
End of training
0fedd69
verified
guoqiang-x
commited on
Oct 23
Model save
eebedcd
verified
guoqiang-x
commited on
Oct 23
Training in progress, step 3821
5c4b6b8
verified
guoqiang-x
commited on
Oct 23
Training in progress, step 3800
035a2ce
verified
guoqiang-x
commited on
Oct 23
Training in progress, step 3700
2322851
verified
guoqiang-x
commited on
Oct 23
Training in progress, step 3600
744cb38
verified
guoqiang-x
commited on
Oct 23
Training in progress, step 3500
71880ab
verified
guoqiang-x
commited on
Oct 23
Training in progress, step 3400
c307546
verified
guoqiang-x
commited on
Oct 23
Training in progress, step 3300
0c36061
verified
guoqiang-x
commited on
Oct 23
Training in progress, step 3200
8bdde61
verified
guoqiang-x
commited on
Oct 23
Training in progress, step 3100
03eaab7
verified
guoqiang-x
commited on
Oct 23
Training in progress, step 3000
ba26e0c
verified
guoqiang-x
commited on
Oct 23
Training in progress, step 2900
4773d13
verified
guoqiang-x
commited on
Oct 23
Training in progress, step 2800
70d33b4
verified
guoqiang-x
commited on
Oct 23
Training in progress, step 2700
a44d0bc
verified
guoqiang-x
commited on
Oct 23
Training in progress, step 2600
6bf8ec1
verified
guoqiang-x
commited on
Oct 23
Training in progress, step 2500
be1c0fe
verified
guoqiang-x
commited on
Oct 22
Training in progress, step 2400
361e34f
verified
guoqiang-x
commited on
Oct 22
Training in progress, step 2300
df3e834
verified
guoqiang-x
commited on
Oct 22
Training in progress, step 2200
4a051aa
verified
guoqiang-x
commited on
Oct 22
Training in progress, step 2100
be4a5c6
verified
guoqiang-x
commited on
Oct 22
Training in progress, step 2000
a31cd35
verified
guoqiang-x
commited on
Oct 22
Training in progress, step 1900
aa46805
verified
guoqiang-x
commited on
Oct 22
Training in progress, step 1800
dd7c950
verified
guoqiang-x
commited on
Oct 22
Training in progress, step 1700
a6b2094
verified
guoqiang-x
commited on
Oct 22
Training in progress, step 1600
b8a136f
verified
guoqiang-x
commited on
Oct 22
Training in progress, step 1500
30c7fcf
verified
guoqiang-x
commited on
Oct 22
Training in progress, step 1400
da72df2
verified
guoqiang-x
commited on
Oct 22
Training in progress, step 1300
9569e01
verified
guoqiang-x
commited on
Oct 22
Training in progress, step 1200
cc49a6a
verified
guoqiang-x
commited on
Oct 22
Training in progress, step 1100
6b2f261
verified
guoqiang-x
commited on
Oct 22
Training in progress, step 1000
75cc83e
verified
guoqiang-x
commited on
Oct 22
Training in progress, step 900
7b1d74a
verified
guoqiang-x
commited on
Oct 22
Training in progress, step 800
e24d8a8
verified
guoqiang-x
commited on
Oct 22
Training in progress, step 700
46597ec
verified
guoqiang-x
commited on
Oct 22
Training in progress, step 600
bae60fd
verified
guoqiang-x
commited on
Oct 22
Training in progress, step 500
b733f05
verified
guoqiang-x
commited on
Oct 21
Training in progress, step 400
cb973e4
verified
guoqiang-x
commited on
Oct 21
Training in progress, step 300
1a89e79
verified
guoqiang-x
commited on
Oct 21
Training in progress, step 200
bae25ff
verified
guoqiang-x
commited on
Oct 21
Training in progress, step 100
8e87dce
verified
guoqiang-x
commited on
Oct 21
initial commit
9ea24d4
verified
guoqiang-x
commited on
Sep 16