Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
martimfasantos
/
tinyllama-1.1b-chat-dpo-qlora
like
0
PEFT
TensorBoard
Safetensors
HuggingFaceH4/ultrafeedback_binarized
llama
alignment-handbook
trl
dpo
Generated from Trainer
4-bit precision
bitsandbytes
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Use this model
bc5dc81
tinyllama-1.1b-chat-dpo-qlora
Commit History
End of training
bc5dc81
verified
martimfasantos
commited on
Apr 24
Model save
44b1dd3
verified
martimfasantos
commited on
Apr 24
Training in progress, step 3800
6be9751
verified
martimfasantos
commited on
Apr 24
Training in progress, step 3700
3c9c89c
verified
martimfasantos
commited on
Apr 24
Training in progress, step 3600
3dc2ed5
verified
martimfasantos
commited on
Apr 24
Training in progress, step 3500
3498428
verified
martimfasantos
commited on
Apr 24
Training in progress, step 3400
14f5ec1
verified
martimfasantos
commited on
Apr 24
Training in progress, step 3300
bf57e55
verified
martimfasantos
commited on
Apr 24
Training in progress, step 3200
dc829eb
verified
martimfasantos
commited on
Apr 24
Training in progress, step 3100
0164ea8
verified
martimfasantos
commited on
Apr 24
Training in progress, step 3000
8b463a8
verified
martimfasantos
commited on
Apr 24
Training in progress, step 2900
e118f46
verified
martimfasantos
commited on
Apr 24
Training in progress, step 2800
f22eac4
verified
martimfasantos
commited on
Apr 24
Training in progress, step 2700
f2ef03d
verified
martimfasantos
commited on
Apr 24
Training in progress, step 2600
5ef7801
verified
martimfasantos
commited on
Apr 24
Training in progress, step 2500
5aca70f
verified
martimfasantos
commited on
Apr 24
Training in progress, step 2400
c519894
verified
martimfasantos
commited on
Apr 24
Training in progress, step 2300
c6c9c0a
verified
martimfasantos
commited on
Apr 24
Training in progress, step 2100
063b760
verified
martimfasantos
commited on
Apr 24
Training in progress, step 2000
d338e2f
verified
martimfasantos
commited on
Apr 24
Training in progress, step 1900
435cce9
verified
martimfasantos
commited on
Apr 24
Training in progress, step 1800
6fb4b6c
verified
martimfasantos
commited on
Apr 24
Training in progress, step 1500
22232e4
verified
martimfasantos
commited on
Apr 24
Training in progress, step 1400
90dd99f
verified
martimfasantos
commited on
Apr 24
Training in progress, step 1300
f09d87e
verified
martimfasantos
commited on
Apr 24
Training in progress, step 1200
be15821
verified
martimfasantos
commited on
Apr 24
Training in progress, step 1100
cecab76
verified
martimfasantos
commited on
Apr 24
Training in progress, step 1000
d1437ef
verified
martimfasantos
commited on
Apr 24
Training in progress, step 900
404e8df
verified
martimfasantos
commited on
Apr 24
Training in progress, step 800
27c1362
verified
martimfasantos
commited on
Apr 24
Training in progress, step 700
7a4e829
verified
martimfasantos
commited on
Apr 24
Training in progress, step 600
926449b
verified
martimfasantos
commited on
Apr 24
Training in progress, step 500
7d446c9
verified
martimfasantos
commited on
Apr 24
Training in progress, step 400
8be7b6d
verified
martimfasantos
commited on
Apr 24
Training in progress, step 300
1fb6177
verified
martimfasantos
commited on
Apr 24
Training in progress, step 200
42ff024
verified
martimfasantos
commited on
Apr 24
Training in progress, step 100
1188cf0
verified
martimfasantos
commited on
Apr 24
initial commit
b1dd477
verified
martimfasantos
commited on
Apr 23